Mirror of Apache Tajo

Clone this repo:
  1. b68fc2e Use HTTPS instead of HTTP to resolve dependencies (#1067) by Jonathan Leitschuh · 12 months ago master
  2. c0361b9 Change the git repo to the Aapche Tajo Github (#1066) by Hyunsik Choi · 1 year, 8 months ago
  3. fb32619 TAJO-2189: Dictionary encoded text in ORC scanner may cause incorrect result. (#1055) by Jinho Kim · 2 years, 10 months ago
  4. f07f027 Merge pull request #1053 from hyunsik/TAJO-2187 by Hyunsik Choi · 2 years, 10 months ago
  5. 8b644e6 TAJO-2188: Can't start tajo daemon on HDP,CDH by Jinho Kim · 2 years, 10 months ago

Apache Tajo

Tajo is a relational and distributed data warehouse system for Hadoop. Tajo is designed for low-latency and scalable ad-hoc queries, online aggregation and ETL on large-data sets by leveraging advanced database techniques. It supports SQL standards. It has its own query engine which allows direct control of distributed execution and data flow. As a result, Tajo has a variety of query evaluation strategies and more optimization opportunities. In addition, Tajo will have a native columnar execution and and its optimizer.





  • Java 1.8 or higher
  • Hadoop 2.3.0 or higher

Mailing lists

To subscribe to the mailing lists, please send an email to:


For example, to subscribe to dev, send an email from your desired subscription address to:


and follow the instructions from there.