Apache Tajo

Clone this repo:
  1. fdd2ca2 TAJO-2166: Disallow csv file format in DDL Languages. by Lee Dongjin · 1 year, 5 months ago master
  2. 6a52ba9 TAJO-2177: In BroadcastJoinRule, the total volume of broadcast tables should be checked before stages are merged. by Jihoon Son · 1 year, 5 months ago
  3. 354ff85 TAJO-2176: Refining function documents. by Jongyoung Park · 1 year, 5 months ago
  4. ea777aa TAJO-2175: Fix some glitches in source code. by Lee Dongjin · 1 year, 5 months ago
  5. fab36f9 TAJO-2165: Add 'ALTER TABLE UNSET PROPERTY' statement to Tajo DDL. by Lee Dongjin · 1 year, 5 months ago

Apache Tajo

Tajo is a relational and distributed data warehouse system for Hadoop. Tajo is designed for low-latency and scalable ad-hoc queries, online aggregation and ETL on large-data sets by leveraging advanced database techniques. It supports SQL standards. It has its own query engine which allows direct control of distributed execution and data flow. As a result, Tajo has a variety of query evaluation strategies and more optimization opportunities. In addition, Tajo will have a native columnar execution and and its optimizer.





  • Java 1.8 or higher
  • Hadoop 2.3.0 or higher

Mailing lists

To subscribe to the mailing lists, please send an email to:


For example, to subscribe to dev, send an email from your desired subscription address to:


and follow the instructions from there.