blob: 9a0ff17fb9093856a69e593d58c14232464346f7 [file] [log] [blame]
***************
Introduction
***************
The main goal of Apache Tajo project is to build an advanced open source
data warehouse system in Hadoop for processing web-scale data sets.
Basically, Tajo provides SQL standard as a query language.
Tajo is designed for both interactive and batch queries on data sets
stored on HDFS and other data sources. Without hurting query response
times, Tajo provides fault-tolerance and dynamic load balancing which
are necessary for long-running queries. Tajo employs a cost-based and
progressive query optimization techniques for reoptimizing running
queries in order to avoid the worst query plans.