TAJO-2189: Dictionary encoded text in ORC scanner may cause incorrect result. (#1055)

3 files changed
tree: 5ef391a80f393f950b2798de19d184151d2d41fc
  1. .gitignore
  2. .reviewboardrc
  3. .travis.yml
  4. BUILDING
  5. CHANGES
  6. LICENSE
  7. NOTICE
  8. README.md
  9. dev-support/
  10. doap_Tajo.rdf
  11. pom.xml
  12. request-patch-review.py
  13. tajo-algebra/
  14. tajo-catalog/
  15. tajo-cli/
  16. tajo-client-example/
  17. tajo-client/
  18. tajo-cluster-tests/
  19. tajo-common/
  20. tajo-core-tests/
  21. tajo-core/
  22. tajo-dist/
  23. tajo-docs/
  24. tajo-jdbc/
  25. tajo-maven-plugins/
  26. tajo-metrics/
  27. tajo-plan/
  28. tajo-project/
  29. tajo-pullserver/
  30. tajo-rpc/
  31. tajo-sql-parser/
  32. tajo-storage/
  33. tajo-tablespace-example/
  34. tajo-thirdparty/
  35. tajo-yarn/
README.md

Apache Tajo

Tajo is a relational and distributed data warehouse system for Hadoop. Tajo is designed for low-latency and scalable ad-hoc queries, online aggregation and ETL on large-data sets by leveraging advanced database techniques. It supports SQL standards. It has its own query engine which allows direct control of distributed execution and data flow. As a result, Tajo has a variety of query evaluation strategies and more optimization opportunities. In addition, Tajo will have a native columnar execution and and its optimizer.

Project

License

Documents

Requirements

  • Java 1.8 or higher
  • Hadoop 2.3.0 or higher

Mailing lists

To subscribe to the mailing lists, please send an email to:

${listname}-subscribe@tajo.apache.org

For example, to subscribe to dev, send an email from your desired subscription address to:

dev-subscribe@tajo.apache.org

and follow the instructions from there.