Merge remote-tracking branch 'origin/master' into hadoop-next

Change-Id: Ib37056d1f273f6efb5b87c9b48512f8b14ab497b
tree: 6ddeb45b567c58900e99160694977a8babb6aff6
  1. be/
  2. bin/
  3. cmake_modules/
  4. common/
  5. docs/
  6. ext-data-source/
  7. fe/
  8. infra/
  9. shell/
  10. ssh_keys/
  11. testdata/
  12. tests/
  13. www/
  14. .clang-format
  15. .clang-tidy
  16. .gitignore
  17. buildall.sh
  18. CMakeLists.txt
  19. DISCLAIMER
  20. EXPORT_CONTROL.md
  21. LICENSE.txt
  22. LOGS.md
  23. NOTICE.txt
  24. README.md
README.md

Welcome to Impala

Lightning-fast, distributed SQL queries for petabytes of data stored in Apache Hadoop clusters.

Impala is a modern, massively-distributed, massively-parallel, C++ query engine that lets you analyze, transform and combine data from a variety of data sources:

  • Best of breed performance and scalability.
  • Support for data stored in HDFS, Apache HBase and Amazon S3.
  • Wide analytic SQL support, including window functions and subqueries.
  • On-the-fly code generation using LLVM to generate CPU-efficient code tailored specifically to each individual query.
  • Support for the most commonly-used Hadoop file formats, including the Apache Parquet (incubating) project.
  • Apache-licensed, 100% open source.

More about Impala

To learn more about Impala as a business user, or to try Impala live or in a VM, please visit the Impala homepage.

If you are interested in contributing to Impala as a developer, or learning more about Impala's internals and architecture, visit the Impala wiki.

Supported Platforms

Impala only supports Linux at the moment.

Build Instructions

See bin/bootstrap_build.sh.

Export Control Notice

This distribution uses cryptographic software and may be subject to export controls. Please refer to EXPORT_CONTROL.md for more information.