| <!DOCTYPE html> |
| <html lang="en"> |
| <head> |
| <meta charset="utf-8"> |
| <title>Impala</title> |
| <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| <meta name="description" content=""> |
| <meta name="author" content=""> |
| |
| <!-- Le styles --> |
| <link href="css/bootstrap.css" rel="stylesheet"> |
| <style type="text/css"> |
| body { |
| padding-top: 20px; |
| padding-bottom: 40px; |
| } |
| |
| /* Custom container */ |
| .container-narrow { |
| margin: 0 auto; |
| max-width: 700px; |
| } |
| .container-narrow > hr { |
| margin: 30px 0; |
| } |
| |
| /* Main marketing message and sign up button */ |
| .jumbotron { |
| margin: 60px 0; |
| text-align: center; |
| } |
| .jumbotron h1 { |
| font-size: 72px; |
| line-height: 1; |
| } |
| .jumbotron .btn { |
| font-size: 21px; |
| padding: 14px 24px; |
| } |
| |
| /* Supporting marketing content */ |
| .marketing { |
| margin: 60px 0; |
| } |
| .marketing p + h4 { |
| margin-top: 28px; |
| } |
| /* Customize the navbar links to be fill the entire space of the .navbar */ |
| .navbar .navbar-inner { |
| padding: 0; |
| } |
| .navbar .nav { |
| margin: 0; |
| display: table; |
| width: 100%; |
| } |
| .navbar .nav li { |
| display: table-cell; |
| width: 1%; |
| float: none; |
| } |
| .navbar .nav li a { |
| font-weight: bold; |
| text-align: center; |
| border-left: 1px solid rgba(255,255,255,.75); |
| border-right: 1px solid rgba(0,0,0,.1); |
| } |
| .navbar .nav li:first-child a { |
| border-left: 0; |
| border-radius: 3px 0 0 3px; |
| } |
| .navbar .nav li:last-child a { |
| border-right: 0; |
| border-radius: 0 3px 3px 0; |
| } |
| </style> |
| <link href="css/bootstrap-responsive.css" rel="stylesheet"> |
| |
| <!-- HTML5 shim, for IE6-8 support of HTML5 elements --> |
| <!--[if lt IE 9]> |
| <script src="js/html5shiv.js"></script> |
| <![endif]--> |
| <script src="js/jquery-3.5.1.min.js"></script> |
| <script src="js/bootstrap-3.3.5.min.js"></script> |
| </head> |
| |
| <body> |
| |
| <div class="container"> |
| |
| <div class="masthead"> |
| <h3 class="muted">Apache Impala</h3> |
| <div class="navbar"> |
| <div class="navbar-inner"> |
| <div class="container"> |
| <ul class="nav"> |
| <li><a href="index.html">Home</a></li> |
| <li><a href="downloads.html">Downloads</a></li> |
| <li class="active"><a href="overview.html">Overview</a></li> |
| <li><a href="https://cwiki.apache.org/confluence/display/IMPALA/Contributing+to+Impala">Contribute</a></li> |
| <li class="dropdown"> |
| <a href='#' class="dropdown-toggle" data-toggle="dropdown" role="button"> |
| Source code |
| </a> |
| <ul class="dropdown-menu"> |
| <li><a href="https://git-wip-us.apache.org/repos/asf/impala.git">Official source</a></li> |
| <li><a href="https://github.com/apache/impala">GitHub mirror</a></li> |
| </ul> |
| </li> |
| <li><a href="community.html">Community</a></li> |
| <li><a href="impala-docs.html">Documentation</a></li> |
| </ul> |
| </div> |
| </div> |
| |
| |
| |
| <div class="row-fluid"> |
| <div class="span12"> |
| <h2>Overview</h2> |
| <p>Impala raises the bar for SQL query performance on Apache Hadoop while retaining a familiar user experience. With Impala, you can query data, whether stored in HDFS or Apache HBase – including SELECT, JOIN, and aggregate functions – in real time. Furthermore, Impala uses the same metadata, SQL syntax (Hive SQL), ODBC driver, and user interface (Hue Beeswax) as Apache Hive, providing a familiar and unified platform for batch-oriented or real-time queries. (For that reason, Hive users can utilize Impala with little setup overhead.) </p> |
| |
| |
| <h3>Architecture</h3> |
| <p>To avoid latency, Impala circumvents MapReduce to directly access the data through a specialized distributed query engine that is very similar to those found in commercial parallel RDBMSs. The result is order-of-magnitude faster performance than Hive, depending on the type of query and configuration.</p> |
| <p align="center"><img src="img/impala.png"></p> |
| |
| <p>There are many advantages to this approach over alternative approaches for querying Hadoop data, including:: |
| <ul> |
| <li>Thanks to local processing on data nodes, network bottlenecks are avoided.</li> |
| <li>A single, open, and unified metadata store can be utilized. |
| </li> |
| <li>Costly data format conversion is unnecessary and thus no overhead is incurred.</li> |
| <li> All data is immediately query-able, with no delays for ETL. |
| </li> |
| <li>All hardware is utilized for Impala queries as well as for MapReduce. |
| </li> |
| <li>Only a single machine pool is needed to scale. |
| </li> |
| </ul> |
| We encourage you to |
| read <a href="http://cidrdb.org/cidr2015/Papers/CIDR15_Paper28.pdf">"Impala: A Modern, |
| Open-Source SQL Engine for Hadoop"</a> for details about Impala's architecture. |
| <p> </p> |
| </div> |
| </div> |
| |
| |
| <div class="navbar"> |
| <div class="navbar-inner"> |
| <div class="container"> |
| |
| <ul class="nav"> |
| <li><a href="https://www.apache.org/licenses/">License</a></li> |
| <li><a href="https://www.apache.org/foundation/sponsorship.html">Sponsorship</a></li> |
| <li><a href="https://www.apache.org/foundation/thanks.html">Thanks</a></li> |
| <li><a href="https://www.apache.org/security/">Security</a></li> |
| <li><a href="https://www.apache.org/">Apache Software Foundation</a></li> |
| </ul> |
| </div> |
| </div> |
| </div> |
| |
| <div class="footer"> |
| <center> |
| <a href="http://www.apache.org/events/current-event.html"> |
| <img src="http://www.apache.org/events/current-event-234x60.png"/> |
| </a> |
| </center> |
| <p>Apache Impala, Impala, Apache, the Apache feather logo, and the Apache Impala |
| project logo are either registered trademarks or trademarks of The Apache Software |
| Foundation in the United States and other countries. |
| </p> |
| </div> |
| |
| </body> |
| </html> |