blob: fa875af62f1da7751687a90f2713e0c233ab73b8 [file] [log] [blame]
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>Impala</title>
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<meta name="description" content="">
<meta name="author" content="">
<!-- Le styles -->
<link href="css/bootstrap.css" rel="stylesheet">
<style type="text/css">
body {
padding-top: 20px;
padding-bottom: 40px;
}
/* Custom container */
.container-narrow {
margin: 0 auto;
max-width: 700px;
}
.container-narrow > hr {
margin: 30px 0;
}
/* Main marketing message and sign up button */
.jumbotron {
margin: 60px 0;
text-align: center;
}
.jumbotron h1 {
font-size: 72px;
line-height: 1;
}
.jumbotron .btn {
font-size: 21px;
padding: 14px 24px;
}
/* Supporting marketing content */
.marketing {
margin: 60px 0;
}
.marketing p + h4 {
margin-top: 28px;
}
/* Customize the navbar links to be fill the entire space of the .navbar */
.navbar .navbar-inner {
padding: 0;
}
.navbar .nav {
margin: 0;
display: table;
width: 100%;
}
.navbar .nav li {
display: table-cell;
width: 1%;
float: none;
}
.navbar .nav li a {
font-weight: bold;
text-align: center;
border-left: 1px solid rgba(255,255,255,.75);
border-right: 1px solid rgba(0,0,0,.1);
}
.navbar .nav li:first-child a {
border-left: 0;
border-radius: 3px 0 0 3px;
}
.navbar .nav li:last-child a {
border-right: 0;
border-radius: 0 3px 3px 0;
}
</style>
<link href="css/bootstrap-responsive.css" rel="stylesheet">
<!-- HTML5 shim, for IE6-8 support of HTML5 elements -->
<!--[if lt IE 9]>
<script src="js/html5shiv.js"></script>
<![endif]-->
<script src="js/jquery-3.5.1.min.js"></script>
<script src="js/bootstrap-3.3.5.min.js"></script>
</head>
<body>
<div class="container">
<div class="masthead">
<h3 class="muted">Apache Impala</h3>
<div class="navbar">
<div class="navbar-inner">
<div class="container">
<ul class="nav">
<li><a href="index.html">Home</a></li>
<li><a href="downloads.html">Downloads</a></li>
<li class="active"><a href="overview.html">Overview</a></li>
<li><a href="https://cwiki.apache.org/confluence/display/IMPALA/Contributing+to+Impala">Contribute</a></li>
<li class="dropdown">
<a href='#' class="dropdown-toggle" data-toggle="dropdown" role="button">
Source code
</a>
<ul class="dropdown-menu">
<li><a href="https://git-wip-us.apache.org/repos/asf/impala.git">Official source</a></li>
<li><a href="https://github.com/apache/impala">GitHub mirror</a></li>
</ul>
</li>
<li><a href="community.html">Community</a></li>
<li><a href="impala-docs.html">Documentation</a></li>
</ul>
</div>
</div>
<div class="row-fluid">
<div class="span12">
<h2>Overview</h2>
<p>Impala raises the bar for SQL query performance on Apache Hadoop while retaining a familiar user experience. With Impala, you can query data, whether stored in HDFS or Apache HBase – including SELECT, JOIN, and aggregate functions – in real time. Furthermore, Impala uses the same metadata, SQL syntax (Hive SQL), ODBC driver, and user interface (Hue Beeswax) as Apache Hive, providing a familiar and unified platform for batch-oriented or real-time queries. (For that reason, Hive users can utilize Impala with little setup overhead.) </p>
<h3>Architecture</h3>
<p>To avoid latency, Impala circumvents MapReduce to directly access the data through a specialized distributed query engine that is very similar to those found in commercial parallel RDBMSs. The result is order-of-magnitude faster performance than Hive, depending on the type of query and configuration.</p>
<p align="center"><img src="img/impala.png"></p>
<p>There are many advantages to this approach over alternative approaches for querying Hadoop data, including::
<ul>
<li>Thanks to local processing on data nodes, network bottlenecks are avoided.</li>
<li>A single, open, and unified metadata store can be utilized.
</li>
<li>Costly data format conversion is unnecessary and thus no overhead is incurred.</li>
<li> All data is immediately query-able, with no delays for ETL.
</li>
<li>All hardware is utilized for Impala queries as well as for MapReduce.
</li>
<li>Only a single machine pool is needed to scale.
</li>
</ul>
We encourage you to
read <a href="http://cidrdb.org/cidr2015/Papers/CIDR15_Paper28.pdf">"Impala: A Modern,
Open-Source SQL Engine for Hadoop"</a> for details about Impala's architecture.
<p>&nbsp;</p>
</div>
</div>
<div class="navbar">
<div class="navbar-inner">
<div class="container">
<ul class="nav">
<li><a href="https://www.apache.org/licenses/">License</a></li>
<li><a href="https://www.apache.org/foundation/sponsorship.html">Sponsorship</a></li>
<li><a href="https://www.apache.org/foundation/thanks.html">Thanks</a></li>
<li><a href="https://www.apache.org/security/">Security</a></li>
<li><a href="https://www.apache.org/">Apache Software Foundation</a></li>
</ul>
</div>
</div>
</div>
<div class="footer">
<center>
<a href="http://www.apache.org/events/current-event.html">
<img src="http://www.apache.org/events/current-event-234x60.png"/>
</a>
</center>
<p>Apache Impala, Impala, Apache, the Apache feather logo, and the Apache Impala
project logo are either registered trademarks or trademarks of The Apache Software
Foundation in the United States and other countries.
</p>
</div>
</body>
</html>