blob: 6a05310dc3ba9ef59ccbcd137265b793718f473e [file] [log] [blame]
<!DOCTYPE html>
<html prefix="
" lang="en">
<head>
<meta name="keywords" content="hadoop, impala, sql, mpp, bi, big data, open source">
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<meta charset="utf-8">
<meta name="description" content="Apache Impala is a modern, open source, distributed SQL query engine for open data and table formats.">
<title>Apache Impala (old posts, page 1) | Apache Impala</title>
<link href="assets/css/bootstrap.min.css" rel="stylesheet" type="text/css">
<link href="assets/css/bootstrap-responsive.min.css" rel="stylesheet" type="text/css">
<!-- order is significant to prevent overwriting of some bootstrap-defined css styles --><link href="assets/css/additional_styles.css" rel="stylesheet" type="text/css">
<script src="https://ajax.googleapis.com/ajax/libs/jquery/1.11.3/jquery.min.js"></script><script src="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.5/js/bootstrap.min.js"></script><meta content="#5670d4" name="theme-color">
<link rel="canonical" href="https://impala.apache.org/blog/index-1.html">
<link rel="prev" href="." type="text/html">
</head>
<body id="index" class="home">
<div class="container">
<div class="masthead">
<header id="header"><h3 id="brand">
<span id="blog-title" class="muted">Apache Impala</span>
</h3>
<nav id="menu"><div class="navbar">
<div class="navbar-inner">
<div class="container">
<ul class="nav">
<li><a href="https://impala.apache.org/index.html">Home</a></li>
<li><a href="https://impala.apache.org/downloads.html">Downloads</a></li>
<li><a href="https://impala.apache.org/overview.html">Overview</a></li>
<li class="active"><a href="https://impala.apache.org/blog/">Blog</a></li>
<li><a href="https://cwiki.apache.org/confluence/display/IMPALA/Contributing+to+Impala">Contribute</a></li>
<li class="dropdown">
<a href="#" class="dropdown-toggle" data-toggle="dropdown" role="button">
Source code</a>
<ul class="dropdown-menu">
<li>
<a href="https://git-wip-us.apache.org/repos/asf/incubator-impala.git">
Official source</a>
</li>
<li><a href="https://github.com/apache/incubator-impala">GitHub mirror</a></li>
</ul>
</li>
<li><a href="https://impala.apache.org/community.html">Community</a></li>
<li><a href="https://impala.apache.org/impala-docs.html">Documentation</a></li>
</ul>
</div>
<!-- container -->
</div>
<!-- navbar-inner -->
</div>
<!-- navbar -->
</nav></header>
</div>
<!-- masthead -->
<main id="content"><h3>All articles</h3>
<div class="postindex">
<hr>
<article class="h-entry post-text"><header><h3 class="p-name entry-title"><a href="posts/impala-25-performance-overview/" class="u-url">Impala 2.5 performance overview</a></h3>
<div class="metadata">
<p class="byline author vcard"><span class="byline-name fn">
<a href="authors/mostafa-mokhtar/">Mostafa Mokhtar</a>
</span></p>
<p class="dateline"><a href="posts/impala-25-performance-overview/" rel="bookmark"><time class="published dt-published" datetime="2016-05-24T23:00:00-07:00" title="2016-05-24 23:00">2016-05-24 23:00</time></a></p>
</div>
</header><div class="p-summary entry-summary">
<p>Impala has proven to be a high-performance analytics query engine since the beginning. Even as an
initial production release in 2013, it demonstrated performance 2x faster than a traditional DBMS,
and each subsequent release has continued to demonstrate the wide performance gap between Impala’s
analytic-database architecture and SQL-on-Apache Hadoop alternatives. Today, we are excited to
continue that track record via some important performance gains for Impala 2.5 (with more to come on
the roadmap), summarized below.</p>
<p>Overall, compared to Impala 2.3, in Impala 2.5:</p>
<ul>
<li>TPC-DS queries run on average 4.3x faster.</li>
<li>TPC-H queries run 2.2x faster on flat tables, and 1.71x faster on nested tables.</li>
</ul>
<div style="margin-bottom:5px">
<strong>
<a href="https://www.slideshare.net/slideshow/hug-meetup-impala-25-performance-overview/61689036" title="Hug meetup impala 2.5 performance overview" target="_blank">Hug meetup impala 2.5 performance
overview</a>
</strong>
from
<strong>
<a href="https://www.slideshare.net/MostafaMokhtar1" target="_blank">Mostafa Mokhtar</a>
</strong>
</div>
</div>
</article><hr>
<article class="h-entry post-text"><header><h3 class="p-name entry-title"><a href="posts/nested-types-in-impala/" class="u-url">Nested Types in Impala</a></h3>
<div class="metadata">
<p class="byline author vcard"><span class="byline-name fn">
<a href="authors/alex-behm/">Alex Behm</a>
<a href="authors/marcel-kornacker/">Marcel Kornacker</a>
<a href="authors/skye-wanderman-milne/">Skye Wanderman-Milne</a>
</span></p>
<p class="dateline"><a href="posts/nested-types-in-impala/" rel="bookmark"><time class="published dt-published" datetime="2015-03-24T23:00:00-07:00" title="2015-03-24 23:00">2015-03-24 23:00</time></a></p>
</div>
</header><div class="p-summary entry-summary">
<p>This document discusses nested data types in Impala, including structs, maps, and arrays. It
provides an example schema using these types, describes Impala's SQL syntax extensions for querying
nested data, and discusses techniques for advanced querying capabilities like correlated subqueries.
The execution model materializes minimal nested structures in memory and uses new execution nodes to
handle nested data types.</p>
<div style="margin-bottom:5px">
<strong>
<a href="https://www.slideshare.net/slideshow/nested-types-in-impala-55344174/55344174" title="Nested Types in Impala" target="_blank">Nested Types in Impala</a>
</strong>
from
<strong>
<a href="https://www.slideshare.net/cloudera" target="_blank">Cloudera, Inc.</a>
</strong>
</div>
<p><i>Presented in Impala Meetup, PA, March 24th, 2015</i></p>
</div>
</article><hr>
<article class="h-entry post-text"><header><h3 class="p-name entry-title"><a href="posts/impala-a-modern-open-source-sql-engine-for-hadoop/" class="u-url">Impala: A Modern, Open-Source SQL Engine for Hadoop</a></h3>
<div class="metadata">
<p class="byline author vcard"><span class="byline-name fn">
<a href="authors/impala-dev/">Impala Dev</a>
</span></p>
<p class="dateline"><a href="posts/impala-a-modern-open-source-sql-engine-for-hadoop/" rel="bookmark"><time class="published dt-published" datetime="2015-01-05T23:00:00-07:00" title="2015-01-05 23:00">2015-01-05 23:00</time></a></p>
</div>
</header><div class="p-summary entry-summary">
<p>
<i>Presented at
<a href="https://www.cidrdb.org/cidr2015/program.html" target="_blank">The Conference on
Innovative Data Systems Research (CIDR) 2015</a>.
</i>
</p>
<h4>ABSTRACT</h4>
<p>Cloudera Impala is a modern, open-source MPP SQL engine architected from the ground up for the
Hadoop data processing environment. Impala provides low latency and high concurrency for BI/analytic
read-mostly queries on Hadoop, not delivered by batch frameworks such as Apache Hive. This paper
presents Impala from a user’s perspective, gives an overview of its architecture and main components
and briefly demonstrates its superior performance compared against other popular SQL-on-Hadoop
systems.</p>
<p>
<a href="https://www.cidrdb.org/cidr2015/Papers/CIDR15_Paper28.pdf" target="_blank">Paper</a> |
<a href="https://www.cidrdb.org/cidr2015/Slides/28_CIDR15_Slides_Paper28.pdf" target="_blank">Slides</a>
</p>
</div>
</article>
</div>
<nav class="postindexpager"><ul class="pager">
<li class="previous">
<a href="." rel="prev">Newer posts</a>
</li>
</ul></nav></main><footer id="footer"><p> </p>
<div class="navbar">
<div class="navbar-inner">
<div class="container">
<ul class="nav">
<li><a href="https://www.apache.org/licenses/">License</a></li>
<li><a href="https://www.apache.org/foundation/sponsorship.html">Sponsorship</a></li>
<li><a href="https://www.apache.org/foundation/thanks.html">Thanks</a></li>
<li><a href="https://www.apache.org/security/">Security</a></li>
<li><a href="https://www.apache.org/">Apache Software Foundation</a></li>
</ul>
</div>
<!-- container -->
</div>
<!-- navbar-inner -->
</div>
<!-- navbar -->
<div class="footer">
<center>
<a href="https://www.apache.org/events/current-event.html">
<img src="https://www.apache.org/events/current-event-234x60.png"></a>
</center>
<p>Apache Impala, Impala, Apache, the Apache feather logo, and the Apache
Impala project logo are either registered trademarks or trademarks of The
Apache Software Foundation in the United States and other countries.
</p>
</div>
<!-- footer -->
<p>Contents © 2016-2024 <a href="mailto:dev@impala.apache.org">Impala Dev</a> - Powered by <a href="https://getnikola.com" rel="nofollow">Nikola</a> </p>
</footer>
</div>
<!-- container -->
</body>
</html>