commit | 68bd27a128c2f244bd504369dce510727ea28da7 | [log] [tgz] |
---|---|---|
author | chunhui-shi <cshi@maprtech.com> | Sun Oct 30 01:29:06 2016 -0700 |
committer | Aman Sinha <asinha@maprtech.com> | Sun Dec 04 08:35:05 2016 -0800 |
tree | 5caa40bd6e07158377f7ec9fc4f96a8c47ef40e7 | |
parent | 42006ad3324c778b3f3867079c9e75121c743c73 [diff] |
DRILL-4982: Separate Hive reader classes for different data formats to improve performance. 1, Separating Hive reader classes allows optimization to apply on different classes in optimized ways. This separation effectively avoid the performance degradation of scan. 2, Do not apply Skip footer/header mechanism on most Hive formats. This skip mechanism introduces extra checks on each incoming records. close apache/drill#638
Apache Drill is a distributed MPP query layer that supports SQL and alternative query languages against NoSQL and Hadoop data storage systems. It was inspired in part by Google's Dremel.
Please read INSTALL.md for setting up and running Apache Drill.
Please see the Apache Drill Website or the Apache Drill Documentation for more information including:
Apache Drill is an Apache Foundation project and is seeking all types of contributions. Please say hello on the Apache Drill mailing list or join our Google Hangouts for more information. (More information can be found at the Apache Drill website).