Hadoop-Scale with SQL Access

Hadoop Scale

Running out of room with your current SQL solution? Starting a new operational application? Trafodion allows you to work in SQL at Hadoop-scale levels.

Fully Integrated with HBase and Hive

Trafodion Stack

Trafodion provides SQL access to structured, semi-structured, and unstructured data allowing you to run operational, historical, and analytical workloads on a single platform.


News

About

Apache Trafodion (incubating) is a webscale SQL-on-Hadoop solution enabling transactional or operational workloads on Apache Hadoop.

The name "Trafodion" (the Welsh word for transactions, pronounced "Tra-vod-eee-on") was chosen specifically to emphasize the differentiation that Trafodion provides in closing a critical gap in the Hadoop ecosystem.

Trafodion builds on the scalability, elasticity, and flexibility of Hadoop. Trafodion extends Hadoop to provide guaranteed transactional integrity, enabling new kinds of big data applications to run on Hadoop.

Disclaimer: Apache Trafodion is an effort undergoing incubation at the Apache Software Foundation (ASF), sponsored by the Apache Incubator PMC. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.


Key Features

  • Full-functioned ANSI SQL language support
  • JDBC/ODBC connectivity for Linux/Windows clients
  • Distributed ACID transaction protection across multiple statements, tables and rows
  • Performance improvements for OLTP workloads with compile-time and run-time optimizations
  • Support for large data sets using a parallel-aware query optimizer

Key Benefits

  • Reuse existing SQL skills and improve developer productivity
  • Distributed ACID transactions guarantee data consistency across multiple rows and tables
  • Interoperability with existing tools and applications
  • Hadoop and Linux distribution neutral
  • Easy to add to your existing Hadoop infrastructure