tree: 0c61c2c5373beaebed360f5136853c525d3dceda [path history] [tgz]
  1. auto-setup.sh
  2. core-site-example.xml
  3. distrib-env.sh
  4. distrib-setup.sh
  5. drill-am-log.xml
  6. drill-am.sh
  7. drill-conf
  8. drill-config.sh
  9. drill-embedded
  10. drill-embedded.bat
  11. drill-env.sh
  12. drill-localhost
  13. drill-on-yarn-example.conf
  14. drill-on-yarn.sh
  15. drill-override-example.conf
  16. drill-override.conf
  17. drill-setup.sh
  18. drill-sqlline-override-example.conf
  19. drillbit
  20. drillbit.sh
  21. dumpcat
  22. hadoop-excludes.txt
  23. LICENSE
  24. logback.xml
  25. NOTICE
  26. README.md
  27. runbit
  28. saffron.properties
  29. sqlline
  30. sqlline.bat
  31. storage-plugins-override-example.conf
  32. submit_plan
  33. yarn-client-log.xml
  34. yarn-drillbit.sh
distribution/src/resources/README.md

Running Apache Drill

Prerequisites

  • Linux, Windows or OSX
  • Oracle/OpenJDK 8 (JDK, not JRE)

Additional requirements when running in clustered mode:

  • Hadoop 2.3+ distribution of Hadoop (such as Apache or MapR)
  • Zookeeper is required for a clustered installation

Installing the Tarball

  1. mkdir /opt/drill
  2. tar xvzf [tarball] --strip=1 -C /opt/drill

Running in embedded mode

  1. cd /opt/drill
  2. bin/sqlline -u jdbc:drill:zk=local
  3. Run a query (below).

Running in clustered mode

  1. Edit drill-override.conf to provide zookeeper location
  2. Start the drillbit using bin/drillbit.sh start
  3. Repeat on other nodes
  4. Connect with sqlline by using bin/sqlline -u “jdbc:drill:zk=[zk_host:port]”
  5. Run a query (below).

Run a query

Drill comes preinstalled with a number of example data files including a small copy of the TPCH data in self describing Parquet files as well as the foodmart database in JSON. You can query these files using the cp schema. For example:

USE cp;

SELECT 
  employee_id, 
  first_name
FROM `employee.json`; 

More information

For more information including how to run a Apache Drill cluster, visit the Apache Drill Documentation