layout: doc_page

Druid extensions

Druid implements an extension system that allows for adding functionality at runtime. Extensions are commonly used to add support for deep storages (like HDFS and S3), metadata stores (like MySQL and PostgreSQL), new aggregators, new input formats, and so on.

Production clusters will generally use at least two extensions; one for deep storage and one for a metadata store. Many clusters will also use additional extensions.

Including extensions

Please see here.

Core extensions

Core extensions are maintained by Druid committers.

NameDescriptionDocs
druid-avro-extensionsSupport for data in Apache Avro data format.link
druid-caffeine-cacheA local cache implementation backed by Caffeine.link
druid-datasketchesSupport for approximate counts and set operations with DataSketches.link
druid-hdfs-storageHDFS deep storage.link
druid-histogramApproximate histograms and quantiles aggregator.link
druid-kafka-eightKafka ingest firehose (high level consumer).link
druid-kafka-extraction-namespaceKafka-based namespaced lookup. Requires namespace lookup extension.link
druid-lookups-cached-globalA module for lookups providing a jvm-global eager caching for lookups. It provides JDBC and URI implementations for fetching lookup data.link
druid-s3-extensionsInterfacing with data in AWS S3, and using S3 as deep storage.link
druid-statsStatistics related module including variance and standard deviation, which is using the same algorithm with that of hive.link
mysql-metadata-storageMySQL metadata store.link
postgresql-metadata-storagePostgreSQL metadata store.link

Community Extensions

A number of community members have contributed their own extensions to Druid that are not packaged with the default Druid tarball. Community extensions are not maintained by Druid committers, although we accept patches from community members using these extensions. If you'd like to take on maintenance for a community extension, please post on druid-development group to let us know!

All of these community extensions can be downloaded using pull-deps with the coordinate io.druid.extensions.contrib:EXTENSION_NAME:LATEST_DRUID_STABLE_VERSION.

NameDescriptionDocs
druid-azure-extensionsMicrosoft Azure deep storage.link
druid-cassandra-storageApache Cassandra deep storage.link
druid-cloudfiles-extensionsRackspace Cloudfiles deep storage and firehose.link
druid-distinctcountDistinctCount aggregatorlink
druid-kafka-eight-simpleConsumerKafka ingest firehose (low level consumer).link
druid-parquet-extensionsSupport for data in Apache Parquet data format. Requires druid-avro-extensions to be loaded.link
druid-rabbitmqRabbitMQ firehose.link
druid-rocketmqRocketMQ firehose.link
graphite-emitterGraphite metrics emitterlink
statsd-emitterStatsD metrics emitterlink

Promoting Community Extension to Core Extension

Please let us know if you'd like an extension to be promoted to core. If we see a community extension actively supported by the community, we can promote it to core based on community feedback.

Creating your own Extensions

For information how to create your own extension, please see here.