[CARBONDATA-2206] support lucene index datamap

This PR is an initial effort to integrate lucene as an index datamap into carbondata.
A new module called carbondata-lucene is added to support lucene datamap:

1.Add LuceneFineGrainDataMap, implement FineGrainDataMap interface.
2.Add LuceneCoarseGrainDataMap, implement CoarseGrainDataMap interface.
3.Support writing lucene index via LuceneDataMapWriter.
4.Implement LuceneDataMapFactory
5.A UDF called TEXT_MATCH is added, use it to do filtering on string column by lucene

This closes #2003
30 files changed
tree: 68958e040b9ca9441a906914218e8c24b8806053
  1. .github/
  2. assembly/
  3. bin/
  4. build/
  5. common/
  6. conf/
  7. core/
  8. datamap/
  9. dev/
  10. docs/
  11. examples/
  12. format/
  13. hadoop/
  14. integration/
  15. processing/
  16. store/
  17. streaming/
  18. .gitignore
  19. LICENSE
  20. NOTICE
  21. pom.xml
  22. README.md
README.md

Apache CarbonData is an indexed columnar data store solution for fast analytics on big data platform, e.g.Apache Hadoop, Apache Spark, etc.

You can find the latest CarbonData document and learn more at: http://carbondata.apache.org

CarbonData cwiki

Visit count: HitCount

Status

Spark2.2: Build Status Coverage Status

Building CarbonData

CarbonData is built using Apache Maven, to build CarbonData

Online Documentation

Other Technical Material

Apache CarbonData meetup material Use Case Articles

Fork and Contribute

This is an active open source project for everyone, and we are always open to people who want to use this system or contribute to it. This guide document introduce how to contribute to CarbonData.

Contact us

To get involved in CarbonData:

About

Apache CarbonData is an open source project of The Apache Software Foundation (ASF).