commit | cc674bf0cd9e651916a19e79f045d51498113fda | [log] [tgz] |
---|---|---|
author | akashrn5 <akashnilugal@gmail.com> | Tue Apr 21 14:23:41 2020 +0530 |
committer | ajantha-bhat <ajanthabhat@gmail.com> | Mon Apr 27 21:42:52 2020 +0530 |
tree | e83336474bc0c3496af8ece7ee1665fa65871b36 | |
parent | 67714158269badec44bd90a7b940a8dab87faa48 [diff] |
[CARBONDATA-3777] Add HDFSLocalCarbonFile implementation to Use FileSystem's LocalFileSystem in cluster mode Why is this PR needed? Currently LocalFile file implementation is JAVA's file implementation, which will give problem if we want to load the local file in cluster for instance. What changes were proposed in this PR? Implement a new class HDFSLocalCarbonFile, which extends HDFSCarbonFIle and when a file with local file scheme "file://" is given and trying to load in cluster, it takes the file as HDFSLocalCarbonFile and go ahead instead of failing which is current behaviour. Does this PR introduce any user interface change? Yes. (Doc update is not needed) Is any new testcase added? No(Existing HDFSCarbonFile tests will take care) This closes #3721
Apache CarbonData is an indexed columnar data store solution for fast analytics on big data platform, e.g.Apache Hadoop, Apache Spark, etc.
You can find the latest CarbonData document and learn more at: http://carbondata.apache.org
CarbonData file format is a columnar store in HDFS, it has many features that a modern columnar format has, such as splittable, compression schema ,complex data type etc, and CarbonData has following unique features:
CarbonData is built using Apache Maven, to build CarbonData
This is an active open source project for everyone, and we are always open to people who want to use this system or contribute to it. This guide document introduce how to contribute to CarbonData.
To get involved in CarbonData:
Apache CarbonData is an open source project of The Apache Software Foundation (ASF).