commit | 3f50879971453f4ced61fa24c4f4a425cbf2631e | [log] [tgz] |
---|---|---|
author | sandeep <sandysmdl@gmail.com> | Fri Nov 25 15:54:56 2016 +0530 |
committer | Pallavi Rao <pallavi.rao@inmobi.com> | Fri Nov 25 15:54:56 2016 +0530 |
tree | 0e7c022db91dad0a3f6e87775de11fea9124b8e3 | |
parent | 1b7708fa1e2258c0aca759684dad1d0dbe91bd5b [diff] |
FALCON-2185 Falcon Client changes for Falcon user extensions. Author: sandeep <sandysmdl@gmail.com> Reviewers: @pallavi-rao Closes #306 from sandeepSamudrala/FALCON-2185 and squashes the following commits: 466705f [sandeep] FALCON-2185 Incorporated review comments.Made stage entities private method 2a3e61d [sandeep] FALCON-2185 Incorporated more review comments e3516c2 [sandeep] FALCON-2185 Incorporated review comments cfe6c57 [sandeep] FALCON-2185 Moved UTs to falcon unit and example to extensions ebac5bb [sandeep] FALCON-2185 Falcon Client changes for Falcon user extensions d680244 [sandeep] Merge branch 'master' of https://github.com/apache/falcon into FALCON-2185 8b2e0d9 [sandeep] Merge branch 'master' of https://github.com/apache/falcon into FALCON-2185 2fd05bb [sandeep] Merge branch 'master' of https://github.com/apache/falcon into FALCON-2185 fc7e9a1 [sandeep] Merge branch 'master' of https://github.com/apache/falcon into FALCON-2185 8aacd75 [sandeep] FALCON-2183 Incorporated review comments f3d7268 [sandeep] FALCON-2183 Incorporated review comments 11e7b3f [sandeep] FALCON-2183 Extension Builder changes to support new user extensions 250cc46 [sandeep] Merge branch 'master' of https://github.com/apache/falcon d0393e9 [sandeep] Merge branch 'master' of https://github.com/apache/falcon a178805 [sandeep] Merge branch 'master' of https://github.com/apache/falcon d6dc8bf [sandeep] Merge branch 'master' of https://github.com/apache/falcon 1bb8d3c [sandeep] Merge branch 'master' of https://github.com/apache/falcon c065566 [sandeep] reverting last line changes made 1a4dcd2 [sandeep] rebased and resolved the conflicts from master 271318b [sandeep] FALCON-2097. Adding UT to the new method for getting next instance time with Delay. a94d4fe [sandeep] rebasing from master 9e68a57 [sandeep] FALCON-298. Feed update with replication delay creates holes
Falcon is a feed processing and feed management system aimed at making it easier for end consumers to onboard their feed processing and feed management on hadoop clusters.
Dependencies across various data processing pipelines are not easy to establish. Gaps here typically leads to either incorrect/partial processing or expensive reprocessing. Repeated duplicate definition of a single feed multiple times can lead to inconsistencies / issues.
Input data may not arrive always on time and it is required to kick off the processing without waiting for all data to arrive and accommodate late data separately
Feed management services such as feed retention, replications across clusters, archival etc are tasks that are burdensome on individual pipeline owners and better offered as a service for all customers.
It should be easy to onboard new workflows/pipelines
Smoother integration with metastore/catalog
Provide notification to end customer based on availability of feed groups (logical group of related feeds, which are likely to be used together)
You can find the documentation on Apache Falcon website.
Before opening a pull request, please go through the Contributing to Apache Falcon wiki. It lists steps that are required before creating a PR and the conventions that we follow. If you are looking for issues to pick up then you can look at starter tasks or open tasks
You can download release notes of previous releases from the following links.