FALCON-2210 Server side changes in submit and submitAndSchedule apis to accept list of feeds and processes

Author: sandeep <sandysmdl@gmail.com>

Reviewers: @pallavi-rao

Closes #313 from sandeepSamudrala/FALCON-2210 and squashes the following commits:

3b7a3ae [sandeep] FALCON-2210 applied tags to the entities for user extensions
f08cf15 [sandeep] FALCON-2210 Fixed falcon unit client build issues
98d382e [sandeep] FALCON-2210 Incorporated review comments
4004ae7 [sandeep] FALCON-2210 Incorporated review comments
d5e4c53 [sandeep] FALCON-2210 Server side changes in submit and submitAndSchedule apis to accept list of feeds and processes
05b87fb [sandeep] Merge branch 'master' of https://github.com/apache/falcon into FALCON-2210
2bd685f [sandeep] FALCON-2201 Fixed checkstyle issues
c7422e6 [sandeep] FALCON-2201 Incorporated review comments. Removed applying tags from client. I will move that to Server side in the server side changes
86446ad [sandeep] Merge branch 'master' of https://github.com/apache/falcon into FALCON-2201
432cdfd [sandeep] FALCON-2201 Incorporated review comments
a0ce5e0 [sandeep] Merge branch 'master' of https://github.com/apache/falcon into FALCON-2201
519a877 [sandeep] FALCON-2201 Fixed checkstyle issues
c101c7b [sandeep] FALCON-2201 Incorporated review comments
bf0e6ed [sandeep] FALCON-2201 Incorporated review comments and few client side changes
adfd318 [sandeep] FALCON-2201 Falcon Unit changes for extension support and falcon unit tests for extensions and fixes.
03f0c3c [sandeep] Merge branch 'master' of https://github.com/apache/falcon into FALCON-2201
9cf36e9 [sandeep] Merge branch 'master' of https://github.com/apache/falcon
bbca081 [sandeep] Merge branch 'master' of https://github.com/apache/falcon
48f6afa [sandeep] Merge branch 'master' of https://github.com/apache/falcon
250cc46 [sandeep] Merge branch 'master' of https://github.com/apache/falcon
d0393e9 [sandeep] Merge branch 'master' of https://github.com/apache/falcon
a178805 [sandeep] Merge branch 'master' of https://github.com/apache/falcon
d6dc8bf [sandeep] Merge branch 'master' of https://github.com/apache/falcon
1bb8d3c [sandeep] Merge branch 'master' of https://github.com/apache/falcon
c065566 [sandeep] reverting last line changes made
1a4dcd2 [sandeep] rebased and resolved the conflicts from master
271318b [sandeep] FALCON-2097. Adding UT to the new method for getting next instance time with Delay.
a94d4fe [sandeep] rebasing from master
9e68a57 [sandeep] FALCON-298. Feed update with replication delay creates holes
5 files changed
tree: 254ac2b681090aa69f8b9362b79062e59fde531c
  1. acquisition/
  2. addons/
  3. archival/
  4. build-tools/
  5. cli/
  6. client/
  7. common/
  8. common-types/
  9. distro/
  10. docs/
  11. examples/
  12. extensions/
  13. falcon-regression/
  14. falcon-ui/
  15. hadoop-dependencies/
  16. html5-ui/
  17. lifecycle/
  18. messaging/
  19. metrics/
  20. monitoring/
  21. oozie/
  22. oozie-el-extensions/
  23. prism/
  24. release-docs/
  25. replication/
  26. rerun/
  27. retention/
  28. scheduler/
  29. shell/
  30. src/
  31. test-tools/
  32. test-util/
  33. titan/
  34. unit/
  35. webapp/
  36. .gitignore
  37. .reviewboardrc
  38. CHANGES.txt
  39. falcon_merge_pr.py
  40. Installation-steps.txt
  41. LICENSE.txt
  42. NOTICE.txt
  43. pom.xml
  44. README.md
README.md

Apache Falcon

Falcon is a feed processing and feed management system aimed at making it easier for end consumers to onboard their feed processing and feed management on hadoop clusters.

Why Apache Falcon?

  • Dependencies across various data processing pipelines are not easy to establish. Gaps here typically leads to either incorrect/partial processing or expensive reprocessing. Repeated duplicate definition of a single feed multiple times can lead to inconsistencies / issues.

  • Input data may not arrive always on time and it is required to kick off the processing without waiting for all data to arrive and accommodate late data separately

  • Feed management services such as feed retention, replications across clusters, archival etc are tasks that are burdensome on individual pipeline owners and better offered as a service for all customers.

  • It should be easy to onboard new workflows/pipelines

  • Smoother integration with metastore/catalog

  • Provide notification to end customer based on availability of feed groups (logical group of related feeds, which are likely to be used together)

Online Documentation

You can find the documentation on Apache Falcon website.

How to Contribute

Before opening a pull request, please go through the Contributing to Apache Falcon wiki. It lists steps that are required before creating a PR and the conventions that we follow. If you are looking for issues to pick up then you can look at starter tasks or open tasks

Release Notes

You can download release notes of previous releases from the following links.

0.8

0.7