[GOBBLIN-707][GOBBLIN-694][GOBBLIN-477][GOBBLIN-719] rewrite gobblin script to combine all modes and command

[GOBBLIN-719] fix invalid git links for classes in
docs

fixing all file paths and linksA

[GOBBLIN-707]-rewrite gobblin script combining all
gobblin modes and commands along with configs

updating as per review 1

updates docs as per gobblin.sh changes + some more
improvements in gobblin.sh

updates docs as per gobblin.sh changes + some more
improvements in gobblin.sh - 2

Merge branch 'master' of
github.com:jhsenjaliya/incubator-gobblin

Merge branch 'master' of
github.com:apache/incubator-gobblin

Merge branch 'master' of
github.com:apache/incubator-gobblin

[GOBBLIN-707]-rewrite gobblin script combining all
gobblin modes and commands along with configs

updating as per review 1

updates docs as per gobblin.sh changes + some more
improvements in gobblin.sh

updates docs as per gobblin.sh changes + some more
improvements in gobblin.sh - 2

Merge branch 'GOBBLIN-707' of
github.com:jhsenjaliya/incubator-gobblin into
GOBBLIN-707

adding project version

Merge branch 'master' of
github.com:apache/incubator-gobblin into
GOBBLIN-707

arrange HADOOP classpath

restructure command vs services help

all command and cli classes under GobblinCli with
Alias, remove Cli class

uncomment required code

keep individual script that calls new gobblin
script internally

resolves findBug error, replaced system exit with
return

update in config, keeping default as before

update to yarn, MR config

updating gobblin.sh and adding gobblin-service
config

Merge branch 'master' into GOBBLIN-707

updates to gobblin.sh and help on cli classes

rename service manager to gobblin-as-service

merge upstream changes

revert to upstream config to put change later

Closes #2578 from jhsenjaliya/GOBBLIN-707
62 files changed
tree: 368debe2d23840c136bf16bc4c7dc7f7e55984a5
  1. .github/
  2. bin/
  3. buildSrc/
  4. conf/
  5. config/
  6. dev/
  7. gobblin-admin/
  8. gobblin-api/
  9. gobblin-audit/
  10. gobblin-aws/
  11. gobblin-binary-management/
  12. gobblin-cluster/
  13. gobblin-compaction/
  14. gobblin-config-management/
  15. gobblin-core/
  16. gobblin-core-base/
  17. gobblin-data-management/
  18. gobblin-distribution/
  19. gobblin-docker/
  20. gobblin-docs/
  21. gobblin-example/
  22. gobblin-hive-registration/
  23. gobblin-metastore/
  24. gobblin-metrics-libs/
  25. gobblin-modules/
  26. gobblin-oozie/
  27. gobblin-rest-service/
  28. gobblin-restli/
  29. gobblin-runtime/
  30. gobblin-runtime-hadoop/
  31. gobblin-salesforce/
  32. gobblin-service/
  33. gobblin-test/
  34. gobblin-test-harness/
  35. gobblin-test-utils/
  36. gobblin-tunnel/
  37. gobblin-utility/
  38. gobblin-yarn/
  39. gradle/
  40. ligradle/
  41. maven-nexus/
  42. maven-sonatype/
  43. travis/
  44. .codecov_bash
  45. .dockerignore
  46. .gitignore
  47. .travis.yml
  48. build.gradle
  49. CHANGELOG.md
  50. defaultEnvironment.gradle
  51. DISCLAIMER
  52. gobblin-flavored-build.gradle
  53. gradle.properties
  54. gradlew
  55. gradlew.bat
  56. HEADER
  57. LICENSE
  58. mkdocs.yml
  59. NOTICE
  60. query_github_issues.py
  61. README.md
  62. readthedocs.yml
  63. settings.gradle
README.md

Apache Gobblin Build Status Documentation Status codecov.io

Apache Gobblin is a universal data ingestion framework for extracting, transforming, and loading large volume of data from a variety of data sources, e.g., databases, rest APIs, FTP/SFTP servers, filers, etc., onto Hadoop. Apache Gobblin handles the common routine tasks required for all data ingestion ETLs, including job/task scheduling, task partitioning, error handling, state management, data quality checking, data publishing, etc. Gobblin ingests data from different data sources in the same execution framework, and manages metadata of different sources all in one place. This, combined with other features such as auto scalability, fault tolerance, data quality assurance, extensibility, and the ability of handling data model evolution, makes Gobblin an easy-to-use, self-serving, and efficient data ingestion framework.

Requirements

  • Java >= 1.8
  • gradle-wrapper.jar version 2.13

If building the distribution with tests turned on:

  • Maven version 3.5.3

Instructions to download gradle wrapper

Run the following command for downloading the gradle-wrapper.jar from Gobblin git repository to gradle/wrapper directory.

wget --no-check-certificate -P gradle/wrapper https://github.com/apache/incubator-gobblin/raw/0.12.0/gradle/wrapper/gradle-wrapper.jar (or) curl --insecure -L https://github.com/apache/incubator-gobblin/raw/0.12.0/gradle/wrapper/gradle-wrapper.jar > gradle/wrapper/gradle-wrapper.jar

Alternatively, you can download it manually from: https://github.com/apache/incubator-gobblin/blob/0.12.0/gradle/wrapper/gradle-wrapper.jar

Make sure that you download it to gradle/wrapper directory.

Instructions to run Apache RAT (Release Audit Tool)

  1. Extract the archive file to your local directory.
  2. Download gradle-wrapper.jar (version 2.13) and place it in the gradle/wrapper folder. See ‘Instructions to download gradle wrapper’ above.
  3. Run ./gradlew rat. Report will be generated under build/rat/rat-report.html

Instructions to build the distribution

  1. Extract the archive file to your local directory.
  2. Download gradle-wrapper.jar (version 2.13) and place it in the gradle/wrapper folder. See ‘Instructions to download gradle wrapper’ above.
  3. Skip tests and build the distribution: Run ./gradlew build -x findbugsMain -x test -x rat -x checkstyleMain The distribution will be created in build/gobblin-distribution/distributions directory. (or)
  4. Run tests and build the distribution (requires Maven): Run ./gradlew build The distribution will be created in build/gobblin-distribution/distributions directory.

Quick Links

  • Documentation: Check out the Gobblin documentation for a complete description of Gobblin's features
  • Powered By: Check out the list of companies known to use Gobblin
  • Architecture: The Gobblin Architecture page has a full explanation of Gobblin's architecture
  • Getting Started with Gobblin: Refer to the Getting Started Guide on how to get started with Gobblin
  • Building Gobblin (from master branch): Refer to the page Building Gobblin for directions on how to build Gobblin
  • Javadocs: The full JavaDocs for each released version of Gobblin can be found here
  • Gobblin chat room: Gitter chat room for Gobblin developers and users here
  • Gobblin Issue Tracker can be found here