blob: c9966cbed654220514f20b2646ca6777c638cb5a [file] [log] [blame]
Currently, Gobblin supports the following feature list:
* Different Data Sources
{|
!Source Type
!Protocol API
!Vendors
|- valign="middle"
|RDBMS
|JDBC
|MySQL/SQLServer
|-valign="middle"
|Files
|HDFS/SFTP/LocalFS
|N/A
|-
|Salesforce
|REST
|Salesforce
|}
<BR>
* Different Pulling Types
** SNAPSHOT-ONLY: Pull the snapshot of one dataset.
** SNAPSHOT-APPEND: Pull delta changes since last run, optionally merge delta changes into snapshot (Delta changes include updates to the dataset since last run).
** APPEND-ONLY: Pull delta changes since last run, and append to dataset.
<BR>
* Different Deployment Types
** standalone deploy on a single machine
** cluster deploy on hadoop 1.2.1, hadoop 2.3.0
<BR>
* Compaction
**Merge delta changes into snapshot.