| ---+ Data Governance and Metadata framework for Hadoop |
| |
| |
| ---++ Overview |
| |
| Atlas is a scalable and extensible set of core foundational governance services – enabling |
| enterprises to effectively and efficiently meet their compliance requirements within Hadoop and |
| allows integration with the whole enterprise data ecosystem. |
| |
| Apache Atlas provides open metadata management and governance capabilities for organizations |
| to build a catalog of their data assets, classify and govern these assets and provide collaboration |
| capabilities around these data assets for data scientists, analysts and the data governance team. |
| |
| ---++ Features |
| |
| ---+++ Metadata types & instances |
| * Pre-defined types for various Hadoop and non-Hadoop metadata |
| * Ability to define new types for the metadata to be managed |
| * Types can have primitive attributes, complex attributes, object references; can inherit from other types |
| * Instances of types, called entities, capture metadata object details and their relationships |
| * REST APIs to work with types and instances allow easier integration |
| |
| ---+++ Classification |
| * Ability to dynamically create classifications - like PII, EXPIRES_ON, DATA_QUALITY, SENSITIVE |
| * Classifications can include attributes - like expiry_date attribute in EXPIRES_ON classification |
| * Entities can be associated with multiple classifications, enabling easier discovery and security enforcement |
| |
| ---+++ Lineage |
| * Intuitive UI to view lineage of data as it moves through various processes |
| * REST APIs to access and update lineage |
| |
| ---+++ Search/Discovery |
| * Intuitive UI to search entities by type, classification, attribute value or free-text |
| * Rich REST APIs to search by complex criteria |
| * SQL like query language to search entities - Domain Specific Language (DSL) |
| |
| ---+++ Security & Data Masking |
| * Integration with Apache Ranger enables authorization/data-masking based on classifications associated with entities in Apache Atlas. For example: |
| * who can access data classified as PII, SENSITIVE |
| * customer-service users can only see last 4 digits of columns classified as NATIONAL_ID |
| |
| |
| ---++ Getting Started |
| |
| * [[InstallationSteps][Build & Install]] |
| * [[QuickStart][Quick Start]] |
| |
| |
| ---++ Documentation |
| |
| * [[Architecture][High Level Architecture]] |
| * [[TypeSystem][Type System]] |
| * [[Search][Search]] |
| * [[security][Security]] |
| * [[Authentication-Authorization][Authentication and Authorization]] |
| * [[Configuration][Configuration]] |
| * Notification |
| * [[Notification-Entity][Entity Notification]] |
| * Bridges |
| * [[Bridge-Hive][Hive Bridge]] |
| * [[Bridge-Sqoop][Sqoop Bridge]] |
| * [[Bridge-Falcon][Falcon Bridge]] |
| * [[StormAtlasHook][Storm Bridge]] |
| * [[HighAvailability][Fault Tolerance And High Availability Options]] |
| |
| ---++ API Documentation |
| |
| * <a href="api/v2/index.html">REST API Documentation</a> |
| * [[Import-Export-API][Export & Import REST API Documentation]] |
| * <a href="api/rest.html">Legacy API Documentation</a> |
| |
| ---++ Developer Setup Documentation |
| * [[EclipseSetup][Developer Setup: Eclipse]] |
| |
| #LicenseInfo |
| ---+ Licensing Information |
| |
| Atlas is distributed under [[http://www.apache.org/licenses/LICENSE-2.0][Apache License 2.0]]. |