Apache Atlas

Apache Atlas is a data-governance and metadata-management framework developed specifically for Hadoop and its ecosystem of services. Using Apache Atlas, you you first define a catalog of the data Assets you have. Once the catalog is in place, you start classifying these assets into various categories. Classifying the assets provides you with the ability to organize and govern a set of data assets in a consistent manner. For example, you may classify certain data collections as data available for analysis for data scientists. 

The classification of data also gives you the ability to define various different governance policies for different sets of data.  

The classification of data also results in enabling collaboration between different sets of teams working on different sets of classified data.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.12.153.31