Data cataloging – Glue

Welcome to what might be a new AWS service to you—AWS Glue. Glue is meant to simplify the ETL process by discovering your data and learning information about it. The idea is that we first need to define our own data classifiers and crawlers. We can then register our data sources and Glue will start to build the data catalog. After that, we can get creative in the way we map and transform the data. Then, we can configure regular ETL jobs for batch processing.

To build our data catalog, the first thing we need is to create a classifier. We'll take you through the steps in the next section.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.226.34.197