Welcome to what might be a new AWS service to you—AWS Glue. Glue is meant to simplify the ETL process by discovering your data and learning information about it. The idea is that we first need to define our own data classifiers and crawlers. We can then register our data sources and Glue will start to build the data catalog. After that, we can get creative in the way we map and transform the data. Then, we can configure regular ETL jobs for batch processing.
To build our data catalog, the first thing we need is to create a classifier. We'll take you through the steps in the next section.