0%

Book Description

Explore the latest Azure ETL techniques both on-premises and in the cloud using Azure services such as SQL Server Integration Services (SSIS), Azure Data Factory, and Azure Databricks

Key Features

  • Understand the key components of an ETL solution using Azure Integration Services
  • Discover the common and not-so-common challenges faced while creating modern and scalable ETL solutions
  • Program and extend your packages to develop efficient data integration and data transformation solutions

Book Description

ETL is one of the most common and tedious procedures for moving and processing data from one database to another. With the help of this book, you will be able to speed up the process by designing effective ETL solutions using the Azure services available for handling and transforming any data to suit your requirements.

With this cookbook, you'll become well versed in all the features of SQL Server Integration Services (SSIS) to perform data migration and ETL tasks that integrate with Azure. You'll learn how to transform data in Azure and understand how legacy systems perform ETL on-premises using SSIS. Later chapters will get you up to speed with connecting and retrieving data from SQL Server 2019 Big Data Clusters, and even show you how to extend and customize the SSIS toolbox using custom-developed tasks and transforms. This ETL book also contains practical recipes for moving and transforming data with Azure services, such as Data Factory and Azure Databricks, and lets you explore various options for migrating SSIS packages to Azure. Toward the end, you'll find out how to profile data in the cloud and automate service creation with Business Intelligence Markup Language (BIML).

By the end of this book, you'll have developed the skills you need to create and automate ETL solutions on-premises as well as in Azure.

What you will learn

  • Explore ETL and how it is different from ELT
  • Move and transform various data sources with Azure ETL and ELT services
  • Use SSIS 2019 with Azure HDInsight clusters
  • Discover how to query SQL Server 2019 Big Data Clusters hosted in Azure
  • Migrate SSIS solutions to Azure and solve key challenges associated with it
  • Understand why data profiling is crucial and how to implement it in Azure Databricks
  • Get to grips with BIML and learn how it applies to SSIS and Azure Data Factory solutions

Who this book is for

This book is for data warehouse architects, ETL developers, or anyone who wants to build scalable ETL applications in Azure. Those looking to extend their existing on-premise ETL applications to use big data and a variety of Azure services or others interested in migrating existing on-premise solutions to the Azure cloud platform will also find the book useful. Familiarity with SQL Server services is necessary to get the most out of this book.

Table of Contents

  1. ETL with Azure Cookbook
  2. Why subscribe?
  3. Contributors
  4. About the authors
  5. About the reviewers
  6. Packt is searching for authors like you
  7. Preface
    1. Who this book is for
    2. What this book covers
    3. To get the most out of this book
    4. Download the example code files
    5. Download the color images
    6. Conventions used
    7. Sections
      1. Getting ready
      2. How to do it…
      3. How it works…
      4. There's more…
      5. See also
    8. Get in touch
    9. Reviews
  8. Chapter 1: Getting Started with Azure and SSIS 2019
    1. Technical requirements
    2. Installing Microsoft SQL Server 2019 Integration Services
      1. Getting ready
      2. How to do it…
      3. How it works…
    3. Installing SQL Server Management Studio
      1. Getting ready
      2. How to do it…
      3. How it works…
    4. Installing SQL Server Data Tools
      1. Getting ready
      2. How to do it…
      3. How it works…
    5. Installing Azure Data Studio
      1. Getting ready
      2. How to do it…
      3. How it works…
    6. Creating an Azure subscription
      1. Getting ready
      2. How to do it…
      3. How it works…
      4. There's more…
  9. Chapter 2: Introducing ETL
    1. Creating a SQL Azure database
      1. Getting ready
      2. How to do it…
      3. How it works…
    2. Connecting SQL Management Studio to Azure SQL
      1. Getting ready
      2. How to do it…
      3. How it works…
    3. Creating a simple ETL package
      1. Getting ready
      2. How to do it…
      3. How it works…
    4. Loading data before its transformation
      1. Getting ready
      2. How to do it…
      3. How it works…
  10. Chapter 3: Creating and Using SQL Server 2019 Big Data Clusters
    1. Technical requirements
    2. Creating a SQL Server 2019 Big Data Cluster
      1. Getting ready
      2. How to do it…
      3. How it works…
    3. Stopping and starting Azure resources
      1. Getting ready
      2. How to do it…
      3. How it works…
    4. Setting up a Big Data Cluster
      1. Getting ready
      2. How to do it…
      3. How it works…
    5. Loading data into a Big Data Cluster
      1. Getting ready
      2. How to do it…
      3. How it works…
    6. Extracting data from a Big Data Cluster
      1. Getting ready
      2. How to do it…
      3. How it works…
  11. Chapter 4: Azure Data Integration
    1. Setting up the Azure Feature Pack in SSIS
      1. Getting ready
      2. How to do it…
      3. How it works…
    2. Setting up a Java Runtime Environment
      1. Getting ready
      2. How to do it…
      3. How it works…
    3. Setting up an Azure storage account
      1. Getting ready
      2. How to do it…
      3. How it works…
    4. Creating an on-demand Azure HDInsight cluster
      1. Getting ready
      2. How to do it…
      3. How it works…
    5. Transforming data with Hive
      1. Getting ready
      2. How to do it…
      3. How it works…
    6. Transforming data with Pig
      1. Getting ready
      2. How to do it…
      3. How it works…
  12. Chapter 5: Extending SSIS with Custom Tasks and Transformations
    1. Technical requirements
    2. Designing a Custom Control Flow Task
      1. Getting ready
      2. How to do it…
      3. How it works…
    3. Designing a Custom Data Flow Component
      1. Getting ready
      2. How to do it…
      3. How it works…
      4. There's more…
    4. Deploying SSIS customizations
      1. Getting ready
      2. How to do it…
      3. How it works…
    5. Upgrading a Custom Data Flow Component
      1. Getting ready
      2. How to do it…
      3. How it works…
  13. Chapter 6: Azure Data Factory
    1. Data factory creation
      1. Getting ready
      2. How to do it…
      3. How it works…
    2. Copying data from the internet
      1. Getting ready
      2. How to do it…
      3. How it works…
    3. Moving and transforming data
      1. Getting ready
      2. How to do it…
      3. How it works…
    4. Triggering and monitoring our pipeline
      1. Getting ready
      2. How to do it…
      3. How it works…
  14. Chapter 7: Azure Databricks
    1. Creating a Databricks workspace
      1. Getting ready
      2. How to do it…
      3. How it works…
    2. Creating a cluster in our workspace
      1. Getting ready
      2. How to do it…
      3. How it works…
    3. Connecting to an internet data source
      1. Getting ready
      2. How to do it…
      3. How it works…
    4. Using Delta Lake
      1. Getting ready
      2. How to do it…
      3. How it works…
      4. There's more…
    5. Writing in Azure SQL Server
      1. Getting ready
      2. How to do it…
      3. How it works…
  15. Chapter 8: SSIS Migration Strategies
    1. Running SSIS packages in ADF
      1. Getting ready
      2. How to do it…
      3. How it works…
    2. Rewriting an SSIS package using ADF
      1. Getting ready
      2. How to do it…
      3. How it works…
  16. Chapter 9: Profiling data in Azure
    1. Technical requirements
    2. Using SQL in Spark
      1. Getting ready
      2. How to do it…
      3. How it works…
    3. Using Spark built-in functions
      1. Getting ready
      2. How to do it…
      3. How it works…
    4. Using pandas profiling
      1. Getting ready
      2. How to do it…
      3. How it works…
  17. Chapter 10: Manage SSIS and Azure Data Factory with Biml
    1. Technical requirements
    2. Installing prerequisites
      1. How to do it…
      2. How it works…
    3. Generating T-SQL select statements for all tables
      1. Getting ready
      2. How to do it…
      3. How it works…
      4. There's more…
    4. Generating T-SQL to drop and create all indexes
      1. Getting ready
      2. How to do it…
      3. How it works…
    5. Generating a basic SSIS package
      1. Getting ready
      2. How to do it…
      3. How it works…
      4. There's more…
    6. Generating an SSIS package with Execute SQL Tasks
      1. Getting ready
      2. How to do it…
      3. How it works…
      4. There's more…
    7. Using Convert SSIS package to Biml
      1. Getting ready
      2. How to do it…
      3. How it works…
      4. There's more…
    8. Generating a mass change to stored procedures
      1. Getting ready
      2. How to do it…
      3. How it works…
      4. There's more…
    9. Generating an SSIS package containing a Data Flow Task
      1. Getting ready
      2. How to do it…
      3. How it works…
    10. Generating your first Azure Data Factory
      1. Getting ready
      2. How to do it…
      3. How it works…
      4. There's more…
    11. Loading multiple files using Data Factory
      1. Getting ready
      2. How to do it…
      3. How it works…
      4. See also
  18. Other Books You May Enjoy
    1. Leave a review - let other readers know what you think
54.158.138.161