0%

Over 90 recipes to help you orchestrate modern ETL/ELT workflows and perform analytics using Azure services more easily

Key Features

  • Build highly efficient ETL pipelines using the Microsoft Azure Data services
  • Create and execute real-time processing solutions using Azure Databricks, Azure Stream Analytics, and Azure Data Explorer
  • Design and execute batch processing solutions using Azure Data Factory

Book Description

Data engineering is one of the faster growing job areas as Data Engineers are the ones who ensure that the data is extracted, provisioned and the data is of the highest quality for data analysis. This book uses various Azure services to implement and maintain infrastructure to extract data from multiple sources, and then transform and load it for data analysis.

It takes you through different techniques for performing big data engineering using Microsoft Azure Data services. It begins by showing you how Azure Blob storage can be used for storing large amounts of unstructured data and how to use it for orchestrating a data workflow. You'll then work with different Cosmos DB APIs and Azure SQL Database. Moving on, you'll discover how to provision an Azure Synapse database and find out how to ingest and analyze data in Azure Synapse. As you advance, you'll cover the design and implementation of batch processing solutions using Azure Data Factory, and understand how to manage, maintain, and secure Azure Data Factory pipelines. You'll also design and implement batch processing solutions using Azure Databricks and then manage and secure Azure Databricks clusters and jobs. In the concluding chapters, you'll learn how to process streaming data using Azure Stream Analytics and Data Explorer.

By the end of this Azure book, you'll have gained the knowledge you need to be able to orchestrate batch and real-time ETL workflows in Microsoft Azure.

What you will learn

  • Use Azure Blob storage for storing large amounts of unstructured data
  • Perform CRUD operations on the Cosmos Table API
  • Implement elastic pools and business continuity with Azure SQL Database
  • Ingest and analyze data using Azure Synapse Analytics
  • Develop Data Factory data flows to extract data from multiple sources
  • Manage, maintain, and secure Azure Data Factory pipelines
  • Process streaming data using Azure Stream Analytics and Data Explorer

Who this book is for

This book is for Data Engineers, Database administrators, Database developers, and extract, load, transform (ETL) developers looking to build expertise in Azure Data engineering using a recipe-based approach. Technical architects and database architects with experience in designing data or ETL applications either on-premise or on any other cloud vendor who wants to learn Azure Data engineering concepts will also find this book useful. Prior knowledge of Azure fundamentals and data engineering concepts is needed.

Table of Contents

  1. Azure Data Engineering Cookbook
  2. Contributors
  3. About the author
  4. About the reviewers
  5. Preface
    1. Who this book is for
    2. What this book covers
    3. To get the most out of this book
    4. Download the example code files
    5. Download the color images
    6. Conventions used
    7. Sections
    8. Getting ready
    9. How to do it…
    10. How it works…
    11. There's more…
    12. See also
    13. Get in touch
    14. Reviews
  6. Chapter 1: Working with Azure Blob Storage
    1. Technical requirements
    2. Provisioning an Azure storage account using the Azure portal
    3. Getting ready
    4. How to do it…
    5. How it works…
    6. Provisioning an Azure storage account using PowerShell
    7. Getting ready
    8. How to do it…
    9. How it works…
    10. Creating containers and uploading files to Azure Blob storage using PowerShell
    11. Getting ready
    12. How to do it…
    13. How it works…
    14. Managing blobs in Azure Storage using PowerShell
    15. Getting ready
    16. How to do it…
    17. How it works…
    18. Managing an Azure blob snapshot in Azure Storage using PowerShell
    19. Getting ready
    20. How to do it…
    21. How it works…
    22. Configuring blob life cycle management for blob objects using the Azure portal
    23. Getting ready
    24. How to do it…
    25. How it works…
    26. Configuring a firewall for an Azure storage account using the Azure portal
    27. Getting ready
    28. How to do it…
    29. How it works…
    30. Configuring virtual networks for an Azure storage account using the Azure portal
    31. Getting ready
    32. How to do it…
    33. How it works…
    34. Configuring a firewall for an Azure storage account using PowerShell
    35. Getting ready
    36. How to do it…
    37. How it works…
    38. Configuring virtual networks for an Azure storage account using PowerShell
    39. Getting ready
    40. How to do it…
    41. How it works…
    42. Creating an alert to monitor an Azure storage account
    43. Getting ready
    44. How to do it…
    45. How it works…
    46. Securing an Azure storage account with SAS using PowerShell
    47. Getting ready
    48. How to do it…
    49. How it works…
  7. Chapter 2: Working with Relational Databases in Azure
    1. Provisioning and connecting to an Azure SQL database using PowerShell
    2. Getting ready
    3. How to do it…
    4. How it works…
    5. Provisioning and connecting to an Azure PostgreSQL database using the Azure CLI
    6. Getting ready
    7. How to do it…
    8. How it works…
    9. Provisioning and connecting to an Azure MySQL database using the Azure CLI
    10. Getting ready
    11. How it works…
    12. How it works…
    13. Implementing active geo-replication for an Azure SQL database using PowerShell
    14. Getting ready
    15. How to do it…
    16. How it works…
    17. Implementing an auto-failover group for an Azure SQL database using PowerShell
    18. Getting ready
    19. How to do it…
    20. How it works…
    21. Implementing vertical scaling for an Azure SQL database using PowerShell
    22. Getting ready
    23. How to do it…
    24. How it works…
    25. Implementing an Azure SQL database elastic pool using PowerShell
    26. Getting ready
    27. How it works…
    28. How it works…
    29. Monitoring an Azure SQL database using the Azure portal
    30. Getting ready
    31. How to do it…
  8. Chapter 3: Analyzing Data with Azure Synapse Analytics
    1. Technical requirements
    2. Provisioning and connecting to an Azure Synapse SQL pool using PowerShell
    3. Getting ready
    4. How to do it…
    5. How it works…
    6. Pausing or resuming a Synapse SQL pool using PowerShell
    7. Getting ready
    8. How to do it…
    9. How it works…
    10. Scaling an Azure Synapse SQL pool instance using PowerShell
    11. Getting ready
    12. How to do it…
    13. How it works…
    14. Loading data into a SQL pool using PolyBase with T-SQL
    15. Getting ready
    16. How to do it…
    17. How it works…
    18. Loading data into a SQL pool using the COPY INTO statement
    19. Getting ready
    20. How to do it…
    21. How it works…
    22. Implementing workload management in an Azure Synapse SQL pool
    23. Getting ready
    24. How to do it…
    25. How it works…
    26. Optimizing queries using materialized views in Azure Synapse Analytics
    27. Getting ready
    28. How to do it…
    29. How it works…
  9. Chapter 4: Control Flow Activities in Azure Data Factory
    1. Technical requirements
    2. Implementing control flow activities
    3. Getting ready
    4. How to do it…
    5. Implementing control flow activities – Lookup and If activities
    6. Getting ready
    7. How to do it…
    8. Triggering a pipeline in Azure Data Factory
    9. Getting ready
    10. How to do it…
  10. Chapter 5: Control Flow Transformation and the Copy Data Activity in Azure Data Factory
    1. Technical requirements
    2. Implementing HDInsight Hive and Pig activities
    3. Getting ready
    4. How to do it…
    5. Implementing an Azure Functions activity
    6. Getting ready
    7. How to do it…
    8. Implementing a Data Lake Analytics U-SQL activity
    9. Getting ready
    10. How to do it…
    11. Copying data from Azure Data Lake Gen2 to an Azure Synapse SQL pool using the copy activity
    12. Getting ready
    13. How to do it…
    14. Copying data from Azure Data Lake Gen2 to Azure Cosmos DB using the copy activity
    15. Getting ready
    16. How to do it…
  11. Chapter 6: Data Flows in Azure Data Factory
    1. Technical requirements
    2. Implementing incremental data loading with a mapping data flow
    3. Getting ready
    4. How to do it…
    5. Implementing a wrangling data flow
    6. Getting ready
    7. How to do it…
  12. Chapter 7: Azure Data Factory Integration Runtime
    1. Technical requirements
    2. Configuring a self-hosted IR
    3. Getting ready
    4. How to do it…
    5. Configuring a shared self-hosted IR
    6. Getting ready
    7. How to do it…
    8. Migrating an SSIS package to Azure Data Factory
    9. Getting ready
    10. How to do it…
    11. Executing an SSIS package with an on-premises data store
    12. Getting ready
    13. How to do it…
  13. Chapter 8: Deploying Azure Data Factory Pipelines
    1. Technical requirements
    2. Configuring the development, test, and production environments
    3. Getting ready
    4. How to do it…
    5. Deploying Azure Data Factory pipelines using the Azure portal and ARM templates
    6. Getting ready
    7. How to do it…
    8. Automating Azure Data Factory pipeline deployment using Azure DevOps
    9. Getting ready
    10. How to do it…
  14. Chapter 9: Batch and Streaming Data Processing with Azure Databricks
    1. Technical requirements
    2. Configuring the Azure Databricks environment
    3. Getting ready
    4. How to do it…
    5. Transforming data using Python
    6. Getting ready
    7. How to do it…
    8. Transforming data using Scala
    9. Getting ready
    10. How to do it…
    11. Working with Delta Lake
    12. Getting ready
    13. How to do it…
    14. Processing structured streaming data with Azure Databricks
    15. Getting ready
    16. How to do it…
    17. Why subscribe?
  15. Other Books You May Enjoy
    1. Packt is searching for authors like you
    2. Leave a review - let other readers know what you think
3.14.130.24