MS Azure Data Factory Lab Overview
MS Azure Data Factory Lab Overview
Lab Overview
PREREQUISITES
Microsoft
Azure
SQL Data Warehouse
Resource group
Data Factory
HDInsight
Script file
Azure SQL database
CANDIDATE DATA SET
Lab Overview
Module 1 – Setting Up ADF and Resources
Module 2 – Lift and Shift of SSIS to Azure
Module 3 – Rebuilding the Extract and Load with ADF
Module 4 – Enhancing Data with Cloud Services
Module 5 – Transform and Merge Data with ADF and HDInsight
Module 6 – Load Data into DW with ADF
Module 7 – Scheduling your ADF
Module 8 – Monitoring your ADF
Module 9 – Bringing it all Together
Module 1 – Setting Up ADF
and Resources
Module 1 Goal
• Azure PowerShell
• Office 365
Lab Overview
Module 1 – Setting Up ADF and Resources
Module 2 – Lift and Shift of SSIS to Azure
Module 3 – Rebuilding the Extract and Load with ADF
Module 4 – Enhancing Data with Cloud Services
Module 5 – Transform and Merge Data with ADF and HDInsight
Module 6 – Load Data into DW with ADF
Module 7 – Scheduling your ADF
Module 8 – Monitoring your ADF
Module 9 – Bringing it all Together
Module 2 – Lift and Shift of
SSIS to Azure
Module 2 Goal
• Use parameters to make the pipeline easy to change and more reusable
• FAA Master and FAA Aircraft Hive Script files in Azure Storage from Module
1
• Azure Blob storage container from Module 3
Lab Task Overview
• Show the Hive activity to run Hive scripts against an HDInsight cluster
• Hive
• Create Copy activities to copy Azure DB and Azure Blob files to the staging
schema
• Create Stored Procedure activities to call a load dimensions and load fact
• Complete previous lab modules 3 - 7 to ensure data is loaded in Azure SQL Data Warehouse
View pricing
https://azure.microsoft.com/en-us/pricing/details/data-factory/
Documentation
https://docs.microsoft.com/en-us/azure/data-factory/