Is there a way that we can build our Azure Data Factory all with parameters based on MetaData?

#ScottishSummit2021
E r w i n d e K r e u k
A z u r e D a t a F a c t o r y
E l o n 1 7 : 0 0 G M T
Is there a way that we can build our Azure Data Factory all with
parameters based on MetaData?

I n S p a r k
L e a d D a t a & A I
@ e r w i n d e k r e u k
Erwin
De Kreuk
Is there a way that we can build our Azure Data Factory all
with parameters based on MetaData?

We help organizations
accelerating their digital
transformation with impactful
Microsoft solutions & expertise
We Are InSpark

• Hybrid data integration service
• With visual tools, you can build, debug, deploy, operationalize and
monitor your (big) data pipelines
• Provides a way to transform data at scale without any coding required ELT
Platform
What is Azure Data Factory?

Scottish
Summit
Template
 Global Parameter
 Pipeline Parameter
 Dataset Parameter
 Notebook Parameter
 Linked Service Parameter
 Dataflow Parameter

Global Parameters
 Can be used across all your
Pipelines
 Can be deployment in CI/CD
pipeline().globalParameters.<parameterName>.

Pipelines
Global parameters - Azure Data Factory | Microsoft Docs
Disabled
Global Parameters

Enabled
Global Parameters
Pipelines
Global parameters - Azure Data Factory | Microsoft Docs

Dataset Parameters
 Create 1 dataset for all your
Linked Services activities
 You can’t use Global Parameters
FileSystem Directory FileName

Pipeline Parameters
Pipelines

Notebook Parameters
 Pass Parameters from ADF to
Databricks

Linked Services Parameters
 Connect to different Database
on same Server
 Connect to different Logical
Servers

• Pipeline
• Global(Only ADF)
• Linked Service
Parameters
• DataSet
• Notebooks

Is there a way that we can build our Azure Data Factory all with parameters based on MetaData?

Metadata
Load your pipelines
dynamically

Can we get answers on the
following questions?
Can we build ADF Pipelines
dynamically?
Can we extract data from my sources
based on MetaData?
Can we load the active(current) or
historical records to a DataStore?
Can we build history from extracted
data based on MetaData?
Can we log the execution of the
Pipelines?

Source Name
Source Schema
DataLake Catalog
Table Destination
Schema
IsIncremental LastLoadTime
Table Destination
Name IsActive
IsIncremental
Column
Metadata
Source Parameter table

Lookup
Get Source data
ForEach
For Each
Execute Pipeline
Load
Lookup
Get LastLoadDate
Copy
Copy Source to
ADLS
Stored Procedure
Set LastLoadDate
Command
Execute
SELECT [PipelineParameterId]
,[SourceName]
,[SourceSchema]
,[SelectQuery]
,[SelectLastLoaddate]
,[FilePath]
,[FileName]
,[TableDestinationName]
,[ProcessType]
,[IsActive]
,[IsIncremental]
,[IsIncrementalColumn]
,[LastLoadtime]
FROM [execution].[Pipeline_DataLake_Files]
SELECT case when 1=1 then
convert(varchar,max(LasteditedWhen),120) else
convert(varchar,getdate(),120) end as LastLoadDate FROM
SourceSchema.SourceTable
Metadata
Source Parameter table

Logging  Log Start and End Time of records
 Log Extracted Records
 Log Execution Failure
 Create Pipeline_ExecutionLog table
[audit].[Event_Pipeline_OnBegin] [audit].[Event_Pipeline_OnEnd]
[audit].[Event_Pipeline_OnError]
PIPELINE ACTIVITY

Logging  Log Start and End Time of records
 Log Extracted Records
 Log Execution Failure
 Create Pipeline_ExecutionLog table
Pipeline_ExecutionLog
BEGIN
Insert new Record
Insert Metadata
Insert Start time
END
End Time
Status(1)
Row Counts
Pipeline Details
ERROR
End Time
Status(2)
Failure Message

Integration
runtime
on premises
datasources
Databricks
Data Factory
Azure SQL
Azure SQL Database
Data Lake
Intermediate Zone
Parquet
Azure
Synapse
Analytics
Data Factory
Data Lake
Raw Zone
Parquet
Data Store
Delta Lake
Data Lake
EXTRACT PREP LOAD
Auditing, Logging, MetaData and Execution
Power BI
HIGH OVERVIEW ARCHITECTURE
NITROGEN Data Accelerator

Process Flow
For each
Daily Run
Data Lake
Command
Delta Lake
Command
Data Store
Command
Data Lake
Execute
For each
Delta Lake
Execute
For each
Data Store
Execute
Auditing
Pipeline_DataLake
Pipeline_ExecutionLog
Pipeline_DeltaLake Pipeline_DataStore
Begin End Error
Auditing
Begin End Error
Auditing
Begib End Error
Command
Execute

@erwindekreuk
https://www.linkedin.com/in/erwindekreuk/
Questions?
https://erwindekreuk.com
Slides will be available on my blog

Is there a way that we can build our Azure Data Factory all with parameters based on MetaData?

More Related Content

What's hot

What's hot (20)

Similar to Is there a way that we can build our Azure Data Factory all with parameters based on MetaData?

Similar to Is there a way that we can build our Azure Data Factory all with parameters based on MetaData? (20)

More from Erwin de Kreuk

More from Erwin de Kreuk (9)

Recently uploaded

Recently uploaded (20)

Is there a way that we can build our Azure Data Factory all with parameters based on MetaData?

Editor's Notes