0% found this document useful (0 votes)

79 views

Azure SQL Trainings: Contact: +91 90 32 82 44 67

This document provides an overview of training courses related to Azure SQL and big data technologies. It lists topics that will be covered including introductions to Azure, storage, SQL, data factory, data lake, U-SQL, streaming, HD Insight, Databricks, Spark, database migration, Sqoop, Hive, MapReduce, Scala, and Spark. The training aims to help participants learn about these Azure and big data technologies through explanations and hands-on exercises.

Uploaded by

jeffa123

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

79 views

Azure SQL Trainings: Contact: +91 90 32 82 44 67

Uploaded by

jeffa123

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Azure SQL Trainings

Contact : +91 90 32 82 44 67

Introduction to Azure

1) Introduction to Azure Cloud

2) What is difference between Azure Cloud and On-Premises
3) What is Subscriptions and Resource Groups
4) Different offerings of Cloud IaaS, PaaS and SaaS
5) Creation of Virtual Machine

Introduction to Storage

1) Azure Storage
Azure Blob
Azure Table
Azure Message
Azure Queue
2) Azure Data Lake Store Gen (1 and 2)

Introduction to Azure SQL

1) Introduction to Azure SQL Database

2) Introduction to Azure SQL Data Warehouse
3) Installation of SQL Server 2016 and above in Virtual Machine
4) Creation of External Table or Polybase in On-Premise SQL Server
Creation of Master Key
Creation of Database Scoped Credential
Creation of External Data Source
Creation of External File Format
Creation of External Table
5) Creation of External Table or Polybase in Azure SQL Data Warehouse
Creation of Master Key
Creation of Database Scoped Credential
Creation of External Data Source
Creation of External File Format
Creation of External Table
6) Different Distribution or Shredding Patterns
ROUND ROBIN
HASH
REPLICATION
7) Cross Query Databases in Azure SQL Database
Creation of Master Key
Creation of Database Scoped Credential
Creation of External Data Source
Creation of External Table

8) Creation of Elastic Pools in Azure SQL Server between Databases

Azure Data Factory

1) Creation of Azure Data Factory

2) Creation of Linked Services
3) Creation of Datasets
4) Creation of Pipelines
5) Creation of Integration Runtime and different types
6) Copy Activity
7) Stored Procedure Activity
8) Lookup Activity
9) For Each Activity
9) Spark Activity
10) U-SQL Activity
11) Notebooks Activity
13) Web Activity
14) Data Flow Activity (with different transformations join, filter, exists, condition etc.)
15) Dynamic Queries in ADF
16) Sending mails through Logic Apps
17) Get Metadata Activity
18) If Condition Activity
19) Few more Activities ......

Azure Data Lake and Azure Data Lake Analytics

1) How to create App Registration

2) How to create ADL and ADLA
3) How to integrate Azure data lake analytics with ADF USQL Activity
4) How to create jobs and submit U-SQL Queries
5) How to integrate Azure data lake store with ADF

U-SQL Language

Introduction to U-SQL
U-SQL vs. SQL
Transforming Row Sets
Declare Parameters
Data Types
Expressions
Reading and Writing files
File sets
Grouping and Aggregation
U-SQL Catalog
Window functions
Set Operations
Joins
Complex Types
Extending U-SQL

Azure Streaming

Azure Event Hub

Azure Stream Analytics
Scheduling Jobs

Azure HD Insight -- Part 1

1) How to create Azure HDInsight

2) How to attach extra storage to HDInsight
3) How to SSH to the cluster and Use of Spark Activity
4) How to Monitor Job execution in Yarn
5) Introduction to Ambari, Hive and Jupyter

Azure Databricks

1) How to create Cluster

2) How to work with Databricks File System
3) How to create notebooks and Integrate with ADF
4) How to import and export the Notebooks
5) How to connect to blob, SQL DB from Databricks
6) How to read data files from Azure Blob and Azure Data Lake Store
Using Scala
Using R
Using Python
Using Spark SQL
7) Creating Data Frames
8) Converting Data Frames into Temporary Table or Temporary View
9) Incremental and Full Load with Azure SQL Data Warehouse

Azure Spark

1) Introduction to Apache Spark

2) Why Spark
3) Batch Vs Real Time Big Data Analytics
4) Disk and In-Memory Processing
5) Spark Execution Architecture
6) What is RDD
7) Different ways to create the RDDs
8) Transformation using Map, flatMap, Filter
9) Grouping using reduceByKey, groupByKey
10) Spark Actions
11) Data frame, different ways to create Data frame
12) Spark SQL with CSV
13) Spark SQL with Parquet
14) Spark SQL with JSON
15) Spark SQL With Database
16) Different ways of creating temp tables.

On-Premise Databases Migration

1) DMS -- Database Migration Service
2) On-Premise SQL Server to Azure Virtual Machine
3) On-Premise SQL Server to Azure SQL Server

Azure HD Insight -- Part 2

 Sqoop
 Oozie
 Hive
 Scala
 Spark
 Spark SQL

BIG DATA

Evolution of Data – Introduction to Big data – Classification - Size Hierarchy - Why Big data is Trending
(IOT, Devops, Cloud Computing, Enterprise Mobility) - Challenges in Big Data – Characteristics - Tools for
Big Data - Why Big Data draws attention in IT Industry - What do we do with Big data - How Big Data can
be analyzed - Typical Distributed System - Draw backs in Traditional distributed System

LINUX

History and Evolution - Architecture – Development Commands – Env Variables - File Management –
Directories Management – Admin Commands – Advanced Commands – Shell Scripting – Groups and
User managements – Permissions – Important directory structure – Disk utilities – Compression
Techniques – Misc Commands

HADOOP: HDFS (1 and 2)

What is Hadoop? - Evolution of Hadoop - Features of Hadoop - Characteristic of Hadoop - Hadoop

compared with Traditional Dist. Systems - When to use Hadoop - When not to use Hadoop -
Components of Hadoop (HDFS & MapReduce) - Hadoop Architecture - Daemons in Hadoop Version 1 &
2 -How Data is stored in Hadoop (Cluster, Datacenter, Spilt, Block, Rack Awareness, Replication, Hear
beat) - Hadoop 1.0 Limitation - NameNode High Availability - NameNode federation - How Metadata is
stored in Disk (FSImage & Editlog file) -Role of Secondary Name Node - Anatomy of File read & File Write
- Data Integrity - Serialization - Compression - What happens when copying data in Hadoop cluster? -
Centos Linux Commands Exercise - Hadoop Next Gen (ver 2) single node Pseudo mode Custer
installation - Hadoop commands Exercise

SQOOP – RDBMS

Introduction & History – History - Installation and configuration - Why Sqoop - Indepth Architecture -
Sqoop Import Properties - Sqoop Export Architecture - Commands (Import – HDSF, HIVE, HBase from
MySQL) - Export – Incremental Import - Saved Jobs - Import All tables - Sqoop installation and
configuration - Sqoop workouts - Sqoop best practices & performance tuning - Sqoop import/export use
cases - Mock test on Sqoop

HIVE – SQL & OLAP Layer on Hadoop

Introduction – Architecture - Hive Vs RDBMS - Detailed Installation (Metastore, Integrating with Hue)-
Starting Metastore and Hive Server - Data types (Primitive, Collection) - Create Tables (Managed,
external) and DML operations (load, insert, export) - Managed Vs External tables - QL Queries (select,
where, group by, having, sort by, order by) - Hive access through Hive Client, Beeline and Hue - File
Formats (RC, ORC, Sequence)- Partitioning (static and dynamic), partition with external table, dropping
partitions and corresponding configuration parameters - Bucketing, Partitioning Vs Bucketing - Views,
different types of joins (inner, outer) - Queries (Union, union all, intersection, minus) - Add files to the
distributed cache, jars to the class path - Optimized joins (MapSide join, Bucketing join) - Compressions
on tables (LZO, Snappy) - Serde (XML Serde, JsonSerde) - Parallel execution, Sampling data, Speculative
execution -Two POCs using the large dataset on the above topics -Mock Test on Hive and Its
Architecture

HADOOP – PROCESSING ARCHITECTURE

Hadoop Ecosystems ROAD MAP-MAP REDUCE FLOW-MapReduce Job submission in YARN Cluster in
details -What is MapReduce? - How MapReduce works on high level - Types of Input and Output Format
- MapReduce in details -Different types of files supported (Text, Sequence, map and Avro) - STORAGE &
PROCESSING DAEMONS Architecture Version 1 - PROCESSING DAEMONS Architecture Version 1 - Role of
Job Tracker and Task Tracker - Manager, Application Master, Node Manager), Architecture and Failure
handling – Schedulers - Resource Manager High availability -YARN Architecture

SCALA

Scala Introduction – History - Why Scala - Scala Installation - Get deep insights into the functioning of
Scala - Execute Pattern Matching in Scala - OOPs concepts (Classes, Objects, Collections, Inheritance,
Abstraction and Encapsulation) - Functional Programming in Scala (Closures, Currying, Expressions,
Anonymous Functions) - Know the concepts of classes in Scala - Object Orientation in Scala (Primary,
Auxiliary Constructors, Singleton Objects, Companion Objects) - Traits - Abstract classes
SPARK

Introduction – Scala/Python – History – Overview – MR vs Spark – Spark Libraries – Why Spark – RDDs –
Spark Internals – Transformations – Actions – DAG – Fault Tolerance – Lineage – Terminologies – Cluster
types – Hadoop Integration – Spark SQL – Data frames – DataSets – Optimizers – AST – Session –
Structured Streaming– RDDs to Relations – Spark Streaming – Why Spark Streaming– Data masking
techniques – SCD implementation - Real time use cases – End to end realtime integration with NIFI,
Kafka, Spark Streaming, EC2, Cassandra, RDBMS, Different Filesystems, Hive, Oozie & HBase

12 - DataEngineer - Interview - Questions and Answers - EPAM Anywhere
No ratings yet
12 - DataEngineer - Interview - Questions and Answers - EPAM Anywhere
2 pages
Pyspark 30 Days
No ratings yet
Pyspark 30 Days
32 pages
Account Deactivation New Test Case Template Example (2021)
No ratings yet
Account Deactivation New Test Case Template Example (2021)
4 pages
ADF Course Content
No ratings yet
ADF Course Content
11 pages
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Case Study-Machine Learning at American Express
No ratings yet
Case Study-Machine Learning at American Express
8 pages
Bigdata Engineer Complete Syllabus: Presented by
No ratings yet
Bigdata Engineer Complete Syllabus: Presented by
21 pages
Learn Hive in 24 Hours
From Everand
Learn Hive in 24 Hours
Alex Nordeen
No ratings yet
Professional Hadoop Solutions
From Everand
Professional Hadoop Solutions
Boris Lublinsky
4/5 (2)
Azure Data Factory Monitoring Best Practices
No ratings yet
Azure Data Factory Monitoring Best Practices
9 pages
Pyspark Hands on
No ratings yet
Pyspark Hands on
189 pages
Datawarehouse Tools
No ratings yet
Datawarehouse Tools
8 pages
3 Lecture 3-ETL
100% (1)
3 Lecture 3-ETL
42 pages
Facebook Hive POC
No ratings yet
Facebook Hive POC
18 pages
Apache Druid: Sudhindra Tirupati Nagaraj
No ratings yet
Apache Druid: Sudhindra Tirupati Nagaraj
12 pages
02 - Apache Spark On Amazon EMR
No ratings yet
02 - Apache Spark On Amazon EMR
31 pages
SCD Type 2. Pyspark
No ratings yet
SCD Type 2. Pyspark
7 pages
AWS Databases
No ratings yet
AWS Databases
31 pages
Tracking Data Changes: With Temporal Tables and More
No ratings yet
Tracking Data Changes: With Temporal Tables and More
22 pages
The Following Are The Different Phases Involved in A ETL Project Development Life Cycle
100% (2)
The Following Are The Different Phases Involved in A ETL Project Development Life Cycle
3 pages
SS1123 - D2T - Apache Cassandra Overview PDF
100% (1)
SS1123 - D2T - Apache Cassandra Overview PDF
45 pages
Talend Open Studio For Data Integration: User Guide
No ratings yet
Talend Open Studio For Data Integration: User Guide
452 pages
Database Services in AWS: Relational Databases
No ratings yet
Database Services in AWS: Relational Databases
9 pages
Unstructured Dataload Into Hive Database Through PySpark
No ratings yet
Unstructured Dataload Into Hive Database Through PySpark
9 pages
Data-Engineering Course Structure
No ratings yet
Data-Engineering Course Structure
9 pages
Lead Data Engineer Resume Example
No ratings yet
Lead Data Engineer Resume Example
1 page
O Reilly Data Lake Bootcamp Day 11694182865124
No ratings yet
O Reilly Data Lake Bootcamp Day 11694182865124
46 pages
1 - Creating A Data Transformation Pipeline With Cloud Dataprep
0% (1)
1 - Creating A Data Transformation Pipeline With Cloud Dataprep
39 pages
Databricksmcqsquestionsandanswers
No ratings yet
Databricksmcqsquestionsandanswers
5 pages
Create An Spark Streaming App: 1. Architecture and Abstraction
No ratings yet
Create An Spark Streaming App: 1. Architecture and Abstraction
8 pages
Performance Tuning Spark UI
No ratings yet
Performance Tuning Spark UI
37 pages
Big Data Masters Certification Learnbay
No ratings yet
Big Data Masters Certification Learnbay
12 pages
azure DE interview que
100% (1)
azure DE interview que
25 pages
Windowing Functions
No ratings yet
Windowing Functions
54 pages
Ambari Operations
No ratings yet
Ambari Operations
194 pages
WP - Databricks vs. ETL Data Lake - Updated
No ratings yet
WP - Databricks vs. ETL Data Lake - Updated
12 pages
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
No ratings yet
Deepshikha Agrawal Pushp B.Sc. (IT), MBA (IT) Certification-Hadoop, Spark, Scala, Python, Tableau, ML (Assistant Professor JLBS)
74 pages
2 Hadoop (Uploaded)
No ratings yet
2 Hadoop (Uploaded)
82 pages
PySpark Cheatsheet
No ratings yet
PySpark Cheatsheet
12 pages
DW
No ratings yet
DW
29 pages
Cloudera Spark
No ratings yet
Cloudera Spark
55 pages
Hive Interview Questions Answers
No ratings yet
Hive Interview Questions Answers
6 pages
Sampath Polishetty BigData Consultant
No ratings yet
Sampath Polishetty BigData Consultant
7 pages
Azure Data Engineer Interview Questions
No ratings yet
Azure Data Engineer Interview Questions
15 pages
Certification
No ratings yet
Certification
16 pages
Databricks
No ratings yet
Databricks
11 pages
Pyspark Cashing & Persisting - Complete Guide
No ratings yet
Pyspark Cashing & Persisting - Complete Guide
3 pages
WP Data Engineers Handbook
No ratings yet
WP Data Engineers Handbook
22 pages
SQL & NoSQL Cheat Sheet
No ratings yet
SQL & NoSQL Cheat Sheet
52 pages
Databricks Course Curriculum
No ratings yet
Databricks Course Curriculum
2 pages
Apache Hive
No ratings yet
Apache Hive
3 pages
Data Warehousing Interview Questions and Answers
No ratings yet
Data Warehousing Interview Questions and Answers
6 pages
Practice Questions Edition'22: Prepare Yourself For Exam Azure Administrator
No ratings yet
Practice Questions Edition'22: Prepare Yourself For Exam Azure Administrator
14 pages
Spark ETL and Process
No ratings yet
Spark ETL and Process
15 pages
Data Dictionary
No ratings yet
Data Dictionary
11 pages
AWS Oracle DB Migration Questionnaire
No ratings yet
AWS Oracle DB Migration Questionnaire
2 pages
DataEngineer Roadmap
No ratings yet
DataEngineer Roadmap
12 pages
6 Frequently Asked Hadoop Interview Questions and Answers: Q1.What Is Hadoop?
No ratings yet
6 Frequently Asked Hadoop Interview Questions and Answers: Q1.What Is Hadoop?
8 pages
Ajay Kadiyala Resume 2023 PDF
No ratings yet
Ajay Kadiyala Resume 2023 PDF
6 pages
SCD Type-1,2 Implementation in Pyspark
No ratings yet
SCD Type-1,2 Implementation in Pyspark
6 pages
Databricks Pyspark 1712042928
100% (1)
Databricks Pyspark 1712042928
21 pages
Data Warehousing Interview Questions
No ratings yet
Data Warehousing Interview Questions
6 pages
The SQL IN Operator
No ratings yet
The SQL IN Operator
15 pages
The SQL UPDATE Statement
No ratings yet
The SQL UPDATE Statement
5 pages
The SQL BETWEEN Operator
No ratings yet
The SQL BETWEEN Operator
8 pages
The SQL COUNT, AVG and SUM Functions
No ratings yet
The SQL COUNT, AVG and SUM Functions
7 pages
The SQL INSERT INTO Statement
No ratings yet
The SQL INSERT INTO Statement
5 pages
The SQL LIKE Operator
No ratings yet
The SQL LIKE Operator
16 pages
The SQL MIN and MAX Functions
No ratings yet
The SQL MIN and MAX Functions
4 pages
SQL Top, Limit, Fetch First or ROWNUM Clause
No ratings yet
SQL Top, Limit, Fetch First or ROWNUM Clause
7 pages
The SQL DELETE Statement
No ratings yet
The SQL DELETE Statement
6 pages
SQL NULL Values: What Is A NULL Value?
No ratings yet
SQL NULL Values: What Is A NULL Value?
6 pages
SQL5
No ratings yet
SQL5
2 pages
Learn SQL Tutorial - Javatpoint
No ratings yet
Learn SQL Tutorial - Javatpoint
7 pages
The SQL WHERE Clause
No ratings yet
The SQL WHERE Clause
4 pages
ETL Vs Database Testing - Tutorialspoint3
100% (1)
ETL Vs Database Testing - Tutorialspoint3
2 pages
Sapient Latest 2021 Selenium API Testing Interview Questions
No ratings yet
Sapient Latest 2021 Selenium API Testing Interview Questions
7 pages
ETL Â - Tester's Roles - Tutorialspoint
No ratings yet
ETL Â - Tester's Roles - Tutorialspoint
2 pages
ETL Testing Â - Introduction - Tutorialspoint2
No ratings yet
ETL Testing Â - Introduction - Tutorialspoint2
3 pages
ETL Testing Tutorial - Tutorialspoint1
No ratings yet
ETL Testing Tutorial - Tutorialspoint1
1 page
New Accordion Widget Test Case Template Excel Sheet 2021
No ratings yet
New Accordion Widget Test Case Template Excel Sheet 2021
4 pages
GlobalLogic Latest 2021 Manual Selenium Interview Questions
No ratings yet
GlobalLogic Latest 2021 Manual Selenium Interview Questions
4 pages
Test Case For Notepad Test Cases New & Easy Process (2021)
No ratings yet
Test Case For Notepad Test Cases New & Easy Process (2021)
5 pages
Manual Test Cases & Scenario Template New & Easy Steps 2021
No ratings yet
Manual Test Cases & Scenario Template New & Easy Steps 2021
8 pages
Java Subclass Example With Easy Explanation (2021)
No ratings yet
Java Subclass Example With Easy Explanation (2021)
4 pages
Oracle 2021 Java Selenium Automation Interview Questions
No ratings yet
Oracle 2021 Java Selenium Automation Interview Questions
10 pages
Latest Updates: Interview Questions Java Java Programs Test Cases Selenium Manual Testing Difference
No ratings yet
Latest Updates: Interview Questions Java Java Programs Test Cases Selenium Manual Testing Difference
2 pages
Interview Questions Java Java Programs Test Cases Selenium Manual Testing Difference
No ratings yet
Interview Questions Java Java Programs Test Cases Selenium Manual Testing Difference
4 pages
WinWire-Hadoop-to-Databricks-Migration
No ratings yet
WinWire-Hadoop-to-Databricks-Migration
14 pages
Download full Mastering Spark with R The Complete Guide to Large Scale Analysis and Modeling 1st Edition Javier Luraschi ebook all chapters
100% (2)
Download full Mastering Spark with R The Complete Guide to Large Scale Analysis and Modeling 1st Edition Javier Luraschi ebook all chapters
55 pages
Experiment 2: Aim: Installation of Cloudera Theory
No ratings yet
Experiment 2: Aim: Installation of Cloudera Theory
8 pages
Hbase
100% (1)
Hbase
30 pages
Architecting A Platform For Big Data Analytics
No ratings yet
Architecting A Platform For Big Data Analytics
23 pages
Big Data: Insight: Mrs. S.V. Balshetwar, Dr. R.M.Tugnayat
No ratings yet
Big Data: Insight: Mrs. S.V. Balshetwar, Dr. R.M.Tugnayat
3 pages
R22-M.tech Curriculum and Syllabus
No ratings yet
R22-M.tech Curriculum and Syllabus
85 pages
A Comparison of Azure AWS and Google Cloud Services PDF
No ratings yet
A Comparison of Azure AWS and Google Cloud Services PDF
17 pages
Professional Development
No ratings yet
Professional Development
30 pages
Big Data Now 2012 Edition O'Reilly Media instant download
No ratings yet
Big Data Now 2012 Edition O'Reilly Media instant download
47 pages
BD - Unit - IV - Hive and Pig
No ratings yet
BD - Unit - IV - Hive and Pig
41 pages
MCS 226
No ratings yet
MCS 226
13 pages
Intro Haddop Ecosystem 24sep2020
No ratings yet
Intro Haddop Ecosystem 24sep2020
127 pages
CV Daniar Heri Kurniawan New 1
No ratings yet
CV Daniar Heri Kurniawan New 1
4 pages
Data Migration From RDBMS To Hadoop: Platform Migration Approach
No ratings yet
Data Migration From RDBMS To Hadoop: Platform Migration Approach
25 pages
Chapter - 1 Introduction
No ratings yet
Chapter - 1 Introduction
22 pages
Bda
No ratings yet
Bda
2 pages
DAN Lab ManuaL
No ratings yet
DAN Lab ManuaL
53 pages
CSC440M Cloud Computing Monsoon 2016-17
No ratings yet
CSC440M Cloud Computing Monsoon 2016-17
6 pages
BDA Lab Assignment 4 PDF
No ratings yet
BDA Lab Assignment 4 PDF
21 pages
IT Sem 6 Syllabus
No ratings yet
IT Sem 6 Syllabus
13 pages
Beginning Database Design
No ratings yet
Beginning Database Design
2 pages
Seminar Information System
No ratings yet
Seminar Information System
18 pages
Mining Public Datasets
100% (1)
Mining Public Datasets
45 pages
Bindiya - 144628950
No ratings yet
Bindiya - 144628950
3 pages
Book
100% (1)
Book
388 pages
7 Hive Notes
No ratings yet
7 Hive Notes
36 pages
Microsoft Integration Runtime - Release Notes: Azure Data Factory
No ratings yet
Microsoft Integration Runtime - Release Notes: Azure Data Factory
31 pages
B.Tech IT AY 2023 2024
No ratings yet
B.Tech IT AY 2023 2024
147 pages