Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Udemy Course DP-900 Microsoft Azure Data Fundaments Guide Part 1 of 2

Download as pdf or txt
Download as pdf or txt
You are on page 1of 12
At a glance
Powered by AI
The key takeaways are about relational databases, ELT processes, workload types and Power BI components.

The different types of analytics are descriptive, diagnostic, predictive and prescriptive.

The different types of workloads are batch, streaming and micro-batch.

Udemy Course DP- 900 Microsoft Azure Data Fundaments Guide Part 1 of 2

A relational database is appropriate for which of the below scenarios?


A: For those Scenarios where there is a high volume of Transactional writes.
Ex: A relational database can be used for OLTP purposes. This is can be used for transactional systems. Hence if
you have a high volume of transactional writes, you can make use of relational databases.

Which of the following is required for an extract, load and transform (ELT) process?
A: A target data store powerful enough to transform data
Ex: In the ELT process, the transformation happens on the target system. Hence the target system must be
powerful enough to transform the data.

A company wants to load data from a customer relationship management system onto a data
warehouse by using an extract , load and transform process.
Which of the following is the stage in the ELT process you would perform the process of Extract?
A: The CRM system
Ex: Here since the data needs to be extracted from the CRM system, this will be the stage in the process of
Extraction.

A company wants to load data from a customer relationship management system onto a data
warehouse by using an extract, load and transform process. Which of the following is the stage in the
ELT process you would perform the process of Load?
A: The Data WareHouse
Ex:Here since the data needs to be loaded into the data warehouse, this will be the stage for the Load process.

A company wants to load data from a customer relationship management system onto a data
warehouse by using an extract , load and transform process.
Which of the following is the stage in the ELT process you would perform the process of Transform?
A: An in-memory data integration tool
Ex: Since the transformation is going to occur in the destination , that is the data warehouse, there needs to be
an in-memory data integration tool to perform the transformation.

A company is planning on implementing the following workloads.


1) Storage of data for a product catalog that will be loaded every 12 hours to a data warehouse
2) Storage of data for online purchases will be loaded to a data warehouse as the purchases occur
3) Updates to the inventory data that will be loaded to a data warehouse every 1000 transactions
Which of the following is the workload type for the workload?
“Storage of data for a product catalog that will be loaded every 12 hours to a data warehouse”
A: Batch
Ex: This is a batch process that is going to occur after a duration of time.

A company is planning on implementing the following workloads.


1) Storage of data for a product catalog that will be loaded every 12 hours to a data warehouse
2) Storage of data for online purchases will be loaded to a data warehouse as the purchases occur
3) Updates to the inventory data that will be loaded to a data warehouse every 1000 transactions
Which of the following is the workload type for the workload?
“Storage of data for online purchases will be loaded to a data warehouse as the purchases occur”
A: Streaming
Ex: Here since the data needs to be loaded as soon as the purchase occur, this is an example of the streaming
workload type.
A company is planning on implementing the following workloads.
1) Storage of data for a product catalog that will be loaded every 12 hours to a data warehouse
2) Storage of data for online purchases will be loaded to a data warehouse as the purchases occur
3) Updates to the inventory data that will be loaded to a data warehouse every 1000 transactions
Which of the following is the workload type for the workload?
“Updates to the inventory data that will be loaded to a data warehouse every 1000 transactions”
A: Micro-Batch
Ex: Here since the data need to be loaded into the data warehouse after a certain threshold, this is an example
of the Micro-Batch workload type.

You need to match the right type of analytics that can be used for the following business scenario?
“Why did the sales increase last month”
A: Diagnostics
Ex: This is an example of Diagnostics analytics. This is used to understand why something happened.

You need to match the right type of analytics that can be used for the following business scenario?
“Which people are mentioned in a company’s business documents”
A: Descriptive
Ex:This is an example of Descriptive analytics. Here you are describing information about the people mentioned
in the business documents.

Which of the following is the right option when it comes to relational data?
A: Relational data is stored in a tabular form of rows and column
Ex: Relational data is normally stored in the form of tables. The tables consist of rows and columns of data.

Which of the following are two uses of data visualization? Choose 2 answers from the options given
A: Represent trends and patterns over time & Communicate the significance of data
Ex: With data visualization, you can see patterns and trends in your data over time. You can also communicate
the significance of data to your stakeholders.

A company is planning on implementing the following workloads.


1) Analyzing web traffic from the past year
2) Classifying images that were uploaded last month
3) Tracking how many people are currently using a website
Which of the following is the workload type for the workload?
“Analyzing web traffic from the past year”
A: Batch
Ex: Here since you are looking at taking data from a period of time, this qualifies as batch processing

A company is planning on implementing the following workloads.


1) Analyzing web traffic from the past year
2) Classifying images that were uploaded last month
3) Tracking how many people are currently using a website
Which of the following is the workload type for the workload?
“Tracking how many people are currently using a website”
A: Streaming
Ex: Since here the user data needs to be tracked in real-time, this classifies as Streaming data.
Area 1 : Azure Data Factory
Ex: Azure Data Factory can be used as the service to Extract , transform and load the data into the destination.

Area 2 : Azure Synapse Analytics


Ex: You can use Azure Synapse Analytics. Here you can host a SQL data warehouse.

You have an application that performs reads and writes data to an Azure SQL Database. Which of the
following is the type of processing that can be performed on the database?
A: Online Transaction Processing (OLTP)
Ex: Online Transaction Processing (OLTP) is performed on a relational database.

Which of the following are two aspects of real-time data processing?


A: Data is Processed as it is created .
Low Latency is expected.
Ex: Here data needs to be processed immediately. And here low latency is expected. You should be able to
retrieve the data immediately so that it can be processed.

Which of the following are two Azure services that can be used to provision Spark clusters?
A: Azure Databricks & Azure HDInsight
Ex: You can create Spark clusters using Azure Databricks and Azure HDInsight.

You have to state whether the following is TRUE or FALSE.


“A pipeline is a representation of a data structure within Azure Data Factory”
A: False
Ex: The data set in Azure Data Factory is used to represent the data structure.

You have to state whether the following is TRUE or FALSE.


“Azure Data Factory can execute other pipelines”
A:True
Ex: Yes, you can trigger other pipelines form Azure Data Factory. You can use the Execute Pipeline step in Azure
Data Factory.
You have to state whether the following is TRUE or FALSE.
“A processing step within an Azure Data Factory pipeline is an activity”
A:True
Ex: Yes, activities in Azure Data Factory can be used as a processing step to process the data.

You have to work with the Azure Data Factory service. You need to map the Azure Data Factory
components to the right descriptions.
Which of the following component should be mapped to the following description?
“ A representation of data structures within data stores”
A:Dataset
Ex: The Dataset is a representation of the data structured within data stores.

Which of the following component should be mapped to the following description?


“The information used to connect to external data sources”
A: Linked Services
Ex: The Linked Service can be used to connect to external data sources.

Which of the following component should be mapped to the following description?


“ A logical grouping of activities that performs a unit of work and can be scheduled”
A: Pipeline
Ex: Here you can define a pipeline of activities

You need to select the appropriate an Azure service that would be used for a particular requirement.
Which of the following would you use for the following requirement?
“Output data to Parquet format”
A: Azure Data Factory
Ex: You can use Azure Data Factory to output data in a particular format.

Which of the following would you map to the Graph data structure?
A: Gremlin API
Ex: The Gremlin API supports the usage of graph database.

You need to map the appropriate Azure Cosmos DB API’s to the appropriate data structures
Which of the following would you map to the JSON data structure?
A:MongoDB API
Ex: The MongoDB API supports the usage of documents that can be used to store JSON data.

You need to map the appropriate Azure Cosmos DB API’s to the appropriate data structures
Which of the following would you map to the Key/Value data structure?
A: Table API
Ex: The Table API supports the usage of key/value pairs.

You currently manage an application that stores data within a shared folder on a Windows server. You
have to move the shared folder to Azure Storage. Which of the following service within an Azure
storage account could you use for this purpose?
A: File
Ex: You can use the File service to create file shares. These file shares can be mounted on Windows Servers.
You have to state whether the following is TRUE or FALSE.
“When data is ingested from Azure Data Lake Storage across Azure regions, then you will incur costs for
bandwidth”
A: True
Ex: For any data transfer costs across regions, there is a cost associated.

You have to state whether the following is TRUE or FALSE.


“You can use the blob, table and file storage service in the same Azure Storage account”
A: True
Ex: Yes, you can use these services at the same time in an Azure Storage account.

You have to state whether the following is TRUE or FALSE.


“You can implement Azure Data Lake Storage by creating an Azure Storage account”
A: True
Ex:Yes, since Azure Data Lake Storage is built on Azure Storage Accounts.

Which of the following is an Azure Data Factory component that provides the compute environment for
activities?
A: An integration runtime
Ex: This is provided by the integration runtime.

You have to state whether the following is TRUE or FALSE.


“Azure Databricks is an Apache Spark-based collaborative analytics platform”
A: True
Ex:Azure Databricks is an Apache Spark based platform.

You have to state whether the following is TRUE or FALSE.


“Azure Data Factory orchestrates data Integration workflows”
A: True
Ex: Azure Data Factory is used to create data integration workflows.

You need to select the appropriate an Azure service that would be used for a particular requirement.
Which of the following would you use for the following requirement?
“Store data in Parquet format”
A: Azure Data Lake Storage
Ex: You can store files in virtually any data format in Azure Data Lake Storage.

You need to select the appropriate an Azure service that would be used for a particular requirement.
Which of the following would you use for the following requirement?
“Persist a tabular representation of data that is stored in Parquet format”
A: Azure Synapse Analytics
Ex: The tabular representation of the data is normally stored in Azure Synapse Analytics.

Which of the following is the right statement when it comes to the massively parallel processing engine
in Azure Synapse Analytics?
A: The massively parallel processing engine of Azure Synapse Analytics distributes processing across
compute nodes.
Ex: The engine actually sends all the processing to the compute nodes.
Which of the following is an Azure storage solution that provides native support for POSIX-compliant
access control lists?
A: Azure Data Lake Storage
Ex: This feature is provided by Azure Data Lake Storage

Your company currently has a transactional application that stores data in an Azure SQL Managed
instance. In which of the following circumstances would you need to implement a read-only database
replica?
A: You need to generate reports without affecting the transactional workload.
Ex: A read-only instance can be used to reduce the workload on your OLTP database. You can generate reports
from the read-only copy of the database.”

You have to state whether the following is TRUE or FALSE.


“Azure Synapse Analytics scales storage and compute independently”
A: True
Ex: Yes, these are two separate aspects when it comes to Azure Synapse Analytics

You have to state whether the following is TRUE or FALSE.


“Azure Synapse Analytics can be paused to save on compute costs”
A: True
Ex: Yes, you can pause the Synapse pool to save on costs.

You have to state whether the following is TRUE or FALSE.


“An Azure Synapse Analytics data warehouse has fixed storage capacity”
A: False
Ex: For Gen2 data warehouse, you have unlimited storage for columnstore tables.

You have to state whether the following is TRUE or FALSE.


“Azure Databricks can consume data from an Azure SQL database”
A: True
Ex:Yes, you can consume data from an Azure SQL database.

You have to state whether the following is TRUE or FALSE.


“Azure Databricks can consume data from Azure Event Hubs”
A: True
Ex: Yes, you can consume data from an Azure Event Hubs

You have to state whether the following is TRUE or FALSE.


“Azure Databricks can consume data from Azure Cosmos DB”
A: True
Ex:Yes, you can consume data from an Azure Cosmos DB

Which of the following is TRUE when it comes a key/value data store?


A: A Key value data store is optimized for simple lookups
Ex: A key value store is better for simple lookups based on the key.
Your company has an application that runs on Windows. It needs access to a mapped drive. Which of
the following is an Azure service you can use for this requirement?
A: Azure Files
Ex: Azure Files allows you to make use of mapped drives.

Which of the following is TRUE when it comes a graph database in Azure Cosmos DB?
A: You can Query a graph database in Azure Cosmos DB as Nodes and edges by using Gremlin
Language.
Ex: The graph database is based on nodes and edges. And you can use the Gremlin API in Azure Cosmos DB to
host a graph based database.

Your company needs to design a database that shows how traffic changes in one area of a network
affect other components on the network. Which of the following is a data store type you would use for
this requirement?
A: Graph
Ex: It would be ideal to define the network as nodes in a graph-based database. You can define the relationship
between the networks as edges between the nodes.

Your company needs a non-relational data store that is optimized for storing and retrieving files,
videos, audio streams and virtual disk images. The data store must be able to store data , metadata and
a unique ID for each file.
Which of the following is a data store type you would choose for this requirement?
A: Object
Ex: For this you can use an Object data store like Azure Blob Storage. Here you can have metadata for each
object and also a unique URL.

Your company wants to have an Azure storage account in place. They want to ensure that the data is
replicated in another Azure region automatically. Which of the following can be used as the replication
strategy for the storage account? Choose 2 answers from the options given below
A: Read-Access Geo-Redundant Storage(RA_GRS) & Geo-redundant Storage(GRS)
Ex: For this you can make use of Read-access geo-redundant storage (RA_GRS) and Geo-redundant storage
(GRS). This ensures that data is replicated to another region.

You have to state whether the following is TRUE or FALSE.


“The Azure Cosmos DB API is configured separately for each database in an Azure Cosmos DB account”
A: False
Ex: The Cosmos DB API is configured at the account level and not at the database level.

You have to state whether the following is TRUE or FALSE.


“Partition keys are used in Azure Cosmos DB to optimize queries”
A: True
Ex: Yes, the data in an Azure Cosmos DB database is partitioned via the partition keys.

You have to state whether the following is TRUE or FALSE.


“Items contained in the same Azure Cosmos DB logical partition can have different partition keys”
A: False
Ex: All items in the same logical partition have the same partition key.
Which of the following is the right use case for an Azure Resource Manager template?
A: To automate the creation of an interdependent group of Azure resources in a repeatable way.
Ex: Azure Resource Manager templates can be used to deploy resources to Azure. You can reuse the same
templates to deploy the same set of resources in a repeatable way.

Which of the following is a feature in Azure Cosmos DB that provides redundancy within an Azure
region?
A: Availability Zones
Ex: When Availability Zones, data is copied onto multiple Availability Zones. Even if one zone goes down, you
still have the other zones available.

At which of the following level in Azure Cosmos DB can you configure multiple write regions and read
regions?
A: Account
Ex: This is done at the account level.

Your company is planning on designing an application that will write a high volume of JSON data that
will have an application-defined schema. Which of the following can you specify as the data store type
for this requirement?
A: Document
Ex: For this you can use a document-based data store. Here the JSON data can be stored as documents.

At which of the following levels can you set the throughput for an Azure Cosmos DB account? Choose 2
answers from the options given below.
A: Container & Database
Ex: You can define the throughput at a database or container level.

A: Update
Ex: You would use the UPDATE statement for this requirement

Your company needs to implement a relational database in Azure. The solution must minimize ongoing
maintenance. Which of the following can be implemented for this requirement?
A: Azure SQL Database
Ex: With the Azure SQL database, you don’t need to maintain the underlying infrastructure.

You have to write a set of queries that will help administrators to troubleshoot an Azure SQL database.
You have to be able to embed the documents and query results into a SQL notebook. Which of the
following would you use for this requirement?
A: Azure Data Studio
Ex: You can use SQL Notebooks in Azure Data Studio.
You have to map the right term to the term description.
Which of the following would you map to the following description?
“ A database object that holds data”
A: Table
Ex: The table is used to hold data

You have to map the right term to the term description.


Which of the following would you map to the following description?
“ A database object whose content is defined by a query”
A: View
Ex: The View is defined by a query.

You have to map the right term to the term description.


Which of the following would you map to the following description?
“ A database object that helps improve the speed of data retrieval”
A: Index
Ex: The index defined on the table can be used to improve the speed of data retrieval.

Which of the following is a command line tool that can be used to query Azure SQL databases?
A: sqlcmd
Ex: The sqlcmd tool can be used to query Azure SQL databases.

You have to state whether the following is TRUE or FALSE.


“Azure SQL Database includes a fully managed backup service”
A: True
Ex: Yes, there is a fully managed backup service that takes automated backups.

You have to state whether the following is TRUE or FALSE.


“Azure SQL Database has built-in high availability”
A: True
Ex: Yes, Azure SQL Database automatically has high availability

You have to state whether the following is TRUE or FALSE.


“Azure SQL Database can use Azure Advanced Threat Protection”
A:True
Ex: Yes, Azure SQL Database also can make use of Advanced Threat Protection.

You have to state whether the following is TRUE or FALSE.


“You can use Azure Data Studio to query a Microsoft SQL Server big data cluster”
A: True
Ex: Yes, you can use Azure Data Studio to connect to a Microsoft SQL Server Big Data cluster.

You have to state whether the following is TRUE or FALSE.


“You can use Microsoft SQL Server Management Studio to query an Azure Synapse Analytics data
warehouse”
A: True
Ex: Yes, you can use Microsoft SQL Server Management Studio to query a data warehouse hosted in Azure
Synapse Analytics.
You have to state whether the following is TRUE or FALSE.
“You can use MySQL Workbench to query Azure Database for MariaDB databases”
A: True
Ex: Yes, you can use MySQL Workbench to query Azure Database for MariaDB.

You have to state whether the following is TRUE or FALSE.


“You must apply patches to Azure SQL databases regularly”
A: False
Ex: The patches are applied automatically by the Azure SQL database service.

You have to state whether the following is TRUE or FALSE.


“You need a Microsoft 365 subscription to create an Azure SQL database”
A: False
Ex: You need an Azure subscription to create an Azure SQL database.

A: Key Value
Ex: Here you can store the data as key/value pairs

Which storage solution supports role-based access control (RBAC) at the file and folder level?
A: Azure Data Lake Storage
Ex: This is possible with Azure Data Lake Storage

Your company need to store data in Azure Blob storage. The data needs to be stored for seven years.
The retrieval time of the data is unimportant. The solution must minimize storage costs. Which of the
following is the ideal storage tier to use for this requirement?
A: Archive
Ex: You can use the Archive tier for this requirement.

Which of the following is an example of an analytics bar chart that shows year-to-date sales by region?
A: Descriptive
Ex: This should be an example on descriptive analysis.
A: Predictive
Ex: This is known as predictive analysis on what is going to happen based on past data.

You need to match the right type of analytics that can be used for the following business scenario?
“How do I allocate my budget to buy different inventory items”
A: Prescriptive
Ex: This should fall under prescriptive analysis. Here you want to know how to approach to fulfil a target
requirement based on the data that is available.

Which of the following statement is right when it comes to Microsoft Power BI?
A: A Microsoft Power BI paginated report enables users to create highly formatted , fixed layout
documents.
Ex: With paginated reports, you can create highly formatted and fixed layout documents

You have to develop a solution that needs to provide data to senior executives. Below are the key
requirements
a) The solution must provide an interactive graphic interface
b) It must depict various key performance indicators
c) It must support data exploration by using drill down
Which of the following can be used within Microsoft BI for this requirement?
A: A Dashboard
Ex: For creating solutions for senior management, you would go ahead and make use of dashboards. Here you
can add key performance indicators, provides a graphic interface and you can also perform drill down as well.

Which of the following needs to be used to build a Microsoft Power BI paginated report?
A: Power BI Report Builder
Ex: To build a paginated report, you need to use the PowerBI Report builder.

You have to develop a reporting solution for your company. The solution needs to support paginated
reports. The reports should be able to query a dimensional model in a data warehouse. Which of the
following type of processing must the reporting solution use?
A: Online Analytical Processing (OLAP)
Ex: Since we are connecting to a data warehouse, this needs to be an Online Analytical Processing system.
Which of the following are activities that are performed in the Microsoft Power BI service? Choose 2
answers from the options given below
A: Report and Dashboard creation & Report Sharing and Distribution
Ex: Here you can create your reports and dashboards. You can also share reports with other users.

You have to state whether the following is TRUE or FALSE.


“A Microsoft Power BI dashboard is associated with a single workspace”
A: True
Ex: Yes , the workspace is a logical boundary for your reports and data.

You have to state whether the following is TRUE or FALSE.


“A Microsoft Power BI dashboard can only display visualizations from a single dataset”
A: False
Ex: A dashboard can use multiple data sets for visualization.

You have to state whether the following is TRUE or FALSE.


“A Microsoft Power BI dashboard can display visualizations from a Microsoft Excel workbook”
A: True
Ex: Yes , data can be visualized from Microsoft Excel

You might also like