Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
34 views

Data and Analytics Syllabus

data science syllabus for 6th sem
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
34 views

Data and Analytics Syllabus

data science syllabus for 6th sem
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

nB.N.M.

Institute of Technology
An Autonomous Institution under VTU, Approved by AICTE
Department of Artificial Intelligence and Machine Learning
SEMESTER – VI
Data and Analytics
Credit : 3
Course Code 21AML16652 CIA Marks 50
Teaching Hours/Week (L: T: P: J) 2:2:0:0AML SEA Marks 50
Total Number of Lecture Hours 40 Exam Hours 03
Course Learning Objectives:
This course will enable students to
• Introduce the concepts of python programming, SQL and HTML.
• Identify different cloud platforms and their characteristics for data specific core computations.
• Identify the concepts of data warehousing and big data.
• Determine the concepts of Power Business Intelligence.
• Design queries for analysis of data.
Number of Bloom’s
Hours Level
Module-1
SQL: Database and RDBMS, NoSQL, RDBMS Advanced Queries,
Functions, Aggregate and Analytical, Data Integrity, Subqueries & Views,
SQL Advanced Concepts
Python: Introduction to python, python variables and syntax, conditions,
python strings, tuples, functions in python, Matplotlib and seaborn for data
visualization, Data visualization: plots, histograms, heatmap, Data
preprocessing: missing values, outliers, encoding, Scaling and 8 Apply
normalization, multivariate analysis, packages and python connectivity
XML, Javascript and Webservices: What is XML, XML basic tags and
XML examples, What is Javascript and why Javascript Requires and
Javascript example, Introduction to Webservices, What is Client and server
technologies, HTTP Protocols and various methods of HTPP?

Module-2
Data specific core: AWS Data Services: Identity Access Management-
IAM, Elastic Cloud Compute- EC2, AWS RDS, AWS Glue, AWS
Redshift, AWS EMR, AWS - Dynamo DB, AWS QuickSight, AWS
SageMaker
8 Apply
GCP Data Services: Compute Engine, Cloud Storage, Cloud SQL,
Spanner, Datastore, Bigtable, BigQuery, Data Proc, DataFlow, Cloud
Composer, Dataprep, Data Fusion, Cloud AutoML, Looker
Azure Data Services: introduction to Azure storage, benefits of Azure
storage, Blob storage, Azure files, Azure container storage, types of storage
accounts, encryption, Azure HDInsight, What is HDInsight and the
Hadoop technology stack?, Cluster types in HDInsight, Programming
languages in HDInsight, Azure Cosmos DB - Database for the AI Era,
Simplified application development
Module-3
Data Specific - DWH, DI/BI Concepts: Introduction to NoSQL, Key
Features of NoSQL, advantages and disadvantages of NoSQL, Types of
NoSQL database, Databricks architecture overview, High-level
architecture, Serverless compute plane, Classic compute plane
8 Apply
Big Data Hadoop and Spark: History and timeline of big data, RDDs vs
DataFrames and Datasets, When to use them and why, Benefits of Dataset
APIs, Understanding Hadoop Architecture, Components, and How It
Works, Functional Programming.
Module-4
Power BI: What is PowerBI, the parts of Power BI, How Power BI
matches the role in a team or project, The flow of work in Power BI, How
Microsoft Fabric works with Power BI, Paginated reports in the Power BI 8 Apply
service, On-premises reporting with Power BI Report Server, powerBI
visuals, Preattentive Attributes in Visualization
Module-5
Snowflake: Getting started with snowflake, key concepts and architecture,
supporting cloud platforms, supported cloud regions, snowflake editions,
snowflake releases, overview of key features, overview of data life cycle,
continuous data protection, snowflake ecosystem, snowflake partner 8 Apply
connect, general configuration, snowflake architecture, snowflake virtual
warehouse overview, snowflake features, Analysis of real time applications
through snowflake.
Course outcomes:
The students will be able to:
• Apply the core concepts of SQL, Python and XML to perform analytics on data. (Apply)
• Develop a comparison study of various cloud platforms and study their services and database. (Apply)
• Apply the concept of Data warehousing, business intelligence and big data to analyze datasets.
(Apply)
• Design and develop power business intelligence and visualization for data analytics through snowflake
platform. (Apply)
• Analyze datasets with appropriate programming language, SQL and snowflake queries. (Analyze)
References:

Sl No Modules Recommended Links for reference

1 https://www.slideshare.net/search?searchfrom=header&q=SQL
Data Specific -
2 https://docs.snowflake.com/en/sql-reference-commands
SQL Deep-dive
3 https://comparecloud.in/
4 Data Specific - https://www.geeksforgeeks.org/introduction-to-nosql/
5 DWH, DI/BI https://docs.databricks.com/en/getting-started/overview.html
6 Concepts https://docs.snowflake.com/en/sql-reference-commands
7 https://www.techtarget.com/whatis/feature/A-history-and-timeline-of-big-data
https://www.databricks.com/blog/2016/07/14/a-tale-of-three-apache-spark-apis-
8 Data Specific - Big rdds-dataframes-and-datasets.html
Data Hadoop +
Spark https://medium.com/@chenglong.w1/demystifying-yarn-understanding-its-
9
architecture-components-and-how-it-works-738dd95ad453
10 https://github.com/readme/guides/functional-programming-basics
11 AWS Data https://aws.amazon.com/quickstart/
12 Services https://aws.amazon.com/about-aws/global-infrastructure/?p=ngi&loc=0
13 https://learn.microsoft.com/en-us/azure/storage/common/storage-introduction
14 https://learn.microsoft.com/en-us/training/paths/azure-sql-fundamentals/
15 https://learn.microsoft.com/en-us/azure/data-factory/quickstart-get-started
16 https://learn.microsoft.com/en-us/azure/hdinsight/hdinsight-overview
Azure Data
17 Services https://learn.microsoft.com/en-us/azure/synapse-analytics/overview-what-is
18 https://learn.microsoft.com/en-us/fabric/get-started/microsoft-fabric-overview
19 https://learn.microsoft.com/en-us/azure/cosmos-db/introduction
https://learn.microsoft.com/en-IN/azure/machine-learning/tutorial-azure-ml-in-a-
20
day?view=azureml-api-2
21 https://www.youtube.com/@googlecloudtech
22 https://thecloudgirl.dev/sketchnote.html
GCP Data Services
23 https://cloud.google.com/docs
24 https://cloud.google.com/architecture
25 https://www.youtube.com/watch?v=yKTSLffVGbk
26 https://learn.microsoft.com/en-us/power-bi/fundamentals/power-bi-overview
27 https://powerbi.microsoft.com/en-us/blog/
28 https://powerbi.microsoft.com/en-my/search/community/
29 Power BI https://www.youtube.com/watch?v=77jIzgvCIYY
30 https://appsource.microsoft.com/en-us/marketplace/apps?product=power-bi-visuals
31 https://www.perceptualedge.com/about.php
32 https://daydreamingnumbers.com/blog/preattentive-attributes-example/
33 https://www.storytellingwithdata.com/
34 https://docs.snowflake.com/en/sql-reference-commands
35 https://learn.snowflake.com/en/
36 https://quickstarts.snowflake.com/
Snowflake
https://www.snowflake.com/resource/7-snowflake-reference-architectures-
37
application-builders/#main-content
38 https://www.snowflake.com/en/data-cloud/pricing-options/

Marks Distribution for Assessment


CIA Components Description Marks
(50)
Written test • Total Number of Test:03
• Each Theory test will be conducted for 30 marks 30
• Average of 3 tests= 30 Marks
Assignment Perform data analytics with a dataset and represent them with data 10
visualization.
Presentation Presenting the data visualization and representing the concepts of 10
data analytics.
Total CIA 50
SEA Written exam • Theory exam will be conducted for 100 marks and scaled 50
(50) down to 50 marks.
• The question paper will have 10 full questions each of 20
marks. Students have to answer 5 full questions.
Total Marks for the Course 100

You might also like