Edukuron Data Engineering
Edukuron Data Engineering
Engineering
Reach Us
+91 77603 09798 3rd cross, Aswath Nagar,
info@edukron.com
+91 77602 89798 Marathahalli Bridge,
Bangalore – 560037
Data Course
Engineering Curriculum
MODULE MODULE Control Flow and Error Handling in
Variables and Data Types in Python Python
01 02
MODULE MODULE
7. Continuous assessments for skill enhancement. Data Structures in Python Functions and Scope in Python
03 04
Tuples Generators
MODULE MODULE
File Handling Object-Oriented Programming (OOP)
05 06
MODULE Introduction to Apache Spark and MODULE
PySpark Working with Data in PySpark
09 10
File Modes Constructors and Destructors Overview of Apache Spark and PySpark Transformations and Actions
Working with JSON and CSV Files Inheritance Installation and Setup Lazy Evaluation
Exception Handling with Files Polymorphism PySpark Architecture Caching and Persistence
File Handling Best Practices Encapsulation RDDs (Resilient Distributed Datasets) Partitions and Parallelism
Reading and Writing Data Introduction to MLlib Performance Tuning and Optimization Data Pipelines and ETL (Extract, Transform, Load)
Working with Structured Data (CSV, JSON, Parquet) Feature Engineering Clickstream Analysis
Cluster Management
Dictionaries Model Training and Evaluation Data Partitioning Strategies Fraud Detection
List Comprehensions Deployment and Serving Integration with other Big Data Ecosystem Tools Social Network Analysis
Introduction to Spark Streaming Introduction to GraphFrames Overview of Azure Databricks Working with DataFrames
DStreams (Discretized Streams) Creating GraphFrames Creating and Configuring Databricks Workspace ETL (Extract, Transform, Load) Operations
Window Operations Graph Algorithms Databricks Architecture Data Cleaning and Preprocessing
Stateful Streaming Graph Queries Collaborative Notebooks Window Functions and Aggregations
Encapsulation Visualization and Analysis Data Ingestion and Integration Optimization Techniques
MODULE MODULE MODULE
Data Analysis and Visualization Machine Learning with Databricks Introduction to Databases and SQL
19 20 23
Data Visualization Libraries (Matplotlib, Seaborn) Overview of Databases and Database Management Systems
Introduction to MLlib
Exploratory Data Analysis (EDA) Introduction to SQL and its Importance
Feature Engineering
Statistical Analysis Setting Up SQL Environment (e.g., MySQL, PostgreSQL)
Model Training and Evaluation
Interactive Dashboards (Databricks Visualization) Basic SQL Syntax and Statements
Hyperparameter Tuning
SQL Analytics Introduction to Data Types and Constraints
Model Deployment
MODULE
SQL Fundamentals
MODULE Real-time Data Processing with MODULE 24
Advanced Topics in Databricks
21 Structured Streaming 22
Introduction to Structured Streaming Delta Lake and Data Versioning Select Statement and Retrieving Data
Real-time Data Processing Graph Analytics Filtering Data with WHERE Clause
Window Operations Security and Authentication Sorting Data with ORDER BY Clause
Stateful Streaming Best Practices in Databricks Limiting Results with LIMIT and OFFSET Clauses
Integrating with Event Hubs or Kafka Case Studies and Hands-on Projects Using DISTINCT and Aggregate Functions
MODULE Advanced SQL Techniques and MODULE Database Administration and
MODULE Advanced SQL Queries and 27 Performance Optimization 28 Security
25 Subqueries
Overview of Data Visualization Principles Creating Basic Charts (Bar, Line, Pie)
Inserting Data into Tables
Introduction to Tableau and its Importance Formatting and Customizing Visualizations
Updating Existing Data
Installing and Setting Up Tableau Desktop Adding Filters and Parameters
Deleting Data from Tables
Connecting to Data Sources Using Groups and Sets
Managing Transactions with COMMIT and ROLLBACK
Understanding Tableau Interface and Navigation Introduction to Calculated Fields
Controlling Data Integrity with Constraints
MODULE Introduction to Power BI and Data MODULE Data Loading, Transformation, and
MODULE Advanced Visualizations and MODULE Interactive Dashboard Design in
35 Connection 36 Modeling in Power BI
31 Calculations in Tableau 32 Tableau
Using Dual Axes and Combined Axis Designing Interactive Dashboards Overview of Business Intelligence (BI) Concepts Data Loading and Transformation
Working with Trend Lines and Reference Lines Dashboard Layout and Formatting Introduction to Power BI and its Importance Data Cleansing and Manipulation
Using Maps for Geospatial Analysis Creating Actions and Interactivity Installing and Setting Up Power BI Desktop Creating Relationships between Data Tables
Implementing Advanced Calculations Best Practices for Dashboard Design Connecting to Data Sources Data Modeling and DAX (Data Analysis Expressions)
Incorporating Tableau Prep for Data Preparation Storytelling with Data using Tableau Story Points Understanding Power BI Interface and Navigation Introduction to Power Query Editor
Implementing Level of Detail (LOD) Expressions Creating Basic Visualizations (Bar, Line, Pie) Using Calculated Columns and Measures
Publishing to Tableau Server or Tableau Online
Advanced Table Calculations Formatting and Customizing Visualizations Implementing Conditional Formatting
Managing Permissions and Access Control
Forecasting and Trend Analysis Adding Filters and Slicers Working with KPIs (Key Performance Indicators)
Scheduling Data Refreshes
Clustering and Segmentation Introduction to Hierarchies and Drill-down Incorporating Map Visualizations
Collaborating with Tableau Server/Online
Integrating R and Python Scripts Using Custom Visuals from AppSource Utilizing AI Insights (Quick Insights, Q&A)
Introduction to Tableau Mobile and Embedded Analytics
Target Audience
MODULE Advanced Interactive Dashboard The ideal candidates for a data engineering course include
39 Design and Implementation
recent graduates with degrees in computer science,
information technology, or related fields.
Designing Interactive Dashboards
These individuals possess strong analytical and
Dashboard Layout and Formatting programming skills, providing a solid foundation for
Creating Drill-through and Drill-down Reports learning data engineering concepts.
Implementing Cross-filtering and Highlighting They are eager to apply their academic knowledge to
Advanced Interactivity with Bookmarks and Buttons real-world scenarios, focusing on building and managing
scalable data pipelines and infrastructure.
Additionally, professionals with experience in software
development, database management, or IT are prime
MODULE
40
Power BI Service and Collaboration candidates for a data engineering course.
They bring a practical understanding of systems and data
Publishing to Power BI Service management, which allows them to quickly adopt data
engineering practices.
Managing Dashboards and Reports in Power BI Service
These individuals often aim to upskill or transition into roles
Sharing and Collaboration Features
that focus on designing, constructing, and maintaining the
Security and Access Control architectures that enable data analysis and business
Introduction to Power BI Mobile App and Embedded intelligence, thus enhancing their value in data-driven
Analytics industries.
Projects Placement
"EDUKRON in Marathahalli is undoubtedly the go-to "Attending the Data Engineering course at EDUKRON
destination for mastering Data Engineering. The was a game-changer for me. Bharath's lucid
Varun Shruti instructor, Bharath, brings extensive expertise in Data explanations coupled with real-world examples made
Engineering, Python, and Big Data technologies. With complex concepts digestible. The availability of both
hands-on sessions and a comprehensive curriculum, weekday and weekend batches accommodates
EDUKRON ensures every learner grasps the intricacies diverse schedules. Through real-time projects, I gained
Hero MotoCorp Bosch of data engineering. I highly recommend this institute practical experience that propelled my career forward.
for anyone aspiring to excel in the field of Data Thanks to Bharath and EDUKRON, I now feel confident in
13.5LPA 13.5LPA Engineering." my Data Engineering skills."