Tech Launch Program Data science
Tech Launch Program Data science
DATA SCIENCE
Program Overview
✓ This intensive 90-day course provides a comprehensive journey through Data Science, incorporating Deep Learning,
SQL Databases, and Business Intelligence tools.
✓ The curriculum is divided into three phases, each building upon the previous knowledge.
✓ Enterprise-level BI implementation
Enterprise Integration
Phase 2 - Intermediate
– Advanced Data Analysis
– Advanced Business Intelligence
– Machine Learning Foundations
– Deep Learning Basics
Phase 1 – Basic Level
✓ Classes & Objects: Definitions, Methods (Instance, Class, ✓ Build a Mini-Project combining concepts
Static)
✓ Inheritance: Single/Multiple, Overriding
✓ Encapsulation: Private/Protected Attributes, Property Decorators
✓ Polymorphism: Overloading, Duck Typing
Phase 1 – Basic Level
SQL Foundations
Database Concepts
✓ DBMS Overview: Relational Databases, Tables, Rows,
Columns
JOIN Operations
✓ Keys: Primary & Foreign Keys, Basics of Normalization
✓ Types: INNER, LEFT, RIGHT, FULL OUTER, CROSS,
SQL Installation Self JOIN
✓ Install MySQL/PostgreSQL ✓ Handling Multiple Tables
✓ GUI Tools: MySQL Workbench, pgAdmin ✓ Aggregation Functions
✓ Create and Set Up Sample Databases ✓ COUNT(), SUM(), AVG(), MAX(), MIN()
Basic Queries ✓ String Aggregations, Custom Aggregations
✓ SELECT Statements: Column Selection, Aliases GROUP BY & HAVING
✓ WHERE Clause: Comparison, Logical Operators, ✓ Single/Multiple Column Grouping
BETWEEN, IN, LIKE
✓ Aggregate Filtering
✓ ORDER BY: Ascending/Descending, Multiple Columns,
NULL Handling ✓ Complex Grouping Scenarios
Phase 1 – Basic Level
SQL Foundations
Subqueries
✓ Single & Multiple Row Subqueries
✓ Correlated Subqueries, EXISTS Operator
✓ CTEs (Common Table Expressions)
Window Functions
✓ Ranking: ROW_NUMBER(), RANK(), DENSE_RANK()
✓ LAG(), LEAD(), Partitioning, Moving Averages
Views & Stored Procedures
✓ Views & Materialized Views
✓ Stored Procedures & Functions
✓ Triggers Basics
Phase 1 – Basic Level
Data Analysis Tools
NumPy Array Operations Data Visualization
✓ Array Creation, Indexing & Slicing ✓ Matplotlib:
✓ Array Operations & Broadcasting ✓ Basic Plots: Line, Scatter, Bar, Histogram
✓ Plot Customization, Subplots, Saving
✓ Universal Functions
✓ Seaborn:
Pandas DataFrame
✓ Statistical Plots, Heatmaps
✓ Creation: From Files, Lists, Dicts
✓ Categorical & Regression Plots
✓ Operations: Column, Row, Indexing (loc, iloc) ✓ Style and Color
✓ Boolean Indexing & Selection Basic Charts
Data Cleaning ✓ Line, Bar, Pie, Box, Violin
✓ Missing Values Handling ✓ Correlation Matrices
✓ Removing Duplicates
✓ Data Type Conversion
✓ String & Date/Time Operations
✓ Data Validation
Phase 1 – Basic Level
✓ Simple & Multiple Regression (Feature Selection, Multicollinearity) ✓ Feature Selection (Filter, Wrapper, Embedded Methods)
✓ Binary & Multiclass Classification (One-vs-Rest, ROC Curve) ✓ Techniques: K-fold, Stratified K-fold, Leave-One-Out
✓ Model Evaluation: Precision-Recall, Confusion Matrix ✓ Time Series CV: Forward Chaining, Rolling Forecast
Decision Trees & Random Forests: ✓ Implementation: Validation Curves, Model Selection
✓ Hierarchical (Agglomerative, Dendrograms), DBSCAN ✓ Methods: Grid Search, Random Search, Bayesian Optimization
Phase 2 – Intermediate Level
Deep Learning Basics
Deep Learning Frameworks
Neural Network Architecture
TensorFlow/Keras
✓ Components: Neurons, Layers, Weights, Biases
✓ TensorFlow Basics: Tensors, Variables, Operations
✓ Network Types: Feedforward, Convolutional, Recurrent
✓ Keras API: Sequential, Functional, Model Subclassing
Forward & Backward Propagation
✓ Model Building: Layers, Loss Functions, Optimizers
✓ Forward Propagation: Input Processing, Layer
Computation, Output Generation PyTorch Basics
✓ Backward Propagation: Chain Rule, Gradient Descent, ✓ Tensors, Autograd, Neural Network Modules
Learning Rate ✓ Model Development: DataLoader, Training Loops,
Activation Functions Evaluation
✓ Advanced: Leaky ReLU, ELU, SELU ✓ Implementation: Binary & Multiclass Classification,
Regression
✓ Training: Batch Processing, Epochs, Early Stopping
Phase 2 – Intermediate Level
✓ Data Pipeline: ETL, Feature Engineering, Model ✓ Technical Presentations, Model Performance Metrics
Predictions Practical Applications
✓ Visualization: Interactive Dashboards, Real-Time Updates, ✓ Real-World Datasets, Industry Projects
Performance Metrics
✓ Kaggle Competitions, Portfolio Development
Database Integration
✓ Data Storage: Model Artifacts, Predictions, Metrics
✓ API Development: REST Endpoints, Real-Time Scoring,
Monitoring
Learning Resources & Tools Assessment Methods
20
Best Practices
21
Thank you