Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
4 views

Tech Launch Program Data science

The TECH LAUNCH PROGRAM is a 90-day intensive course focused on Data Science, covering topics such as Deep Learning, SQL Databases, and Business Intelligence tools. The curriculum is structured in three phases, enhancing technical skills in Python, SQL, data analysis, machine learning, and enterprise integration. Major outcomes include proficiency in data processing, visualization, and implementation of machine learning algorithms, preparing participants for real-world applications in data science.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views

Tech Launch Program Data science

The TECH LAUNCH PROGRAM is a 90-day intensive course focused on Data Science, covering topics such as Deep Learning, SQL Databases, and Business Intelligence tools. The curriculum is structured in three phases, enhancing technical skills in Python, SQL, data analysis, machine learning, and enterprise integration. Major outcomes include proficiency in data processing, visualization, and implementation of machine learning algorithms, preparing participants for real-world applications in data science.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 22

TECH LAUNCH PROGRAM

DATA SCIENCE
Program Overview
✓ This intensive 90-day course provides a comprehensive journey through Data Science, incorporating Deep Learning,
SQL Databases, and Business Intelligence tools.

✓ The curriculum is divided into three phases, each building upon the previous knowledge.

Major Course Outcomes

Technical Skills Development:

✓ Proficiency in Python programming with focus on data science libraries

✓ Strong SQL database management capabilities

✓ Advanced data analysis using NumPy, Pandas, and statistical tools

✓ Deep learning and machine learning model development

✓ Expertise in BI tools (Tableau, Power BI)


Major Course Outcomes

Data Processing & Analysis

✓ Ability to clean, preprocess, and transform complex datasets

✓ Big data handling using tools like Spark

✓ Time series analysis and forecasting capabilities

Visualization & Reporting

✓ Creation of interactive dashboards

✓ Advanced data visualization techniques

✓ Enterprise-level BI implementation

✓ Custom visualization development


Major Course Outcomes

Machine Learning & AI

✓ Implementation of supervised and unsupervised learning algorithms

✓ Neural network architecture design and implementation

✓ CNN and RNN model development

✓ Transfer learning applications

Enterprise Integration

✓ Database optimization and performance tuning

✓ Security implementation and governance

✓ Server deployment and maintenance

✓ Enterprise-level system integration


Levels of Program
Phase 1 – Basic Level
– Python Basic For Data Science
– SQL Foundations
– Data Analysis using Python
– Business Intelligence Foundations

Phase 2 - Intermediate
– Advanced Data Analysis
– Advanced Business Intelligence
– Machine Learning Foundations
– Deep Learning Basics
Phase 1 – Basic Level

Python Basic For Data Science


Basic Operations & Functions
Installation and Setup
✓ Operators: Arithmetic, Comparison, Logical
✓ Install Python 3.x ✓ Input/Output: print(), input(), Formatted Output
✓ IDEs: VS Code / PyCharm Data Structures
✓ Using Jupyter Notebook ✓ Lists: Creation, Methods (append, extend), Slicing, Comprehensions

✓ Virtual environments (venv) ✓ Tuples: Immutable Lists, Named Tuples


✓ Dictionaries: Key-Value Operations, Methods, Comprehensions
✓ Package management with pip
✓ Sets: Unique Elements, Union/Intersection
Variables & Data Types
✓ Arrays: NumPy Basics
✓ Data Types: Int, Float, String, Boolean
Control Structures
✓ Type Conversion & Checking ✓ Conditionals: if, elif, else
✓ Variable Naming & Comments ✓ Loops: for, while, break, continue, Nested Loops
✓ String Operations: Formatting (f-strings), Methods (split, join), Slicing

✓ Numeric Operations: Arithmetic, Division (/, //), Modulo


Phase 1 – Basic Level

Python Basic For Data Science


Functions
✓ Definitions & Calls, Default/Keyword Arguments *args,
**kwargs
✓ Lambda Functions
✓ Scope & Lifetime of Variables Error Handling

Modules ✓ Try-Except, Exception Types

✓ Importing & Creating Modules ✓ Raising & Custom Exceptions

✓ Package Structure, Popular Modules ✓ Debugging Techniques

✓ Virtual Environments & pip Practice & Mini-Project

Object-Oriented Programming (OOP) ✓ Algorithms, Data Structures, OOP, Error Handling

✓ Classes & Objects: Definitions, Methods (Instance, Class, ✓ Build a Mini-Project combining concepts
Static)
✓ Inheritance: Single/Multiple, Overriding
✓ Encapsulation: Private/Protected Attributes, Property Decorators
✓ Polymorphism: Overloading, Duck Typing
Phase 1 – Basic Level
SQL Foundations
Database Concepts
✓ DBMS Overview: Relational Databases, Tables, Rows,
Columns
JOIN Operations
✓ Keys: Primary & Foreign Keys, Basics of Normalization
✓ Types: INNER, LEFT, RIGHT, FULL OUTER, CROSS,
SQL Installation Self JOIN
✓ Install MySQL/PostgreSQL ✓ Handling Multiple Tables
✓ GUI Tools: MySQL Workbench, pgAdmin ✓ Aggregation Functions
✓ Create and Set Up Sample Databases ✓ COUNT(), SUM(), AVG(), MAX(), MIN()
Basic Queries ✓ String Aggregations, Custom Aggregations
✓ SELECT Statements: Column Selection, Aliases GROUP BY & HAVING
✓ WHERE Clause: Comparison, Logical Operators, ✓ Single/Multiple Column Grouping
BETWEEN, IN, LIKE
✓ Aggregate Filtering
✓ ORDER BY: Ascending/Descending, Multiple Columns,
NULL Handling ✓ Complex Grouping Scenarios
Phase 1 – Basic Level

SQL Foundations
Subqueries
✓ Single & Multiple Row Subqueries
✓ Correlated Subqueries, EXISTS Operator
✓ CTEs (Common Table Expressions)
Window Functions
✓ Ranking: ROW_NUMBER(), RANK(), DENSE_RANK()
✓ LAG(), LEAD(), Partitioning, Moving Averages
Views & Stored Procedures
✓ Views & Materialized Views
✓ Stored Procedures & Functions
✓ Triggers Basics
Phase 1 – Basic Level
Data Analysis Tools
NumPy Array Operations Data Visualization
✓ Array Creation, Indexing & Slicing ✓ Matplotlib:
✓ Array Operations & Broadcasting ✓ Basic Plots: Line, Scatter, Bar, Histogram
✓ Plot Customization, Subplots, Saving
✓ Universal Functions
✓ Seaborn:
Pandas DataFrame
✓ Statistical Plots, Heatmaps
✓ Creation: From Files, Lists, Dicts
✓ Categorical & Regression Plots
✓ Operations: Column, Row, Indexing (loc, iloc) ✓ Style and Color
✓ Boolean Indexing & Selection Basic Charts
Data Cleaning ✓ Line, Bar, Pie, Box, Violin
✓ Missing Values Handling ✓ Correlation Matrices
✓ Removing Duplicates
✓ Data Type Conversion
✓ String & Date/Time Operations
✓ Data Validation
Phase 1 – Basic Level

Data Analysis Tools


Descriptive Statistics
✓ Central Tendency: Mean, Median, Mode
✓ Dispersion: Variance, Standard Deviation
✓ Distribution Shapes, Percentiles, Quartiles
Probability Basics
✓ Key Concepts, Random Variables
✓ Common Distributions
✓ Sampling Techniques
Statistical Testing
Hypothesis Testing:
✓ Null vs. Alternative
✓ Tests: t-tests, Chi-square
✓ Interpreting p-values
✓ Confidence Intervals
Phase 1 – Basic Level

Business Intelligence Foundations


✓ Power BI Desktop
✓ Interface Overview: Data, Model, and Report Views
Data Modeling
✓ Star Schema Design
✓ Relationships, Calculated Columns, Measures
DAX Basics
✓ Syntax and Common Functions
✓ Time Intelligence and Filter Context
Report Creation
✓ Visualizations: Charts, Tables, and Cards
✓ Filters, Interactions, and Formatting
✓ Publishing Reports
Phase 1 – Basic Level

Phase 1 Project & Assessments


Phase 1 Project
Database Design:
✓ Requirements Analysis, Schema Design, Table Assessment Criteria
Creation
✓ Code Quality, Problem-Solving, Documentation
✓ Data Population
✓ Presentation, Peer Review Feedback
Data Analysis:
Practical Assignments
✓ Exploratory & Statistical Analysis
✓ Daily Coding Exercises & Weekly Mini-Projects
✓ Data Preparation and Insights Generation
✓ SQL Query Challenges & Visualization Tasks
Dashboard Creation:
✓ Final Phase Project
✓ Planning, Visual Selection, Interactivity, Presentation
Learning Resources
✓ Documentation, Practice Datasets, Tutorials, Video
Courses, Forums
Phase 2 – Intermediate Level

Advanced Data Analysis


Complex Data Manipulation:
✓MultiIndex, Pivot Tables, Stack/Unstack
✓Advanced Grouping: Custom Aggregations, Rolling/Expanding Windows
✓Memory Optimization: Chunking, Optimized DataTypes
Time Series Analysis:
✓Basics: DateTime Indexing, Resampling, Rolling Stats
✓Seasonal Decomposition: Trend Analysis, Moving Averages
✓Time Zone Handling: Conversions, Localization
Data Preprocessing
✓ Cleaning: Outlier Handling, Feature Engineering
✓ Text Processing: String Operations, Regular Expressions
✓ Encoding: One-hot, Label, Target Encoding
✓ Scaling: StandardScaler, MinMaxScaler, RobustScaler
Phase 2 – Intermediate Level

Advanced Data Analysis


Database Optimization
✓ Query Optimization: Execution Plans, Query Profiling
✓ Design: Normalization, Denormalization, Partitioning
✓ Memory: Buffer Pool, Query Cache
Indexing
✓ Types: B-Tree, Hash, Full-Text
✓ Strategies: Composite, Covering, Index Hints
✓ Maintenance: Fragmentation, Rebuilding
Performance Tuning
✓ Query: JOIN/Subquery Optimization, WHERE Clause Tuning
✓ Server Config: Resource Allocation, Caching Strategies
✓ Monitoring: Performance Metrics, Slow Query Log
Phase 2 – Intermediate Level
Advanced Business Intelligence
Complex DAX
✓ Advanced Functions: Time Intelligence, Filter, Iterator Functions
✓ Context Manipulation: CALCULATE, CALCULATETABLE, Context Transition
✓ Calculations: Running Totals, YTD, Previous Period Comparison
Custom Visuals
✓ Development: Custom Visual Types, Interactivity, Formatting
✓ Python Integration: Script Visuals, Custom Visual Integration
✓ Advanced Visualizations: Matrix, Decomposition Tree, AI Visuals
Row-Level Security (RLS)
✓ Models: Static RLS, Dynamic RLS, Role Hierarchy
✓ Implementation: DAX Filters, Security Roles, Testing
Deployment
✓ Workspace Management, App Deployment, Gateway Configuration
Administration
✓ Capacity Management, User Management, Usage Monitoring
Sharing & Collaboration
✓ Report & Dashboard Sharing, App Distribution
Phase 2 – Intermediate Level

Machine Learning Foundations


Supervised Learning Dimensionality Reduction & PCA:

Linear Regression: ✓ PCA (Variance Explained, Component Selection)

✓ Simple & Multiple Regression (Feature Selection, Multicollinearity) ✓ Feature Selection (Filter, Wrapper, Embedded Methods)

✓ Regularization: Ridge, Lasso, Elastic Net Model Evaluation

Logistic Regression: Cross-Validation

✓ Binary & Multiclass Classification (One-vs-Rest, ROC Curve) ✓ Techniques: K-fold, Stratified K-fold, Leave-One-Out

✓ Model Evaluation: Precision-Recall, Confusion Matrix ✓ Time Series CV: Forward Chaining, Rolling Forecast

Decision Trees & Random Forests: ✓ Implementation: Validation Curves, Model Selection

✓ Tree Construction, Pruning, Splitting Criteria Metrics & Hyperparameter Tuning

✓ Random Forests: Ensemble Methods, Feature Importance Metrics:

Unsupervised Learning ✓ Classification: Accuracy, Precision, Recall, AUC-ROC

Clustering: ✓ Regression: MSE, RMSE, R-squared

✓ K-means (Elbow Method, Silhouette Analysis) Hyperparameter Tuning:

✓ Hierarchical (Agglomerative, Dendrograms), DBSCAN ✓ Methods: Grid Search, Random Search, Bayesian Optimization
Phase 2 – Intermediate Level
Deep Learning Basics
Deep Learning Frameworks
Neural Network Architecture
TensorFlow/Keras
✓ Components: Neurons, Layers, Weights, Biases
✓ TensorFlow Basics: Tensors, Variables, Operations
✓ Network Types: Feedforward, Convolutional, Recurrent
✓ Keras API: Sequential, Functional, Model Subclassing
Forward & Backward Propagation
✓ Model Building: Layers, Loss Functions, Optimizers
✓ Forward Propagation: Input Processing, Layer
Computation, Output Generation PyTorch Basics

✓ Backward Propagation: Chain Rule, Gradient Descent, ✓ Tensors, Autograd, Neural Network Modules
Learning Rate ✓ Model Development: DataLoader, Training Loops,
Activation Functions Evaluation

✓ Common: ReLU, Sigmoid, Tanh Simple Neural Networks

✓ Advanced: Leaky ReLU, ELU, SELU ✓ Implementation: Binary & Multiclass Classification,
Regression
✓ Training: Batch Processing, Epochs, Early Stopping
Phase 2 – Intermediate Level

Phase 2 Project Overview


ML Model Development Learning Resources
✓ Project Planning: Problem Definition, Data Collection, ✓ Technical Documentation, Research Papers, Online Courses
Model Selection
✓ Industry Case Studies, GitHub Repositories
✓ Implementation: Data Preprocessing, Model Training,
Evaluation Assessment Methods

BI Dashboard Integration ✓ Coding Assignments, Project Milestones, Peer Reviews

✓ Data Pipeline: ETL, Feature Engineering, Model ✓ Technical Presentations, Model Performance Metrics
Predictions Practical Applications
✓ Visualization: Interactive Dashboards, Real-Time Updates, ✓ Real-World Datasets, Industry Projects
Performance Metrics
✓ Kaggle Competitions, Portfolio Development
Database Integration
✓ Data Storage: Model Artifacts, Predictions, Metrics
✓ API Development: REST Endpoints, Real-Time Scoring,
Monitoring
Learning Resources & Tools Assessment Methods

✓ Enterprise tools documentation ✓ Code reviews


✓ Cloud platform documentation ✓ Architecture reviews
✓ Industry best practices
✓ Performance metrics
✓ Research papers
✓ Documentation quality
✓ Case studies
✓ Presentation skills
✓ Online courses
✓ Project completion

20
Best Practices

✓ Code quality standards


✓ Security guidelines
✓ Performance optimization
✓ Documentation standards
✓ Testing procedures
✓ Deployment protocols

21
Thank you

You might also like