Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

1) Introduction To Numpy, Pandas and Matplotlib

Download as pdf or txt
Download as pdf or txt
You are on page 1of 11

1) Introduction to NumPy, Pandas and Matplotlib

Learning Objectives: This Module helps you get familiar with basics of statistics,
different types of measures and probability distributions, and the supporting libraries
in Python that assist in these operations. Also, you will learn in detail about data
visualization.

Topics:
 NumPy - arrays
 Operations on arrays
 Indexing slicing and iterating
 Reading and writing arrays on files
 Pandas - data structures & index operations
 Reading and Writing data from Excel/CSV formats into Pandas
 matplotlib library
 Grids, axes, plots
 Markers, colours, fonts and styling
 Types of plots - bar graphs, pie charts, histograms
 Contour plots

Hands On/Demo:
 NumPy library- Creating NumPy array, operations performed on NumPy array
 Pandas library- Creating series and dataframes, Importing and exporting data
 Matplotlib - Using Scatterplot, histogram, bar graph, pie chart to show information,
Styling of Plot

Skills:
 Probability Distributions in Python
 Python for Data Visualization
2) Data Manipulation
Learning Objective: Through this Module, you will understand in detail about Data
Manipulation

Topics:
 Basic Functionalities of a data object
 Merging of Data objects
 Concatenation of data objects
 Types of Joins on data objects
 Exploring a Dataset
 Analysing a dataset

Hands On/Demo:
 Pandas Function- Ndim(), axes(), values(), head(), tail(), sum(), std(), iteritems(),
iterrows(), itertuples()
 GroupBy operations
 Aggregation
 Concatenation
 Merging
 Joining

Skills:
 Python in Data Manipulation
3) Introduction to Machine Learning with Python
Learning Objectives: In this module, you will learn the concept of Machine Learning
and its types.

Topics:
 Python Revision (numpy, Pandas, scikit learn, matplotlib)
 What is Machine Learning?
 Machine Learning Use-Cases
 Machine Learning Process Flow
 Machine Learning Categories
 Linear regression
 Gradient descent

Hands On/Demo:
 Linear Regression – Boston Dataset

Skills:
 Machine Learning concepts
 Machine Learning types
 Linear Regression Implementation
4) Supervised Learning - I
Learning Objectives: In this module, you will learn Supervised Learning Techniques
and their implementation, for example, Decision Trees, Random Forest Classifier
etc.

Topics:
 What are Classification and its use cases?
 What is Decision Tree?
 Algorithm for Decision Tree Induction
 Creating a Perfect Decision Tree
 Confusion Matrix
 What is Random Forest?

Hands On/Demo:
 Implementation of Logistic regression
 Decision tree
 Random forest

Skills:
 Supervised Learning concepts
 Implementing different types of Supervised Learning algorithms
 Evaluating model output
5) Dimensionality Reduction
Learning Objectives: In this module, you will learn about the impact of dimensions
within data. You will be taught to perform factor analysis using PCA and compress
dimensions. Also, you will be developing LDA model.

Topics:
 Introduction to Dimensionality
 Why Dimensionality Reduction
 PCA
 Factor Analysis
 Scaling dimensional model
 LDA

Hands-On/Demo:
 PCA
 Scaling

Skills:
 Implementing Dimensionality Reduction Technique
6) Supervised Learning - II
Learning Objectives: In this module, you will learn Supervised Learning
Techniques and their implementation, for example, Decision Trees, Random Forest
Classifier etc.

Topics:
 What is Naïve Bayes?
 How Naïve Bayes works?
 Implementing Naïve Bayes Classifier
 What is Support Vector Machine?
 Illustrate how Support Vector Machine works?
 Hyperparameter Optimization
 Grid Search vs Random Search
 Implementation of Support Vector Machine for Classification

Hands-On/Demo:
 Implementation of Naïve Bayes, SVM

Skills:
 Supervised Learning concepts
 Implementing different types of Supervised Learning algorithms
 Evaluating model output
7) Unsupervised Learning
Learning Objectives: In this module, you will learn about Unsupervised Learning
and the various types of clustering that can be used to analyze the data.

Topics:
 What is Clustering & its Use Cases?
 What is K-means Clustering?
 How does K-means algorithm work?
 How to do optimal clustering
 What is C-means Clustering?
 What is Hierarchical Clustering?
 How Hierarchical Clustering works?

Hands-On/Demo:
 Implementing K-means Clustering
 Implementing Hierarchical Clustering

Skills:
 Unsupervised Learning
 Implementation of Clustering – various types
8) Association Rules Mining and Recommendation
Systems
Learning Objectives: In this module, you will learn Association rules and their
extension towards recommendation engines with Apriori algorithm.

Topics:
 What are Association Rules?
 Association Rule Parameters
 Calculating Association Rule Parameters
 Recommendation Engines
 How does Recommendation Engines work?
 Collaborative Filtering
 Content-Based Filtering

Hands-On/Demo:
 Apriori Algorithm
 Market Basket Analysis

Skills:
 Data Mining using python
 Recommender Systems using python
9) Reinforcement Learning
Learning Objectives: In this module, you will learn about developing a smart
learning algorithm such that the learning becomes more and more accurate as time
passes by. You will be able to define an optimal solution for an agent based on
agent-environment interaction.

Topics:
 What is Reinforcement Learning
 Why Reinforcement Learning
 Elements of Reinforcement Learning
 Exploration vs Exploitation dilemma
 Epsilon Greedy Algorithm
 Markov Decision Process (MDP)
 Q values and V values
 Q – Learning
 α values

Hands-On/Demo:
 Calculating Reward
 Discounted Reward
 Calculating Optimal quantities
 Implementing Q Learning
 Setting up an Optimal Action

Skills:
 Implement Reinforcement Learning using python
 Developing Q Learning model in python
10) Time Series Analysis
Learning Objectives: In this module, you will learn about Time Series Analysis to
forecast dependent variables based on time. You will be taught different models for
time series modeling such that you analyze a real time-dependent data for
forecasting.

Topics:
 What is Time Series Analysis?
 Importance of TSA
 Components of TSA
 White Noise
 AR model
 MA model
 ARMA model
 ARIMA model
 Stationarity
 ACF & PACF

Hands on/Demo:
 Checking Stationarity
 Converting a non-stationary data to stationary
 Implementing Dickey-Fuller Test
 Plot ACF and PACF
 Generating the ARIMA plot
 TSA Forecasting

Skills:
 TSA in Python
11) Model Selection and Boosting
Learning Objectives: In this module, you will learn about selecting one model over
another. Also, you will learn about Boosting and its importance in Machine Learning.
You will learn on how to convert weaker algorithms into stronger ones.

Topics:
 What is Model Selection?
 The need for Model Selection
 Cross-Validation
 What is Boosting?
 How Boosting Algorithms work?
 Types of Boosting Algorithms
 Adaptive Boosting

Hands on/Demo:
 Cross-Validation
 AdaBoost

Skills:
 Model Selection
 Boosting algorithm using python

You might also like