0% found this document useful (0 votes)

2 views

Lesson 2 - Introduction to ML

Introduction to machine learning

Uploaded by

star

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Lesson 2 - Introduction to ML

Introduction to machine learning

Uploaded by

star

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 36

Intro to ML

Beginner Level
What is ML?
Machine
Learning
Types
Machine Learning Key Info #1

Supervised
Learning
vs

Unsupervised
Learning
vs

Reinforcement Learning
Supervised vs Unsupervised

Supervised
Learning MAIN DIFFERENCE :
vs Presence of
Labels
Unsupervised
Learning
What is label?

A label is like the

‘right answer’
Cat
provided to the
model so that it
knows whether it
made the correct
prediction

Dog
Types of ML Problems

Classification (Class based prediction),

Regression (Continuous prediction),

Clustering (Predict similar classes),

Etc…
WE FOCUS ON CLASSIFICATION AND REGRESSION
TODAY!
Machine Learning Pipeline

Data Preparation & Exploratory Data Analysis

Feature Engineering & Feature Selection

Model Selection & Testing

Hyper-parameter Tuning & Overall Evaluation

Deployment
05
Machine Learning Pipeline

Data Preparation & Exploratory Data Analysis

Feature Engineering & Feature Selection

Model Selection & Testing

Model Evaluation
05

What we are doing this week…

Step by Step Process (Oversimplified…)

Preprocessing Model Evaluation

Training
Data Preparation and EDA

IMPORTANT to ensure we understand the

data,
Eg how many columns are there, got NA
values anot, what the columns distribution
looks like, etc…
Preprocessing Principle

Create/Make data understandable to the

Model
Preprocessing Key Takeaway #1: Feature
Engineering
Data Transformation into machine readable
data (NUMBERS)
Preprocessing Key Takeaway #1: Feature
Engineering
Preprocessing Key Takeaway #1: Feature
Engineering
One Hot Encoding - Transform to columns with
1/True for presence of the element and 0/False
otherwise
Ex:
Preprocessing Key Takeaway #1: Feature
Engineering
!BEWARE of Curse of Dimensionality!
Essentially means we don’t want to use columns
with too many distinct values for one hot encoding
Preprocessing Key Takeaway #1: Feature
Engineering
Scaling - Converting numerical features to
similar magnitude
Preprocessing Key Takeaway #1: Feature
Engineering
Scaling - Converting numerical features to
similar magnitude
Preprocessing Key Takeaway #2: Feature
Selection
Preprocessing Key Takeaway #2: Feature
Selection

Intuition/Guess based
on Domain
Knowledge!
Preprocessing Key Takeaway #2: Feature
Selection

Correlation Matrix
Preprocessing Key Takeaway #2: Feature
Selection

Chi-Square Test
(For Categorical
Variables) - Test for
independence of each
feature against
target(what we wanna
predict)
Preprocessing Key Takeaway #2: Feature
Selection

AND many more…., including information

gain, fisher’s score, variance threshold,
etc…

Filter Based Feature selection, (there are

other types of feature selection algorithms
also - Wrapper Based, Embedded Feature
Preprocessing Key Takeaway #3: Train Test
Split
Model Training Principle

Pluck in the Xs (features) and Ys (Label) to

the model, let it run (Use packages! ~
Sklearn, Tensorflow, Pytorch etc…)

Key Note - Always do train test split to

prevent data leakage! (Train models on
training set, use test set for evaluation
(Identifying how accurate+consistent a
model is)
Model Training Key Takeaway #1: Packages

Scikit-Learn https://scikit-learn.org/stable/
Model Training Key Takeaway #2: Linear
Regression Model
A simple model that
learns a best-fit line
from training data. This
line is used to predict
target values for unseen
data.
Model Training Key Takeaway #3: Many
more models!

Naive Bayes, K-Nearest Neighbours,

Support Vector Machines, K-Means
Clustering, Decision Trees, Ensemble
Methods, Neural Networks etc….
Evaluation Principle

Checking how good the model is

performing ~ Usually different problem
types requires a different evaluation
metric
Evaluation Key Takeaway #1: Overfitting
and Underfitting
Evaluation Key Takeaway #1: Overfitting
and Underfitting (Bias vs Variance
Tradeoff)
Bias - How
accurate the
model is?

Variance - How
consistent the
model is?
Evaluation Key Takeaway #2: Classification
Evaluation - Confusion Matrix
Evaluation Key Takeaway #2: Classification
Evaluation - Accuracy

Accuracy - (TP
+ TN) / (TP +
TN + FP + FN)

Essentially how
many correct
predictions a
model made!
Evaluation Key Takeaway #3: Regression
Evaluation - Mean Square Error
Evaluation Key Takeaway #3: Regression
Evaluation - Mean Absolute Error
Exercise

ISO 27001 2022 How To Conduct An ISMS Gap Analysis 1684315308
100% (11)
ISO 27001 2022 How To Conduct An ISMS Gap Analysis 1684315308
23 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
Final ML
No ratings yet
Final ML
2 pages
ML Lectures Summary 2
No ratings yet
ML Lectures Summary 2
52 pages
Machine Learning - course
No ratings yet
Machine Learning - course
6 pages
ML Notes
No ratings yet
ML Notes
79 pages
Python Learning
No ratings yet
Python Learning
21 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
Machine Learning Lecture1 - 26-27 Aug
No ratings yet
Machine Learning Lecture1 - 26-27 Aug
30 pages
ML-chap-2
No ratings yet
ML-chap-2
60 pages
Week 4 - Intro to ML
No ratings yet
Week 4 - Intro to ML
37 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
32 pages
Module_-1
No ratings yet
Module_-1
9 pages
Data Science
No ratings yet
Data Science
38 pages
Research Trends in Machine Learning: Muhammad Kashif Hanif
No ratings yet
Research Trends in Machine Learning: Muhammad Kashif Hanif
80 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
04 Machine Learning Overview
No ratings yet
04 Machine Learning Overview
109 pages
04 Machine Learning Overview
No ratings yet
04 Machine Learning Overview
109 pages
Air quality prediction using machine learning
No ratings yet
Air quality prediction using machine learning
29 pages
ML Final Print Upload
No ratings yet
ML Final Print Upload
10 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
Chapter 02 Overview - 4
No ratings yet
Chapter 02 Overview - 4
43 pages
ML_notion_1
No ratings yet
ML_notion_1
18 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
OR forecasting tool
No ratings yet
OR forecasting tool
39 pages
Machine - Learning - Unit - 1
No ratings yet
Machine - Learning - Unit - 1
70 pages
module3_DS_ppt
No ratings yet
module3_DS_ppt
68 pages
Lec2 Intro to ML
No ratings yet
Lec2 Intro to ML
35 pages
Workflow of A Machine Learning Project
No ratings yet
Workflow of A Machine Learning Project
12 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
dbms-10 marks
No ratings yet
dbms-10 marks
32 pages
ChatGPT - Machine Learning Overview
No ratings yet
ChatGPT - Machine Learning Overview
34 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
15 pages
Machine Learning & Data Mining
No ratings yet
Machine Learning & Data Mining
4 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
25 pages
Introduction To ML
No ratings yet
Introduction To ML
55 pages
Types of Machine Learning Algorithms
No ratings yet
Types of Machine Learning Algorithms
14 pages
Unit 1 Machine Learning - PDF Lands
No ratings yet
Unit 1 Machine Learning - PDF Lands
5 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
ML 01
No ratings yet
ML 01
24 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Machine Learning Part: Domain Overview
No ratings yet
Machine Learning Part: Domain Overview
20 pages
EN3150 Pattern Recognition - L02
No ratings yet
EN3150 Pattern Recognition - L02
51 pages
APS1070 Lecture (3) Slides
No ratings yet
APS1070 Lecture (3) Slides
70 pages
Machine Learning
No ratings yet
Machine Learning
51 pages
ML Pipe
No ratings yet
ML Pipe
25 pages
Classification
No ratings yet
Classification
22 pages
ML_DA
No ratings yet
ML_DA
55 pages
Model Evaluation
No ratings yet
Model Evaluation
39 pages
Machine Learning with Python for Everyone (Addison Wesley Data & Analytics Series) 1st Edition, (Ebook PDF) instant download
100% (1)
Machine Learning with Python for Everyone (Addison Wesley Data & Analytics Series) 1st Edition, (Ebook PDF) instant download
38 pages
Data Science Checklist
No ratings yet
Data Science Checklist
22 pages
Lecture 2 - Supervised Learning
No ratings yet
Lecture 2 - Supervised Learning
6 pages
Lecture 17&18 - Introduction To Machine Learning
No ratings yet
Lecture 17&18 - Introduction To Machine Learning
51 pages
Aws ML PDF
No ratings yet
Aws ML PDF
74 pages
Machine Learning
No ratings yet
Machine Learning
54 pages
Machine Learning – I[1]
No ratings yet
Machine Learning – I[1]
126 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
Chapter 4- Machine Learning
No ratings yet
Chapter 4- Machine Learning
81 pages
UNIT 1
No ratings yet
UNIT 1
28 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
Architectural Variation, Building Height, and The Restorative Quality of Urban Residential Streetscapes
No ratings yet
Architectural Variation, Building Height, and The Restorative Quality of Urban Residential Streetscapes
11 pages
An Investigation of The Efficacy and Effectiveness of Using Bilingualism As A Medium of Instruction in Different Educational Levels
No ratings yet
An Investigation of The Efficacy and Effectiveness of Using Bilingualism As A Medium of Instruction in Different Educational Levels
25 pages
Dissertation Proposal Political Science
100% (2)
Dissertation Proposal Political Science
4 pages
Introduction To Research
No ratings yet
Introduction To Research
23 pages
Yusuf Hairul Imam (E1d112134)
No ratings yet
Yusuf Hairul Imam (E1d112134)
17 pages
RM For Computer Science
No ratings yet
RM For Computer Science
2 pages
FinQuiz - Curriculum Note, @InsightSquad Study Session 3, Reading 8
No ratings yet
FinQuiz - Curriculum Note, @InsightSquad Study Session 3, Reading 8
11 pages
References: Goddard, A. (1998) - The Language of Advertising. London and New York: Routledge
No ratings yet
References: Goddard, A. (1998) - The Language of Advertising. London and New York: Routledge
3 pages
Understanding Existing Buildings Five Studies To Complete Before Design Work Starts
No ratings yet
Understanding Existing Buildings Five Studies To Complete Before Design Work Starts
4 pages
English Language Teaching in Mechanical Engineering
No ratings yet
English Language Teaching in Mechanical Engineering
13 pages
How To Write Chapter 3 Proposal (Presentation)
No ratings yet
How To Write Chapter 3 Proposal (Presentation)
10 pages
Mba Shoyemi A 2014 PDF
No ratings yet
Mba Shoyemi A 2014 PDF
95 pages
Elliot Richard - Media Amplification of A Brand Crisis
No ratings yet
Elliot Richard - Media Amplification of A Brand Crisis
18 pages
Essential Statistics 4th Edition D.G. Rees (Author) - The latest ebook version is now available for instant access
100% (1)
Essential Statistics 4th Edition D.G. Rees (Author) - The latest ebook version is now available for instant access
60 pages
The Problem and Its Scope Background of The Study
No ratings yet
The Problem and Its Scope Background of The Study
12 pages
Standard 3: Plan For and Implement Effective Teaching and Learning
No ratings yet
Standard 3: Plan For and Implement Effective Teaching and Learning
3 pages
Survey Report Writing Layout
No ratings yet
Survey Report Writing Layout
5 pages
Business Research Methods: William G. Zikmund
No ratings yet
Business Research Methods: William G. Zikmund
18 pages
An Evaluation of The Textbook English 6
No ratings yet
An Evaluation of The Textbook English 6
304 pages
Alvarez-Conrad, Zoellner, Foa (2001) Linguistic Predictors of Trauma Pathology
No ratings yet
Alvarez-Conrad, Zoellner, Foa (2001) Linguistic Predictors of Trauma Pathology
12 pages
Case 1
No ratings yet
Case 1
8 pages
Community Organizing and Social Mobilization
100% (1)
Community Organizing and Social Mobilization
26 pages
Executive Summary Final Report Lakshadweep - 2017 18 - Final Report - 13march2019
No ratings yet
Executive Summary Final Report Lakshadweep - 2017 18 - Final Report - 13march2019
146 pages
Capitulo 1 Ista Reglas Internacionales Semillas
No ratings yet
Capitulo 1 Ista Reglas Internacionales Semillas
22 pages
Scope and Limitations Thesis Writing
100% (3)
Scope and Limitations Thesis Writing
7 pages
Networking On The Network
100% (5)
Networking On The Network
73 pages
Hero Honda
No ratings yet
Hero Honda
7 pages
NORM Monitor-Is - Tracerco
No ratings yet
NORM Monitor-Is - Tracerco
3 pages
Acknowledgement: Survey Camp Report 2015
No ratings yet
Acknowledgement: Survey Camp Report 2015
5 pages