0% found this document useful (0 votes)

51 views

ML Lecture 2 Supervised Learning Setup

This document provides an overview of machine learning concepts including: - Traditional computer science tasks vs problems machine learning can handle better - The machine learning pipeline involving data, algorithms, and outputs - Key concepts like supervised vs unsupervised learning, classification vs regression, challenges around explainability and fairness It uses examples like medical diagnosis, text analysis, and image recognition to illustrate machine learning applications and how data is represented as feature vectors to train models.

Uploaded by

Faizad Ullah

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views

ML Lecture 2 Supervised Learning Setup

Uploaded by

Faizad Ullah

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 38

CSCS 460 – Machine

Learning
Faizad Ullah

1
Traditional Computer Science
 Tasks like:
 Play an audio/video file
 Display a text file on screen
 Perform a mathematical operation on two numbers
 Sort an array of numbers using Insertion Sort
 Search for a string in a text file
 …

Data
Output
Program

2
Problems that Traditional CS Can’t Handle

Tumor? Y/N Price? What was said? Summarize text

Data
Output
Program?

3
Machine Learning
Regression
Classification

4
Traditional CS
Data
Output

Program

Machine Learning
Data
Program
Output

5
What is Machine Learning?
 Formally:
 A computer program A is said to learn from experience E with respect to some class of tasks T and
performance measure P if its performance at tasks in T , as measured by P, improves with experience
E. (Tom Mitchell, 1997)

 Informally:
 Algorithms that improve on some task with experience.

To train a classifier, we need labelled data (called dataset)

6
Machine Learning Pipeline

7
Data – Big, Big,… data!
 How do we obtain these massive datasets to train our Machine Learning models?
 From real interactions e.g., call centers
 Expert annotators e.g., hired tams of annotators
 Crowd sourcing

Recaptcha Tagging

8
Task-Label Relationship
 Labels are dictated by the task to be performed.
 Example: Speech Technologies
What was said? Speech Recognition

Who said it? Speaker Recognition

Was it John Doe? Speaker Verification

Did it mention “hey Google”? Keyword Detection

What’s the language? Language Identification

Is the language native for the speaker?

What is their height?
What is the age of the speaker?
What is emotional state?
What was the sentiment?
Is the voice fake?
9
Task-Label Relationship
 Example: Text Technologies

Who wrote it?

Summary of what was written?
Was it plagiarized?
What was the intent?
What language is this?
Is the language native for the speaker?
What is author’s literacy level?
What is the topic of this document?
What is emotional state?
What was the sentiment?
Can we fake this writing style?
10
Challenges of ML - Explainability
 A classifier can potentially learn to classify on the basis of features not desirable for humans
 All dogs waring a collar in the training data while no cat is wearing it – ML just learns to separate based
on collar
 All horse images have a copyrights notice – ML just learns to recognize horses based on the copyrights
notice

 Explainable ML: The results should be understandable by humans

 As opposed to a black-box system

11
Challenges of ML – Fairness
 AI tends to reflect the biases of the society
 Human taggers who mark a recording as misinformation based on accent or gender
 Court decisions in country that make a rich person’s acquittal more likely
 Automated standardized testing in the US could yield unfavorable results for certain demographic
groups
 AI plays a decision role in hiring decisions, with up to 72% of resumes in the US never being viewed by a
human (Automation Bias)
 Decision on immigration, bank loans, credit history checks, criminal profiling

12
ML in Low-resource settings
 Problems where large datasets and tools are not available
 Natural Language Processing and Speech
 Pakistan has 71 languages
 We barely have speech recognition capabilities for Urdu!

13
Types of Learning
Supervised

The outcome is provided along with the data.

Unsupervised

The outcome is NOT provided along with the data.

14
Supervised Learning

15
What does a classifier see?
• Features

Day: Night:
1. 1.
2. 2.
3. 3.
4. 4.
5. 5.
What does a classifier see?
What does a classifier see?
Day vs. Night Classifier
Unsupervised Learning

20
Supervised Learning Setup

22
Feature Space: Tabular Data
Features/Dimensions Label/Class/Category

Height Weight B.P.Sys B.P.Dia Heart

(inches) (kgs) disease
62 70 120 80 No Record is 4-dimensional Feature Vector
72 90 110 70 No
74 80 130 70 No
65 120 150 90 Yes
Training Data/Training Split
67 100 140 85 Yes
64 110 130 90 No
69 150 170 100 Yes
66 125 145 90 ?
Testing Data/Testing Split
74 67 110 60 ?

As labels are discrete, this is a classification task.

23
Feature Space: Tabular Data
Features/Dimensions Label

Height Weight B.P.Sys B.P.Dia Choleste

(inches) (kgs) rol Level
62 70 120 80 150 A Record is 4-dimensional Feature Vector
72 90 110 70 165
74 80 130 70 135
65 120 150 90 210
Training Data/Training Split
67 100 140 85 195
64 110 130 90 125
69 150 170 100 250
66 125 145 90 ?
Testing Data/Testing Split
74 67 110 60 ?

As labels are continuous, this is a regression task.

24
Feature Space: Image Data
 Images are nothing but a 2D/3D arrays with values of color
intensities, typically ranging 𝟎 − 𝟐𝟓𝟓

But we said a
record
should be 1D!

25
Feature Space: Image Data
 The color Image is 3D array (𝑊𝑖𝑑𝑡ℎ × 𝐻𝑒𝑖𝑔ℎ𝑡 × 𝐶ℎ𝑎𝑛𝑛𝑒𝑙𝑠)
 Color image has 3 channels while grayscale image has 1 channel.

26
Feature Space: Text Data
 Suppose you are given labeled textual data in excel sheet
Document# Text Class
Training 1 The Best movie best Pos
2 The Best best ever Pos
3 The Best film Pos
4 The Worst cast ever Neg
Testing 5 The Best best best worst ever ?

the best movie ever film worst cast label

1 1 1 0 0 0 0 1
1 1 0 1 0 0 0 1
1 1 0 0 1 0 0 1
1 0 0 1 0 1 1 0
These are called “Binary Occurrences” features.
1 1 0 1 0 1 0 ?
27
Rules vs. Learning
 Suppose we are working on classification of emails into “spam” and “ham”
(not spam)
 We can write a complicated set of rules
 Works well for a while
 Cannot adapt well to new emails
 Program could be reverse-engineered and circumvented

 Learn the mapping between an email and its label using past labelled
data
 Can be retrained on new emails
 Not easy to reverse-engineer and circumvent in all cases
 Easier to plug the leaks
References
 Murphy Chapter 1
 Alpaydin Chapter 1
 TM Chapter 1

 Lectures of Andrew Ng., Dr. Ali Raza, and “Machine Learning for Intelligent Systems
(CS4780/CS5780)”, Kilian Weinberger.

 This disclaimer should serve as adequate citation.

29
Formalizing the Setup
𝑫 = { 𝒙𝟏, 𝒚𝟏 , 𝒙𝟐, 𝒚𝟐 , … , 𝒙𝒏, 𝒚𝒏
⊆𝑿×𝒀

Feature vector
𝑫 = { 𝒙𝟏, 𝒚𝟏 , ⊆𝑿×𝒀
𝒙𝟐, 𝒚𝟐 , … ,
Any categorical attribute can be
𝒙𝒏, 𝒚𝒏 converted to numerical representation.
 Where,
𝑥
𝐷𝑖 is
𝑜𝑟the
𝑥 𝑖 dataset
is the input vector of the 𝑖𝑡ℎ sample/record/instance
𝑋 is the label
𝑌 space
d-dimensional feature space (ℝ𝑑)
If we don’t know the distribution,
The data points are drawn from an unknown distribution 𝑃 lets approximate that using
samples we gathered!
𝒙𝒊, 𝒚𝒊 ~𝑷(𝒙, 𝒚)

We want to learn a function ℎ ∈ 𝐻, such that for a new instance (𝒙𝟏, 𝒚)~𝑃
𝒉(𝒙) = 𝒚 with a high probability or at least 𝒉(𝒙) ≈ 𝒚
This also have to be from the In plain words, don’t train on
same distribution as 𝒙𝒊 dogs and ask prediction for cats.
31
Training and Testing: Formally

Testing Data
Training Data Traditional CS
Machine 𝒙~𝑷
Learning
𝒉(𝒙)
𝒙𝟏, 𝒙𝟐,…, 𝒙𝒏

𝒚 𝟏, 𝒚 𝟐, … , 𝒚 𝒏 𝒉

Label/Ground Truth Prediction

Model

Training Testing
𝒉 𝒙 = 𝒚 (Ideal)
𝒉 𝒙 ≈ 𝒚 (Plausible)

32
Label Space
 Binary (Binary classification)
 Sentiment: positive / negative
 Email: spam / ham
 Online Transactions Fraud: Yes
/ No
 Tumor: Malignant / Benign
 𝑦 ∈ 0,1
 𝑦 ∈ {−1, 1}

 Multi-class (multi-class classification)

 Sentiment: Positive / Negative / Neutral
 Emotion: Happy / Sad / Surprised /
Angry / …
 Parts of Speech Tag: Noun / Verb /
Adjective / Adver / …
 𝑦 ∈ {0,1,2, … }

 Real-valued (Regression)
 Temperature, height, age, length,
33
weight, duration, price, …
Hypothesis Space
 The hypothesis ℎ is sampled from a hypothesis space 𝐻
𝒉∈𝑯 𝑯 ∈ {𝑯𝑫, 𝑯𝑹, 𝑯𝑺𝑽𝑴, 𝑯𝑫𝑳, … }

 𝐻 can be thought of to contain types of hypotheses, which share

sets of assumptions like:
 Support Vector Machines 𝑯𝑺𝑽𝑴 ∈ {𝑯𝟏, 𝑯𝟐, … }

 Decision Tree 𝑯𝑫 ∈ {𝑯𝟏, 𝑯𝟐, … } 𝒉 ∈ 𝑯𝑫

 Perception 𝑯𝑷 ∈ {𝑯𝟏, 𝑯𝟐, … }

 Neural Networks 𝑯𝑵𝑵 ∈ {𝑯𝟏, 𝑯𝟐, … }

…
Selection done
Selection done automatically.
 For example: ℎ ∈ 𝐻 for 𝐻 decision trees: manually.
 Would be instance of decision trees of different height, arity, thresholds etc.

34
So, how do we choose our ℎ?
 Randomly?
 Exhaustively?

How do we evaluate 𝒉?

35
How to choose ℎ?
 Randomly
 May not work well
 Like using a random program to solve your sorting problem!
 May work if 𝐻 is constrained enough

 Exhaustively
 Would be very slow!
 The space 𝐻 is usually very large (if not infinite)

 𝐻 is usually chosen by ML Engineers (You!) based on their experience

 ℎ ∈ 𝐻 is estimated efficiently using various optimization techniques (math
alert!)

Before moving to finding 𝒉, let’s first evaluate the labels.

36
Book Reading
 Murphy – Chapter 1

37
References
 Murphy Chapter 1
 Alpaydin Chapter 1
 TM Chapter 1

 Lectures of Andrew Ng., Dr. Ali Raza, and “Machine Learning for Intelligent Systems
(CS4780/CS5780)”, Kilian Weinberger.

 This disclaimer should serve as adequate citation.

Module 2 - Intelligent Systems
100% (1)
Module 2 - Intelligent Systems
7 pages
Wix Portfolio Rubric
No ratings yet
Wix Portfolio Rubric
1 page
Of Web Development and of Web Development and
No ratings yet
Of Web Development and of Web Development and
101 pages
Motivation 24111
No ratings yet
Motivation 24111
23 pages
2-Introduction of Machine Learning
No ratings yet
2-Introduction of Machine Learning
39 pages
Chapter 2 Work Break Down Structure
No ratings yet
Chapter 2 Work Break Down Structure
87 pages
It0047 Sa1 Dos Debug - Mamaril
No ratings yet
It0047 Sa1 Dos Debug - Mamaril
6 pages
MPI Lab Manual
100% (1)
MPI Lab Manual
68 pages
To Object-Oriented Modeling Techniques
No ratings yet
To Object-Oriented Modeling Techniques
33 pages
K Map
No ratings yet
K Map
49 pages
Working With Lists, Images and Hyperlinks
No ratings yet
Working With Lists, Images and Hyperlinks
17 pages
Dynamic Modeling: Grady Booch, James Rumbaugh, and Ivar Jacobson, Edition, Addison Wesley, 2005
No ratings yet
Dynamic Modeling: Grady Booch, James Rumbaugh, and Ivar Jacobson, Edition, Addison Wesley, 2005
49 pages
SQL Lesson 5 - Set Operators
No ratings yet
SQL Lesson 5 - Set Operators
31 pages
8. Middleware Technologies ppt 42
No ratings yet
8. Middleware Technologies ppt 42
16 pages
Story-Boarding ANP-157: Topic: Character Design UNIT-1
No ratings yet
Story-Boarding ANP-157: Topic: Character Design UNIT-1
15 pages
2presentation ARDUINO WORKSHOP
No ratings yet
2presentation ARDUINO WORKSHOP
49 pages
1-1introduction To Computer Systems - Overview of Organization and Architecture - Functional Components-02!08!202
No ratings yet
1-1introduction To Computer Systems - Overview of Organization and Architecture - Functional Components-02!08!202
22 pages
Frames Tables Forms HTML
No ratings yet
Frames Tables Forms HTML
27 pages
Bubble Sort Insertion Sort Selection Sort
No ratings yet
Bubble Sort Insertion Sort Selection Sort
21 pages
Reinforcement Learning: Amulya Viswambaran (202090007) Kehkashan Fatima (202090202) Sruthi Krishnan (202090333)
No ratings yet
Reinforcement Learning: Amulya Viswambaran (202090007) Kehkashan Fatima (202090202) Sruthi Krishnan (202090333)
40 pages
A. Simplified Segment Directives: (.Exe Program Format)
No ratings yet
A. Simplified Segment Directives: (.Exe Program Format)
10 pages
Module 3 - NFA To DFA
No ratings yet
Module 3 - NFA To DFA
20 pages
Chapter 2 DFA and NFA
No ratings yet
Chapter 2 DFA and NFA
41 pages
2 - UML Class Diagram-Examples - Questions
No ratings yet
2 - UML Class Diagram-Examples - Questions
5 pages
22001001N9221 - 9 - 7009 - 24089HTML Lab Prcatical
No ratings yet
22001001N9221 - 9 - 7009 - 24089HTML Lab Prcatical
13 pages
MTK3013-Chapter1.2 Propositional Equivalences Updated
No ratings yet
MTK3013-Chapter1.2 Propositional Equivalences Updated
38 pages
Data Structure: Stacks, Queues and Lists
No ratings yet
Data Structure: Stacks, Queues and Lists
50 pages
Public Relations in
No ratings yet
Public Relations in
12 pages
Unit 1.4 Wired and Wireless Networks - Lesson 1: Activity 1 - Local Area Network (LAN) Vs Wide Area Network (WAN)
No ratings yet
Unit 1.4 Wired and Wireless Networks - Lesson 1: Activity 1 - Local Area Network (LAN) Vs Wide Area Network (WAN)
4 pages
Finite Automata: Anab Batool Kazmi
No ratings yet
Finite Automata: Anab Batool Kazmi
54 pages
DEBUG Commands
No ratings yet
DEBUG Commands
5 pages
Lecture#1 Web Technologies Basics
No ratings yet
Lecture#1 Web Technologies Basics
31 pages
Application OF: Presented by
100% (1)
Application OF: Presented by
18 pages
An Introduction To Computer Architecture: © 2019 Arm Limited
No ratings yet
An Introduction To Computer Architecture: © 2019 Arm Limited
46 pages
Combinatorics and Probability
No ratings yet
Combinatorics and Probability
15 pages
Use Case Diagrams
No ratings yet
Use Case Diagrams
15 pages
Theory of Automata - Lecture 1
No ratings yet
Theory of Automata - Lecture 1
58 pages
Algorithms Complexity and Data Structures Efficiency
No ratings yet
Algorithms Complexity and Data Structures Efficiency
17 pages
Network in Graph Theory F4
No ratings yet
Network in Graph Theory F4
32 pages
Web Browser and Developer Tools
No ratings yet
Web Browser and Developer Tools
27 pages
Sequence Diagram PBL
No ratings yet
Sequence Diagram PBL
18 pages
Number System and Logic Gates
No ratings yet
Number System and Logic Gates
137 pages
L11.Sequence Diagram
0% (1)
L11.Sequence Diagram
48 pages
IoT 2
100% (1)
IoT 2
17 pages
Graph Theory Basic Concepts
No ratings yet
Graph Theory Basic Concepts
56 pages
HTML5 Slides
No ratings yet
HTML5 Slides
35 pages
Discrete Mathematics
No ratings yet
Discrete Mathematics
36 pages
CPE121 - Chapter01 - Introduction To Data Structures and Algorithm
No ratings yet
CPE121 - Chapter01 - Introduction To Data Structures and Algorithm
24 pages
Powerpoint As A Powerful Tool: Tips For Effective Design and Increased Interactivity
No ratings yet
Powerpoint As A Powerful Tool: Tips For Effective Design and Increased Interactivity
47 pages
Types of ML & Supervised Learning
No ratings yet
Types of ML & Supervised Learning
17 pages
CRYPTOGRAPHY
No ratings yet
CRYPTOGRAPHY
23 pages
Introduction To Computer
No ratings yet
Introduction To Computer
30 pages
OOMD Module2 Dynamic Modelling
No ratings yet
OOMD Module2 Dynamic Modelling
58 pages
L12.State Diagram
No ratings yet
L12.State Diagram
48 pages
Pointers Review
100% (1)
Pointers Review
28 pages
Introduction To Fault Tolerance
No ratings yet
Introduction To Fault Tolerance
20 pages
Social Media Web and Text Analytics
No ratings yet
Social Media Web and Text Analytics
10 pages
Ubuntu 18.04 LTS Desktop Installation
No ratings yet
Ubuntu 18.04 LTS Desktop Installation
38 pages
An Ethnomycological Survey of Macrofungi Utilized by Aeta Communities in Central Luzon, Philippines
No ratings yet
An Ethnomycological Survey of Macrofungi Utilized by Aeta Communities in Central Luzon, Philippines
9 pages
2 Pattern Recognition Task
No ratings yet
2 Pattern Recognition Task
27 pages
NLP_Week_01
No ratings yet
NLP_Week_01
57 pages
NLP_Week_02
No ratings yet
NLP_Week_02
55 pages
NLP_Week_03
No ratings yet
NLP_Week_03
33 pages
NLP_Week_02
No ratings yet
NLP_Week_02
54 pages
DM Lecture 1 Introudction and Policies
No ratings yet
DM Lecture 1 Introudction and Policies
17 pages
NLP_Week_01
No ratings yet
NLP_Week_01
57 pages
ML Lecture 1 Introduction and Policies
No ratings yet
ML Lecture 1 Introduction and Policies
45 pages
Pair of Linear Equations in Two Variables - DPPs
100% (1)
Pair of Linear Equations in Two Variables - DPPs
6 pages
GEO247 Handout 9
No ratings yet
GEO247 Handout 9
4 pages
Digital Jewellery: M.Aji Vesta J.Princiya Beulah Rani
No ratings yet
Digital Jewellery: M.Aji Vesta J.Princiya Beulah Rani
17 pages
Password analyzer project
No ratings yet
Password analyzer project
12 pages
Compact Type Electronically Controlled Pattern Sewing Machines For Extra Thick Material
No ratings yet
Compact Type Electronically Controlled Pattern Sewing Machines For Extra Thick Material
68 pages
Tes Wawancara Duta Wisata - PDF
No ratings yet
Tes Wawancara Duta Wisata - PDF
13 pages
V Model
No ratings yet
V Model
30 pages
Requirement Elicitation Technique
No ratings yet
Requirement Elicitation Technique
7 pages
Eqexam: An Exam Construction Package: D. P. Story Email: Dpstory@uakron - Edu Processed June 17, 2021
No ratings yet
Eqexam: An Exam Construction Package: D. P. Story Email: Dpstory@uakron - Edu Processed June 17, 2021
110 pages
Clipboard Tool v1.0
No ratings yet
Clipboard Tool v1.0
28 pages
DIGITAL-AND-TECHNOLOGICAL-SOLUTIONS-SYLLABUS-OPTION-2
No ratings yet
DIGITAL-AND-TECHNOLOGICAL-SOLUTIONS-SYLLABUS-OPTION-2
3 pages
Mayuresh Manet Project
No ratings yet
Mayuresh Manet Project
33 pages
Audio Generation With Diffusion Models
No ratings yet
Audio Generation With Diffusion Models
16 pages
final_ Accops Certified Sales Professional - Fundamentals
No ratings yet
final_ Accops Certified Sales Professional - Fundamentals
14 pages
Annexure P 1
No ratings yet
Annexure P 1
7 pages
Kaspersky XDR Expert 1.1-English
No ratings yet
Kaspersky XDR Expert 1.1-English
1,397 pages
Python Unit 3
No ratings yet
Python Unit 3
20 pages
Banking PPT What To Say
No ratings yet
Banking PPT What To Say
3 pages
Exam C1000-109 IBM Cloud Developer v4 Sample Test
No ratings yet
Exam C1000-109 IBM Cloud Developer v4 Sample Test
5 pages
fef
No ratings yet
fef
5 pages
L13 Intro-Cnn Slides
No ratings yet
L13 Intro-Cnn Slides
65 pages
Catalogo de Partes P501-P502
No ratings yet
Catalogo de Partes P501-P502
88 pages
Corrintec Subsea Brochure PDF
No ratings yet
Corrintec Subsea Brochure PDF
8 pages
Chapter 4
No ratings yet
Chapter 4
12 pages
SSRF
No ratings yet
SSRF
3 pages
Ansible Note
No ratings yet
Ansible Note
2 pages
!GAMELOG
No ratings yet
!GAMELOG
6 pages
Arm7 LPC 2148
No ratings yet
Arm7 LPC 2148
13 pages
Resume Steven MC Namara
No ratings yet
Resume Steven MC Namara
3 pages
Cheung 2021
No ratings yet
Cheung 2021
18 pages