0% found this document useful (0 votes)

5 views

Class12-PatternClassification_PerformanceMetric_ReferenceTemplate

The document discusses performance evaluation metrics for classification tasks, focusing on confusion matrices for both binary and multiclass classifications. It explains key terms such as true positives, true negatives, false positives, and false negatives, along with precision, recall, and F-measure. Additionally, it covers supervised machine learning methods like K-Nearest Neighbors and reference templates for classification, including the use of Mahalanobis distance.

Uploaded by

Paladin

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Class12-PatternClassification_PerformanceMetric_ReferenceTemplate

Uploaded by

Paladin

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 33

Performance Evaluation for

Classification
Confusion Matrix – 2-class

• True Positive: Number of test samples correctly predicted as

positive class (Class1).
• True Negative: Number of test samples correctly predicted as
negative class (Class2).
• False Positive: Number of test samples predicted as positive
class (Class1) but actually belonging to negative class
(Class2).
• False Negative: Number of test samples predicted as negative
class (Class2) but actually belonging to positive class (Class1). 2
Confusion Matrix – 2-class

Total test
samples
in class1

3
Confusion Matrix – 2-class

Total test
samples
in class2

4
Confusion Matrix – 2-class

Total test
samples
predicted as
class1

• Biometric authentication system to access account

– False Positive (wrongly detecting as genuine person) should be low
– Some False Negative (Not detecting a genuine person as genuine) is OK
– Precision should be high
5
Confusion Matrix – 2-class

Total test
samples
predicted as
class2

• Medical image analysis of microscopic image to detect the presence

of cancer
– False Negative (Detecting cancerous image as not cancer) should be low
– Some False Positive (Detecting a non-cancerous images as cancer) is OK
– Recall should be high 6
Accuracy – 2-class

7
Confusion Matrix - Multiclass
Illustration: Number of classes is 3. It can be extended to any
number of classes

• C11: Number of test examples predicted as class1 and actually

belonging to class1
• C12: Number of test examples predicted as class1, but actually
belonging to class2
• C13: Number of test examples predicted as class1, but actually
belonging to class3
• Similarly C21, C22, C23, C31, C32 and C33 are interpreted
8
Confusion Matrix - Multiclass
With reference to Class1:

• True Positive: Number of test samples correctly predicted as

positive class (class1) (C11).
• True Negative: Number of test samples correctly predicted as
negative class (class2 and class3) (C22+C33).
• False Positive: Number of test samples predicted as positive class
(class1) but actually belonging to negative class (class2 and
class3) (C12+C13)
• False Negative: Number of test samples predicted as negative
class (class2 and class3) but actually belonging to positive class
(class1) (C21+C31) 9
Confusion Matrix - Multiclass
With reference to Class2:

• True Positive: Number of test samples correctly predicted as

positive class (class2) (C22).
• True Negative: Number of test samples correctly predicted as
negative class (class1 and class3) (C11+C33).
• False Positive: Number of test samples predicted as positive class
(class2) but actually belonging to negative class (class1 and
class3) (C21+C23)
• False Negative: Number of test samples predicted as negative
class (class1 and class3) but actually belonging to positive class
(class2) (C12+C32) 10
Confusion Matrix - Multiclass
With reference to Class3:

• True Positive: Number of test samples correctly predicted as

positive class (class3) (C33).
• True Negative: Number of test samples correctly predicted as
negative class (class1 and class2) (C11+C22).
• False Positive: Number of test samples predicted as positive class
(class3) but actually belonging to negative class (class1 and
class2) (C31+C32)
• False Negative: Number of test samples predicted as negative
class (class1 and class2) but actually belonging to positive class
(class3) (C13+C23) 11
Confusion Matrix - Multiclass
Example: Number of classes = 3. Same concept can be extended to
number of classes more than 3

Total samples used for testing

12
Accuracy of Multiclass Classification
Example: Number of classes = 3. Same concept can be extended to
number of classes more than 3

13
Binary (2-class) Classification:
Precision, Recall and F-measure

• Precision:
– Number of samples correctly classified as positive class,
out of all the examples classified as positive class
– It is also called positive predictive value
Binary (2-class) Classification:
Precision, Recall and F-measure

• Recall:
– Number of samples correctly classified as positive class,
out of all the examples belonging to positive class
– It is also called as sensitivity or true positive rate (TPR)

15
Binary (2-class) Classification:
Precision, Recall and F-measure

• F-measure or F-score or F1-score:

– Combines precision and recall
– Recall and precision are evenly weighted.
– Harmonic mean of precision and recall

16
Supervised Machine Learning:
Pattern Classification
K-Nearest Neighbor, Reference Template Method
K-Nearest Neighbours (K-NN) Method
• Consider the class labels of the K training examples
nearest to the test example
• Step 1: Compute Euclidean distance for a test
example x with every training examples, x1, x2, …, xn,
…, xN • Step 2: Sort the examples in
the training set in the
ascending order of the
distance to x
x2 • Step 3: Choose the first K
examples in the sorted list
– K is the number of
neighbours for text
x1 example
• Step 4: Test example is assigned the most common
class among its K neighbours
18
Reference Templates Method

• Each class is represented by its reference templates

– Mean of each data points of each class as reference
template
– Let the data of class i be
• Ni: Number of examples (data points) in class i

– Mean of data points of a class i, μi is given as:

19
Reference Templates Method
• Each class is represented by its reference templates
– Mean of each data points of each class as reference template
• For a test example, compute an Euclidean distance to all the
reference template corresponding to each class, ED(x, μi)

μi: Mean vector of class i

• The class of the nearest
reference template
(mean) is assigned to
the test pattern

x2 Class label for x

i
i = 1,2,…, M
M = Number of classes
x1 20
Reference Templates Method
• Each class is represented by its reference templates
– Mean of each data points of each class as reference template
• For a test example, compute an Euclidean distance to all the
reference template corresponding to each class, ED(x, μi)

μi: Mean vector of class i

• The class of the nearest
reference template
(mean) is assigned to
the test pattern

x2
• Learning: Estimating first
order statistics (mean)
from the data of each
class
x1 21
Illustration of Reference Templates
Method: Adult(1)-Child(0) Classification

• Training Phase:
– Compute sample mean vector from training
data of class 0 (Child)
μ0= [103.60 30.66]

22
Illustration of Reference Templates
Method: Adult(1)-Child(0) Classification
• Training Phase:
– Compute sample mean vector from training
data of class 0 (Child)
μ0= [103.60 30.66]

– Compute sample mean vector from training

data of class 1 (Adult)
μ1= [166.00 67.12]

23
Illustration of Reference Templates Method:
Adult(1)-Child(0) Classification
• Test Phase - Classification:
Test Example, x:
Height Weight Class
μ0 103.60 30.66 0
μ1 166.60 67.12 1

Weight
in Kg

Height in cm

• Compute Euclidean distance of test sample, x with mean

vector of class 0 (Child), μ0: ED(x, μ0) = 50.50
• Compute Euclidean distance of test sample, x with mean
vector of class 1 (Adult), μ1: ED(x, μ1) = 23.00
Class label of x = Adult 24
Modified Reference Templates Method
• Each class is represented by its reference templates
– Mean and variance (covariance) of data points of each
class as reference template
– Let the data of class i be
• Ni: Number of examples (data points) in class i

– Mean of data points of a class i, μi is given as:

– Covariance matrix of data points of a class i, Σi is given

as:

σjk: Covariance of
jth and kth attribute
25
Modified Reference Templates Method
• Each class is represented by one or more reference templates
– Mean and variance (covariance) of data points of each class as
reference template
• For a test example, compute a Mahalanobis distance to all
the reference template corresponding to each class, MD(x, μi, Σi)

μi & Σi : Mean vector and Covariance matrix of class i

The Mahalanobis distance is a measure

of the distance between a point and
x1 a distribution 26
Modified Reference Templates Method
• Each class is represented by one or more reference templates
– Mean and variance (covariance) of data points of each class as
reference template
• For a test example, compute a Mahalanobis distance to all
the reference template corresponding to each class, MD(x, μi, Σi)

μi & Σi : Mean vector and Covariance matrix of class i

• The class of the nearest
reference templates is
assigned to the test pattern

x2 Class label for x = argmin MD(x, μi, Σi)

i
i = 1,2,…, M
M = Number of classes
x1 27
Modified Reference Templates Method
• Each class is represented by one or more reference templates
– Mean and variance (covariance) of data points of each class as
reference template
• For a test example, compute a Mahalanobis distance to all
the reference template corresponding to each class, MD(x, μi, Σi)

μi & Σi : Mean vector and Covariance matrix of class i

• The class of the nearest
reference templates is
assigned to the test pattern
• Learning: Estimating
x2 – first order statistics (mean) and
– Second order statistics
(variance and covariance) from
the data of each class

x1 28
Illustration of Reference Templates Method:
Adult(1)-Child(0) Classification

• Training Phase:
– Compute sample mean vector from training
data of class 0 (Child)
μ0= [103.60 30.66]
– Compute sample covariance matrix from
training data of class 0 (Child)

Σ0= 109.38 61.35

61.35 43.54

29
Illustration of Reference Templates Method:
Adult(1)-Child(0) Classification
• Training Phase:
– Compute sample mean vector from training
data of class 0 (Child)
μ0= [103.60 30.66]
– Compute sample covariance matrix from
training data of class 0 (Child)

Σ0= 109.38 61.35

61.35 43.54

– Compute sample mean vector from training

data of class 1 (Adult)
μ1= [166.00 67.12]
– Compute sample covariance matrix from
training data of class 1 (Adult)

Σ1= 110.67 160.53

160.53 255.49
30
Illustration of Reference Templates Method:
Adult(1)-Child(0) Classification
• Test Phase - Classification:
Test Example, x:
μ0= [103.60 30.66]
Class
0
Σ0= 109.38 61.35
61.35 43.54

Weight
μ1= [166.00 67.12] in Kg
Class
1
Σ1= 110.67 160.53
160.53 255.49

Height in cm
• Compute Mahalanobis distance of test sample,
x with mean vector and covariance matrix of
class 0 (Child): MD(x, μ0, Σ0) = 4.87 Class label of x =
• Compute Mahalanobis distance of test sample, Adult
x with mean vector and covariance matrix of
class 1 (Adult): MD(x, μ2, Σ2) = 2.07 31
Classification using Reference
Template Methods
• For a test example, a distance measure is computed with
the reference template of each class
• The class of the reference template with least distance is
assigned to the test pattern
• When Mahalanobis distance is used, it gives the notion
that distance measure is computed between a test
example and the distribution (density) of a class
– Distribution (density) of class: All the training examples are
drawn from that distribution
– Density here is normal (Gaussian) density
• In other way, we are interested to estimate probability of
class, P(Ci | x)
– Given the test example x, what is the probability that it
belongs to ith class (Ci)
• Solution: Bayes classifier
32
Text Books

1. J. Han and M. Kamber, Data Mining: Concepts and

Techniques, Third Edition, Morgan Kaufmann Publishers,
2011.

2. S. Theodoridis and K. Koutroumbas, Pattern Recognition,

Academic Press, 2009.

3. C. M. Bishop, Pattern Recognition and Machine Learning,

Springer, 2006.

End-Term Questions and Answers
No ratings yet
End-Term Questions and Answers
3 pages
Class11-PatternClassification_KNN
No ratings yet
Class11-PatternClassification_KNN
28 pages
Lecture 4
No ratings yet
Lecture 4
31 pages
009 Confusion Matrix - Unlocked
No ratings yet
009 Confusion Matrix - Unlocked
4 pages
Unit 4 Learning
No ratings yet
Unit 4 Learning
100 pages
T6- KNN - Features, Distances &amp; Non-Parametric Models
No ratings yet
T6- KNN - Features, Distances &amp; Non-Parametric Models
23 pages
8.predictive Analytics - Classification 2
No ratings yet
8.predictive Analytics - Classification 2
28 pages
ML UNIT - III-Complete
No ratings yet
ML UNIT - III-Complete
52 pages
Class10 14 PatternClassification - 13 24sept2019
No ratings yet
Class10 14 PatternClassification - 13 24sept2019
50 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
19 pages
Lecture 5
No ratings yet
Lecture 5
21 pages
Chapter 3 Model Evaluation Final
No ratings yet
Chapter 3 Model Evaluation Final
30 pages
Map Assign 8
No ratings yet
Map Assign 8
7 pages
Generalized Confusion Matrix For Multiple Classes
No ratings yet
Generalized Confusion Matrix For Multiple Classes
3 pages
Evaluating Model Performance Unit 6
No ratings yet
Evaluating Model Performance Unit 6
33 pages
Image Processing Mahalanobis Distance
No ratings yet
Image Processing Mahalanobis Distance
17 pages
Performance Measures - Session 2
No ratings yet
Performance Measures - Session 2
35 pages
HW02 Sol - KNN DT
No ratings yet
HW02 Sol - KNN DT
8 pages
Pattern Recognition
No ratings yet
Pattern Recognition
461 pages
Macro - and Micro-Averaged Evaluation
No ratings yet
Macro - and Micro-Averaged Evaluation
27 pages
Risk Security and Regulatory Compliance
No ratings yet
Risk Security and Regulatory Compliance
12 pages
Presentation On Classification
No ratings yet
Presentation On Classification
18 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
Cross Validation
No ratings yet
Cross Validation
10 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
EDAN96_2024_Last_lecture-1
No ratings yet
EDAN96_2024_Last_lecture-1
78 pages
Assignment3_sol.pdf
No ratings yet
Assignment3_sol.pdf
5 pages
3 2KNN
No ratings yet
3 2KNN
27 pages
MLT numericals
No ratings yet
MLT numericals
4 pages
Performance Metrics Classification (1)
No ratings yet
Performance Metrics Classification (1)
39 pages
Lecture 2 Classifier Performance Metrics
No ratings yet
Lecture 2 Classifier Performance Metrics
60 pages
K-Nearest Neighbor in Missing Data Imputation: Ms.R.Malarvizhi, DR - Antony Selvadoss Thanamani
No ratings yet
K-Nearest Neighbor in Missing Data Imputation: Ms.R.Malarvizhi, DR - Antony Selvadoss Thanamani
3 pages
06 Performance Evaluation
No ratings yet
06 Performance Evaluation
12 pages
Regression PDF
No ratings yet
Regression PDF
10 pages
DM - Ch4 - Classification (Part1)
No ratings yet
DM - Ch4 - Classification (Part1)
20 pages
cs4302-lecture2
No ratings yet
cs4302-lecture2
40 pages
ML Unit 2
No ratings yet
ML Unit 2
31 pages
ML - Module 5
No ratings yet
ML - Module 5
80 pages
Lecture - Model Accuracy Measures
No ratings yet
Lecture - Model Accuracy Measures
61 pages
CH 8 Data Mining
No ratings yet
CH 8 Data Mining
30 pages
SupervisedLearning_Classification
No ratings yet
SupervisedLearning_Classification
20 pages
12_23ECE216_Nearest Neighbors
No ratings yet
12_23ECE216_Nearest Neighbors
29 pages
6 Model Evalution
No ratings yet
6 Model Evalution
16 pages
chapter 5 Model Evaluation
No ratings yet
chapter 5 Model Evaluation
21 pages
Lecture 02 - KNN and ML Basics
No ratings yet
Lecture 02 - KNN and ML Basics
33 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
Lec 9
No ratings yet
Lec 9
15 pages
Cristian Quiñonez Fase2
No ratings yet
Cristian Quiñonez Fase2
7 pages
Evaluation of Predictive Models Final
No ratings yet
Evaluation of Predictive Models Final
6 pages
Machine Learning Project
No ratings yet
Machine Learning Project
12 pages
Conf Stab
No ratings yet
Conf Stab
16 pages
KNN and Baysian Method
No ratings yet
KNN and Baysian Method
43 pages
M4-Similarity based learning
No ratings yet
M4-Similarity based learning
8 pages
Unit III 1
No ratings yet
Unit III 1
21 pages
Confusion Matrix
No ratings yet
Confusion Matrix
8 pages
Cofusion Matrix Cross- Validation
No ratings yet
Cofusion Matrix Cross- Validation
34 pages
Chapitre_2
No ratings yet
Chapitre_2
26 pages
Statistical Methods in Artificial Intelligence CSE471 - Monsoon 2015: Lecture 02
No ratings yet
Statistical Methods in Artificial Intelligence CSE471 - Monsoon 2015: Lecture 02
26 pages
Elementary Statistics
From Everand
Elementary Statistics
jay prakash Maheshwari
No ratings yet
Topic3-Limit-Continuity
No ratings yet
Topic3-Limit-Continuity
9 pages
Clustering_Partitioning-Hierarchical-DensityBased_68c3f9e8cbf266f647e4a8c7ade8a79c
No ratings yet
Clustering_Partitioning-Hierarchical-DensityBased_68c3f9e8cbf266f647e4a8c7ade8a79c
87 pages
Topic1-Natural_Number_System
No ratings yet
Topic1-Natural_Number_System
11 pages
Class10-Introduction_to_ML
No ratings yet
Class10-Introduction_to_ML
32 pages
lab4_linking (1)
No ratings yet
lab4_linking (1)
3 pages
Christ University, Bangalore Department of Professional Studies Business Mathematics and Statistics Worksheeet - 7
No ratings yet
Christ University, Bangalore Department of Professional Studies Business Mathematics and Statistics Worksheeet - 7
1 page
Jamboree Linear Regression Version 2 Jupyter Notebook
No ratings yet
Jamboree Linear Regression Version 2 Jupyter Notebook
12 pages
Rec 4B - Regression - n-1
No ratings yet
Rec 4B - Regression - n-1
5 pages
9.1 Significance Tests: The Basics: Problem 1 - 911 Calls
No ratings yet
9.1 Significance Tests: The Basics: Problem 1 - 911 Calls
15 pages
One Way Mancova
No ratings yet
One Way Mancova
4 pages
Machine Learning Interview Questions &amp Answers PDF
No ratings yet
Machine Learning Interview Questions &amp Answers PDF
18 pages
Applied Financial Econometrics: Sunil Paul
No ratings yet
Applied Financial Econometrics: Sunil Paul
9 pages
NARDL
No ratings yet
NARDL
23 pages
Rbi Grade B (Depr) - Test-13
No ratings yet
Rbi Grade B (Depr) - Test-13
3 pages
Centeral Tendency Part 3
No ratings yet
Centeral Tendency Part 3
24 pages
Full Download of Design and Analysis of Experiments 8th Edition Montgomery Solutions Manual in PDF DOCX Format
100% (15)
Full Download of Design and Analysis of Experiments 8th Edition Montgomery Solutions Manual in PDF DOCX Format
58 pages
Introduction and Descriptive Statistics - Vinjar Fønnebø PDF
No ratings yet
Introduction and Descriptive Statistics - Vinjar Fønnebø PDF
37 pages
Statistical Techniques in Business & Economics, Lind/Marchal/Wathen, 13/e 105
100% (1)
Statistical Techniques in Business & Economics, Lind/Marchal/Wathen, 13/e 105
7 pages
02_review_estimation_2
No ratings yet
02_review_estimation_2
36 pages
Lean Six Sigma Sec A
No ratings yet
Lean Six Sigma Sec A
3 pages
Module 3
No ratings yet
Module 3
15 pages
Reflection Paper On Median
No ratings yet
Reflection Paper On Median
2 pages
Written Assignment Unit 2
No ratings yet
Written Assignment Unit 2
4 pages
2 Pengenalan Geostatistik
No ratings yet
2 Pengenalan Geostatistik
59 pages
Anova Two Way
No ratings yet
Anova Two Way
35 pages
Oversikt ECN402
No ratings yet
Oversikt ECN402
40 pages
Chap 16
No ratings yet
Chap 16
30 pages
3 - Stat - More Graphs and Displays 2024
No ratings yet
3 - Stat - More Graphs and Displays 2024
32 pages
KLS Gogte Institute of Technology, Belagavi.: "F Distribution"
No ratings yet
KLS Gogte Institute of Technology, Belagavi.: "F Distribution"
15 pages
Using The Leapfrog Design As A Simple Form of Ad
No ratings yet
Using The Leapfrog Design As A Simple Form of Ad
17 pages
H-86 BCA III N 6th Sem Statistical C
No ratings yet
H-86 BCA III N 6th Sem Statistical C
2 pages
SQL For Data Science Project Week 4
No ratings yet
SQL For Data Science Project Week 4
25 pages
Bonat 2018
No ratings yet
Bonat 2018
30 pages