Lecture Testmodels

The document discusses various methods for evaluating machine learning models including test sets, validation sets, learning curves, cross validation, and accuracy metrics like confusion matrices, ROC curves and precision/recall curves.

Uploaded by

sowmyasanthavel

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views

Lecture Testmodels

Uploaded by

sowmyasanthavel

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

Evaluating Machine

Learning Methods

This file is adapted with materials from: http://pages.cs.wisc.edu/~dpage/cs760/ by David Page

Test Sets

• How can we get an unbiased estimate of the accuracy of a learned model?

• When learning a model, you should pretend that you don’t have the
test data yet. (In some applications, it is reasonable to assume that
you have access to the feature vector x but not the y part of each test
instance).
• If the test set labels influence the learned model in any way, accuracy
estimates will be biased.
Learning Curves

• How does the accuracy of a learning method change as a function of

the training-set size?
Validation (Tuning) sets
• Suppose we want unbiased estimates of accuracy the learning process.
Partition training data into separate training/validation sets.
Limitations of using a single training/test partition
Random Resampling
• We can address the second issue by repeatedly randomly partitioning the
available data into training and test sets.
Stratified Sampling
• When randomly selecting training or validation sets, we may want to
ensure that class proportions are maintained in each selected set.
Cross Validation
Cross Validation Example
• Suppose we have 100 instances, and we want to estimate accuracy with
cross validation.
Cross Validation
Internal Cross Validation
• Instead of a single validation set, we can use cross-validation within a
training set to select a model.
Example: Using Internal Cross Validation to Select k in k-NN
Confusion Matrices
• How can we understand what types of mistakes a learned model makes?

Figure from vision.jhu.edu

Confusion Matrix for 2-class Problems
Is Accuracy an adequate measure of predictive performance?
Other Accuracy Metrics
ROC Curves
• A Receiver Operating Characteristic (ROC) curve plots the TP-rate vs. the
FP-rate as a threshold on the confidence of an instance being positive is
varied.
ROC Curve Example
ROC Curves and Misclassification Costs
Algorithm for Creating an ROC Curve
Plotting an ROC Curve
• Does a low false-positive rate indicate that most positive predictions (i.e.
predictions with confidence > some threshold) are correct?
Other Accuracy Metrics
Precision/Recall Curves
• A precision/recall curve plots the precision vs. recall (TP-rate) as a
threshold on the confidence of an instance being positive is varied.
How Do We Get one ROC/PR Curve When We Do Cross Validation?
Comments on ROC and PR Curves
To Avoid Cross-Validation Pitfalls, Ask:
• Is my held-aside test data really representative of going out to collect new data?
• Did I repeat my entire data processing procedure on every fold of
cross-validation, using only the training data for that fold?
• On each fold of cross-validation, did I ever access in any way the label of a test
case?
• Any preprocessing done over entire data set (feature selection, parameter
tuning, threshold selection) must not use labels

• Have I modified my algorithm so many times, or tried so many

approaches, on this same date set that I (the human) am overfitting
it?
• Have I continually modified my preprocessing or learning algorithm until I got
some improvement on this data set?
• If so, I really need to get some additional data now to at least test on.

Attention-Based Automated Pallet Racking Damage Detection
No ratings yet
Attention-Based Automated Pallet Racking Damage Detection
13 pages
Full Notes 1
No ratings yet
Full Notes 1
252 pages
AI351 Lecture 2 - Common Evaluation Metrics
No ratings yet
AI351 Lecture 2 - Common Evaluation Metrics
50 pages
DL_IT324a_4
No ratings yet
DL_IT324a_4
52 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
Lecture 5 Evaluation_Classifer
No ratings yet
Lecture 5 Evaluation_Classifer
61 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-08 Reference-Material-I
18 pages
9b. Evaluation of Classifiers
No ratings yet
9b. Evaluation of Classifiers
4 pages
Int3209 - Data Mining: Week 5: Classification Model Improvements
No ratings yet
Int3209 - Data Mining: Week 5: Classification Model Improvements
56 pages
A10-Model-Performance-v2-2up
No ratings yet
A10-Model-Performance-v2-2up
11 pages
ML 5
No ratings yet
ML 5
14 pages
Lecture 01-Model Selection and Evaluation
No ratings yet
Lecture 01-Model Selection and Evaluation
29 pages
Accuracy Measures
No ratings yet
Accuracy Measures
61 pages
6 Model Evalution
No ratings yet
6 Model Evalution
16 pages
Module 5 Advanced Classification Techniques
No ratings yet
Module 5 Advanced Classification Techniques
40 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
37 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
EvaluationMatrix
No ratings yet
EvaluationMatrix
29 pages
Lec 16
No ratings yet
Lec 16
18 pages
Learning Best Practices For Model Evaluation and Hyperparameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyperparameter Tuning
17 pages
Module 6
No ratings yet
Module 6
24 pages
Module 2
No ratings yet
Module 2
19 pages
Xchapter 1
No ratings yet
Xchapter 1
31 pages
Lesson 6 Analytics Methods
No ratings yet
Lesson 6 Analytics Methods
12 pages
Lecture 3 1611410001002
No ratings yet
Lecture 3 1611410001002
51 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
MACHINELEARNING
No ratings yet
MACHINELEARNING
20 pages
Accuracy and Error Measures
No ratings yet
Accuracy and Error Measures
46 pages
lec5
No ratings yet
lec5
28 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
Presentation On Classification
No ratings yet
Presentation On Classification
18 pages
ML - Module 5
No ratings yet
ML - Module 5
80 pages
Cofusion Matrix Cross- Validation
No ratings yet
Cofusion Matrix Cross- Validation
34 pages
Classification Evaluation
No ratings yet
Classification Evaluation
28 pages
Unit 6-Feature Engineering and Sensitivity Analysis
No ratings yet
Unit 6-Feature Engineering and Sensitivity Analysis
63 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
CSC4316 9
No ratings yet
CSC4316 9
40 pages
Lecture 10
No ratings yet
Lecture 10
16 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
Machine Learning # 2
No ratings yet
Machine Learning # 2
17 pages
ML Model Evaluation
No ratings yet
ML Model Evaluation
17 pages
Clase10 11
No ratings yet
Clase10 11
18 pages
Receiver Operating Character-Istic: Max C
No ratings yet
Receiver Operating Character-Istic: Max C
5 pages
Chapitre_2-converti
No ratings yet
Chapitre_2-converti
26 pages
Lec - 4
No ratings yet
Lec - 4
43 pages
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
No ratings yet
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
27 pages
Analytic Method:: Model Evaluation
No ratings yet
Analytic Method:: Model Evaluation
17 pages
2020 Evaluation PDF
No ratings yet
2020 Evaluation PDF
25 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
Evaluating A Machine Learning Model
No ratings yet
Evaluating A Machine Learning Model
14 pages
Data Mining Models and Evaluation Techniques
No ratings yet
Data Mining Models and Evaluation Techniques
59 pages
Chapitre_2
No ratings yet
Chapitre_2
26 pages
DS Notes Unit - V
No ratings yet
DS Notes Unit - V
13 pages
ml_pyq_ans
No ratings yet
ml_pyq_ans
37 pages
Analytics in Practice: Model Evaluation
No ratings yet
Analytics in Practice: Model Evaluation
40 pages
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
Performance
No ratings yet
Performance
11 pages
CE802_Lec_Eval_handouts
No ratings yet
CE802_Lec_Eval_handouts
33 pages
TR Rain Error
No ratings yet
TR Rain Error
6 pages
Ways to Achieve Quality
From Everand
Ways to Achieve Quality
chakrapani srinivasa
5/5 (1)
Automated Written Corrective Feedback: How Well Can Students Make Use of It?
No ratings yet
Automated Written Corrective Feedback: How Well Can Students Make Use of It?
23 pages
MCE Cambridge Primary Science 2E Stage6 SOW and LP C06
No ratings yet
MCE Cambridge Primary Science 2E Stage6 SOW and LP C06
15 pages
Artisan: Definitive
No ratings yet
Artisan: Definitive
30 pages
ISO 9001 - Clause 7.1.5 - Calibrated Equipment Procedure
No ratings yet
ISO 9001 - Clause 7.1.5 - Calibrated Equipment Procedure
13 pages
Contingency Tables Involving Small Numbers and the χ2 Test
No ratings yet
Contingency Tables Involving Small Numbers and the χ2 Test
20 pages
Pcast: President's Council of Advisors On Science and Technology
No ratings yet
Pcast: President's Council of Advisors On Science and Technology
26 pages
EE 304 Measurements and Instrumentation Lecture 2 (Measurement Errors)
No ratings yet
EE 304 Measurements and Instrumentation Lecture 2 (Measurement Errors)
14 pages
ActAssign 1oct2020
No ratings yet
ActAssign 1oct2020
4 pages
AirDataUnit Datasheet
No ratings yet
AirDataUnit Datasheet
2 pages
IEP Goals and Objectives Bank Printable PDF
No ratings yet
IEP Goals and Objectives Bank Printable PDF
132 pages
Remote Sensing of Environment
No ratings yet
Remote Sensing of Environment
16 pages
Maquinas de Soldar PDF
No ratings yet
Maquinas de Soldar PDF
121 pages
Experiment 1
No ratings yet
Experiment 1
20 pages
EPX300 User's Manual en
No ratings yet
EPX300 User's Manual en
16 pages
MIS Group 8 Presentation
No ratings yet
MIS Group 8 Presentation
21 pages
Abakkus Investment Profile PPT Feb 2024 AACA
No ratings yet
Abakkus Investment Profile PPT Feb 2024 AACA
21 pages
ML-Lecture-12 (Evaluation Metrics For Classification)
No ratings yet
ML-Lecture-12 (Evaluation Metrics For Classification)
15 pages
Ensemble Deep Learning For Cervix Image Selection
No ratings yet
Ensemble Deep Learning For Cervix Image Selection
13 pages
Cambridge International A Level: Mathematics 9709/41 May/June 2021
No ratings yet
Cambridge International A Level: Mathematics 9709/41 May/June 2021
14 pages
CT Dimension Ing
100% (1)
CT Dimension Ing
56 pages
Infrastructures 09 00003
No ratings yet
Infrastructures 09 00003
16 pages
Machine Learning For Real-Time Heart Disease Prediction
No ratings yet
Machine Learning For Real-Time Heart Disease Prediction
11 pages
Racine 2011
No ratings yet
Racine 2011
30 pages
Quality Control in Clinical Biochemistry BMLT
No ratings yet
Quality Control in Clinical Biochemistry BMLT
56 pages
An Attention-Based Convolutional Neural Network For Intrusion Detection Model-Paper
No ratings yet
An Attention-Based Convolutional Neural Network For Intrusion Detection Model-Paper
12 pages
15 Classification of Healthy and Diseased Broccoli Leaves Using A Custom Deep Learning CNN Model
No ratings yet
15 Classification of Healthy and Diseased Broccoli Leaves Using A Custom Deep Learning CNN Model
7 pages
Fraud Detection in Fintech Leveraging Machine Lear
No ratings yet
Fraud Detection in Fintech Leveraging Machine Lear
23 pages
Effects of Improper Waste Disposal
No ratings yet
Effects of Improper Waste Disposal
4 pages