1.4 Intro To Need of Estimation and Validation PDF

The document discusses several key points regarding data science model evaluation and validation: 1) It explains the importance of selecting optimal hyperparameters and evaluating model performance on unseen data to estimate generalization error. 2) Learning curves are discussed as a useful tool for understanding bias and variance in models and how more data can help address overfitting. 3) When data is limited, strategies like stratifying samples or data augmentation are recommended over separating into separate training, validation, and test sets. K-fold cross validation is also introduced.

Uploaded by

Dhairya Thakkar

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

146 views

1.4 Intro To Need of Estimation and Validation PDF

Uploaded by

Dhairya Thakkar

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Introduction To Need Of Estimation

And Validation For Added Value Due

To Data Science
Vaibhav P. Vasani

Assistant Professor
Department of Computer Engineering
K. J. Somaiya College of Engineering
Somaiya Vidyavihar University

2/17/2022 vaibhav.vasani@gmail.com 1
Introduction To Need Of Estimation
And Validation For Added Value Due
To Data Science

2/17/2022 2
Model selection at different scales

• Hyperparameters are the parameters of the learning

method itself which have to specify a priori, i.e., before
model fitting.
• In contrast, model parameters are parameters which arise
as a result of the fit.

• Example
o In a logistic regression model, for example, the regularization
strength (as well as the regularization type, if any) is a
hyperparameter which has to be specified prior to the fitting,
while the coefficients of the fitted model are model parameters.
Finding the right hyperparameters for a model can be crucial for
the model performance on given data.

2/17/2022 3
• the best learning method (and their corresponding
“optimal” hyperparameters) from a set of eligible
machine learning methods.
o algorithm selection.

2/17/2022 4
Model evaluation

• Model evaluation aims at estimating the

generalization error of the selected model, i.e., how
well the selected model performs on unseen data.

2/17/2022 5
Fitting of curve slide

2/17/2022 6
If data is not an issue

• The recommended strategy for model selection

depends on the amount of data available. If plenty of
data is available, It can split the data into several
parts, each serving a special purpose.
• For instance, for hyperparameter tuning we may split
the data into three sets: train / validation / test.
• There is no general rule as to how the data should be
split. A typical split is e.g. 50%/25%/25%.

2/17/2022 8
Learning curves, and why they are
useful
• In a learning curve, the performance of a model both
on the training and validation set is plotted as a
function of the training set size.

2/17/2022 9
• The training score (performance on the training set) decreases with
increasing training set size while the validation score increases at the same
time.
• High training score and low validation score at the same time indicates that
the model has overfit the data, i.e., has adapted too well to the specific
training set samples.
• As the training set increases, overfitting decreases, and the validation score
increases.
• Especially for data-hungry machine learning models, the learning curve
might not yet have reached a plateau at the given training set size, which
means the generalization error might still decrease when providing more
data to the model.
• Hence, it seems reasonable to increase the training set (by adding the
validation set) before estimating the generalization error on the test set, and
to further take advantage of the test set data for model fitting before
shipping the model. Whether or not this strategy is needed depends strongly
on the slope of the learning curve at the initial training set size.

2/17/2022 10
• Learning curves further allow to easily illustrate the concept of
(statistical) bias and variance. Bias in this context refers to
erroneous (e.g. simplifying) model assumptions, which can cause the
model to underfit the data. A high-bias model does not adequately
capture the structure present in the data. Variance on the other hand
quantifies how much the model varies as we change the training
data. A high-variance model is very sensitive to small fluctuations in
the training data, which can cause the model to overfit. The amount
of bias and variance can be estimated using learning curves: A model
exhibits high variance, but low bias if the training score plateaus at a
high level while the validation score at a low level, i.e., if there is a
large gap between training and validation score. A model with low
variance but high bias, in contrast, is a model where both training
and validation score are low, but similar. Very simple models are
high-bias, low-variance while with increasing model complexity they
become low-bias, high-variance.

2/17/2022 11
2/17/2022 12
• Bias
• Bias is the error resulting from the difference between the
expected value(s) of a model and the actual (or “correct”)
value(s) for which we want to predict over multiple
iterations. In the scientific concepts of accuracy and
precision, bias is very similar to accuracy.
• Variance
• Variance is defined as the error resulting from the
variability between different data predictions in a model.
In variance, the correct value(s) don’t matter as much as
the range of differences in value between the predictions.
Variance also comes into play more when we run multiple
model creation trials.

2/17/2022 13
2/17/2022 14
Divide and conquer — but do it
carefully
• Training, validation, and test set are sampled from
the same distribution.
• Ensure before model building that the distribution of
the data is not affected by partitioning your data.
o Example

• use stratified sampling

2/17/2022 15
If all you have is small data

• split the data into two sets, a training and a test set.
• Augmentation of Data

2/17/2022 17
K-Fold Cross Validation

• shuffles the data and splits it into k number of folds

(groups).

2/17/2022 18
2/17/2022 19
2/17/2022 20

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Challenges and Scope of Data Science Project
No ratings yet
Challenges and Scope of Data Science Project
21 pages
Memory Based Reasoning - BIA
100% (1)
Memory Based Reasoning - BIA
19 pages
Written Questions
No ratings yet
Written Questions
33 pages
Humphrey Managing The Software Process
No ratings yet
Humphrey Managing The Software Process
32 pages
Portfolio Optimization With Return Prediction Using Deep Learning and Machine Learning
No ratings yet
Portfolio Optimization With Return Prediction Using Deep Learning and Machine Learning
15 pages
1.2 Introduction To Applied Data Science
No ratings yet
1.2 Introduction To Applied Data Science
47 pages
1.3 Impact of Data Science On Bussiness
No ratings yet
1.3 Impact of Data Science On Bussiness
38 pages
Computer Networks Assignment Questions
100% (1)
Computer Networks Assignment Questions
8 pages
Bda Unit 5
No ratings yet
Bda Unit 5
30 pages
Insights From A Venn Diagram Software Testing
No ratings yet
Insights From A Venn Diagram Software Testing
21 pages
R20 IT 4-1 - DevOps - UNIT - 3
No ratings yet
R20 IT 4-1 - DevOps - UNIT - 3
15 pages
Supervised Vs Unsupervised Learning What S The Difference IBM 24062021 035331pm
No ratings yet
Supervised Vs Unsupervised Learning What S The Difference IBM 24062021 035331pm
9 pages
DATA ANAYTICS Notes UNIT4
No ratings yet
DATA ANAYTICS Notes UNIT4
45 pages
III Year - Internship Review 5P7
No ratings yet
III Year - Internship Review 5P7
10 pages
JMGS1 - Recollected Questions of Exam Held in Feb-2016
No ratings yet
JMGS1 - Recollected Questions of Exam Held in Feb-2016
4 pages
DSA Syllabus
No ratings yet
DSA Syllabus
3 pages
Must Know Quantitative Aptitude Concepts For TCS Ninja: Faceprep - in
No ratings yet
Must Know Quantitative Aptitude Concepts For TCS Ninja: Faceprep - in
21 pages
Machine Learning Tech. (Unit-1), KAI-601
No ratings yet
Machine Learning Tech. (Unit-1), KAI-601
18 pages
Big Data Unit5
No ratings yet
Big Data Unit5
57 pages
Birla Institute of Technology & Science, Pilani: No. of Pages: 2 No. of Questions: 6
No ratings yet
Birla Institute of Technology & Science, Pilani: No. of Pages: 2 No. of Questions: 6
2 pages
Creditcard Fraud Detection
No ratings yet
Creditcard Fraud Detection
26 pages
Crop Yield Prediction
No ratings yet
Crop Yield Prediction
5 pages
DBMS Chit Sheet For Capgemini Preparation
No ratings yet
DBMS Chit Sheet For Capgemini Preparation
7 pages
devops lab viva questions
No ratings yet
devops lab viva questions
14 pages
Cloud&Distributed Computing
No ratings yet
Cloud&Distributed Computing
93 pages
Jake S Resume Anonymous
No ratings yet
Jake S Resume Anonymous
1 page
Cloud Computing Unit 1
No ratings yet
Cloud Computing Unit 1
23 pages
CS341Tut3 PDF
100% (1)
CS341Tut3 PDF
3 pages
Knowledge Codification: Test Your Understanding
No ratings yet
Knowledge Codification: Test Your Understanding
9 pages
3.1 Lesson 1.2 Hints PDF
0% (1)
3.1 Lesson 1.2 Hints PDF
7 pages
Air Pollution Analysis Using Python
No ratings yet
Air Pollution Analysis Using Python
13 pages
Lecture 6 Data Preprocessing
No ratings yet
Lecture 6 Data Preprocessing
59 pages
Interview Preparations - NielsenIQ
No ratings yet
Interview Preparations - NielsenIQ
1 page
Salary Prediction Using Machine Learning
No ratings yet
Salary Prediction Using Machine Learning
4 pages
Taxonomy
No ratings yet
Taxonomy
30 pages
UNIT-IV Advanced Architecture Part-2
No ratings yet
UNIT-IV Advanced Architecture Part-2
52 pages
Unit 4
No ratings yet
Unit 4
4 pages
Sepm Notes Module 2
No ratings yet
Sepm Notes Module 2
31 pages
Software Testing Notes
100% (1)
Software Testing Notes
12 pages
Unit-4 & 5
No ratings yet
Unit-4 & 5
21 pages
Mcs 023
No ratings yet
Mcs 023
261 pages
ML Lab Programs (1-12)
No ratings yet
ML Lab Programs (1-12)
35 pages
Data Preprocessing
No ratings yet
Data Preprocessing
37 pages
Pattern Recognition
No ratings yet
Pattern Recognition
461 pages
Kyc Blockchain Er Diagram
No ratings yet
Kyc Blockchain Er Diagram
1 page
Introduction To Google App Engine
No ratings yet
Introduction To Google App Engine
36 pages
SCT Unit-V
No ratings yet
SCT Unit-V
15 pages
Automated Policy Based Management
100% (1)
Automated Policy Based Management
5 pages
Msc. 3 Sem: Unit - 1
No ratings yet
Msc. 3 Sem: Unit - 1
57 pages
Telecom CRM Big Data Analytics For Tariff Plan Design: Puja Shrivastava, DR - Laxmansahoo
No ratings yet
Telecom CRM Big Data Analytics For Tariff Plan Design: Puja Shrivastava, DR - Laxmansahoo
4 pages
LAB # 07 Facts and Rules in PROLOG: Objective
No ratings yet
LAB # 07 Facts and Rules in PROLOG: Objective
6 pages
Group Project Assignment
No ratings yet
Group Project Assignment
2 pages
Research Proposal
100% (1)
Research Proposal
4 pages
Prepinsta-Com-Acce
No ratings yet
Prepinsta-Com-Acce
5 pages
CHP 1 Ethics in Engineering
No ratings yet
CHP 1 Ethics in Engineering
17 pages
Unit 5
No ratings yet
Unit 5
13 pages
Offer Letter
No ratings yet
Offer Letter
2 pages
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Touchpad Plus Ver. 1.1 Class 7
From Everand
Touchpad Plus Ver. 1.1 Class 7
Nisha Batra
No ratings yet
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
From Everand
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
Janet Laane Effron
No ratings yet
A Stock Price Prediction Model Based On Investor Sentiment and Optimized Deep Learning
No ratings yet
A Stock Price Prediction Model Based On Investor Sentiment and Optimized Deep Learning
10 pages
Instant Download (Ebook) Learning Ray (Fifth Early Release) by Max Pumperla, Edward Oakes, Richard Liaw ISBN 9781098117160, 9781098117214, 1098117166, 1098117212 PDF All Chapters
100% (6)
Instant Download (Ebook) Learning Ray (Fifth Early Release) by Max Pumperla, Edward Oakes, Richard Liaw ISBN 9781098117160, 9781098117214, 1098117166, 1098117212 PDF All Chapters
81 pages
Adam vs. SGD - Closing The Generalization Gap On Image Classification
No ratings yet
Adam vs. SGD - Closing The Generalization Gap On Image Classification
7 pages
A Novel Multi-Phase Hierarchical Forecasting Approach With Machine Learning in Supply Chain Management
No ratings yet
A Novel Multi-Phase Hierarchical Forecasting Approach With Machine Learning in Supply Chain Management
15 pages
Module 5
No ratings yet
Module 5
51 pages
Technical Analysis of Data-Centric and Model-Centric Artificial Intelligence
No ratings yet
Technical Analysis of Data-Centric and Model-Centric Artificial Intelligence
9 pages
Auto ML
No ratings yet
Auto ML
15 pages
MLR in R PDF
No ratings yet
MLR in R PDF
5 pages
Deep Learning - AD3501 - Important Questions and 2 Marks With Answer - Unit 4 - Model Evaluation
No ratings yet
Deep Learning - AD3501 - Important Questions and 2 Marks With Answer - Unit 4 - Model Evaluation
12 pages
An Automated Deep Reinforcement Learning Pipeline For Dynamic Pricing
No ratings yet
An Automated Deep Reinforcement Learning Pipeline For Dynamic Pricing
10 pages
ML 2022 Sheet 10
No ratings yet
ML 2022 Sheet 10
1 page
Module 3.3 Classification Models, An Overview
No ratings yet
Module 3.3 Classification Models, An Overview
11 pages
Hyperparameter Tuning For Deep Learning in Natural Language Processing
No ratings yet
Hyperparameter Tuning For Deep Learning in Natural Language Processing
7 pages
Brief Introduction To Artificial Neural Networks Ensps
No ratings yet
Brief Introduction To Artificial Neural Networks Ensps
11 pages
9
No ratings yet
9
29 pages
A Predictive Model For Steady-State Multiphase Pipe Flow: Machine Learning On Lab Data
No ratings yet
A Predictive Model For Steady-State Multiphase Pipe Flow: Machine Learning On Lab Data
23 pages
1.4 Intro To Need of Estimation and Validation PDF
No ratings yet
1.4 Intro To Need of Estimation and Validation PDF
18 pages
UNIT1
No ratings yet
UNIT1
38 pages
Lec4 Oct12 2022 PracticalNotes LinearRegression
No ratings yet
Lec4 Oct12 2022 PracticalNotes LinearRegression
34 pages
An Empirical Investigation of Catastrophic Forgeti
No ratings yet
An Empirical Investigation of Catastrophic Forgeti
10 pages
Model Fine Tuning Documentation
No ratings yet
Model Fine Tuning Documentation
11 pages
Regression Trees Chapter2
No ratings yet
Regression Trees Chapter2
21 pages
Automatic Hyperparameter Tuning With Sklearn Using Grid and Random Search - by Bex T. - Towards Data Science
No ratings yet
Automatic Hyperparameter Tuning With Sklearn Using Grid and Random Search - by Bex T. - Towards Data Science
8 pages
AutoML-GPT - Automatic Machine Learning With GPT
No ratings yet
AutoML-GPT - Automatic Machine Learning With GPT
11 pages
Stealing Hyperparameters in Machine Learning: Binghui Wang and Neil Zhenqiang Gong
No ratings yet
Stealing Hyperparameters in Machine Learning: Binghui Wang and Neil Zhenqiang Gong
30 pages
MLOps Engineering at Scale 1st Edition Carl Osipov 2024 scribd download
100% (6)
MLOps Engineering at Scale 1st Edition Carl Osipov 2024 scribd download
40 pages
17 - Project Report - NLP-2-27
No ratings yet
17 - Project Report - NLP-2-27
26 pages
MCQS ML
No ratings yet
MCQS ML
27 pages
Hyperparameter Tuning For Machine Learning Models
No ratings yet
Hyperparameter Tuning For Machine Learning Models
14 pages