Lecture6c HyperparameterOptimization

Uploaded by

Kassa Derbie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

Lecture6c HyperparameterOptimization

Uploaded by

Kassa Derbie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

DEPARTMENT OF APPLIED MATHEMATICS, COMPUTER SCIENCE AND STATISTICS

HYPERPARAMETER
OPTIMIZATION
Big Data Science (Master in Statistical Data Analysis)
PARAMETER OPTIMIZATION
̶ So far, we have talked about parameter optimization:
̶ Our model contains trainable parameters
̶ We define a loss function
̶ An optimization algorithm searches the parameters that
minimize the loss:
‒ Analytic solutions
‒ Newton-Raphson
‒ (Stochastic) gradient descent
‒ ...

2
HYPERPARAMETER OPTIMIZATION
̶ Most models also have hyperparameters:
̶ Fixed before training the model
̶ Involve assumptions of the model
̶ Not taken into account in the gradient of the
optimization function

3
EXAMPLES OF HYPERPARAMETERS
Neural
Linear models Random Forest SVM KNN
networks
• Regularization • Number of • Kernel • Architecture •𝐾
constant trees • Margin • Number of • Distance
• Maximum • Kernel layers metric
depth parameters: • Size of each • Parameters of
• Minimum leaf • Polinomial layer approximate
size degree • Activation structures
• Criterion for • Gaussian function
split kernel width • Dropout
• Number of • ... • Regularization
features per • ...
split
• ...

4
CHOOSING HYPERPARAMETERS
̶ Manual search
̶ Grid search
̶ Random search
̶ Automated methods:
̶ Bayesian optimization
̶ Evolutionary optimization

5
MANUAL TUNING
̶ Using assumptions or knowledge to select the hyperparameters

̶ Pros:
̶ Computationally efficient

̶ Cons:
̶ Requires manual labor
̶ Prone to bias
̶ Limited combinations are tested

6
GRID SEARCH
̶ For each hyperparameter, define a subset of values that will be
tested
̶ Iteratively test all combinations

̶ Pros:
̶ The individual effect of parameters can be studied
̶ Cons:
̶ The number of combinations can become very high
̶ Few values are tested for every parameter
̶ The combined effect of parameters is not completely modeled

7
RANDOM SEARCH
̶ A random distribution is specified for each parameter
̶ Samples are drawn and tested

̶ Pros:
̶ The combined effect of parameters is somewhat modeled
̶ More values per parameter can be considered
̶ Cons:
̶ The search is not guided
̶ The individual effect of parameters is not clear

8
GRID VS RANDOM

J. Bergstra, Y. Bengio, “Random Search for Hyper-

Parameter Optimization”, Journal of Machine Learning
Research 13 (2012) 281-305

9
HYPERPARAMETER
OPTIMIZATION AS AN
OPTIMIZATION PROBLEM
10
AUTOMATED HYPERPARAMETER OPTIMIZATION
̶ Why not solve hyperparameter optimization in the
same way as parameter optimization?

̶ Main approaches:
̶ Bayesian optimization
̶ Evolutionary algorithms

11
SEQUENTIAL MODEL-BASED BAYESIAN OPTIMIZATION (SMBO)
1. Query the function 𝑓 at 𝑡 values and record the
𝑡
resulting pairs S = 𝜽𝑖 , 𝑓(𝜽𝑖 ) 𝑖=1

2. For a fixed number of iterations:

1. Fit a probabilistic model ℳ to the pairs in S
2. Apply an aquisition function 𝑎(𝜽, ℳ) to select a
promising input 𝜽 to evaluate next
3. Evaluate 𝑓(𝜽) and include 𝜽, 𝑓(𝜽) into S

12
GENETIC ALGORITHMS
̶ Applying the principles of natural selection to optimization
̶ Solutions are encoded as "chromosomes"
̶ A crossover operator combines two chromosomes into new ones
̶ A mutation operator introduces random mutations

1. Generate an initial population of solutions

2. For a number of generations:
1. Crossover solutions to increase population size
2. Apply mutation operator
3. Evaluate new solutions
4. Discard some "bad" solutions to maintain a "good" population

13
PARTITIONING
14
PARTITIONING FOR HYPERPARAMETER OPTIMIZATION
̶ Remember: NEVER TRAIN ON THE TEST SET

̶ This is also valid when training hyperparameters

15
TEST SET + CROSS VALIDATION
Valid. Training
Training
Valid.
Training
CV Valid. ...
Training
Training
Training
Valid.

Test

Daniel Peralta <daniel.peralta@ugent.vib.be> 16

NESTED CROSS VALIDATION

Test Training

Test
Valid. Training
Training
Training

...
Training Valid.

Training
Training

Training Valid.
Test

Daniel Peralta <daniel.peralta@ugent.vib.be>

17
NESTED CROSS VALIDATION: EXAMPLE
̶ 5 folds
̶ 3 classifiers: Logistic Regression, Random Forest, SVM

̶ We want to know which classifier is better suited to our problem

̶ We also want to optimize the hyperparameters of each classifier
̶ 3 inner folds for hyperparameter optimization
̶ The ultimate goal is to have a system in production doing real
predictions

18
NESTED CROSS VALIDATION: EXAMPLE
1. For each outer fold i in [1...5]:
1. Validation set: Fold i
2. Training set: Folds {1,2,3,4,5}\{i}
3. Split training set into 3 inner folds
4. For each classifier 𝐶 in {LR, RF, SVM}:
1. For each combination of hyperparameters 𝜃𝑐 for 𝐶:
1. For each inner fold j in [1...3]:
1. (Inner) validation set: Fold j
2. (Inner) training set: Folds {1,2,3}\{j}
3. Train classifier 𝐶(𝜃𝑐 ) on training set
4. Evaluate 𝐶(𝜃𝑐 ) on validation set
2. Calculate average performance of 𝐶 𝜃𝑐 across 3 inner folds
∗(𝑖)
2. Select best performing parameters 𝜃𝑐 for classifier 𝐶
∗(𝑖)
3. Evaluate C(𝜃𝑐 ) on validation set
∗(𝑖)
2. Calculate average performance of each C(𝜃𝑐 ) across all validation folds
3. Select the best classifier C ∗
4. Select 𝜃𝑐∗∗ as the optimal hyperparameters for C ∗
5. Train C ∗ 𝜃𝑐∗∗ on the entire dataset

∗(𝑖)
̶ Note that the best parameters 𝜃𝑐 for each classifier depend on the outer fold that was used for training

Deformations of Reinforced Concrete Members at Yielding and Ultimate
No ratings yet
Deformations of Reinforced Concrete Members at Yielding and Ultimate
84 pages
Model Training: (Anything Done While We Train The Model)
No ratings yet
Model Training: (Anything Done While We Train The Model)
194 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Automl: A Perspective Where Industry Meets Academy
No ratings yet
Automl: A Perspective Where Industry Meets Academy
154 pages
804-Article Text-1282-1-10-20221223
No ratings yet
804-Article Text-1282-1-10-20221223
7 pages
Bergstra12a PDF
No ratings yet
Bergstra12a PDF
25 pages
RO47002 - Lecture 2C - Hyperparameters and Cross-Validation
No ratings yet
RO47002 - Lecture 2C - Hyperparameters and Cross-Validation
10 pages
HyperParameterTuning
No ratings yet
HyperParameterTuning
4 pages
1 s2.0 S1674862X19300047 Main
No ratings yet
1 s2.0 S1674862X19300047 Main
15 pages
Hyperparameter Optimization For Machine Learning Models Based On Bayesian Optimization
No ratings yet
Hyperparameter Optimization For Machine Learning Models Based On Bayesian Optimization
15 pages
Hyperparameter Optimization of ML Algorithms
No ratings yet
Hyperparameter Optimization of ML Algorithms
69 pages
The Importance of Hyperparameters in Machine Learning
No ratings yet
The Importance of Hyperparameters in Machine Learning
8 pages
ANDONIE, R. Hyperparameter Optimization in Learning Systems. Journal of Membrane Computing. 2019.
No ratings yet
ANDONIE, R. Hyperparameter Optimization in Learning Systems. Journal of Membrane Computing. 2019.
13 pages
Hyper Parameters
No ratings yet
Hyper Parameters
24 pages
On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice
No ratings yet
On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice
69 pages
Hyper Parameters
No ratings yet
Hyper Parameters
7 pages
Hyperparameter Search in Machine Learning: February 2015
No ratings yet
Hyperparameter Search in Machine Learning: February 2015
6 pages
Hyperopt a Python library for model selection and
No ratings yet
Hyperopt a Python library for model selection and
25 pages
Module_6
No ratings yet
Module_6
4 pages
Metalearning For Hyperparameter Optimization
No ratings yet
Metalearning For Hyperparameter Optimization
20 pages
On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice
No ratings yet
On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice
22 pages
Lecture_2
No ratings yet
Lecture_2
31 pages
Hyperparameter tuning
No ratings yet
Hyperparameter tuning
4 pages
Optimized hyperparameters tuning of multi-class classification algorithms
No ratings yet
Optimized hyperparameters tuning of multi-class classification algorithms
17 pages
Introduction to Optimization-Lec1
No ratings yet
Introduction to Optimization-Lec1
36 pages
Grid Random Search
No ratings yet
Grid Random Search
6 pages
Unit 4 A
No ratings yet
Unit 4 A
16 pages
Egg Et Al 13
No ratings yet
Egg Et Al 13
5 pages
Comparing ML Algorithms - Anjali Garg
No ratings yet
Comparing ML Algorithms - Anjali Garg
14 pages
Hyperband
No ratings yet
Hyperband
52 pages
Tuning Decision Trees Python
No ratings yet
Tuning Decision Trees Python
50 pages
Hyperparameters
No ratings yet
Hyperparameters
8 pages
Lecture 3 1611410001002
No ratings yet
Lecture 3 1611410001002
51 pages
Quiz 1 Materials
No ratings yet
Quiz 1 Materials
159 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
4 pages
Lecture 5 - Feature extraction, model building & evaluation
No ratings yet
Lecture 5 - Feature extraction, model building & evaluation
35 pages
Lecture 15 - Recap and Midterm Review
No ratings yet
Lecture 15 - Recap and Midterm Review
37 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
9 pages
2012 Nikolaos Nikolaou MSC
No ratings yet
2012 Nikolaos Nikolaou MSC
102 pages
Comparative Study of Bayesian Optimization Process For The Best Machine Learning Hyperparameters
No ratings yet
Comparative Study of Bayesian Optimization Process For The Best Machine Learning Hyperparameters
11 pages
#Machinelearning: Mastering Tuning Hyperparameter
No ratings yet
#Machinelearning: Mastering Tuning Hyperparameter
7 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
Hyperparameter-Tuning
No ratings yet
Hyperparameter-Tuning
6 pages
Lecture 7 - Feature Selection & Model Optimization
No ratings yet
Lecture 7 - Feature Selection & Model Optimization
48 pages
Bayesian Optimization
No ratings yet
Bayesian Optimization
15 pages
4-2 Generalizing Bayesian Optimization With Likelihood-Free Inference and Decision-Theoretic Entropies
No ratings yet
4-2 Generalizing Bayesian Optimization With Likelihood-Free Inference and Decision-Theoretic Entropies
45 pages
DR Antonio Gulli - A Collection of Advanced Data Science and Machine Learning Interview Questions Solved in Python and Spark (II) - Hands-On Big Data and Machine - Programming Interview Questions) (
No ratings yet
DR Antonio Gulli - A Collection of Advanced Data Science and Machine Learning Interview Questions Solved in Python and Spark (II) - Hands-On Big Data and Machine - Programming Interview Questions) (
112 pages
SVM Parameter Optimization Using Grid Search and G
No ratings yet
SVM Parameter Optimization Using Grid Search and G
8 pages
8 Ejercicio - Optimización y Guardado de Modelos - Training - Microsoft Learn Ingles
No ratings yet
8 Ejercicio - Optimización y Guardado de Modelos - Training - Microsoft Learn Ingles
13 pages
Machine Learning Tutorial Machine Learning Tutorial
No ratings yet
Machine Learning Tutorial Machine Learning Tutorial
33 pages
Machine Learning Super Cheatsheet (Prof. Pedram Jahangiry)
No ratings yet
Machine Learning Super Cheatsheet (Prof. Pedram Jahangiry)
2 pages
Adaptive Bayesian Contextual Hyperband: A Novel Hyperparameter Optimization Approach
No ratings yet
Adaptive Bayesian Contextual Hyperband: A Novel Hyperparameter Optimization Approach
11 pages
2006.12703v1
No ratings yet
2006.12703v1
16 pages
Logistic
No ratings yet
Logistic
14 pages
optimization_for_machine_learning_mini_course
No ratings yet
optimization_for_machine_learning_mini_course
21 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
Lecture 4.1 AML
No ratings yet
Lecture 4.1 AML
12 pages
Scala for Machine Learning: Leverage Scala and Machine Learning to construct and study systems that can learn from data
From Everand
Scala for Machine Learning: Leverage Scala and Machine Learning to construct and study systems that can learn from data
Patrick R. Nicolas
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Semi-Global Fixed-Time State Estimation and Unknown Input Reconstruction Via First-Order Sliding Mode Observers With Delay
No ratings yet
Semi-Global Fixed-Time State Estimation and Unknown Input Reconstruction Via First-Order Sliding Mode Observers With Delay
9 pages
Course Notes For EE394V Restructured Electricity Markets: Locational Marginal Pricing
No ratings yet
Course Notes For EE394V Restructured Electricity Markets: Locational Marginal Pricing
95 pages
[FREE PDF sample] Signal processing for neuroscientists 2nd ed Edition Drongelen - eBook PDF ebooks
100% (4)
[FREE PDF sample] Signal processing for neuroscientists 2nd ed Edition Drongelen - eBook PDF ebooks
69 pages
19.differentitation and Applications of Derivatives
No ratings yet
19.differentitation and Applications of Derivatives
62 pages
C57.91-1995 Errata PDF
No ratings yet
C57.91-1995 Errata PDF
3 pages
Jntu Hyd 2 2ece Eca Set 2
No ratings yet
Jntu Hyd 2 2ece Eca Set 2
18 pages
1st Hour Break Out
No ratings yet
1st Hour Break Out
14 pages
Mental Math Strategies: Roshel Salvador
No ratings yet
Mental Math Strategies: Roshel Salvador
29 pages
Concise Selina Solutions For Class 9 Maths Chapter 24 Solution of Right Triangles
No ratings yet
Concise Selina Solutions For Class 9 Maths Chapter 24 Solution of Right Triangles
23 pages
Out PDF
No ratings yet
Out PDF
199 pages
K Roset Training
100% (1)
K Roset Training
22 pages
Class 10
No ratings yet
Class 10
143 pages
Final DBMS Lab Manual
No ratings yet
Final DBMS Lab Manual
15 pages
How To Calculate Your EMI
100% (1)
How To Calculate Your EMI
7 pages
Content: Dplyr, Readr, TM, Ggplot2/+ggforce/, Tidyr, Broom Dplyr
No ratings yet
Content: Dplyr, Readr, TM, Ggplot2/+ggforce/, Tidyr, Broom Dplyr
8 pages
Post-Tonal Music Theory
No ratings yet
Post-Tonal Music Theory
5 pages
Assign 4
No ratings yet
Assign 4
1 page
Notes On Logic Gates - Basic Logic Gates
No ratings yet
Notes On Logic Gates - Basic Logic Gates
7 pages
unit 1 part 1
No ratings yet
unit 1 part 1
52 pages
martin1984
No ratings yet
martin1984
16 pages
Quant Investing 101
No ratings yet
Quant Investing 101
8 pages
Analysis of Constant Strain Triangular Elemnts Using Matlab For Displacements and Stresses
100% (1)
Analysis of Constant Strain Triangular Elemnts Using Matlab For Displacements and Stresses
41 pages
Chaptyer One of Introduction
No ratings yet
Chaptyer One of Introduction
18 pages
GoldenRatio Paper
No ratings yet
GoldenRatio Paper
9 pages
Full download ARCHAEOLOGIST S LABORATORY the analysis of archaeological evidence 2nd Edition Eb Banning pdf docx
No ratings yet
Full download ARCHAEOLOGIST S LABORATORY the analysis of archaeological evidence 2nd Edition Eb Banning pdf docx
55 pages
Notes m3 Unit 2
No ratings yet
Notes m3 Unit 2
27 pages
12345
No ratings yet
12345
64 pages
Gemp 110
No ratings yet
Gemp 110
28 pages