0% found this document useful (0 votes)

104 views28 pages

Random Forest Algorithm Overview and Tuning

Random Forest is a non-parametric algorithm that averages the predictions of many de-correlated decision trees to improve performance over a single tree. It can be used for both classification and regression problems. Random Forest introduces randomness during tree construction by selecting a random subset of features at each split, which provides a more diverse set of trees that reduces variance compared to bagging. Key hyperparameters include the number of trees, number of randomly selected features at each split (mtry), minimum node size, sampling scheme, and whether to use early stopping criteria. Random Forest performance can be tuned by optimizing these hyperparameters, such as through random grid search as demonstrated using the h2o package.

Uploaded by

Vivi Wong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views28 pages

Random Forest Algorithm Overview and Tuning

Uploaded by

Vivi Wong

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Random Forest

Algorithm class: Non-parametric

Mechanism: Average predictions of many trees (de-correlated)

Applicable: Both classification and regression problem

Random Forest is a generalization of Bagging, and typically achieves much

better performance
• Essentially, provide an improvement over bagging by a small tweak
• This reduces the variance when we average the trees
Idea

Split variable randomization

• Follow a similar bagging process but …

Trees produced by bagging

Idea

Split variable randomization

• Follow a similar bagging process but …
• Each time a split is to be performed,

- regression trees: m = p/3

- classification trees: m= 𝑝
- m is commonly referred to as mtry

Trees produced by RF
Random Forest

Essentially

• Bagging introduces randomness into rows of the data

• Random forest introduces randomness into __________________________

• This provides a more diverse set of trees that almost always lowers the
prediction error
Out of bag (OOB) Performance
• For large enough N, on average 63% or the original records end up in any
bootstrap sample

• i.e. 37% of the observations are not used in the construction of a particular tree

• These observations are considered OOB and can be used for efficient assessment
of model performance (unstructured, but free, cross validation)

• RF typically has the least variability in prediction accuracy when tuning

• Let’s now look at how to implement RF

Implementation of Random Forest

• Simple way: ranger, full grid search

• More advanced: h2o, random grid search & early stopping rules
Ames Housing Example (RF), with ranger package
Direct implementation of RF, no tuning

…
For regression tree

Baseline RF model, RMSE ≈ 25,500

Next, we will look at how to tune hyperpara. to improve the model

Random Forest
Tuning Hyperparameters
Random forests provide good "out-of-the-box" performance but there are a few hyperpara.
we can tune to increase performance.

# Trees
Typically have the largest impact on predictive accuracy
Mtry

Min node size/Max depth

(Tree Complexity) Some impact on predictive accuracy, but can increase
computational efficiency
Sampling scheme
Random Forest
#Features in dataset?
Tuning Hyperparameters: # Trees

Need to be sufficiently large: stabilize error rate

Rule of thumb: start with 10p trees and adjust as

necessary

More trees provide robust and stable error

estimates and variable importance measures

Computation time increases linearly with the

number of trees
Random Forest
Tuning Hyperparameters: mtry (#split vars)

Balance low tree correlation and reasonable

predictive strength

Rule of thumb default:

Regression default:
Classification default:

Start with 5 values evenly spaced from 2 to p,

including the default rule-of-thumb value

Few relevant predictors: Should we  or  mtry?

Random Forest
Tuning Hyperparameters: Min node size/Max depth (Tree Complexity)

Control the complexity of individual trees

Error Growth
Rule of thumb:
Regression default: 5
Classification default: 1
Start with 3 values (1,5,10)

Study (Segal, 2004) has shown Run Time Reduction

Few relevant predictors: Node size 
Very large data sets: Node size 

Impact of Node size on error & run time (Right Figure)

• If run time is a concern, can run time substantially by node size
Random Forest
Tuning Hyperparameters: Sampling scheme

1. Sample size (default: 100%)

2. Sample with replacement / without replacement
(default: with replacement)

Rationale:
 Sample size ______ between-tree correlation

Sampling without replacement produces trees that are less biased

• Ensures [obs. with low-freq categories] more likely to be selected
• Especially important when data has categories that are imbalanced

Rule of thumb:
3-4 values of sample sizes ranging from 25-100%
Try both sampling with/without replacement
Ames Housing Example (RF), with ranger package (cont’d)
Tuning Strategy Illustration

mtry
Min node size
Sample scheme

Note: [Link] returns a dataframe with columns mtry,

[Link], replace, [Link], rmse (values to be filled)
Ames Housing Example (RF), with ranger package (cont’d)
Tuning Strategy Illustration

#trees
mtry
Node size
Sample scheme

Fills rmse in hyper_grid

(created by [Link])
Ames Housing Example (RF), with ranger package (cont’d)
Tuning Strategy Illustration %improvement of RMSE w.r.t. baseline model

RMSE slightly improvement over

baseline model

Observations
1. Default mtry = 26 (#features/3)
nearly sufficient

2. Smaller node size performs better

(deeper tree)

3. Sample <100% and sample

without replacement consistently
performs better
• Probably due to data having a lot
of high-cardinality & imbalanced
categorial features
Ames Housing Example (RF), with h2o package

Benefits of h2o package:

• Random grid search
• Full Cartesian hyperpara. search can be computationally expensive
• Randomly jump from one random para. combination to another
• Can specify early stopping rules
• E.g. #models trained >= threshold, certain runtime elapses
Ames Housing Example (RF), with h2o package
Baseline h2o RF
• Syntax and result very similar to the baseline ranger RF

Similar to baseline RF using ranger

Ames Housing Example (RF), with h2o package
h2o RF with Random Grid Search + Early Stopping Rule (Optional)

Recall in ranger,
we build the hyperpara. grid using the following syntax
Ames Housing Example (RF), with h2o package
h2o RF with Random Grid Search + Early Stopping Rule (Optional)

In h2o, we use a list

Ames Housing Example (RF), with h2o package
h2o RF with Random Grid Search + Early Stopping Rule (Optional)

In h2o, we use a list

Min node size Random grid-search strategy: “RandomDiscrete”

• Randomly jump from one hyperpara. combination to another

Early stopping criteria for grid-search

• Stop if the last 10 RF models do NOT improve RMSE by 0.1%
• Stop if run time > 5 mins
Ames Housing Example (RF), with h2o package
h2o RF with Random Grid Search + Early Stopping Rule (Optional)

Early stopping criteria for building one RF

• Stop if the last 10 trees added do NOT improve RMSE by 0.5%
Ames Housing Example (RF), with h2o package
h2o RF with Random Grid Search + Early Stopping Rule (Optional)
Ames Housing Example (RF), with h2o package
h2o RF with Random Grid Search + Early Stopping Rule (Optional)
Note: with early stopping, results may NOT be
the same (#models searched in laptops of
different speed will be different)

Assessed 66 models, best CV RMSE = 24670

This is near-optimal, and the random grid-

search is more efficient
Feature Interpretation
For RF: 2 approaches for variable importance
At this point, do not need to know the details, just know there are 2 measures
Impurity (Same as CART)
• Based on the average total reduction in MSE

Permutation (Applicable for All ML models, will talk about it in more details)
• Permute a feature to a random value, see how it affects MSE
Feature Interpretation
E.g. using ranger
Feature Interpretation
Typically, similar variables at the top between the two approaches
• Can conclude top 3 important vars: Overall_Qual, Gr_Liv_Area, Neighborhood
Summary
Method Hyperpara Unique features RMSE Package
Demonstrated

CART • Tree depth Simple to interpret - rpart

• Node size
• cp caret
method =
“rpart”
Random Forest # Trees (~10p) Subsample rows/cols ~24000 ranger
Mtry (#split vars, p/3 or 𝑝) Early Stopping
Node size (Tree Complexity) (in adding trees) h2o
Sampling scheme Algorithm =
• Sample size “randomForest”
• Sample with/without replacement
End

Random Forest & SVM for Regression Analysis
No ratings yet
Random Forest & SVM for Regression Analysis
12 pages
Random Forest Algorithm Assessment
No ratings yet
Random Forest Algorithm Assessment
4 pages
Random Forest Algorithm in Machine Learning
No ratings yet
Random Forest Algorithm in Machine Learning
21 pages
Ultimate Guide to Random Forest Regression
100% (1)
Ultimate Guide to Random Forest Regression
9 pages
Understanding Random Forests in ML
No ratings yet
Understanding Random Forests in ML
10 pages
Understanding Decision Trees and Their Algorithms
No ratings yet
Understanding Decision Trees and Their Algorithms
33 pages
Random Forest Algorithm Overview
No ratings yet
Random Forest Algorithm Overview
9 pages
Random Forest Applications in Finance
No ratings yet
Random Forest Applications in Finance
13 pages
Random Forest Model Assumptions Explained
No ratings yet
Random Forest Model Assumptions Explained
33 pages
Random Forest Algorithm in Machine Learning
100% (1)
Random Forest Algorithm in Machine Learning
14 pages
Understanding Random Forest in ML
No ratings yet
Understanding Random Forest in ML
3 pages
Random Forest Machine Learning Lab Report
No ratings yet
Random Forest Machine Learning Lab Report
8 pages
Random Forest for Crab Age Prediction
No ratings yet
Random Forest for Crab Age Prediction
2 pages
Understanding Random Forests Explained
No ratings yet
Understanding Random Forests Explained
34 pages
Understanding Ensemble Learning Techniques
No ratings yet
Understanding Ensemble Learning Techniques
6 pages
Enhancing Random Forest Classifier Performance
No ratings yet
Enhancing Random Forest Classifier Performance
7 pages
Random Forest Regression in Python Guide
No ratings yet
Random Forest Regression in Python Guide
5 pages
Random Forest Algorithm Overview
No ratings yet
Random Forest Algorithm Overview
6 pages
03 - Random Forest
No ratings yet
03 - Random Forest
24 pages
Understanding Random Forest Algorithm
No ratings yet
Understanding Random Forest Algorithm
25 pages
Forest
No ratings yet
Forest
2 pages
Overview of Random Forest in ML
No ratings yet
Overview of Random Forest in ML
25 pages
Random Forest: Pros and Cons Explained
No ratings yet
Random Forest: Pros and Cons Explained
2 pages
Overview of Random Forest Algorithm
No ratings yet
Overview of Random Forest Algorithm
2 pages
Understanding Random Forest Classifier & Regressor
No ratings yet
Understanding Random Forest Classifier & Regressor
23 pages
Understanding Random Forest Algorithm
No ratings yet
Understanding Random Forest Algorithm
6 pages
Understanding Random Forest Algorithm
No ratings yet
Understanding Random Forest Algorithm
10 pages
Machine Learning: Practical Tutorial On Random Forest and Parameter Tuning in R
No ratings yet
Machine Learning: Practical Tutorial On Random Forest and Parameter Tuning in R
11 pages
Hyperparameter Tuning in Random Forest
No ratings yet
Hyperparameter Tuning in Random Forest
5 pages
Practical Guide to Conformal Prediction
No ratings yet
Practical Guide to Conformal Prediction
27 pages
Regression Trees and Random Forests Explained
No ratings yet
Regression Trees and Random Forests Explained
34 pages
Decision Trees and Random Forests Overview
No ratings yet
Decision Trees and Random Forests Overview
27 pages
Random Forests 2
No ratings yet
Random Forests 2
43 pages
Understanding Random Forest Algorithm
No ratings yet
Understanding Random Forest Algorithm
16 pages
Hyperparameter Tuning in Python Guide
No ratings yet
Hyperparameter Tuning in Python Guide
11 pages
Overview of Random Forest Algorithm
No ratings yet
Overview of Random Forest Algorithm
11 pages
Understanding Random Forest Algorithm
100% (1)
Understanding Random Forest Algorithm
32 pages
Ensemble Methods and Random Forest Insights
100% (1)
Ensemble Methods and Random Forest Insights
6 pages
Understanding Random Forest Algorithm
No ratings yet
Understanding Random Forest Algorithm
14 pages
Machine Learning Model for Housing Prices
No ratings yet
Machine Learning Model for Housing Prices
29 pages
Understanding Random Forest Algorithm
No ratings yet
Understanding Random Forest Algorithm
21 pages
Decision Tree vs Random Tree Analysis
No ratings yet
Decision Tree vs Random Tree Analysis
8 pages
Understanding Random Forest Algorithm
No ratings yet
Understanding Random Forest Algorithm
2 pages
Random Forest Algorithm Overview
No ratings yet
Random Forest Algorithm Overview
8 pages
Understanding Random Forest Algorithm
100% (1)
Understanding Random Forest Algorithm
18 pages
Random Forest Overview and Insights
No ratings yet
Random Forest Overview and Insights
4 pages
Understanding Random Forest in ML
No ratings yet
Understanding Random Forest in ML
4 pages
Understanding Random Forests in ML
No ratings yet
Understanding Random Forests in ML
25 pages
Tuning min_samples_leaf in CART Models
No ratings yet
Tuning min_samples_leaf in CART Models
26 pages
Understanding Random Forest Algorithm
No ratings yet
Understanding Random Forest Algorithm
11 pages
Random Forest Algorithm Overview
100% (2)
Random Forest Algorithm Overview
14 pages
Overview of Random Forests Method
No ratings yet
Overview of Random Forests Method
11 pages
Bootstrap Sampling in Random Forests
No ratings yet
Bootstrap Sampling in Random Forests
43 pages
Random Forest Algorithm Explained
No ratings yet
Random Forest Algorithm Explained
10 pages
Overview of Random Forest Algorithm
No ratings yet
Overview of Random Forest Algorithm
15 pages
TETD Newsletter: July 2021 Updates
No ratings yet
TETD Newsletter: July 2021 Updates
18 pages
J
0% (1)
J
81 pages
Functions and Their Properties in Math
No ratings yet
Functions and Their Properties in Math
4 pages
Security Features of Philippine Money
100% (1)
Security Features of Philippine Money
32 pages
Greedy Algorithms: Design & Analysis Guide
No ratings yet
Greedy Algorithms: Design & Analysis Guide
61 pages
Euroklimat Chiller Inverter Manual
No ratings yet
Euroklimat Chiller Inverter Manual
100 pages
Surface-Mount Schottky Rectifiers Guide
No ratings yet
Surface-Mount Schottky Rectifiers Guide
5 pages
Seal Assembly Bill of Materials Guide
No ratings yet
Seal Assembly Bill of Materials Guide
1 page
Summative Test for Grade 3 Students
No ratings yet
Summative Test for Grade 3 Students
8 pages
Super Sujata Corrugation Machines Overview
100% (1)
Super Sujata Corrugation Machines Overview
12 pages
B-Splines and Curve Generation Techniques
No ratings yet
B-Splines and Curve Generation Techniques
27 pages
Earthing Methods and IEE Regulations
No ratings yet
Earthing Methods and IEE Regulations
2 pages
Understanding the Global Community
No ratings yet
Understanding the Global Community
14 pages
Mitsubishi MR-JE-10A Servo Amplifier Data
No ratings yet
Mitsubishi MR-JE-10A Servo Amplifier Data
3 pages
NoSQL Database Cloud Service Insights
No ratings yet
NoSQL Database Cloud Service Insights
11 pages
KEMCO Boiler & Pressure Vessel Code
No ratings yet
KEMCO Boiler & Pressure Vessel Code
523 pages
C++ Library Functions and DEV C++ Guide
No ratings yet
C++ Library Functions and DEV C++ Guide
6 pages
Toyota LPE200 Error Code Guide
No ratings yet
Toyota LPE200 Error Code Guide
66 pages
Network Design Plan for TechConnect Inc.
No ratings yet
Network Design Plan for TechConnect Inc.
9 pages
New-MST Mini Project Statements With Rubrics
No ratings yet
New-MST Mini Project Statements With Rubrics
2 pages
2026 01 30 702911 Full
No ratings yet
2026 01 30 702911 Full
14 pages
P3437 Engine Brake Solenoid Troubleshooting
No ratings yet
P3437 Engine Brake Solenoid Troubleshooting
5 pages
PIC16F1829 Microcontroller Guide
No ratings yet
PIC16F1829 Microcontroller Guide
135 pages
Process Orchestration for Edge Apps
No ratings yet
Process Orchestration for Edge Apps
6 pages
ISHACON 2023 Abstract Submission Guide
No ratings yet
ISHACON 2023 Abstract Submission Guide
3 pages
DSA Practice Problems on Code With Aryan
No ratings yet
DSA Practice Problems on Code With Aryan
6 pages
Football Player Rating Prediction ML
No ratings yet
Football Player Rating Prediction ML
6 pages
India's Open Defecation Statistics Pre-SBM
No ratings yet
India's Open Defecation Statistics Pre-SBM
89 pages
Mechanical Era of Computing Devices
No ratings yet
Mechanical Era of Computing Devices
3 pages
JIS A 5373:2010 Precast Concrete Standards
No ratings yet
JIS A 5373:2010 Precast Concrete Standards
135 pages