Support, Decision and Random

Uploaded by

divyanshu.chouhan786

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views

Support, Decision and Random

Uploaded by

divyanshu.chouhan786

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Support Vector Regression

Support Vector Regression (SVR) is a type of machine learning algorithm used for regression analysis. The goal of SVR is
to find a function that approximates the relationship between the input variables and a continuous target variable, while
minimizing the prediction error.
SVR seeks to find a hyperplane that best fits the data points in a continuous space. This is achieved by mapping the input
variables to a high-dimensional feature space and finding the hyperplane that maximizes the margin (distance) between the
hyperplane and the closest data points, while also minimizing the prediction error.
SVR can handle non-linear relationships between the input variables and the target variable by using a kernel function to
map the data to a higher-dimensional space
Kernel: A kernel helps us find a hyperplane in the higher dimensional space
without increasing the computational cost. Usually, the computational cost
will increase if the dimension of the data increases. This increase in dimension
is required when we are unable to find a separating hyperplane in a given
dimension and are required to move in a higher dimension.
•Hyperplane: This is basically a separating line between two data classes in
SVM. But in Support Vector Regression, this is the line that will be used to
predict the continuous output
•Decision Boundary: A decision boundary can be thought of as a demarcation
line (for simplification) on one side of which lie positive examples and on the
other side lie the negative examples. On this very line, the examples may be
classified as either positive or negative.
Support Vector Regression
Consider these two red lines as the decision boundary and the green line as the
hyperplane. Our objective, when we are moving on with SVR, is to basically
consider the points that are within the decision boundary line. Our best fit
line is the hyperplane that has a maximum number of points.
The first thing that we’ll understand is what is the decision boundary (the danger
red line above!). Consider these lines as being at any distance, say ‘a’, from the
hyperplane. So, these are the lines that we draw at distance ‘+a’ and ‘-a’ from the
hyperplane. This ‘a’ in the text is basically referred to as epsilon.
Assuming that the equation of the hyperplane is as follows:
Y = wx+b (equation of hyperplane)
Then the equations of decision boundary become:
wx + b = +a
wx + b = -a
Thus, any hyperplane that satisfies our SVR should satisfy:
-a < Y- wx+b < +a
Our main aim here is to decide a decision boundary at ‘a’ distance from the
original hyperplane such that data points closest to the hyperplane or the
support vectors are within that boundary line.
Hence, we are going to take only those points that are within the decision
boundary and have the least error rate, or are within the Margin of Tolerance.
This gives us a better fitting model.
DECISION TREE Regression

•Root Node: It is the topmost node in the tree, which represents the complete dataset. It is the starting point of the decision-
making process.
•Leaf/Terminal Node: A node without any child nodes that indicates a class label or a numerical value.
•Splitting: The process of splitting a node into two or more sub-nodes using a split criterion and a selected feature.
•Branch/Sub-Tree: A subsection of the decision tree starts at an internal node and ends at the leaf nodes.
•Parent Node: The node that divides into one or more child nodes.
•Child Node: The nodes that emerge when a parent node is split.
•Pruning: The process of removing branches from the tree that do not provide any additional information or lead to
overfitting.
DECISION TREE Regression
Decision tree Regression work by partitioning the feature space into regions and predicting the target variable based on the average (or median) value of the
training samples in each region.
Building a Decision Tree
1.Root Node Selection: The feature that provides the best split (according to the chosen criterion) is selected as the root node.
2.Splitting: The dataset is split into subsets based on the value of the selected feature.
3.Recursive Splitting: The splitting process is repeated recursively for each subset until a stopping criterion is met. This criterion could be a maximum tree
depth, minimum samples per leaf, or other hyperparameters.
4.Leaf Node Prediction: When a stopping criterion is reached, the average (or median) value of the target variable in each leaf node is used as the prediction
for new instances falling into that leaf.

VARIANCE
variance typically refers to assessing the variability or spread of the target variable within each leaf node of the tree. This variance can be used as a measure of
impurity when making decisions about how to split the data at each node. The most common measure used to calculate the variance within each node is the
mean squared error (MSE).
DECISION TREE Regression

Variance Reduction

Variance reduction is a measure of how much the variance of the target variable is reduced as a result of splitting the data
based on a particular feature at a particular node in the tree. The formula for calculating variance reduction in decision tree
regression typically involves comparing the variance of the target variable before and after the split.
Random Forest Regression

A Random Forest is like a group decision-making team in machine learning. It combines the opinions of many “trees”
(individual models) to make better predictions, creating a more robust and accurate overall model.In other words,
builds multiple decision trees during training. Each decision tree is constructed by selecting a random subset of features
and a random subset of the training data points. This randomness helps to ensure that the individual trees are diverse and
not overly correlated with each other.
1.Bagging (Bootstrap Aggregating): This method involves
training multiple models on random subsets of the training
data. The predictions from the individual models are then
combined, typically by averaging.
2.Boosting: This method involves training a sequence of
models, where each subsequent model focuses on the errors
made by the previous model. The predictions are combined
using a weighted voting scheme.
3.Stacking: This method involves using the predictions from
one set of models as input features for another model. The
final prediction is made by the second-level model.
Random Forest Regression

Steps Involved in Random Forest Algorithm

•Step 1: In the Random forest model, a subset of data points and a subset of features is selected for constructing
each decision tree. Simply put, n random records and m features are taken from the data set having k number of
records.
•Step 2: Individual decision trees are constructed for each sample.
•Step 3: Each decision tree will generate an output.
•Step 4: Final output is considered based on Majority Voting or Averaging for Classification and regression,
respectively.
Difference between decision tree and random forest

Decision trees Random Forest

1. Random forests are created from subsets of data,
1. Decision trees normally suffer from the problem of and the final output is based on average or majority
overfitting if it’s allowed to grow without any control. ranking; hence the problem of overfitting is taken care
of.
2. A single decision tree is faster in computation. 2. It is comparatively slower.
3. When a data set with features is taken as input by a 3. Random forest randomly selects observations,
decision tree, it will formulate some rules to make builds a decision tree, and takes the average result. It
predictions. doesn’t use any set of formulas.

Mod4 Eda
No ratings yet
Mod4 Eda
13 pages
Lecture-7---Decision-Tree-Regression-imran-19032025-103416am
No ratings yet
Lecture-7---Decision-Tree-Regression-imran-19032025-103416am
40 pages
Unit 2
No ratings yet
Unit 2
11 pages
MLp
No ratings yet
MLp
28 pages
UNIT III MACHINE LEARNING
No ratings yet
UNIT III MACHINE LEARNING
19 pages
MC Learning
No ratings yet
MC Learning
4 pages
Regression Models: by Mayuri Bhandari
No ratings yet
Regression Models: by Mayuri Bhandari
64 pages
UNIT3 Machine Learning
No ratings yet
UNIT3 Machine Learning
53 pages
PerceptiLabs-ML Handbook
No ratings yet
PerceptiLabs-ML Handbook
31 pages
Random Forest Class Lecture Notes
No ratings yet
Random Forest Class Lecture Notes
2 pages
CSL0777 L19
No ratings yet
CSL0777 L19
23 pages
PID5108657
No ratings yet
PID5108657
8 pages
Decision Tree & Regression
No ratings yet
Decision Tree & Regression
33 pages
Support_Vector_Regression_Introduction
No ratings yet
Support_Vector_Regression_Introduction
10 pages
CP 4
No ratings yet
CP 4
2 pages
SVM Regressor
No ratings yet
SVM Regressor
13 pages
Interview Preparing - ML Draft
No ratings yet
Interview Preparing - ML Draft
12 pages
Logistic - Regression
No ratings yet
Logistic - Regression
31 pages
Random Forest
No ratings yet
Random Forest
25 pages
Stats 3
No ratings yet
Stats 3
3 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
Types of Regression
No ratings yet
Types of Regression
8 pages
Algorithms
No ratings yet
Algorithms
5 pages
Unit 4
No ratings yet
Unit 4
33 pages
THUẬT TOÁN
No ratings yet
THUẬT TOÁN
4 pages
ml_cheat (1)
No ratings yet
ml_cheat (1)
9 pages
Refer For KNNDecison Tree SVM
No ratings yet
Refer For KNNDecison Tree SVM
90 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Decision Tree Learning
No ratings yet
Decision Tree Learning
22 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
UNIT-3
No ratings yet
UNIT-3
12 pages
Unit 3
No ratings yet
Unit 3
31 pages
1729585037_ML11_Generalization
No ratings yet
1729585037_ML11_Generalization
40 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
118 pages
Decision Trees
67% (3)
Decision Trees
14 pages
Unit-7 ML
No ratings yet
Unit-7 ML
11 pages
Types of Kernels in Support Vector Machines
No ratings yet
Types of Kernels in Support Vector Machines
14 pages
DS Unit - 4
No ratings yet
DS Unit - 4
76 pages
It Is A Graphical Representation For Getting All The Possible Solutions To A Problem/decision Based On Given Conditions
No ratings yet
It Is A Graphical Representation For Getting All The Possible Solutions To A Problem/decision Based On Given Conditions
1 page
DecisionTrees RandomForest v2
No ratings yet
DecisionTrees RandomForest v2
27 pages
Unit No. 03 - Classification & Regression
No ratings yet
Unit No. 03 - Classification & Regression
75 pages
ML-U2-Regression
No ratings yet
ML-U2-Regression
20 pages
Lecture 05 Random Forest 07112022 124639pm
No ratings yet
Lecture 05 Random Forest 07112022 124639pm
25 pages
Week 2 Watermark
No ratings yet
Week 2 Watermark
84 pages
A Detailed Analysis of The Supervised Machine Learning Algorithms
No ratings yet
A Detailed Analysis of The Supervised Machine Learning Algorithms
5 pages
Unit IV
No ratings yet
Unit IV
36 pages
Module 1 Notes
100% (1)
Module 1 Notes
73 pages
Support-Vector-Regression
No ratings yet
Support-Vector-Regression
5 pages
ML & DL Notes
No ratings yet
ML & DL Notes
30 pages
Tree Based Learning Methods
No ratings yet
Tree Based Learning Methods
28 pages
Random Forest Summary
No ratings yet
Random Forest Summary
6 pages
MI_Unit 4
No ratings yet
MI_Unit 4
79 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Decision Tree Learning
No ratings yet
Decision Tree Learning
11 pages
Árboles de Regresión. Algunos Algoritmos y Extensiones A Métodos de Consenso Autor David Gonzalo Ejea Carbonell
No ratings yet
Árboles de Regresión. Algunos Algoritmos y Extensiones A Métodos de Consenso Autor David Gonzalo Ejea Carbonell
34 pages
12 PAGES_Random Forest Algorithm, Support Vector Machine for Regression Analysis
No ratings yet
12 PAGES_Random Forest Algorithm, Support Vector Machine for Regression Analysis
12 pages
Classification Algorithms
No ratings yet
Classification Algorithms
68 pages
Module09 TreeBasedMethods
No ratings yet
Module09 TreeBasedMethods
36 pages
Regression Trees
No ratings yet
Regression Trees
11 pages
PDS+LVC+2+Post-Session+Summary
No ratings yet
PDS+LVC+2+Post-Session+Summary
11 pages
Lesson Plan For Grade 3 English (Aiza A. Miranda-Bse3-2E)
No ratings yet
Lesson Plan For Grade 3 English (Aiza A. Miranda-Bse3-2E)
10 pages
Penjelasan Project SIK - 2022-2023 Gasal
No ratings yet
Penjelasan Project SIK - 2022-2023 Gasal
5 pages
FS 1 Learning Ep 14
No ratings yet
FS 1 Learning Ep 14
8 pages
(Ebook) Linear Feedback Control: Analysis and Design with MATLAB (Advances in Design and Control) by Dingyu Xue, YangQuan Chen, Derek P. Atherton ISBN 9780898716382, 0898716381 - The full ebook with all chapters is available for download
100% (1)
(Ebook) Linear Feedback Control: Analysis and Design with MATLAB (Advances in Design and Control) by Dingyu Xue, YangQuan Chen, Derek P. Atherton ISBN 9780898716382, 0898716381 - The full ebook with all chapters is available for download
59 pages
Science Form 2 March 2018 Monthly Test Marking Scheme
No ratings yet
Science Form 2 March 2018 Monthly Test Marking Scheme
3 pages
Animal Robot
No ratings yet
Animal Robot
11 pages
Emr Drill Press
No ratings yet
Emr Drill Press
3 pages
Ed698 14 2
No ratings yet
Ed698 14 2
5 pages
1 s2.0 002008919290052U Main
No ratings yet
1 s2.0 002008919290052U Main
6 pages
Akash Karia - TED Talks Storytelling - 23 Storytelling Techniques From The Best TED Talks-CreateSpace Independent Publishing Platform (2015)
No ratings yet
Akash Karia - TED Talks Storytelling - 23 Storytelling Techniques From The Best TED Talks-CreateSpace Independent Publishing Platform (2015)
40 pages
UML 2 0 in a Nutshell 1st ed Edition Dan Pilone - The full ebook version is available, download now to explore
100% (2)
UML 2 0 in a Nutshell 1st ed Edition Dan Pilone - The full ebook version is available, download now to explore
57 pages
Introducing Japanese Basic Sounds
No ratings yet
Introducing Japanese Basic Sounds
7 pages
Lecture 3 - Pressure of Concrete On Formwork
No ratings yet
Lecture 3 - Pressure of Concrete On Formwork
49 pages
Sample Exercises Biometric
No ratings yet
Sample Exercises Biometric
3 pages
Iridium 9505a Satellite Phone
No ratings yet
Iridium 9505a Satellite Phone
4 pages
Proyecto de Telenovela
No ratings yet
Proyecto de Telenovela
1 page
Electric Drive Ebjc Ebja Ebra Ebjd Repair Manual Eng
No ratings yet
Electric Drive Ebjc Ebja Ebra Ebjd Repair Manual Eng
283 pages
Lift PDF
No ratings yet
Lift PDF
12 pages
ME3493 MANUFACTURING TECHNOLOGY syllabus
No ratings yet
ME3493 MANUFACTURING TECHNOLOGY syllabus
2 pages
CAT Eleco Conectors
No ratings yet
CAT Eleco Conectors
93 pages
Document (Dental Casting Alloy
No ratings yet
Document (Dental Casting Alloy
11 pages
182SWD-21
No ratings yet
182SWD-21
6 pages
High Level Impacts GST Training & SAP Landscape - High Level Impacts. Compliance To Legal Requirements? Efficiency That Reduces Costs?
No ratings yet
High Level Impacts GST Training & SAP Landscape - High Level Impacts. Compliance To Legal Requirements? Efficiency That Reduces Costs?
9 pages
Eagle Quantum Premier 8 Channel Relay Module Model EQ3720RM: Specification Data
No ratings yet
Eagle Quantum Premier 8 Channel Relay Module Model EQ3720RM: Specification Data
4 pages
ASPE - Vol 2 (2004) - CHAPTER 1-3
No ratings yet
ASPE - Vol 2 (2004) - CHAPTER 1-3
1 page
Cue Sheet 2
No ratings yet
Cue Sheet 2
2 pages
Phy 15 Notes
No ratings yet
Phy 15 Notes
8 pages
Fluency Speaking Activities
No ratings yet
Fluency Speaking Activities
9 pages
Rcs454: Python Language Programming LAB: Write A Python Program To
No ratings yet
Rcs454: Python Language Programming LAB: Write A Python Program To
39 pages
Essay Writing Skills
No ratings yet
Essay Writing Skills
7 pages