0% found this document useful (0 votes)

9 views

Decision Tree Algorithm

notes for Decision Tree Algorithm

Uploaded by

Aatish

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Decision Tree Algorithm

notes for Decision Tree Algorithm

Uploaded by

Aatish

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

Decision Tree Algorithm With

Hands-On Example

The decision tree is one of the most important machine learning

algorithms. It is used for both classification and regression problems.
In this article, we will go through the classification part.

What is a decision tree?

A decision tree is a classification and prediction tool having a tree-like

structure, where each internal node denotes a test on an attribute,
each branch represents an outcome of the test, and each leaf node
(terminal node) holds a class label.
Above we have a small decision tree. An important advantage of the
decision tree is that it is highly interpretable. Here If Height > 180cm
or if height < 180cm and weight > 80kg person is male.Otherwise
female. Did you ever think about how we came up with this decision
tree? I will try to explain it using the weather dataset.

Before going to it further I will explain some important terms related

to decision trees.

Entropy

In machine learning, entropy is a measure of the randomness in the

information being processed. The higher the entropy, the harder it is
to draw any conclusions from that information.
Information Gain

Information gain can be defined as the amount of information gained

about a random variable or signal from observing another random
variable.It can be considered as the difference between the entropy of
parent node and weighted average entropy of child nodes.

Gini Impurity

Gini impurity is a measure of how often a randomly chosen element

from the set would be incorrectly labeled if it was randomly labeled
according to the distribution of labels in the subset.

Gini impurity is lower bounded by 0, with 0 occurring if the data set

contains only one class.
There are many algorithms there to build a decision tree. They are

1. CART (Classification and Regression Trees) — This makes use

of Gini impurity as the metric.

2. ID3 (Iterative Dichotomiser 3) — This uses entropy and

information gain as metric.

In this article, I will go through ID3. Once you got it it is easy to

implement the same using CART.

Classification using the ID3 algorithm

Consider whether a dataset based on which we will determine whether
to play football or not.
Here There are for independent variables to determine the dependent
variable. The independent variables are Outlook, Temperature,
Humidity, and Wind. The dependent variable is whether to play
football or not.

As the first step, we have to find the parent node for our decision tree.
For that follow the steps:

Find the entropy of the class variable.

E(S) = -[(9/14)log(9/14) + (5/14)log(5/14)] = 0.94

note: Here typically we will take log to base 2.Here total there are 14
yes/no. Out of which 9 yes and 5 no.Based on it we calculated
probability above.

From the above data for outlook we can arrive at the following table
easily
Now we have to calculate average weighted entropy. ie, we have
found the total of weights of each feature multiplied by probabilities.

E(S, outlook) = (5/14)E(3,2) + (4/14)E(4,0) + (5/14)*E(2,3) = (5/14)(-

(3/5)log(3/5)-(2/5)log(2/5))+ (4/14)(0) + (5/14)((2/5)log(2/5)-
(3/5)log(3/5)) = 0.693

The next step is to find the information gain. It is the difference

between parent entropy and average weighted entropy we found
above.

IG(S, outlook) = 0.94 - 0.693 = 0.247

Similarly find Information gain for Temperature, Humidity, and Windy.

IG(S, Temperature) = 0.940 - 0.911 = 0.029

IG(S, Humidity) = 0.940 - 0.788 = 0.152

IG(S, Windy) = 0.940 - 0.8932 = 0.048

Now select the feature having the largest entropy gain. Here it is
Outlook. So it forms the first node(root node) of our decision tree.
Now our data look as follows

Since overcast contains only examples of class ‘Yes’ we can set it as

yes. That means If outlook is overcast football will be played. Now our
decision tree looks as follows.

The next step is to find the next node in our decision tree. Now we will
find one under sunny. We have to determine which of the following
Temperature, Humidity or Wind has higher information gain.
Calculate parent entropy E(sunny)

E(sunny) = (-(3/5)log(3/5)-(2/5)log(2/5)) = 0.971.

Now Calculate the information gain of Temperature. IG(sunny,

Temperature)

E(sunny, Temperature) = (2/5)E(0,2) + (2/5)E(1,1) +

(1/5)*E(1,0)=2/5=0.4

Now calculate information gain.

IG(sunny, Temperature) = 0.971–0.4 =0.571

Similarly we get

IG(sunny, Humidity) = 0.971

IG(sunny, Windy) = 0.020

Here IG(sunny, Humidity) is the largest value. So Humidity is the node
that comes under sunny.

For humidity from the above table, we can say that play will occur if
humidity is normal and will not occur if it is high. Similarly, find the
nodes under rainy.

Note: A branch with entropy more than 0 needs further

splitting.

Finally, our decision tree will look as below:

Classification using CART algorithm

Classification using CART is similar to it. But instead of entropy, we
use Gini impurity.

So as the first step we will find the root node of our decision
tree. For that Calculate the Gini index of the class variable

Gini(S) = 1 - [(9/14)² + (5/14)²] = 0.4591

As the next step, we will calculate the Gini gain. For that first, we
will find the average weighted Gini impurity of Outlook, Temperature,
Humidity, and Windy.

First, consider case of Outlook

Gini(S, outlook) = (5/14)gini(3,2) + (4/14)gini(4,0)+ (5/14)gini(2,3) =

(5/14)(1 - (3/5)² - (2/5)²) + (4/14)*0 + (5/14)(1 - (2/5)² - (3/5)²)=
0.171+0+0.171 = 0.342

Gini gain (S, outlook) = 0.459 - 0.342 = 0.117

Gini gain(S, Temperature) = 0.459 - 0.4405 = 0.0185

Gini gain(S, Humidity) = 0.459 - 0.3674 = 0.0916

Gini gain(S, windy) = 0.459 - 0.4286 = 0.0304

Choose one that has a higher Gini gain. Gini gain is higher for outlook.
So we can choose it as our root node.

Now you have got an idea of how to proceed further. Repeat the same
steps we used in the ID3 algorithm.

Advantages and disadvantages of decision trees

Advantages:

1. Decision trees are super interpretable

2. Require little data preprocessing

3. Suitable for low latency applications

Disadvantages:

1. More likely to overfit noisy data. The probability of overfitting

on noise increases as a tree gets deeper. A solution for it
is pruning. You can read more about pruning from my Kaggle
notebook. Another way to avoid overfitting is to use bagging
techniques like Random Forest. You can read more about
Random Forest from an article from neptune.ai.

References:
 https://www.saedsayad.com/decision_tree.htm

 Applied-ai course

Leetcode Python Solutions
86% (7)
Leetcode Python Solutions
226 pages
Examples
No ratings yet
Examples
8 pages
MODULE 4-Dr - GM
No ratings yet
MODULE 4-Dr - GM
23 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
ML Unit-3 ppt
No ratings yet
ML Unit-3 ppt
92 pages
Classification and Clustering
No ratings yet
Classification and Clustering
59 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Decision Tree
No ratings yet
Decision Tree
100 pages
Ml Unit 2 Final_iii Yr
No ratings yet
Ml Unit 2 Final_iii Yr
72 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
No ratings yet
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
7 pages
MLT UNIT-3 notes
No ratings yet
MLT UNIT-3 notes
35 pages
Practice Q Machine Learning Ans
No ratings yet
Practice Q Machine Learning Ans
54 pages
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
11 pages
Decision Tree
No ratings yet
Decision Tree
36 pages
ML Classification Tree
No ratings yet
ML Classification Tree
36 pages
Decision Trees - Neha Chowdhary PPT
No ratings yet
Decision Trees - Neha Chowdhary PPT
20 pages
3. Tree Models
No ratings yet
3. Tree Models
42 pages
AIML Lect5 Decision Tree
No ratings yet
AIML Lect5 Decision Tree
33 pages
ML_Unit-2_Material
No ratings yet
ML_Unit-2_Material
20 pages
Decision Trees
No ratings yet
Decision Trees
19 pages
Decision Tree
100% (4)
Decision Tree
66 pages
Decision Tree For Classification (ID3 Information Gain Entropy)
No ratings yet
Decision Tree For Classification (ID3 Information Gain Entropy)
3 pages
Day48 Decision Trees
No ratings yet
Day48 Decision Trees
5 pages
decisiontrees (1)
No ratings yet
decisiontrees (1)
28 pages
Experiment No-2
No ratings yet
Experiment No-2
4 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Naïve Bayes-DecisionTrees-RandomForest-SVM
No ratings yet
Naïve Bayes-DecisionTrees-RandomForest-SVM
26 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
Decision Trees For Classification - A Machine Learning Algorithm - Xoriant Blog
No ratings yet
Decision Trees For Classification - A Machine Learning Algorithm - Xoriant Blog
17 pages
IS4834 Week 8
No ratings yet
IS4834 Week 8
42 pages
Lecture2 DT
No ratings yet
Lecture2 DT
75 pages
Classification
No ratings yet
Classification
148 pages
Unit 4a Decision Tree
No ratings yet
Unit 4a Decision Tree
90 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
No ratings yet
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
22 pages
DM-Lecture Decision Trees (A)
No ratings yet
DM-Lecture Decision Trees (A)
161 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
13 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
28 pages
Module 3 Chap 3 Decision Tree Learning
No ratings yet
Module 3 Chap 3 Decision Tree Learning
79 pages
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
No ratings yet
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
7 pages
Lecture 7.1 - Decision Tree Classification
No ratings yet
Lecture 7.1 - Decision Tree Classification
15 pages
DM UNIT III (1)
No ratings yet
DM UNIT III (1)
87 pages
Trinh Khanh Ly 20213676
No ratings yet
Trinh Khanh Ly 20213676
13 pages
2.3 Decision-Tree-Algorithm
No ratings yet
2.3 Decision-Tree-Algorithm
61 pages
Chapter 3
No ratings yet
Chapter 3
88 pages
Dec Tree
No ratings yet
Dec Tree
17 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
Decision Tree Algorithm, Explained-1-22
No ratings yet
Decision Tree Algorithm, Explained-1-22
22 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
Entropy and Information Gain Explained
No ratings yet
Entropy and Information Gain Explained
10 pages
Decision Tree (Class 37-38) 169692509554958626652505a71d481
No ratings yet
Decision Tree (Class 37-38) 169692509554958626652505a71d481
45 pages
Nest Learning Thermostat
From Everand
Nest Learning Thermostat
Arthur Tech
No ratings yet
Algo 2
No ratings yet
Algo 2
9 pages
Heap Data Structure - Notes
No ratings yet
Heap Data Structure - Notes
4 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
10 pages
2nd Largest Element in A Binary Search Tree
No ratings yet
2nd Largest Element in A Binary Search Tree
7 pages
What Is A Tree: - Organization Charts - File Systems - Programming Environments
No ratings yet
What Is A Tree: - Organization Charts - File Systems - Programming Environments
59 pages
(Slides) 6 - Trees and Traversals
No ratings yet
(Slides) 6 - Trees and Traversals
15 pages
DSA Sheet
No ratings yet
DSA Sheet
5 pages
CH 11
No ratings yet
CH 11
37 pages
TAD Btree: Estructura de Datos II Página: 1
No ratings yet
TAD Btree: Estructura de Datos II Página: 1
4 pages
Heap Structure
No ratings yet
Heap Structure
28 pages
Lec 15 Heap
No ratings yet
Lec 15 Heap
84 pages
CSC 204
No ratings yet
CSC 204
22 pages
Chapter 8. Tree
No ratings yet
Chapter 8. Tree
104 pages
Graph Traversal: Text Depth-First Search Breadth-First Search
No ratings yet
Graph Traversal: Text Depth-First Search Breadth-First Search
41 pages
Unit 2 - Analysis Design of Algorithm - WWW - Rgpvnotes.in
No ratings yet
Unit 2 - Analysis Design of Algorithm - WWW - Rgpvnotes.in
11 pages
Or in Education - Prim's and Kruskal's Algorithm Answer Sheet
No ratings yet
Or in Education - Prim's and Kruskal's Algorithm Answer Sheet
5 pages
Expression Tree
No ratings yet
Expression Tree
4 pages
B Trees
No ratings yet
B Trees
31 pages
UNIT 5 Linked List, Trees and Graphs
0% (1)
UNIT 5 Linked List, Trees and Graphs
47 pages
CS301 QUIZ #1 2022 File by VU
No ratings yet
CS301 QUIZ #1 2022 File by VU
9 pages
Blind 75 LeetCode Questions - LeetCode Discuss
No ratings yet
Blind 75 LeetCode Questions - LeetCode Discuss
7 pages
The MX-Quadtree: Child XLB XUB YLB YUB
No ratings yet
The MX-Quadtree: Child XLB XUB YLB YUB
6 pages
Efficient Binary Trees
No ratings yet
Efficient Binary Trees
11 pages
Binarytree
No ratings yet
Binarytree
5 pages
Revision DSA
No ratings yet
Revision DSA
6 pages
Algorithm Gym - Data Structures - Codeforces
No ratings yet
Algorithm Gym - Data Structures - Codeforces
23 pages
D1, L5 Kruskal's and Prim's Algorithms
No ratings yet
D1, L5 Kruskal's and Prim's Algorithms
18 pages
Heaps, Heap Sort, and Priority Queues
No ratings yet
Heaps, Heap Sort, and Priority Queues
35 pages
Decision Trees
No ratings yet
Decision Trees
25 pages