0% found this document useful (0 votes)

72 views

Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm

The document describes the ID3 algorithm for building decision trees for classification problems. ID3 recursively splits the data into purer subsets based on feature values that provide the greatest information gain. It begins by calculating the entropy of the full dataset, then chooses the feature with the highest information gain to place at the root node. The dataset is then split on this feature and the process repeats on the subsets until reaching leaf nodes with single class labels or running out of features. An example applies ID3 to a weather dataset to build a tree predicting whether golf will be played. Outlook had the highest information gain and was placed at the root, with subsequent splits made based on humidity and wind.

Uploaded by

Mohammad Sharif

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views

Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm

Uploaded by

Mohammad Sharif

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Decision Trees Iterative Dichotomiser 3 (ID3) for Classification:

An ML Algorithm
INTRODUCTION
Decision trees are a type of Supervised Machine Learning (that is you explain what the input is and what
the corresponding output is in the training data) where the data is continuously split according to a
certain parameter.

A (Decision) Tree
The tree can be explained by two entities, namely decision nodes and leaves.

 The leaves are the decisions or final outcomes.

 And the decision nodes are where the data is split.

Fig: A binary tree.

An example of a decision tree can be explained using the above binary tree. Let’s say you want to
predict whether a person is fit given their information like age, eating habits, and physical activity,
etc.

The decision nodes here are questions like

 ‘What’s the age?’,
 ‘Does he exercise?’,
 ‘Does he eat a lot of pizzas’?

And the leaves, which are outcomes like either ‘fit’, or ‘unfit’.

In this case, this was a binary classification problem (a yes, no type problem).

Types of Decision Trees

There are two main types of Decision Trees:
Classification Trees (Yes/No Types)
What we’ve seen above is an example of a classification tree, where the outcome was a variable like ‘fit’
or ‘unfit’. Here the decision variable is Categorical.

Regression Trees (Continuous Data Types)

Here the decision or the outcome variable is continuous, e.g. a number like 123.

DEFINITIONS
Before discussing the ID3 algorithm, we’ll go through a few definitions.

Entropy
Entropy, also called Shannon Entropy is denoted by H(S) for a finite set S, is the measure of the amount
of uncertainty or randomness in data.

Intuitively, it tells us about the predictability of a certain event.

Example:
 Consider a coin toss whose probability of heads is 0.5 and the probability of tails is 0.5. Here the
entropy is the highest possible since there’s no way of determining what the outcome might be.
 Consider a coin that has heads on both sides, the entropy of such an event can be predicted
perfectly since we know beforehand that it’ll always be heads. In other words, this event has no
randomness hence its entropy is zero.

In particular, lower values imply less uncertainty while higher values imply high uncertainty.

Information Gain
Information gain is also called as Kullback-Leibler divergence denoted by IG(S,A) for a set S is the
effective change in entropy after deciding on a particular attribute A. It measures the relative change in
entropy with respect to the independent variables.

Alternatively,

where IG(S,A) is the information gain by applying feature A. H(S) is the Entropy of the entire set, while
the second term calculates the Entropy after applying the feature A, where P(x) is the probability of
event x.
WORKING
Now that we know what a Decision Tree is, we’ll see how it works internally. There are many algorithms
out there that construct Decision Trees, but one of the best is called an ID3 Algorithm. ID3 Stands for
Iterative Dichotomiser 3 (ID3).

ID3 Algorithm will perform following tasks recursively

1. Create a root node for the tree

2. If all examples are positive, return leaf node ‘positive’
3. Else if all examples are negative, return leaf node ‘negative’
4. Calculate the entropy of current state H(S)
5. For each attribute, calculate the entropy with respect to the attribute ‘x’ denoted by H(S, x)
6. Select the attribute which has a maximum value of IG(S, x)
7. Remove the attribute that offers highest IG from the set of attributes
8. Repeat until we run out of all attributes, or the decision tree has all leaf nodes.

Example 1
Consider a piece of data collected over the course of 14 days where the features are Outlook,
Temperature, Humidity, Wind, and the outcome variable is whether Golf was played on the day. Now,
our job is to build a predictive model that takes in the above 4 parameters and predicts whether Golf
will be played on the day. We’ll build a decision tree to do that using the ID3 algorithm.

Day Outlook Temperature Humidity Wind Play Golf

D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Hot High Weak Yes
D4 Rain Mild High Weak Yes
D5 Rain Cool Normal Weak Yes
D6 Rain Cool Normal Strong No
D7 Overcast Cool Normal Strong Yes
D8 Sunny Mild High Weak No
D9 Sunny Cool Normal Weak Yes
D10 Rain Mild Normal Weak Yes
D11 Sunny Mild Normal Strong Yes
D12 Overcast Mild High Strong Yes
D13 Overcast Hot Normal Weak Yes
D14 Rain Mild High Strong No
 Now we’ll go ahead and grow the decision tree. The initial step is to calculate H(S), the Entropy of
the current state. In the above example, we can see in total there are 5 No’s and 9 Yes’s.
Yes No Total
9 5 14

Remember that the Entropy is 0 if all members belong to the same class, and 1 when half of them
belong to one class, and the other half belong to another class that is perfect randomness. Here it’s
0.94 which means the distribution is fairly random.

 Now the next step is to choose the attribute that gives us the highest possible Information
Gain which we’ll choose as the root node.

Let’s start with ‘Wind’

where ‘x’ is the possible values for an attribute. Here, attribute ‘Wind’ takes two possible values in
the sample data, hence x = {Weak, Strong}

We’ll have to calculate:

Amongst all the 14 examples we have 8 places where the wind is weak and 6 where the wind is
Strong.
Wind = Weak Wind = Strong Total

8 6 14
Now out of the 8 Weak examples, 6 of them were ‘Yes’ for Play Golf and 2 of them were ‘No’ for
‘Play Golf’. So, we have,

Similarly, out of 6 Strong examples, we have 3 examples where the outcome was ‘Yes’ for Play Golf
and 3 where we had ‘No’ for Play Golf.

Remember, here half items belong to one class while the other half belongs to others. Hence, we
have perfect randomness.

Now we have all the pieces required to calculate the Information Gain,

This tells us the Information Gain by considering ‘Wind’ as the feature and gives us an information
gain of 0.048. Now we must similarly calculate the Information Gain for all the features.

We can clearly see that IG(S,Outlook) has the highest information gain of 0.246, hence we chose
Outlook attribute as the root node. At this point, the decision tree looks like.
Here we observe that whenever the outlook is Overcast, Play Golf is always ‘Yes’, it’s no coincidence
by any chance, the simple tree resulted because of the highest information gain is given by the
attribute Outlook.

Now how do we proceed from this point? We can simply apply recursion; you might want to look at
the algorithm steps described earlier.

Now that we’ve used Outlook, we’ve got three of them remaining Humidity, Temperature, and
Wind. And, we had three possible values of Outlook: Sunny, Overcast, Rain. Where the Overcast
node already ended up having leaf node ‘Yes’, so we’re left with two subtrees to compute: Sunny
and Rain.

The table where the value of Outlook is Sunny looks like:

Temperature Humidity Wind Play Golf

Hot High Weak No

Hot High Strong No

Mild High Weak No

Cool Normal Weak Yes

Mild Normal Strong Yes

In a similar fashion, we compute the following values

As we can see the highest Information Gain is given by Humidity.

Proceeding in the same way with will give us Wind as the one with the highest information
gain. The final Decision Tree looks something like this.

The final Decision Tree looks something like this.

Capstone Presentation: Telecom Churn Study
100% (3)
Capstone Presentation: Telecom Churn Study
19 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
ML_Unit-2_Material
No ratings yet
ML_Unit-2_Material
20 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
DM UNIT III (1)
No ratings yet
DM UNIT III (1)
87 pages
Practice Q Machine Learning Ans
No ratings yet
Practice Q Machine Learning Ans
54 pages
Module 3 DecisionTree Notes
100% (1)
Module 3 DecisionTree Notes
14 pages
Dec Tree
No ratings yet
Dec Tree
17 pages
Lec-3-Decision Trees
No ratings yet
Lec-3-Decision Trees
47 pages
ID3
No ratings yet
ID3
7 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
MLT UNIT-3 notes
No ratings yet
MLT UNIT-3 notes
35 pages
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
No ratings yet
FALLSEM2023-24 CSE4020 ELA VL2023240104096 2023-08-19 Reference-Material-I
11 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
48 pages
Decision Tree Classifier-Introduction, ID3
No ratings yet
Decision Tree Classifier-Introduction, ID3
34 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Decision Tree Learning and Inductive Inference
No ratings yet
Decision Tree Learning and Inductive Inference
37 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
Unit 3
No ratings yet
Unit 3
46 pages
Screenshot 2024-02-06 at 1.43.15 PM
No ratings yet
Screenshot 2024-02-06 at 1.43.15 PM
66 pages
3. Tree Models
No ratings yet
3. Tree Models
42 pages
Unit 2 1
No ratings yet
Unit 2 1
15 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
Decision Trees
No ratings yet
Decision Trees
19 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Chapter 5 2018 2019
No ratings yet
Chapter 5 2018 2019
5 pages
So sánh thuật toán cây quyết định ID3 và C45
No ratings yet
So sánh thuật toán cây quyết định ID3 và C45
7 pages
Module 3-1 PDF
No ratings yet
Module 3-1 PDF
43 pages
Unit-3 (1)
No ratings yet
Unit-3 (1)
81 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Classification
No ratings yet
Classification
148 pages
AIML- Module 3- Updated
No ratings yet
AIML- Module 3- Updated
42 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
Decision Tree 2
No ratings yet
Decision Tree 2
20 pages
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
No ratings yet
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
19 pages
Lec-2 Decision Tree_13-8-2024
No ratings yet
Lec-2 Decision Tree_13-8-2024
38 pages
Video Tutorial: Decision Tree Learning
No ratings yet
Video Tutorial: Decision Tree Learning
21 pages
2.3 Decision-Tree-Algorithm
No ratings yet
2.3 Decision-Tree-Algorithm
61 pages
Ai Mod3@Azdocuments - in
No ratings yet
Ai Mod3@Azdocuments - in
42 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
12 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
21 pages
Decision Tree
No ratings yet
Decision Tree
100 pages
Classification and Clustering
No ratings yet
Classification and Clustering
59 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
Examples
No ratings yet
Examples
8 pages
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
No ratings yet
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
54 pages
AI_01_ID3
No ratings yet
AI_01_ID3
7 pages
Module 2 Notes v1 PDF
No ratings yet
Module 2 Notes v1 PDF
20 pages
module 2
No ratings yet
module 2
42 pages
Entropy and Information Gain Explained
No ratings yet
Entropy and Information Gain Explained
10 pages
Chapter 2 Types of Machine Learning and Their Learning Strategies
No ratings yet
Chapter 2 Types of Machine Learning and Their Learning Strategies
45 pages
7. Decision Tree & Random Forest
No ratings yet
7. Decision Tree & Random Forest
41 pages
Decision Tree (Class 37-38) 169692509554958626652505a71d481
No ratings yet
Decision Tree (Class 37-38) 169692509554958626652505a71d481
45 pages
The ID3 Algorithm
No ratings yet
The ID3 Algorithm
9 pages
Unit II Part 1
No ratings yet
Unit II Part 1
62 pages
Stem Guides To Weather
From Everand
Stem Guides To Weather
Kay Robertson
No ratings yet
How Pi Can Save Your Life: Using Math to Survive Plane Crashes, Zombie Attacks, Alien Encounters, and Other Improbable Real-World Situations
From Everand
How Pi Can Save Your Life: Using Math to Survive Plane Crashes, Zombie Attacks, Alien Encounters, and Other Improbable Real-World Situations
Chris Waring
No ratings yet
Hypothesis Testing Made Simple
From Everand
Hypothesis Testing Made Simple
Leonard Gaston
4/5 (5)
Installed Batteries and UPS
No ratings yet
Installed Batteries and UPS
1 page
MTH202-CLASS, Linear Algebra
No ratings yet
MTH202-CLASS, Linear Algebra
21 pages
FAQ Lithium-Ion Battery BR153094EN LR
No ratings yet
FAQ Lithium-Ion Battery BR153094EN LR
4 pages
On The Use of Real Time Simulators For The Test and Validation of Protection and Control Systems of Micro Grids and Smart Grids
No ratings yet
On The Use of Real Time Simulators For The Test and Validation of Protection and Control Systems of Micro Grids and Smart Grids
5 pages
Supervised Learning
No ratings yet
Supervised Learning
17 pages
Chapter-4: Utility Plants and Renewable Sources (Part-2)
No ratings yet
Chapter-4: Utility Plants and Renewable Sources (Part-2)
16 pages
Chapter-2: NMOS Problems On Current-Voltage Characteristics Dr. Sharmini Enoch
No ratings yet
Chapter-2: NMOS Problems On Current-Voltage Characteristics Dr. Sharmini Enoch
11 pages
Chapter-3: MOSFET DC Circuit Analysis
No ratings yet
Chapter-3: MOSFET DC Circuit Analysis
9 pages
Machine Learning: Course
No ratings yet
Machine Learning: Course
44 pages
Assignment 4.solution
100% (1)
Assignment 4.solution
7 pages
Assignment 3.solution
No ratings yet
Assignment 3.solution
2 pages
K Mean Clustering (Assignment 2)
No ratings yet
K Mean Clustering (Assignment 2)
5 pages
Assignmnet #1 of ML MSEE02173023 M.Sharif Naqvi
No ratings yet
Assignmnet #1 of ML MSEE02173023 M.Sharif Naqvi
3 pages
Midterm Exam
No ratings yet
Midterm Exam
48 pages
45 Best Marriage Advice and Ethics Ever by 1400 Year Old
No ratings yet
45 Best Marriage Advice and Ethics Ever by 1400 Year Old
47 pages
Demand Response Measurement & Verification: AEIC Load Research Committee
No ratings yet
Demand Response Measurement & Verification: AEIC Load Research Committee
30 pages
LS (SSO) V5Series - Catalog - EN - 202104
No ratings yet
LS (SSO) V5Series - Catalog - EN - 202104
24 pages
Currency Exchange Prediction IEEE
No ratings yet
Currency Exchange Prediction IEEE
2 pages
Diagrama-M Vs Curvatura
No ratings yet
Diagrama-M Vs Curvatura
5 pages
Banking and Insurance Law Assignment (Abhinav)
No ratings yet
Banking and Insurance Law Assignment (Abhinav)
18 pages
Mechatronics and IoT - ME3791 - Important Questions With Answers - Unit 1 - Sensors and Actuators
No ratings yet
Mechatronics and IoT - ME3791 - Important Questions With Answers - Unit 1 - Sensors and Actuators
16 pages
Step To Step Guide On Saas Product Development Process
No ratings yet
Step To Step Guide On Saas Product Development Process
6 pages
Circuits Solutions Sp14 Sp14Week3Homework
No ratings yet
Circuits Solutions Sp14 Sp14Week3Homework
12 pages
Docs Python Telegram Bot Org en v12.1.1
No ratings yet
Docs Python Telegram Bot Org en v12.1.1
254 pages
Penetra Meter
No ratings yet
Penetra Meter
19 pages
Final DMP-2023 FEBRUARY
No ratings yet
Final DMP-2023 FEBRUARY
62 pages
MIX-M501MAP Monitor Module: Installation and Maintenance Instructions
No ratings yet
MIX-M501MAP Monitor Module: Installation and Maintenance Instructions
1 page
MSC Data Science Oncampus 2020
No ratings yet
MSC Data Science Oncampus 2020
14 pages
Perbandingan Lampu Induksi LVD Dan Lampu LED
No ratings yet
Perbandingan Lampu Induksi LVD Dan Lampu LED
5 pages
Data Science Course
100% (1)
Data Science Course
51 pages
g7 Tle
No ratings yet
g7 Tle
3 pages
Installationguide T484 120510 Uk
No ratings yet
Installationguide T484 120510 Uk
44 pages
Original: English Translation of German
No ratings yet
Original: English Translation of German
56 pages
Examen de Certificación Práctica CCENT Nº1
No ratings yet
Examen de Certificación Práctica CCENT Nº1
19 pages
AIX For The System Administrator
No ratings yet
AIX For The System Administrator
10 pages
Man VLF 62 en
No ratings yet
Man VLF 62 en
68 pages
cqm13392 7j
No ratings yet
cqm13392 7j
8 pages
Apex Ch10c1 Chassis At2408s Ch04t1002 Om8839ps Tda4605 TV SM
No ratings yet
Apex Ch10c1 Chassis At2408s Ch04t1002 Om8839ps Tda4605 TV SM
61 pages
SWIFTgpi Newsflash July Application Providers v02
No ratings yet
SWIFTgpi Newsflash July Application Providers v02
12 pages
Windows XP Embedded Thin Client Manual
No ratings yet
Windows XP Embedded Thin Client Manual
72 pages
Laudon-Traver Ec10 PPT ch01
No ratings yet
Laudon-Traver Ec10 PPT ch01
31 pages
Transit Capacity & Quality of Service Manual, Third Edition: Multimodal Transit LOS Computational Engine
No ratings yet
Transit Capacity & Quality of Service Manual, Third Edition: Multimodal Transit LOS Computational Engine
13 pages
Computer Systems Servicing DLL
No ratings yet
Computer Systems Servicing DLL
11 pages
Bluetooth Based Smart Sensor Network
No ratings yet
Bluetooth Based Smart Sensor Network
15 pages
Ese Qp Answer Key Ranjana Mam
No ratings yet
Ese Qp Answer Key Ranjana Mam
54 pages