ID3 Decision Tree Algorithm

The document provides an overview of the ID3 decision tree algorithm, which uses Information Gain to classify data by recursively splitting datasets based on features. It highlights the advantages of ID3, such as its simplicity and efficiency, while also noting its limitations, including overfitting and bias towards features with many categories. The document also mentions C4.5 as an improvement over ID3, addressing some of its limitations by handling continuous data and reducing bias.

Uploaded by

yuvrajas4074

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

ID3 Decision Tree Algorithm

Uploaded by

yuvrajas4074

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

Basic Decision Tree Learning

Algorithms
A Comprehensive Overview
Er. Zubair
Overview of ID3 Algorithm
• Some of the common Decision Tree Learning Algorithm are:
 ID3 (Iterative Dichotomiser 3)

 C4.5

 CART (Classification and Regression Trees)

1. ID3 (Iterative Dichotomiser 3)

• ID3 is an early and well-known decision tree algorithm for classification
problems.
• The algorithm splits the dataset recursively based on features, ultimately
creating a model for classification.
• Key feature: The algorithm uses Information Gain to decide which feature
to split on at each node.
Understanding Information Gain
• Information Gain (IG) measures how well a feature separates the
dataset into distinct classes.
• Information Gain is based on Entropy, which measures the
uncertainty or impurity of a dataset.
• The formula for Entropy: Entropy(S) = -∑p(i) log2 p(i)
 Where p(i) is the proportion of class i in the dataset.

• Information Gain is the difference in entropy before and after a

split based on a feature.
How ID3 Uses Information Gain
• At each node, ID3 calculates the Information Gain for each
feature.
• The feature with the highest Information Gain is chosen to split
the dataset.
• By doing this, ID3 aims to reduce uncertainty and create purer
nodes at each step.
• Example: Splitting the data based Outlook when classifying
whether to play tennis.
Advantages of ID3
Simple to understand: Based on a clear metric, Information Gain.
Efficient: Computationally efficient for small to medium datasets.
Transparent: The tree structure is easy to visualize and interpret.
Limitations of ID3
• Overfitting: Tends to create deep trees that may overfit the
training data.
• Bias towards features with many categories: May favor
features with many values.
• No support for continuous data: Needs data to be discretized
or transformed for continuous attributes.
Improvement: C4.5
C4.5 is an extension of ID3 that addresses some limitations:

• Handles both continuous and categorical features.

• Uses Gain Ratio instead of Information Gain to reduce bias

towards features with many categories.

• Includes pruning to remove unnecessary branches and reduce

overfitting.
Example of ID3
• Given a dataset of weather conditions, ID3 calculates the
Information Gain for each feature.
• Entropy of the whole dataset (using the formula):

• where p+ and p−are the proportions of the positive and negative
classes, respectively.
• Step 1: Entropy of the Entire Dataset
Example of ID3
Example of ID3
Example of ID3
Example of ID3
Example of ID3
• With these values, you can build a decision tree, Weather
provides the highest information gain. So, it should be used as
the root node, followed by other attributes based on their
information gain values.
• Since Weather has the highest information gain, we choose it
as the root node. The possible values for Weather are: Sunny,
Cloudy, Rain. We'll split the dataset based on these values
and compute the information gain for the remaining
attributes (Temperature, Humidity, and Wind) for each
subset.
Example of ID3
• With these values, you can build a decision tree, Weather
provides the highest information gain. So, it should be used as
the root node, followed by other attributes based on their
information gain values.
• Since Weather has the highest information gain, we choose it
as the root node. The possible values for Weather are: Sunny,
Cloudy, Rain. We'll split the dataset based on these values
and compute the information gain for the remaining
attributes (Temperature, Humidity, and Wind) for each
subset.
Example of ID3

Day Weather Temperature Humidity Wind Play Football?

Day 1 Sunny Hot High Weak No

Day 2 Sunny Hot High Strong No

Day 8 Sunny Mild High Weak Yes

Day 9 Rain Cool Normal Weak Yes

Day 11 Sunny Mild Normal Strong Yes

Example of ID3
Play
Day Weather Temperature Humidity Wind Football?

Day 4 Rain Mild High Weak Yes

Day 5 Rain Cool Normal Weak Yes
Day 6 Rain Cool Normal Strong Yes
Day 9 Rain Cool Normal Weak Yes
Day 10 Rain Mild Normal Weak Yes
Day 14 Rain Hot High Strong No
Conclusion
• ID3 is a simple and powerful decision tree algorithm that uses
Information Gain for decision-making.

• While effective for many problems, improvements like C4.5

address its limitations such as handling continuous data and
reducing bias.

• ID3 and C4.5 laid the foundation for modern decision tree
algorithms.

ID3 Algorithm For Decision Trees
No ratings yet
ID3 Algorithm For Decision Trees
16 pages
Decision Tree Classifier-Introduction, ID3
No ratings yet
Decision Tree Classifier-Introduction, ID3
34 pages
So sánh thuật toán cây quyết định ID3 và C45
No ratings yet
So sánh thuật toán cây quyết định ID3 và C45
7 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-29 Reference-Material-I
48 pages
Lec-3-Decision Trees
No ratings yet
Lec-3-Decision Trees
47 pages
Decision Tree 2
No ratings yet
Decision Tree 2
20 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
Dec Tree
No ratings yet
Dec Tree
17 pages
Unit 3
No ratings yet
Unit 3
46 pages
MLT UNIT-3 notes
No ratings yet
MLT UNIT-3 notes
35 pages
3. Tree Models
No ratings yet
3. Tree Models
42 pages
Module 3 DecisionTree Notes
100% (1)
Module 3 DecisionTree Notes
14 pages
Decision Trees
No ratings yet
Decision Trees
19 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
Unit 4 - Decision Tree ID3
No ratings yet
Unit 4 - Decision Tree ID3
5 pages
Screenshot 2024-02-06 at 1.43.15 PM
No ratings yet
Screenshot 2024-02-06 at 1.43.15 PM
66 pages
ML_Unit-2_Material
No ratings yet
ML_Unit-2_Material
20 pages
Decision Tree ID3 Algorithm - Machine Learning - by AshirbadPradhan - Medium
No ratings yet
Decision Tree ID3 Algorithm - Machine Learning - by AshirbadPradhan - Medium
18 pages
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
No ratings yet
Decision Trees Iterative Dichotomiser 3 (ID3) For Classification: An ML Algorithm
7 pages
Peer Reviewed Scientific Journals
No ratings yet
Peer Reviewed Scientific Journals
9 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
AI_01_ID3
No ratings yet
AI_01_ID3
7 pages
Unit 4a Decision Tree
No ratings yet
Unit 4a Decision Tree
90 pages
Lesson 7 Supervised Method (Decision Trees) Algorithms
No ratings yet
Lesson 7 Supervised Method (Decision Trees) Algorithms
12 pages
New Module 3 Part1
No ratings yet
New Module 3 Part1
69 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
Video Tutorial: Decision Tree Learning
No ratings yet
Video Tutorial: Decision Tree Learning
21 pages
Decision Trees - Neha Chowdhary PPT
No ratings yet
Decision Trees - Neha Chowdhary PPT
20 pages
A New Decision Tree Method For Data Mining in Medicine: Kasra Madadipouya
No ratings yet
A New Decision Tree Method For Data Mining in Medicine: Kasra Madadipouya
7 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
12 pages
3 - Decision trees
No ratings yet
3 - Decision trees
16 pages
Session 5b Classification by Decision Tree Induction (1)
No ratings yet
Session 5b Classification by Decision Tree Induction (1)
42 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
The ID3 Algorithm
No ratings yet
The ID3 Algorithm
9 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
UNIT-IV - Decision Tree Induction
No ratings yet
UNIT-IV - Decision Tree Induction
19 pages
3 Decision Tree Learning
No ratings yet
3 Decision Tree Learning
38 pages
MCA3 (DS) Unit 4 ML
No ratings yet
MCA3 (DS) Unit 4 ML
29 pages
Data Structures: Notes For Lecture 13 Techniques of Data Mining by Samaher Hussein Ali
No ratings yet
Data Structures: Notes For Lecture 13 Techniques of Data Mining by Samaher Hussein Ali
8 pages
Algorithms: Improvement of ID3 Algorithm Based On Simplified Information Entropy and Coordination Degree
No ratings yet
Algorithms: Improvement of ID3 Algorithm Based On Simplified Information Entropy and Coordination Degree
18 pages
Decision Tree Learning and Inductive Inference
No ratings yet
Decision Tree Learning and Inductive Inference
37 pages
ML Unit-2 Material WORD
No ratings yet
ML Unit-2 Material WORD
25 pages
Module 3-1 PDF
No ratings yet
Module 3-1 PDF
43 pages
Aiml M4 C1
No ratings yet
Aiml M4 C1
101 pages
Lec4 - Decision Trees
No ratings yet
Lec4 - Decision Trees
43 pages
ID3
No ratings yet
ID3
7 pages
Decision Tree - Associative Rule Mining
No ratings yet
Decision Tree - Associative Rule Mining
69 pages
Decision Tree Ppt
0% (1)
Decision Tree Ppt
24 pages
Decision Trees- Id3 Algorithms
No ratings yet
Decision Trees- Id3 Algorithms
12 pages
Storey DecisionTrees
No ratings yet
Storey DecisionTrees
38 pages
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
No ratings yet
ID3 Algorithm: Abbas Rizvi CS157 B Spring 2010
19 pages
Artificial Intelligence 11. Decision Tree Learning
No ratings yet
Artificial Intelligence 11. Decision Tree Learning
25 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
Green Home Computing For Dummies
From Everand
Green Home Computing For Dummies
Woody Leonhard
No ratings yet
Administrator & Helpdesk Interview Questions You'll Most Likely Be Asked
From Everand
Administrator & Helpdesk Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Greening the Data Center: A Pocket Guide
From Everand
Greening the Data Center: A Pocket Guide
George Spafford
No ratings yet
DM - Lab - 8 - Jupyter Notebook
No ratings yet
DM - Lab - 8 - Jupyter Notebook
5 pages
DM DT Solved Example 02 - Unlocked
No ratings yet
DM DT Solved Example 02 - Unlocked
3 pages
Decision Trees Pohon Keputusan
No ratings yet
Decision Trees Pohon Keputusan
5 pages
Decision Trees and Random Forests
No ratings yet
Decision Trees and Random Forests
36 pages
Svmsmote 061430
No ratings yet
Svmsmote 061430
2 pages
Decision_Trees_Concepts_Algorithms
No ratings yet
Decision_Trees_Concepts_Algorithms
15 pages
Module 5 Decision Tree Part2
No ratings yet
Module 5 Decision Tree Part2
47 pages
ID3 MedhaPradhan
No ratings yet
ID3 MedhaPradhan
22 pages
Dokumen - Tips Contoh Studi Kasus Decision Tree
No ratings yet
Dokumen - Tips Contoh Studi Kasus Decision Tree
11 pages
Klasifikasi Lama Masa Studi Mahasiswa Menggunakan Perbandingan Metode Algoritma C.45 Dan Algoritma Classification and Regression Tree
No ratings yet
Klasifikasi Lama Masa Studi Mahasiswa Menggunakan Perbandingan Metode Algoritma C.45 Dan Algoritma Classification and Regression Tree
10 pages
Decision Tree - IDS3
No ratings yet
Decision Tree - IDS3
13 pages
Lab Program 3
No ratings yet
Lab Program 3
6 pages
Machine Learning Approaches: Decision Trees
No ratings yet
Machine Learning Approaches: Decision Trees
44 pages
Decision Tree
No ratings yet
Decision Tree
44 pages
ID3 Decision Tree Explanation
No ratings yet
ID3 Decision Tree Explanation
8 pages
The Random Forest Algorithm - A Complete Guide - Built in
No ratings yet
The Random Forest Algorithm - A Complete Guide - Built in
12 pages
DM DT Solved Example 01 - Unlocked
No ratings yet
DM DT Solved Example 01 - Unlocked
4 pages
Import Import Def
No ratings yet
Import Import Def
2 pages
ID3 BuyPC
No ratings yet
ID3 BuyPC
3 pages
DECISION TREE
No ratings yet
DECISION TREE
9 pages
Daily AI Exercise - Kmeans - KNN
No ratings yet
Daily AI Exercise - Kmeans - KNN
15 pages
3ID3 Algorithm
No ratings yet
3ID3 Algorithm
9 pages