0% found this document useful (0 votes)

5 views

Decision Tree Example

The document discusses the ID3 algorithm for decision tree learning, focusing on the process of selecting the best decision attribute and the criteria for stopping the tree growth. It explains concepts such as entropy, information gain, and the GINI index, which are used to evaluate the effectiveness of attributes in classifying training examples. Additionally, it addresses practical issues in classification, including underfitting, overfitting, and the challenges of finding an optimal decision tree.

Uploaded by

Sri Latha

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Decision Tree Example

Uploaded by

Sri Latha

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 21

Foundations of Machine Learning

Module 2: Linear Regression and Decision Tree

Part C: Learning Decision Tree

Sudeshna Sarkar
IIT Kharagpur
Top-Down Induction of Decision Trees ID3

1. Which node to proceed with?

1. A  the “best” decision attribute for next node
2. Assign A as decision attribute for node
3. For each value of A create new descendant
4. Sort training examples to leaf node according to the
attribute value of the branch
5. If all training examples are perfectly classified (same
value of target attribute) stop, else iterate over new
leaf nodes. 2. When to stop?
Choices
• When to stop
– no more input features
– all examples are classified the same
– too few examples to make an informative split

• Which test to split on

– split gives smallest error.
– With multi-valued features
• split on all values or
• split values into half.
Which Attribute is ”best”?

[29+,35-] A1=? A2=? [29+,35-]

True False True False

[21+, 5-] [8+, 30-] [18+, 33-] [11+, 2-]

ICS320 4
Principled Criterion
• Selection of an attribute to test at each node -
choosing the most useful attribute for classifying
examples.
• information gain
– measures how well a given attribute separates the training
examples according to their target classification
– This measure is used to select among the candidate
attributes at each step while growing the tree
– Gain is measure of how much we can reduce
uncertainty (Value lies between 0,1)
Entropy
• A measure for
– uncertainty
– purity
– information content
• Information theory: optimal length code assigns (logp) bits to
message having probability p
• S is a sample of training examples
– p+ is the proportion of positive examples in S
– p is the proportion of negative examples in S
• Entropy of S: average optimal number of bits to encode
information about certainty/uncertainty about S
EntropySplogpplogp plogpplogp
Entropy

• The entropy is 0 if the outcome

is ``certain”.
• The entropy is maximum if we
have no knowledge of the
system (or any outcome is
equally possible).

• S is a sample of training examples

• p+ is the proportion of positive examples
• p- is the proportion of negative examples
• Entropy measures the impurity of S
Entropy(S) = plogpplogp
Information Gain
Gain(S,A): expected reduction in entropy due to partitioning S
on attribute A

Entropy([29+,35-]) = -29/64 log2 29/64 – 35/64 log2 35/64

= 0.99

[29+,35-] A1=? A2=? [29+,35-]

True False True False

[21+, 5-] [8+, 30-] [18+, 33-] [11+, 2-]

Information Gain
Entropy([21+,5-]) = 0.71 Entropy([18+,33-]) = 0.94
Entropy([8+,30-]) = 0.74 Entropy([8+,30-]) = 0.62
Gain(S,A1)=Entropy(S) Gain(S,A2)=Entropy(S)
-26/64*Entropy([21+,5-])
-51/64*Entropy([18+,33-])
-38/64*Entropy([8+,30-])
-13/64*Entropy([11+,2-])
=0.27
=0.12

[29+,35-] A1=? A2=? [29+,35-]

True False True False

[21+, 5-] [8+, 30-] [18+, 33-] [11+, 2-]

ICS320 9
Training Examples
Day Outlook Temp Humidity Wind Tennis?
D1 Sunny Hot High Weak No
D2 Sunny Hot High Strong No
D3 Overcast Hot High Weak Yes
D4 Rain Mild High Weak Yes
D5 Rain Cool Normal Weak Yes
D6 Rain Cool Normal Strong No
D7 Overcast Cool Normal Strong Yes
D8 Sunny Mild High Weak No
D9 Sunny Cool Normal Weak Yes
D10 Rain Mild Normal Weak Yes
D11 Sunny Mild Normal Strong Yes
D12 Overcast Mild High Strong Yes
D13 Overcast Hot Normal Weak Yes
D14 Rain Mild High Strong No
ID3 Algorithm
Outlook

Sunny Overcast Rain

Humidity Yes Wind

[D3,D7,D12,D13]

High Normal Strong Weak

No Yes No Yes

[D1,D2] [D8,D9,D11] [D6,D14] [D4,D5,D10]

Selecting the Next Attribute
S=[9+,5-]
E=0.940

Outlook

Sunny Overcast Rain

[2+, 3-] [4+, 0] [3+, 2-]

E=0.971 E=0.0 E=0.971

Gain(S,Outlook)
=0.940-(5/14)*0.971
-(4/14)*0.0 – (5/14)*0.0971
=0.247
Selecting the Next Attribute
The information gain values for the 4 attributes are:
• Gain(S,Outlook) =0.247
• Gain(S,Humidity) =0.151
• Gain(S,Wind) =0.048
• Gain(S,Temperature) =0.029

where S denotes the collection of training examples

Note: 0Log20 =0
ID3 Algorithm
[D1,D2,…,D14] Outlook
[9+,5-]

Sunny Overcast Rain

Ssunny=[D1,D2,D8,D9,D11] [D3,D7,D12,D13] [D4,D5,D6,D10,D14]

[4+,0-] [3+,2-]
[2+,3-]

? Yes ?
Test for this node

Gain(Ssunny , Humidity)=0.970-(3/5)0.0 – 2/5(0.0) = 0.970

Gain(Ssunny , Temp.)=0.970-(2/5)0.0 –2/5(1.0)-(1/5)0.0 = 0.570
Gain(Ssunny , Wind)=0.970= -(2/5)1.0 – 3/5(0.918) = 0.019
Selecting the Next Attribute
S=[9+,5-] S=[9+,5-]
E=0.940 E=0.940

Humidity Wind

High Normal Weak Strong

[3+, 4-] [6+, 1-] [6+, 2-] [3+, 3-]

E=0.985 E=0.592 E=0.811 E=1.0

Gain(S,Humidity) Gain(S,Wind)
=0.940-(7/14)*0.985 =0.940-(8/14)*0.811
– (7/14)*0.592 – (6/14)*1.0
=0.151 =0.048
Humidity provides greater info. gain than Wind, w.r.t target classification.
Splitting Rule: GINI Index
• GINI Index
– Measure of node impurity

GINInode(Node) 1 [ p(c)] 2

c  classes

Sv
GINIsplit (A)   S
GINI(N v )
v Values(A )
Splitting Based on Continuous Attributes

Taxable Taxable
Income Income?
> 80K?
< 10K > 80K
Yes No

[10K,25K) [25K,50K) [50K,80K)

(i) Binary split (ii) Multi-way split

Continuous Attribute – Binary Split
• For continuous attribute
– Partition the continuous value of attribute A into a
discrete set of intervals
– Create a new boolean attribute Ac , looking for a
threshold c,
true if Ac  c
Ac 
 false otherwise
How to choose c ?
• consider all possible splits and finds the best cut
Practical Issues of Classification
• Underfitting and Overfitting

• Missing Values

• Costs of Classification
Hypothesis Space Search in Decision Trees
• Conduct a search of the space of decision trees which
can represent all possible discrete functions.

• Goal: to find the best decision tree

• Finding a minimal decision tree consistent with a set of data

is NP-hard.

• Perform a greedy heuristic search: hill climbing without

backtracking

• Statistics-based decisions using all data

20
Bias and Occam’s Razor
Prefer short hypotheses.
Argument in favor:
– Fewer short hypotheses than long hypotheses
– A short hypothesis that fits the data is unlikely to
be a coincidence
– A long hypothesis that fits the data might be a
coincidence

ICS320 21

Electromagnetic Metamaterials and Metasurfaces - From Theory To Applications (Z-Lib - Io)
No ratings yet
Electromagnetic Metamaterials and Metasurfaces - From Theory To Applications (Z-Lib - Io)
526 pages
Case Tractor Cvx1135 To Cvx1195 Service Manual
98% (56)
Case Tractor Cvx1135 To Cvx1195 Service Manual
24 pages
Gautam Ton Taski T
100% (3)
Gautam Ton Taski T
118 pages
Music by Barry Manilow Lyrics by Bruce Sussman & Jack Feldman Book by Barry Manilow Jack Feldman & Bruce Sussman Libretto / Vocal Book
No ratings yet
Music by Barry Manilow Lyrics by Bruce Sussman & Jack Feldman Book by Barry Manilow Jack Feldman & Bruce Sussman Libretto / Vocal Book
174 pages
Activity Sheet-Pre School
100% (8)
Activity Sheet-Pre School
50 pages
Nike Swot Matrix 2014
100% (2)
Nike Swot Matrix 2014
1 page
2.3 Decision-Tree-Algorithm
No ratings yet
2.3 Decision-Tree-Algorithm
61 pages
Decision Tree
100% (4)
Decision Tree
66 pages
7-Decision Trees Learning
No ratings yet
7-Decision Trees Learning
51 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Machine Learning Unit-3.2
No ratings yet
Machine Learning Unit-3.2
61 pages
Chapter 3 Decision Trees
No ratings yet
Chapter 3 Decision Trees
61 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
DMDW-CO3-SESSION-14
No ratings yet
DMDW-CO3-SESSION-14
55 pages
7. Decision Tree & Random Forest
No ratings yet
7. Decision Tree & Random Forest
41 pages
Classification With Decision Trees I: Instructor: Qiang Yang
No ratings yet
Classification With Decision Trees I: Instructor: Qiang Yang
29 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
Decision Tree
No ratings yet
Decision Tree
19 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
70 pages
Lecture2 DT
No ratings yet
Lecture2 DT
75 pages
ML Lecture 13-14
No ratings yet
ML Lecture 13-14
33 pages
3. Tree Models
No ratings yet
3. Tree Models
42 pages
Random Forest Regression
No ratings yet
Random Forest Regression
57 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
Decision Tree
No ratings yet
Decision Tree
33 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
Asset v1 MKAU+SEng9032+DEV 01+Type@Asset+Block@ML Chapterthree
No ratings yet
Asset v1 MKAU+SEng9032+DEV 01+Type@Asset+Block@ML Chapterthree
129 pages
Chapter 3
No ratings yet
Chapter 3
88 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Module 3 Chap 3 Decision Tree Learning
No ratings yet
Module 3 Chap 3 Decision Tree Learning
79 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
Class 16 Decision Tree
No ratings yet
Class 16 Decision Tree
45 pages
Ics320part3 DecisionTreeLearning
No ratings yet
Ics320part3 DecisionTreeLearning
29 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
decision-tree-intro-MDT903
No ratings yet
decision-tree-intro-MDT903
40 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
21 pages
ML Classification Tree
No ratings yet
ML Classification Tree
36 pages
Unit-3 (1)
No ratings yet
Unit-3 (1)
81 pages
Decision - Tree
No ratings yet
Decision - Tree
75 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
80 pages
Decision Tree
No ratings yet
Decision Tree
58 pages
Data Mining: Classification-1
No ratings yet
Data Mining: Classification-1
53 pages
02 DecisionTrees Done
No ratings yet
02 DecisionTrees Done
68 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
AI_01_ID3
No ratings yet
AI_01_ID3
7 pages
Unit 3 (A) NGP
No ratings yet
Unit 3 (A) NGP
78 pages
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
No ratings yet
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
49 pages
Decision Tree Basics
No ratings yet
Decision Tree Basics
30 pages
Unit 2
No ratings yet
Unit 2
20 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Unit2 ML
No ratings yet
Unit2 ML
19 pages
Chapter 5 2018 2019
No ratings yet
Chapter 5 2018 2019
5 pages
Module 3
No ratings yet
Module 3
101 pages
Machine Learning 10601 Recitation 8 Oct 21, 2009: Oznur Tastan
No ratings yet
Machine Learning 10601 Recitation 8 Oct 21, 2009: Oznur Tastan
46 pages
MLT UNIT-3 notes
No ratings yet
MLT UNIT-3 notes
35 pages
Unit 2 1
No ratings yet
Unit 2 1
15 pages
Business Analytics & Machine Learning: Decision Tree Classifiers
No ratings yet
Business Analytics & Machine Learning: Decision Tree Classifiers
60 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
ML-Lec5
No ratings yet
ML-Lec5
7 pages
Decision Trees
No ratings yet
Decision Trees
45 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
module 2
No ratings yet
module 2
42 pages
Module 2 Notes v1 PDF
No ratings yet
Module 2 Notes v1 PDF
20 pages
Short Tricks of Math
From Everand
Short Tricks of Math
knoweldgeflow
No ratings yet
Admission Form
No ratings yet
Admission Form
2 pages
Lp-Understanding Possessive Pronoun
No ratings yet
Lp-Understanding Possessive Pronoun
4 pages
Rr720507 Neural Networks
No ratings yet
Rr720507 Neural Networks
5 pages
A Rich Widows Fortune
No ratings yet
A Rich Widows Fortune
408 pages
Curriculum Vitae
No ratings yet
Curriculum Vitae
3 pages
Chapter Three Mobile Radio Channel Modelling & Mitigations: 3.2 Mitigation Techniques For Fading Wireless Channels
No ratings yet
Chapter Three Mobile Radio Channel Modelling & Mitigations: 3.2 Mitigation Techniques For Fading Wireless Channels
34 pages
Creating Spaces
No ratings yet
Creating Spaces
21 pages
Class Xii Chemistry Question Bank
No ratings yet
Class Xii Chemistry Question Bank
308 pages
الملابس الذكية
No ratings yet
الملابس الذكية
10 pages
Instant ebooks textbook (eBook PDF) Power and Everyday Practice by Deborah Brock download all chapters
100% (1)
Instant ebooks textbook (eBook PDF) Power and Everyday Practice by Deborah Brock download all chapters
40 pages
Ellora Project 20051027
No ratings yet
Ellora Project 20051027
42 pages
Helpbook On Design Thinking
No ratings yet
Helpbook On Design Thinking
16 pages
Bacterial Corneal Ulcer
No ratings yet
Bacterial Corneal Ulcer
23 pages
Identifying Market Segments and Targets
100% (1)
Identifying Market Segments and Targets
24 pages
Contoh 1
No ratings yet
Contoh 1
3 pages
Intro. To Internet of Things, Fall 2020: Ydlin@cs - Nctu.edu - TW
No ratings yet
Intro. To Internet of Things, Fall 2020: Ydlin@cs - Nctu.edu - TW
2 pages
Picq (2016) - Narrativa
No ratings yet
Picq (2016) - Narrativa
16 pages
Programming III - Commands and Basic Motor Control
No ratings yet
Programming III - Commands and Basic Motor Control
31 pages
Beeno Book 1
100% (1)
Beeno Book 1
34 pages
RNA Sequence Structure and Function Computational and Bioinformatic Methods 1st Edition Jan Gorodkin - Download the entire ebook instantly and explore every detail
100% (2)
RNA Sequence Structure and Function Computational and Bioinformatic Methods 1st Edition Jan Gorodkin - Download the entire ebook instantly and explore every detail
84 pages
Coronavirus Whitney Webb 1
100% (2)
Coronavirus Whitney Webb 1
25 pages
708014962 Birth Certificate PDF(1)
No ratings yet
708014962 Birth Certificate PDF(1)
1 page
Applications and Products
No ratings yet
Applications and Products
9 pages
Chapter 4
No ratings yet
Chapter 4
61 pages