Classification Using Decision Tree: CSE-454: Data Warehousing and Data Mining Sessional
Classification Using Decision Tree: CSE-454: Data Warehousing and Data Mining Sessional
Classification Using Decision Tree: CSE-454: Data Warehousing and Data Mining Sessional
Decision Tree
MD. JAKARIA
LECTURER
DEPT. OF CSE, MIST
Classification
Definition: In machine learning, classification refers to a
predictive modeling problem where a class label is predicted for a
given example of input data.
Examples of classification problems include:
• Given an example, classify if it is spam or not.
• Given a handwritten character, classify it as one of the known characters.
• Given recent user behavior, classify as churn or not.
There are perhaps four main types of classification tasks that you may
encounter; they are:
• Binary Classification
• Multi-Class Classification
• Multi-Label Classification
• Imbalanced Classification
𝐼𝑛𝑓𝑜𝑟𝑚𝑎𝑡𝑖𝑜𝑛
𝐺𝑎𝑖𝑛=𝐸𝑛𝑡𝑟𝑜 𝑝𝑦 ( 𝑝𝑎𝑟𝑒𝑛𝑡 ) – [ 𝐴 𝑣𝑒𝑟𝑎𝑔𝑒 𝐸 𝑛𝑡𝑟𝑜𝑝𝑦 (𝑐h𝑖𝑙𝑑𝑟𝑒𝑛)]
Entropy =
Entropy =
Entropy =
= 0.996 - 0.615 = 0.38
Split over whether Balance exceeds 50K Split over whether applicant is employed
Fig: The decision tree induced from the 12-example training set.