classification basic concept.data mining

The document provides an overview of data mining, focusing on machine learning and classification techniques. It explains the types of learning, including supervised and unsupervised learning, and details the classification process, its applications, and various techniques used. Additionally, it discusses the advantages and disadvantages of classification in data mining.

Uploaded by

ssri62439

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

classification basic concept.data mining

Uploaded by

ssri62439

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

CLASSIFICATION

BASIC CONCEPTS
IN
DATA MINING
PRESENTED BY
JAYASRI.A
VAISHNAVI.E
FELIX RAJ.A
BALACHANDAR.R
Machine learning

 Machine learning is a subfield of artificial intelligence, which is broadly

defined as the capability of a machine to imitate intelligent human
behavior. Artificial intelligence systems are used to perform complex
tasks in a way that is similar to how humans solve problems.
 TYPES OF LEARNING:
 SUPERVISED LEARNING

 UNSUPERVISED LEARNING
Supervised learning

o It is defined by its use of labeled datasets to train algorithms that to

classify data or predict outcomes accurately.
o Supervised learning uses a training set to teach models to yield the
desired output.
o This training dataset includes inputs and correct outputs, which allow
the model to learn over time.
o The algorithm measures its accuracy through the loss function, adjusting
until the error has been sufficiently minimized.
Unsupervised learning

o Unsupervised learning, also known as unsupervised machine

learning, uses machine learning algorithms to analyze and cluster
unlabeled datasets.
o These algorithms discover hidden patterns or data groupings without
the need for human intervention.
o Its ability to discover similarities and differences in information make it
the ideal solution for exploratory data analysis, cross-selling strategies,
customer segmentation, and image recognition.
Types of supervised learning

 Classification
 Classification is a technique that aims to
reproduce class assignments. It can predict the
response value and the data is separated into
“classes”.
 Regression
 Regression is related to continuous data (value
functions). The predicted output values are real
numbers. It deals with problems such as predicting
the price of a house or the trend in the stock price at
a given time, etc.
Data analytics

 Data analytics focuses on processing and performing statistical analysis

of existing datasets. Analysts concentrate on creating methods to
capture, process, and organize data to uncover actionable insights for
current problems, and establishing the best way to present this data.
 TYPES:
 Descriptive
 Diagnostic
 Predictive
 Prescriptive.
CLASSIFICATION

 Classification is a data mining function that assigns items in a collection

to target categories or classes. The goal of classification is to accurately
predict the target class for each case in the data.
 Classification is a widely used technique in data mining and is applied in
a variety of domains, such as email filtering, sentiment analysis, and
medical diagnosis.
 Classification is the problem of identifying to which of a set of categories
(subpopulations), a new observation belongs to, on the basis of a
training set of data containing observations and whose categories
membership is known.
Examples of Classification

 Classifying credit card transactions

as legitimate or fraudulent

 Classifying secondary structures of protein

as alpha-helix, beta-sheet, or random
coil
 Categorizing news stories as finance,
weather, entertainment, sports, etc
Example :
Raw mango & ripen mango
Feature-1: Weight

Weight
Feature-2: Weight and color
color weight lable
[22-140-10] 300 Raw
[10-240-10] 330 Ripen
x
[12-235-10] 310 Ripen
[250-130- 307 Raw
10]
[80-220-20] 333 Ripen
Color

y
Weight
RAW DATASET CLASSIFICATION

Raw dataset

Features/Label

Learning Algorithm

Model
Classification Techniques

 Decision Tree Based Method

 Rule- based method
 Memory-base reasoning
 Neural networks
 Navive Bayes and Bayesian Networks
 Support Vector Machines
 Linear Regression
Steps of classification

1. Learning Step (Training Phase): Construction of Classification Model

Different Algorithms are used to build a classifier by making the model
learn using the training set available. The model has to be trained for
the prediction of accurate results.
2. Classification Step: Model used to predict class labels and testing the
constructed model on test data and hence estimate the accuracy of the
classification rules.
Training and Testing

 The goal is to produce a trained (fitted) model that generalizes well to

new, unknown data. The fitted model is evaluated using “new”
examples from the held-out datasets (validation and test datasets) to
estimate the model's accuracy in classifying new data.
 Example:
 Suppose there is a person who is sitting under a fan and the fan starts
falling on him, he should get aside in order not to get hurt. So, this is his
training part to move away. While Testing if the person sees any heavy
object coming towards him or falling on him and moves aside then the
system is tested positively and if the person does not move aside then
the system is negatively tested.
ADVANTAGES

• Mining Based Methods are cost-effective and efficient

• Helps in identifying criminal suspects
• Helps in predicting the risk of diseases
• Helps Banks and Financial Institutions to identify defaulters so
that they may approve Cards, Loan, etc.
DISADVANTAGES

 Privacy: When the data is either are chances that a company may give
some information about their customers to other vendors or use this
information for their profit.
 Accuracy Problem: Selection of Accurate model must be there in order
to get the best accuracy and result.
THANK YOU

Component Diagnostics: E1 - Engine Control Unit: ECU Pin and Plug
No ratings yet
Component Diagnostics: E1 - Engine Control Unit: ECU Pin and Plug
9 pages
Unit Iii Classification
No ratings yet
Unit Iii Classification
57 pages
Classification in Data Mining 12
No ratings yet
Classification in Data Mining 12
7 pages
Classification in Data Mining
No ratings yet
Classification in Data Mining
14 pages
Classification: Unit-III
No ratings yet
Classification: Unit-III
90 pages
Classification
No ratings yet
Classification
15 pages
4 - Data Analytics Using DM and ML Algorithms - 1
No ratings yet
4 - Data Analytics Using DM and ML Algorithms - 1
71 pages
Classification
No ratings yet
Classification
50 pages
DM Chapter 4
No ratings yet
DM Chapter 4
47 pages
Unit 3 (DWDM)
No ratings yet
Unit 3 (DWDM)
23 pages
Data Mining 4th Is
No ratings yet
Data Mining 4th Is
24 pages
3 DM Classification
No ratings yet
3 DM Classification
55 pages
Chapter 4 Classification
No ratings yet
Chapter 4 Classification
78 pages
DATA MINING JNTUH CSE R18
No ratings yet
DATA MINING JNTUH CSE R18
20 pages
Big Data Analytics - Unit 3
No ratings yet
Big Data Analytics - Unit 3
55 pages
08 - Classification - Decision Trees
No ratings yet
08 - Classification - Decision Trees
116 pages
ABP DWDM UNIT 4 Classification 1
No ratings yet
ABP DWDM UNIT 4 Classification 1
51 pages
Lecture 9
No ratings yet
Lecture 9
27 pages
Week 4 Part 1 Classification
No ratings yet
Week 4 Part 1 Classification
71 pages
Unit-4 AML (1. Basics and K-NN)
No ratings yet
Unit-4 AML (1. Basics and K-NN)
25 pages
DM Ch6 (Classification and Prediction)
No ratings yet
DM Ch6 (Classification and Prediction)
39 pages
Data Mining: Concepts and Techniques: - Chapter 6
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 6
115 pages
Chapter 5. Classification and Prediction
No ratings yet
Chapter 5. Classification and Prediction
122 pages
Basic Concept of Classification (Data Mining)
No ratings yet
Basic Concept of Classification (Data Mining)
11 pages
18mca52c U3
No ratings yet
18mca52c U3
8 pages
202396123846584_26076Classification - Data Mining
No ratings yet
202396123846584_26076Classification - Data Mining
4 pages
Classification (Part II)
No ratings yet
Classification (Part II)
162 pages
Classification & Prediction
No ratings yet
Classification & Prediction
19 pages
Data Mining
No ratings yet
Data Mining
73 pages
Module 3_classification
No ratings yet
Module 3_classification
9 pages
Classification and Prediction
No ratings yet
Classification and Prediction
126 pages
Down 4
No ratings yet
Down 4
83 pages
Classification and Prediction Lecture-22,23,24,25,26,27, 28: Dr. Sudhir Sharma Manipal University Jaipur
No ratings yet
Classification and Prediction Lecture-22,23,24,25,26,27, 28: Dr. Sudhir Sharma Manipal University Jaipur
43 pages
Bia Unit-3 Part-2
No ratings yet
Bia Unit-3 Part-2
43 pages
08 Class Basic
No ratings yet
08 Class Basic
141 pages
Lect 1
No ratings yet
Lect 1
38 pages
Classification Analysis
No ratings yet
Classification Analysis
4 pages
Unit4-Classification and Prediction
No ratings yet
Unit4-Classification and Prediction
129 pages
Fundamentals of Data Science Unit 4
100% (1)
Fundamentals of Data Science Unit 4
31 pages
DM Classification 1 3
No ratings yet
DM Classification 1 3
19 pages
Unit 3
No ratings yet
Unit 3
33 pages
Classification and Prediction
No ratings yet
Classification and Prediction
40 pages
overview_basics
No ratings yet
overview_basics
16 pages
Data Mining: Concepts and Techniques
100% (2)
Data Mining: Concepts and Techniques
139 pages
DM_UNIT-1_FUNDAMENTALS OF DATA MINING (1)
No ratings yet
DM_UNIT-1_FUNDAMENTALS OF DATA MINING (1)
43 pages
DATA MINING MODULE 3
No ratings yet
DATA MINING MODULE 3
27 pages
Lecture 12
No ratings yet
Lecture 12
31 pages
Data Mining Classification and Prediction
No ratings yet
Data Mining Classification and Prediction
17 pages
Class Basic
No ratings yet
Class Basic
67 pages
DM Unit-3
No ratings yet
DM Unit-3
46 pages
DAMI 011114a
No ratings yet
DAMI 011114a
48 pages
Classifiers (Support Vector Machines, Decision Trees, Nearest Neighbor Classification)
No ratings yet
Classifiers (Support Vector Machines, Decision Trees, Nearest Neighbor Classification)
16 pages
Unit IV
No ratings yet
Unit IV
43 pages
Data Mining-Unit-3
No ratings yet
Data Mining-Unit-3
16 pages
Classification Unit3
No ratings yet
Classification Unit3
15 pages
Data Mining: Concepts and Techniques: - Chapter 6
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 6
129 pages
Data Mining With Clustering AND Classification
No ratings yet
Data Mining With Clustering AND Classification
16 pages
DWM Unit 3 Final Notes
No ratings yet
DWM Unit 3 Final Notes
47 pages
Classification & Prediction
No ratings yet
Classification & Prediction
24 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Shyu 2011
No ratings yet
Shyu 2011
18 pages
US9226410 PCB MultiLayer
No ratings yet
US9226410 PCB MultiLayer
15 pages
Composite Fabrication by Filament Winding
No ratings yet
Composite Fabrication by Filament Winding
26 pages
Blavatsky's Diagram of Meditation and The Process of Spiritual Transformation
No ratings yet
Blavatsky's Diagram of Meditation and The Process of Spiritual Transformation
7 pages
Science Experiment1
No ratings yet
Science Experiment1
4 pages
10 - 1 - Republic V Orfinada
No ratings yet
10 - 1 - Republic V Orfinada
8 pages
USCP Graded Recitation
No ratings yet
USCP Graded Recitation
4 pages
Output Log
No ratings yet
Output Log
390 pages
Aesthetic Experience: and Literary Hermeneutics
No ratings yet
Aesthetic Experience: and Literary Hermeneutics
389 pages
As 5
No ratings yet
As 5
28 pages
Industrial Engineering Mec 422 2 Unit Course Note WK1-3
No ratings yet
Industrial Engineering Mec 422 2 Unit Course Note WK1-3
8 pages
DATATHON PROGRAMMING COMPETITION 2022 Rules and Regulations 1
No ratings yet
DATATHON PROGRAMMING COMPETITION 2022 Rules and Regulations 1
6 pages
IOM Presnetation GCT, Mianwali 30-01-2019 2
100% (2)
IOM Presnetation GCT, Mianwali 30-01-2019 2
42 pages
Group-6 a338 Accresm
No ratings yet
Group-6 a338 Accresm
9 pages
222 Ways To Avoid Very
100% (1)
222 Ways To Avoid Very
21 pages
Bennett Enterprises Limited Profile
No ratings yet
Bennett Enterprises Limited Profile
15 pages
K CROSS & ISLM Presentation
No ratings yet
K CROSS & ISLM Presentation
43 pages
Sample Exercises Biometric
No ratings yet
Sample Exercises Biometric
3 pages
Assignment Question PE & PLC
No ratings yet
Assignment Question PE & PLC
5 pages
Tendernotice 1
No ratings yet
Tendernotice 1
8 pages
CS 1000/CS 3000 HIS Operation: IM 33S02C10-01E 13th Edition
No ratings yet
CS 1000/CS 3000 HIS Operation: IM 33S02C10-01E 13th Edition
84 pages
Introduction To Statistical Learning
No ratings yet
Introduction To Statistical Learning
16 pages
Pritam7th Sem
No ratings yet
Pritam7th Sem
18 pages
Unit-2 Self Management Skills
No ratings yet
Unit-2 Self Management Skills
7 pages
Matter & Its Various States: of Solids
No ratings yet
Matter & Its Various States: of Solids
37 pages
Metal Forming Lecture 3 Stress Strain Analyses
No ratings yet
Metal Forming Lecture 3 Stress Strain Analyses
30 pages
Military Civil Engineering
100% (1)
Military Civil Engineering
8 pages
AP Biology Final Exam Review
No ratings yet
AP Biology Final Exam Review
1 page
Theories of Corporate Personality
No ratings yet
Theories of Corporate Personality
4 pages