0% found this document useful (0 votes)

39 views

Introduction To Pattern Recognition

The document provides an introduction to pattern recognition, including definitions of key concepts like machine learning, classification, clustering, patterns, classes, and the main approaches and objectives of pattern recognition. It also discusses some common pattern recognition tasks and applications.

Uploaded by

Jenil Saliya

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views

Introduction To Pattern Recognition

Uploaded by

Jenil Saliya

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

Introduction to Pattern

Recognition

1
Reference book:
• “Pattern Classification” by Richard O. Duda,
Peter E. Hart and David G. Stork

2
Some Pattern Recognition tasks performed
by humans in day to day life
• Recognize a face
• Understand spoken words
• Read handwritten characters
• Decide if an apple is ripe based on its smell
• Identify presence of car keys in our pocket by feel

and any more …

3
What is Machine Learning
Definition:
– Machine Learning is a field. that strives to incorporate the Learning Ability into
machines. [Machine Learning, Tom Mitchell]

– Machine Learning is a field that is concerned with building machines that can learn and
improve automatically with experience.

– Formal Definition of Machine Learning : “A computer program is said to learn from

experience E with respect to some class of tasks T and performance measure P, if its
performance at tasks in T , as measured by P, improves with experience E.”

Comment :
“We do not know yet how to make machines learn as well as humans do, but algorithms
have been invented that are effective for certain types of machine-learning tasks.”
Some Pattern Recognition tasks we
would like machines to perform
• Speech Recognition
• Fingerprint Recognition
• Face Recognition
• Optical Character Recognition
• Object Identification/Classification
• DNA sequence identification

and many more..

5
Pattern Recognition/Classification
• A pattern is a regularity in the world, in human-made design,
or in abstract ideas.
• Pattern Recognition- the scientific discipline of taking in raw
data and assigning it to a category/class with the purpose of
performing a suitable action.
• Assign an object or an event (pattern) to one of several known
categories (or classes).

Category “A”

Category “B”

6
Classification vs Clustering

Classification (known categories)

(Supervised Learning)

Category “A” Clustering (unknown categories)

(Unsupervised Learning)

Category “B”

7
Classification vs Clustering

• Supervised learning (classification)

– Supervision: The training data (observations, measurements, etc.) are
accompanied by labels indicating the class of the observations
– New data is classified based on the training set
• Unsupervised learning (clustering)
– Clustering is the process of grouping the data into classes or clusters,
so that objects within a cluster have high similarity in comparison to
one another but are quite dissimilar to objects in other clusters.
– The class labels of training data are unknown and number of classes
to be learned may not be known in advance.

8
What is a Pattern?
• An object or event.  x1 
x 
• Represented by a vector x of values  2
corresponding to various features. x . 
 
.
 xn 

biometric patterns hand gesture patterns

9
What is a Pattern? (con’t)
• Loan/Credit card applications
– Income, # of dependents, mortgage amount  credit
worthiness classification

• Dating services
– Age, hobbies, income “desirability” classification

• Web documents
– Key-word based descriptions (e.g., documents
containing “football”, “NFL”)  document classification

10
What is a Class ?
• A collection of “similar” objects.

class: ‘Female’ class: ‘Male’

11
Main Objectives
(1) Separate data belonging to different classes.

(2) Assign new data to the correct class.

Gender Classification

12
Main Approaches
x: input vector (pattern)
ω1
ω: class label (class)

• Generative ω2
– Models the joint probability, p(x, ω).
– Makes predictions by using Bayes rule to calculate P(ω/x).
– Pick the most likely class label ω.

• Discriminative
– Does not model p(x, ω).
– Estimates P(ω/x) by “learning” a direct mapping from x to ω (i.e.,
estimate decision boundary).
– Pick the most likely class label ω.
13
How do we model p(x, ω)?
• Typically, using a statistical model.
– probability density function (e.g., Gaussian)

male
Gender Classification female

14
Data Variability
• Intra-class variability

The letter “T” in different typefaces

• Inter-class variability

Letters/Numbers that look similar

• We typically deal with this issue by collecting a large

number of examples and a “good” set of features.
15
Some applications of Patern Recognition

16
Handwriting Recognition

17
License Plate Recognition

18
Biometric Recognition

19
Face Detection/Recognition

Detection

Matching

Recognition

20
Fingerprint Classification
Important step for speeding up identification

21
Autonomous Systems
Obstacle detection and avoidance
Object recognition

22
Medical Applications
Skin Cancer Detection Breast Cancer Detection

23
Land Cover Classification
(using aerial or satellite images)

Many applications including “precision” agriculture.

24
More Applications

• Recommendation systems
– e.g., Amazon, Netflix
• Email spam filters
• Malicious website detection
• Loan/Credit Card Applications

25
Main Phases in Pattern Recognition
Testing Training

26
Complexity of PR – An Example
camera

Problem: Sorting
incoming fish on a
conveyor belt.

Assumption: Two
kind of fish:
(1) sea bass
(2) salmon

27
Sensors
• Sensing:
– Use some kind of a
sensor (e.g., camera,
weight scale) for data
capture.
– PR’s overall performance
depends on bandwidth,
resolution, sensitivity,
distortion of the sensor
being used.

28
Preprocessing
A critical step for reliable feature extraction!

Examples:

• Noise removal

• Image enhancement

• Separate touching
or occluding fish

• Extract boundary of
each fish

29
Training/Test data
• How do we know that we have collected an
adequately large and representative set of
examples for training/testing the system?

Training Set ?

Test Set ?

30
Feature Extraction
• How to choose a good set of features?
– Discriminative features

– Invariant features (e.g., invariant to geometric

transformations such as translation, rotation and
scale)
• Are there ways to automatically learn which
features are best ?
31
Feature Extraction
• Let’s assume that a fisherman
told us that a sea bass is
generally longer than a
salmon.
• We can use length as a
feature and decide between
sea bass and salmon
according to a threshold on
length.
• How should we choose the
threshold?
32
Feature Extraction (cont’d)
Histogram of “length”

threshold l*

• Even though sea bass is longer than salmon on

the average, there are many examples of fish
where this observation does not hold.
33
Feature Extraction (cont’d)
• Consider a different feature, e.g., “lightness”
Histogram of “lightness”

threshold x*

• It seems easier to choose the threshold x* but we still

cannot make a perfect decision.
34
Multiple Features
• To improve recognition accuracy, we might need
to use more than one features.
– Single features might not yield the best performance.
– Using combinations of features might yield better
performance.

 x1  x1 : lightness
x  x2 : width
 2

35
Multiple Features (cont’d)
• Does adding more features always help?
– It might be difficult and computationally expensive to
extract more features.
– Correlated features might not improve performance
(may cause redundancy).
• What are correlated features? When features that are meant
to measure different characteristics are influenced by some
common mechanism and tend to vary together, they are
called correlated features.
– Adding too many features can, paradoxically, lead to a worsening
of performance (i.e., “curse” of dimensionality).

36
Curse of Dimensionality
• One of the recurring problems encountered in applying statistical
techniques to pattern recognition problems has been called the “curse of
dimensionality.”
• Curse of Dimensionality refers to a set of problems that arise when
working with high-dimensional data.
• The dimension of a dataset corresponds to the number of
attributes/features that exist in a dataset. A dataset with a large number
of attributes is referred to as high dimensional data.
• For example, methods that are analytically or computationally
manageable in low-dimensional spaces can become completely
impractical in a space of 50 or 100 dimensions.
• Model’s training time would be higher with large number of features
• Algorithm’s running time may increase (sometimes exponentially) with
higher number of features.
• In some cases, classification errors might increase, making the exercise
counter-intuitive
• Dimensionality reduction is required to overcome this 37
Example: Curse of Dimensionality
• The number of training data could depend exponentially
on the number of features.
• Example:
– Divide each of the input features into a number of intervals M, so that the value
of a feature can be specified approximately by saying in which interval it lies.

– The total number of cells will be MD (D: # of features).

– Assuming uniform sampling, each cell should contain at least one data point, i.e.,
the number of training data grows exponentially with D.

38
Missing Features
• Certain features might be missing (e.g., due to occlusion).

• How should we train the classifier with missing features?

– Ignore tuples with missing values or
– Substitute with suitable values
• Manual substitution
• Substitute with most probable values

• How should the classifier make the best decision with

missing features ?

39
Decision Boundary
• How should we assign a given pattern to a class
(i.e., “salmon” or “sea bass”)?
• Classifiers use the training data to partition the
feature space into different regions (i.e., find the
decision boundary).

How should we find an optimal decision boundary? 40

Decision Boundary (cont’d)
• Classifiers find the decision boundary by minimizing an
error function (e.g., classification error on the training
data).
• In general, we can get perfect classification results on the
training set by choosing a complex model.

41
Overfitting

• Complex models are tuned to the particular training data,

rather than on the characteristics of the true model (i.e.,
memorization or overfitting).
• Overfitting of data implies poor generalization!
• Classification models should be sufficiently generalized
to be of practical use in real world situations.
• How to detect overfitting in classification? If classification
accuracy is high on training set but deteriorates on
unseen examples, overfitting has occurred.

42
Overfitting
Overfitting

43
Computational Complexity
• How does an algorithm scale with the
number of:
• features
• Training data
• categories

• Need to consider tradeoffs between

computational complexity and performance.

44
Would it be possible to build a
“general purpose” PR system?

• It would be very difficult to design a system that is capable

of performing a variety of classification tasks.
– Different problems require different features.
– Different features might yield different solutions.
– Different tradeoffs exist for different problems.

45
To do
• Read Chapter 1 from “Pattern Classification”
by Richard O. Duda, Peter E. Hart and David G.
Stork

Heat Illness Prevention Training
100% (1)
Heat Illness Prevention Training
22 pages
Introduction To Machine Learning: Jaime S. Cardoso
100% (1)
Introduction To Machine Learning: Jaime S. Cardoso
52 pages
Fundamentals of PR
No ratings yet
Fundamentals of PR
44 pages
Introduction
100% (1)
Introduction
49 pages
Introduction To Machine Learning
100% (1)
Introduction To Machine Learning
119 pages
ML 01
No ratings yet
ML 01
44 pages
CS464 Ch1 Intro Fall2020
No ratings yet
CS464 Ch1 Intro Fall2020
83 pages
ppt4dl
No ratings yet
ppt4dl
81 pages
AML All Merged PDF Class 1 To 8
No ratings yet
AML All Merged PDF Class 1 To 8
423 pages
Introduction - Machine Learning
No ratings yet
Introduction - Machine Learning
17 pages
PR01
100% (1)
PR01
41 pages
Fintech ML Using Azure
No ratings yet
Fintech ML Using Azure
51 pages
Intor AI ESGB
No ratings yet
Intor AI ESGB
26 pages
Pattern Recognition: Dr. Farah Qais Al-Khalidi
No ratings yet
Pattern Recognition: Dr. Farah Qais Al-Khalidi
49 pages
Lecture 1 - Introduction
No ratings yet
Lecture 1 - Introduction
49 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
03 - Data & Learning
No ratings yet
03 - Data & Learning
53 pages
Mining
No ratings yet
Mining
129 pages
Data Mining - Lecture 1
No ratings yet
Data Mining - Lecture 1
33 pages
2021 Machine Learning Intro
No ratings yet
2021 Machine Learning Intro
43 pages
Pattern Recognition
No ratings yet
Pattern Recognition
52 pages
Ch3-Machine Learning
No ratings yet
Ch3-Machine Learning
124 pages
MLDM Lect1 Introduction
No ratings yet
MLDM Lect1 Introduction
40 pages
DM - MP (1)
No ratings yet
DM - MP (1)
15 pages
Pattern Classification
100% (1)
Pattern Classification
42 pages
Machine Learning BE Merged Modules
No ratings yet
Machine Learning BE Merged Modules
561 pages
Presentation on ML - Copy
No ratings yet
Presentation on ML - Copy
469 pages
Urban Analytics - Theory US 603
No ratings yet
Urban Analytics - Theory US 603
47 pages
Chapter 5 - Machine Learning Basics
No ratings yet
Chapter 5 - Machine Learning Basics
58 pages
To Pattern Recognition: CSE555, Fall 2021 Chapter 1, DHS
100% (1)
To Pattern Recognition: CSE555, Fall 2021 Chapter 1, DHS
39 pages
Machine Learning Introduction
No ratings yet
Machine Learning Introduction
56 pages
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
100% (1)
CSE 473 Pattern Recognition: Instructor: Dr. Md. Monirul Islam
57 pages
Unit-1 - Machine Learning
No ratings yet
Unit-1 - Machine Learning
85 pages
Spoken Dialog Systems and Voice XML
No ratings yet
Spoken Dialog Systems and Voice XML
94 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
19 pages
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-04 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE334L TH VL2024250101768 2024-10-04 Reference-Material-I
69 pages
Presentation of AI ML Session 1
No ratings yet
Presentation of AI ML Session 1
131 pages
Agenda: - Introduction - Basics - Classification - Clustering - Regression - Use-Cases
No ratings yet
Agenda: - Introduction - Basics - Classification - Clustering - Regression - Use-Cases
30 pages
Class10 14 PatternClassification - 13 24sept2019
No ratings yet
Class10 14 PatternClassification - 13 24sept2019
50 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
Introduction to Machine Learning (1)
No ratings yet
Introduction to Machine Learning (1)
89 pages
Lecture 7 - Data Collection & Measurement
No ratings yet
Lecture 7 - Data Collection & Measurement
36 pages
Machine Learning CS-8 Dept. of CS, KFUEIT: Instructor: Muhammad Adeel Abid
No ratings yet
Machine Learning CS-8 Dept. of CS, KFUEIT: Instructor: Muhammad Adeel Abid
19 pages
Unit 5 - KVR
No ratings yet
Unit 5 - KVR
41 pages
Lec1 Intoduction
No ratings yet
Lec1 Intoduction
34 pages
11 W11NSE6220 - Fall 2023 - Zeng
No ratings yet
11 W11NSE6220 - Fall 2023 - Zeng
43 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
29 pages
Introduction of Pattern Recognition PDF
No ratings yet
Introduction of Pattern Recognition PDF
40 pages
Lecture1 - Introduction To Machine Learning
No ratings yet
Lecture1 - Introduction To Machine Learning
39 pages
Symbolic Machine Learning: M.S.Kaysar, M.Engg Cse, Iub
100% (2)
Symbolic Machine Learning: M.S.Kaysar, M.Engg Cse, Iub
112 pages
Aiml Questions1
No ratings yet
Aiml Questions1
27 pages
Module_6_AI_ES_yuva
No ratings yet
Module_6_AI_ES_yuva
61 pages
2025-Lecture06-MachineLearning
No ratings yet
2025-Lecture06-MachineLearning
56 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Pattern Recognition 14
No ratings yet
Pattern Recognition 14
46 pages
Classification
No ratings yet
Classification
53 pages
Learning Paradigms
No ratings yet
Learning Paradigms
41 pages
TTDS Lectures
No ratings yet
TTDS Lectures
13 pages
4.1 Machine Learning Basics
No ratings yet
4.1 Machine Learning Basics
26 pages
Lecture 1 (Part 2)- Definitions and Examples of Machine Learning(1)
No ratings yet
Lecture 1 (Part 2)- Definitions and Examples of Machine Learning(1)
21 pages
People and Organizations: Explorations of Human-Centered Design
From Everand
People and Organizations: Explorations of Human-Centered Design
William B. Rouse
No ratings yet
The Odyssey Owlette - First Edition
No ratings yet
The Odyssey Owlette - First Edition
5 pages
Notification DVC GET Posts
No ratings yet
Notification DVC GET Posts
10 pages
Deepwell Revised Proposal
No ratings yet
Deepwell Revised Proposal
1 page
2004 Submerged Motor LNG Pumps in Send-Out System Service - S. Rush - Pumps & Systems
No ratings yet
2004 Submerged Motor LNG Pumps in Send-Out System Service - S. Rush - Pumps & Systems
6 pages
Alchemy (Mabi David)
No ratings yet
Alchemy (Mabi David)
9 pages
Circle Geometry Presentation Grade11 (1)
No ratings yet
Circle Geometry Presentation Grade11 (1)
27 pages
Analysis of Pakistan Cement Sector
No ratings yet
Analysis of Pakistan Cement Sector
43 pages
Annual Report 2009 10
No ratings yet
Annual Report 2009 10
34 pages
Hand Sanitizer
No ratings yet
Hand Sanitizer
3 pages
Homm3 the Shadow of Death Manual - Engleza
No ratings yet
Homm3 the Shadow of Death Manual - Engleza
36 pages
PET Practice R&W - AnhDaChuyenSangFileDocRoi PDF
No ratings yet
PET Practice R&W - AnhDaChuyenSangFileDocRoi PDF
8 pages
Formulation and Solution of Elasticity Problems: Chapter-5
No ratings yet
Formulation and Solution of Elasticity Problems: Chapter-5
29 pages
Clamp PDF
No ratings yet
Clamp PDF
36 pages
CAS 1 / CA 1 / CA 2 Basic Unit: Installation and Operating Instructions
No ratings yet
CAS 1 / CA 1 / CA 2 Basic Unit: Installation and Operating Instructions
52 pages
LedsMaster LED Lighting
No ratings yet
LedsMaster LED Lighting
29 pages
Extraction and Evaluation of OKRA Fibres
No ratings yet
Extraction and Evaluation of OKRA Fibres
7 pages
IEEE Report Graphs - Sreekar
No ratings yet
IEEE Report Graphs - Sreekar
3 pages
Chandrai Dookhna English Sba Final Draft
No ratings yet
Chandrai Dookhna English Sba Final Draft
18 pages
Grade 3 - Connect - Final Revision - First Term
No ratings yet
Grade 3 - Connect - Final Revision - First Term
9 pages
PDF
No ratings yet
PDF
121 pages
The 12 Fundamental Forms of Surya
No ratings yet
The 12 Fundamental Forms of Surya
15 pages
Managing Nosebleeds
No ratings yet
Managing Nosebleeds
7 pages
Cardiovascular System Answer Key
No ratings yet
Cardiovascular System Answer Key
1 page
ESE-2015 Allocation Sheet - Mechanical
No ratings yet
ESE-2015 Allocation Sheet - Mechanical
11 pages
Chapter 5 CORRELATION AND REGRESSION
No ratings yet
Chapter 5 CORRELATION AND REGRESSION
28 pages
Hot Rolled Steel Sheet
No ratings yet
Hot Rolled Steel Sheet
40 pages
Contractor Daily Activity Pre-Inspection 50-Items Safety Checklist
No ratings yet
Contractor Daily Activity Pre-Inspection 50-Items Safety Checklist
1 page
Figures of Speech
No ratings yet
Figures of Speech
37 pages
Siegenia LS Gear - en
No ratings yet
Siegenia LS Gear - en
9 pages