Pattern Recognition Assignments

The document discusses performing sentiment analysis on movie reviews from the IMDB dataset. It describes sentiment analysis and different approaches like rule-based, machine learning and neural networks. The objectives are to learn and implement sentiment analysis on the IMDB dataset.

Uploaded by

Jahan Chaware

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

Pattern Recognition Assignments

Uploaded by

Jahan Chaware

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Pattern Recognition Assignments

Assignment 2
Title : Face Recognition Using PCA and multiclass LDA
Problem Statement: Face Recognition Using PCA and multiclass LDA.
Objectives: To learn and understand Face recognition using PCA and LDA.
Outcomes : We will be able to implement Face Recognition Using PCA and multiclass LDA.
S/W & H/W requirements:
Jupyter Notebook, Python, 64-bit open-source LINUX.
Theory :
PCA:
Principal Component Analysis is an unsupervised learning algorithm that is used for the
dimensionality reduction in machine learning. It is a statistical process that converts the
observations of correlated features into a set of linearly uncorrelated features with the help of
orthogonal transformation. These new transformed features are called the Principal Components.
PCA works by considering the variance of each attribute because the high attribute shows the
good split between the classes, and hence it reduces the dimensionality. Some real-world
applications of PCA are image processing, movie recommendation system, optimizing the power
allocation in various communication channels. It is a feature extraction technique, so it contains
the important variables and drops the least important variable.
Steps in PCA:

1. Getting the dataset

2. Standardizing the data
3. Calculating the Covariance of Z
4. Calculating the Eigen Values and Eigen Vectors
5. Sorting the Eigen Vectors
6. Calculating the new features Or Principal Components
7. Remove less or unimportant features from the new dataset.

LDA:

Linear Discriminant analysis is one of the most popular dimensionality reduction techniques
used for supervised classification problems in machine learning. It is also considered a pre-
processing step for modeling differences in ML and applications of pattern classification.
Whenever there is a requirement to separate two or more classes having multiple features
efficiently, the Linear Discriminant Analysis model is considered the most common technique to
solve such classification problems. For e.g., if we have two classes with multiple features and need
to separate them efficiently. When we classify them using a single feature, then it may show
overlapping.

To overcome the overlapping issue in the classification process, we must increase the number of
features regularly.

Difference Between PCA and LCA:

Below are some basic differences between LDA and PCA:

o PCA is an unsupervised algorithm that does not care about classes and labels and only aims
to find the principal components to maximize the variance in the given dataset. At the same
time, LDA is a supervised algorithm that aims to find the linear discriminants to represent
the axes that maximize separation between different classes of data.
o LDA is much more suitable for multi-class classification tasks compared to PCA.
However, PCA is assumed to be as good performer for a comparatively small sample size.
o Both LDA and PCA are used as dimensionality reduction techniques, where PCA is first
followed by LDA.

Conclusion:
Hence we have successfully implemented Face recognition using PCA and LCA.

Code: https://github.com/bellatrix007/Face-Recognition/tree/master
Assignment Number 3
Title: Fruit shape recognition using Eigen Faces and Fisher Faces
Problem Statement: Fruit shape recognition using Eigen Faces and Fisher Faces
Objectives: To learn and understand fruit shape recognition using Eigen Faces and Fisher Faces.
Outcomes: We will be able to implement Fruit shape recognition using Eigen Faces and Fisher
Faces
S/W & H/W requirements:
Jupyter Notebook, Python, 64-bit open-source LINUX.
Theory :
Eigen Faces:
Eigenfaces is a representation learning method in computer vision focusing on facial images. The
goal of the method is to represent an image that depicts the face of a person as a linear
combination of a set of basic images that are called eigenfaces. Suppose all H x W images
representing a human face lie in a manifold in RH x W. If we can find the optimal eigenfaces, we
can represent any facial image as a linear combination.
Steps Involves in Eigen Faces Face Recognition:
1. Collect a Training Dataset
2. Preprocess Images
3. Vectorize Images
4. Compute the Mean Face
5. Compute Eigenfaces
6. Select Top Eigenfaces
7. Project Faces onto Eigenspace
8. Represent Faces with Eigenface Coefficients
9. Face Recognition
10. Classification
FisherFaces Face Recognizer:
This algorithm is an improved version of Eigenfaces face recognizer. Eigenfaces face recognizer
looks at all the training faces of all the persons at once and finds principal components from all
of them combined. By capturing principal components from all the of them combined you are
not focusing on the features that discriminate one person from the other but the features that
represent all the people in the training data as a whole. This approach has drawbacks, for
example, images with sharp changes (like light changes which is not a useful feature at all) may
dominate the rest of the images and you may end up with features that are from external source
like light and are not useful for discrimination at all. In the end, your principal components will
represent light changes and not the actual face features. Fisherfaces algorithm, instead of
extracting useful features that represent all the faces of all the persons, it extracts useful features
that discriminate one person from the others. This way features of one person do not dominate
over the others and you have the features that discriminate one person from the others.
Aspect Eigenfaces Fisherfaces
Dimensionality Principal Component Analysis Linear Discriminant Analysis (LDA)
Reduction (PCA)
Technique

Focus Global structure Discriminative power (global and

local structure)
Handling Variations Might not handle Effective at handling variations due
variations well (lighting, pose, to supervised learning and
expressions) discriminative focus
Learning Type Unsupervised Supervised

Discriminative Less discriminative due to global Maximizes discriminative power

Power focus between classes
Representation Linear transformations Can use non-linear transformations
for better discrimination
Computation Simpler and computationally Might be more computationally
efficient intensive, especially for large datasets

Conclusion:
Hence we have successfully implemented fruit shape recognition using Eigen Faces and Fisher
Faces.
Code: https://github.com/informramiz/opencv-face-recognition-python/tree/master
Assignment Number 4
Title: Perform sentiment analysis on the IMDB movie reviews dataset
Problem Statement: Perform sentiment analysis on the IMDB movie reviews dataset.
Objectives: To learn and understand the process of sentiment analysis.
Outcomes: We will be able to implement Sentiment Analysis on IMDB dataset.
S/W & H/W requirements:
Jupyter Notebook, Python, 64-bit open-source LINUX.
Theory :
Sentiment analysis is the process of classifying whether a block of text is positive, negative, or
neutral. The goal that Sentiment mining tries to gain is to analyze people’s opinions in a way that
can help businesses expand. It focuses not only on polarity (positive, negative & neutral) but also
on emotions (happy, sad, angry, etc.). It uses various Natural Language Processing algorithms
such as Rule-based, Automatic, and Hybrid.
Let’s consider a scenario, if we want to analyze whether a product is satisfying customer
requirements or is there a need for this product in the market. We can use sentiment analysis to
monitor that product’s reviews. Sentiment analysis is also efficient to use when there is a large
set of unstructured data, and we want to classify that data by automatically tagging it. Net
Promoter Score (NPS) surveys are used extensively to gain knowledge of how a customer
perceives a product or service. Sentiment analysis also gained popularity due to its feature to
process large volumes of NPS responses and obtain consistent results quickly.
Approaches to Sentiment Analysis
There are three main approaches used:
1. Rule-based : Over here, the lexicon method, tokenization, and parsing come in the rule-
based. The approach is that counts the number of positive and negative words in the
given dataset. If the number of positive words is greater than the number of negative
words then the sentiment is positive else vice-versa.
2. Machine Learning : This approach works on the machine learning technique. Firstly, the
datasets are trained, and predictive analysis is done. The next process is the extraction of
words from the text is done. This text extraction can be done using different techniques
such as Naive Bayes, Support Vector machines, hidden Markov model, and conditional
random fields like this machine learning techniques are used.
3. Neural Network : In the last few years neural networks have evolved at a very rate. It
involves using artificial neural networks, which are inspired by the structure of the human
brain, to classify text into positive, negative, or neutral sentiments. it has Recurrent neural
networks, Long short-term memory, Gated recurrent unit, etc. to process sequential data
like text.
4. Hybrid Approach : It is the combination of two or more approaches i.e. rule-based and
Machine Learning approaches. The surplus is that the accuracy is high compared to the
other two approaches.
Conclusion:
Hence we have performed Sentiment analysis on IMDB movie review dataset.
Code: https://github.com/Ankit152/IMDB-sentiment-
analysis/blob/master/imdbSentimentAnalysis.ipynb
Assignment Number 6
Title: Perform image segmentation on the Berkley Segmentation dataset.
Problem Statement: Perform image segmentation on the Berkley Segmentation dataset.
Objectives: To learn and understand image segmentation.
Outcomes: We will be able to implement image segmentation on the Berkley Segmentation
dataset.

S/W & H/W requirements:

Jupyter Notebook, Python, 64-bit open-source LINUX.
Theory :
Image Segmentation:
Image segmentation is a computer vision technique that partitions a digital image into discrete
groups of pixels—image segments—to inform object detection and related tasks. By parsing an
image’s complex visual data into specifically shaped segments, image segmentation enables
faster, more advanced image processing. Image segmentation techniques range from simple,
intuitive heuristic analysis to the cutting-edge implementation of deep learning. Conventional
image segmentation algorithms process high-level visual features of each pixel, like color or
brightness, to identify object boundaries and background regions. Machine learning, leveraging
annotated datasets, is used to train models to accurately classify the specific types of objects and
regions an image contains. Being a highly versatile and practical method of computer vision,
image segmentation has a wide variety of artificial intelligence use cases, from aiding diagnosis
in medical imaging to automating locomotion for robotics and self-driving cars to identifying
objects of interest in satellite images.
Image segmentation techniques include:
1. Similarity approach: Detects similarities between pixels to form a segment based on a
given threshold.
2. Discontinuity approach: Relies on the discontinuity of pixel intensity values of the image.
3. Edge-based segmentation: Identifies the edges of various objects in a given image.
4. Threshold-based segmentation: Divides pixels based on their intensity relative to a given
value or threshold.
Dataset Description :
The dataset consists of 500 natural images, ground-truth human annotations and benchmarking
code. The data is explicitly separated into disjoint train, validation and test subsets. The dataset is
an extension of the BSDS300, where the original 300 images are used for training / validation
and 200 fresh images, together with human annotations, are added for testing. Each image was
segmented by five different subjects on average.
Conclusion:
Hence we have performed image segmentation on the Berkley Segmentation dataset.
Code: https://github.com/alyswidan/Image_Segmentation

Module 3 Quiz
75% (4)
Module 3 Quiz
16 pages
Use Reminder Systems
100% (2)
Use Reminder Systems
24 pages
ML Unit-1
100% (2)
ML Unit-1
12 pages
DFCC Bank Application
No ratings yet
DFCC Bank Application
3 pages
Emotion Detection and Characterization Using Facial Features
No ratings yet
Emotion Detection and Characterization Using Facial Features
6 pages
MLT Assignment 6
No ratings yet
MLT Assignment 6
4 pages
Face Recognition Using Linear Discriminant Analysis: Manju Bala, Priti Singh, Mahendra Singh Meena
No ratings yet
Face Recognition Using Linear Discriminant Analysis: Manju Bala, Priti Singh, Mahendra Singh Meena
8 pages
Sentiment Analysis: A NLP And: 2. Detailed Approach
No ratings yet
Sentiment Analysis: A NLP And: 2. Detailed Approach
6 pages
Applications of Deep Learning To Sentiment Analysis of Movie Reviews
No ratings yet
Applications of Deep Learning To Sentiment Analysis of Movie Reviews
8 pages
Model Explainablity
No ratings yet
Model Explainablity
7 pages
Data Science Vijay1
No ratings yet
Data Science Vijay1
88 pages
Fintech ML Using Azure
No ratings yet
Fintech ML Using Azure
51 pages
DL UNIT 1
No ratings yet
DL UNIT 1
21 pages
Mcd r fe ynny
No ratings yet
Mcd r fe ynny
23 pages
Machine Learning Suggestion
No ratings yet
Machine Learning Suggestion
16 pages
Level Set Segmentation Thesis
100% (3)
Level Set Segmentation Thesis
4 pages
ML Unit 3
No ratings yet
ML Unit 3
29 pages
ISSS609 Project Proposal Group 7
No ratings yet
ISSS609 Project Proposal Group 7
8 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
5 pages
Machine Learning Algorithms and AI Prompt Engineering
No ratings yet
Machine Learning Algorithms and AI Prompt Engineering
3 pages
Unit 3
No ratings yet
Unit 3
10 pages
AI for Breast Cancer
No ratings yet
AI for Breast Cancer
6 pages
P632-639
No ratings yet
P632-639
8 pages
ML NOTES
No ratings yet
ML NOTES
13 pages
Ijcrt 195231
No ratings yet
Ijcrt 195231
6 pages
Sequence Classification movie reviews paper submission
No ratings yet
Sequence Classification movie reviews paper submission
8 pages
Project Lit Final1
No ratings yet
Project Lit Final1
15 pages
Types of Agent (L-6)
No ratings yet
Types of Agent (L-6)
17 pages
The 10 Algorithms Machine Learning Engineers Need To Know
No ratings yet
The 10 Algorithms Machine Learning Engineers Need To Know
14 pages
Face Expression Recognition Thesis
100% (3)
Face Expression Recognition Thesis
5 pages
UNIT-V Notes
No ratings yet
UNIT-V Notes
24 pages
Week 1
No ratings yet
Week 1
5 pages
(IJCST-V9I3P23) :aditi Linge, Bhavya Malviya, Digvijay Raut, Payal Ekre
No ratings yet
(IJCST-V9I3P23) :aditi Linge, Bhavya Malviya, Digvijay Raut, Payal Ekre
3 pages
Paper id - ICCCAI25_188
No ratings yet
Paper id - ICCCAI25_188
8 pages
Farooq CHAPTER ONE - Three
No ratings yet
Farooq CHAPTER ONE - Three
8 pages
Classification Algorithms 3rd
No ratings yet
Classification Algorithms 3rd
15 pages
SRS Sample For Students (2) FINAL (1) - Pages-Deleted (1) - Converted 1
No ratings yet
SRS Sample For Students (2) FINAL (1) - Pages-Deleted (1) - Converted 1
21 pages
Ass
No ratings yet
Ass
8 pages
AI PROJECT CYCLE
No ratings yet
AI PROJECT CYCLE
7 pages
INTRODUCTION
No ratings yet
INTRODUCTION
51 pages
Fundamentals of AI Answers
No ratings yet
Fundamentals of AI Answers
3 pages
AIML ASSIGNMENT 1
No ratings yet
AIML ASSIGNMENT 1
11 pages
ML
No ratings yet
ML
3 pages
AI Learning
No ratings yet
AI Learning
43 pages
Pattern Recognition
No ratings yet
Pattern Recognition
12 pages
Social Bot
No ratings yet
Social Bot
122 pages
Machine Learning Models
No ratings yet
Machine Learning Models
11 pages
Face Recognition Literature Review
100% (2)
Face Recognition Literature Review
7 pages
21AI502 Syllbus
No ratings yet
21AI502 Syllbus
5 pages
Analyzing Sentiment Using IMDb Dataset
No ratings yet
Analyzing Sentiment Using IMDb Dataset
4 pages
AI lecture 9
No ratings yet
AI lecture 9
39 pages
Introduction To ML
No ratings yet
Introduction To ML
55 pages
Machine Learning questions
No ratings yet
Machine Learning questions
13 pages
Common DS Interview Questions and Answers - 1
No ratings yet
Common DS Interview Questions and Answers - 1
4 pages
AI UNIT 1
No ratings yet
AI UNIT 1
11 pages
Unit - V
No ratings yet
Unit - V
44 pages
Principles of Object Technology: First Principle of Modeling
No ratings yet
Principles of Object Technology: First Principle of Modeling
4 pages
AI Techniques
No ratings yet
AI Techniques
24 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
4 pages
ML3
No ratings yet
ML3
7 pages
Full Notes
No ratings yet
Full Notes
37 pages
Unit6 002
No ratings yet
Unit6 002
10 pages
Artificial Intelligence 2024 Book 2 of 2: AI, #2
From Everand
Artificial Intelligence 2024 Book 2 of 2: AI, #2
Yang Yen Thaw
No ratings yet
Bella Septian Almunda Pratama - CHAPTER II
No ratings yet
Bella Septian Almunda Pratama - CHAPTER II
22 pages
Essay On A Personal Development Plan
100% (2)
Essay On A Personal Development Plan
5 pages
Pelajaran
No ratings yet
Pelajaran
12 pages
Teaching Resume
No ratings yet
Teaching Resume
2 pages
Shadowing Techniques-2023
100% (1)
Shadowing Techniques-2023
91 pages
Rating Scale Tool
No ratings yet
Rating Scale Tool
2 pages
Chapter 6 Observing and Assessing Young Children
No ratings yet
Chapter 6 Observing and Assessing Young Children
16 pages
Q1-W8-DLL-21st Century Literature
No ratings yet
Q1-W8-DLL-21st Century Literature
4 pages
CHCECE055 Portfolio (2)-3
No ratings yet
CHCECE055 Portfolio (2)-3
64 pages
Curriculum Design Is A Term Used To Describe The Purposeful
No ratings yet
Curriculum Design Is A Term Used To Describe The Purposeful
5 pages
The Competency-Based Approach As The Modern MFL Pedagogical Theory
No ratings yet
The Competency-Based Approach As The Modern MFL Pedagogical Theory
17 pages
Bruton, A., López, M. G., Mesa, R. E. (2011) - Incidental L2 Vocabulary Learning An Impracticable Term.
No ratings yet
Bruton, A., López, M. G., Mesa, R. E. (2011) - Incidental L2 Vocabulary Learning An Impracticable Term.
10 pages
DLL Mapeh 5 Q2W3
No ratings yet
DLL Mapeh 5 Q2W3
4 pages
Denton Isd Homework Policy
100% (1)
Denton Isd Homework Policy
5 pages
DL 4
No ratings yet
DL 4
11 pages
Teacher Support Materials (2020)
No ratings yet
Teacher Support Materials (2020)
37 pages
Mr. Ed Matthew Bacanaya: Personal Profile
No ratings yet
Mr. Ed Matthew Bacanaya: Personal Profile
3 pages
Questions For Debate On Materials
No ratings yet
Questions For Debate On Materials
2 pages
TBR Final
No ratings yet
TBR Final
6 pages
DLL APPLIED-Econ wk3
No ratings yet
DLL APPLIED-Econ wk3
4 pages
Empowering Leaders Professional Development
No ratings yet
Empowering Leaders Professional Development
18 pages
Interview Question
No ratings yet
Interview Question
3 pages
Me Laboratory 3 Me Lab 422 1
No ratings yet
Me Laboratory 3 Me Lab 422 1
8 pages
Instructional Module in Understanding The Self: School of Teacher Education
No ratings yet
Instructional Module in Understanding The Self: School of Teacher Education
5 pages
STEM TEACHERS WORKSHOP
No ratings yet
STEM TEACHERS WORKSHOP
2 pages
Hydraulic Lifter Jack Project
No ratings yet
Hydraulic Lifter Jack Project
30 pages
G9Adv-Building An MP3 Player in Python (Word Version)
No ratings yet
G9Adv-Building An MP3 Player in Python (Word Version)
11 pages