Electrical Engineering Technical Seminar Report
Electrical Engineering Technical Seminar Report
Electrical Engineering Technical Seminar Report
On
Detection of Leaf Diseases and Classification using
Digital Image Processing
Under the guidance of
Vijayashri V Belgoankar
Assisstant professor
DEPARTMENT OF ELECTRONICS & COMMUNICATIONS ENGINEERING
2
INTRODUCTION
5
Image Pre-Processing
6
Image Segmentation
Image segmentation is the process used to simplify the representation of an image into meaningful form, such as to
highlight object of interest from background.
The K-means clustering algorithm performs segmentation by minimizing the sum of squares of distances between the
image intensities andthe cluster centroids.
K-means clustering algorithm, or Lloyd's algorithm, is an iterative algorithm that partitions the data and assigns n
observations to precisely one of k clusters defined by centroids.
The steps in the algorithm are given below.
1. Choose k initial cluster centers (centroid).
2. Compute point-to-cluster-centroid distances of all observations to each centroid.
3. Assign each observation to the cluster with the closest centroid.
4. Compute the mean of the observations in each cluster to obtain k new centroid locations.
5. Repeat steps 2 through 4 until there is no change in the cluster assignments or the maximum number of iterations is
reached.
7
Feature Extraction
After segmentation, the GLCM features are extracted from the image.
Gray-Level Co-Occurrence Matrix (GLCM) is the statistical method of investigating
texture which considers the spatial relationship of pixels [15].
The GLCM functions characterize the texture of images by computing the spatial
relationship among the pixels in the images.
The statistical measures are extracted from this matrix.
In the creation of GLCMs, an array of offsets which describe pixel relationships of
varying direction and distance have to be specified.
In the proposed method, four features are extracted which include
contrast, energy, homogeneity and correlation.
Let Pij represents the (i, j)th entry in the normalized Gray-Level Co- Occurrence
Matrix.
N represents the number of distinct gray levels in the quantized image.
The different features extracted are defined as follows.
8
a) Contrast
Contrast measures intensity contrast of a pixel and its neighbor pixel over the entire image.
If the image is constant, contrast is equal to 0.
b) Energy
Energy is a measure of uniformity with squared elements summation in the GLCM. Range is in between 0 and 1.
Energy is 1 for a constant image.
c) Homogeneity
Homogeneity measures the similarity among the pixels.
Its range is between 0 and 1. Homogeneity is 1 for a diagonal GLCM.
d) Correlation
Correlation measures how correlated a pixel is to its neighborhood.
Its range is in between -1 and 1.
9
GLCM MATRIX EQUATIONS
10
Classification using Support Vector Machine (SVM)
Support Vector Machine is kernel-based supervised learning algorithm used as a
classification tool.
The resulting decision function depends only on the training data called support
vectors, which are closest to the decision boundary as shown in Next Figure .
SVM transforms data from input space into a high-dimensional feature space using
kernel function.
The idea of support vector machine is to create a hyper plane in between data sets to
indicate which class it belongs to.
The feature vector is given as input to the classifier.
The feature vectors of the database images are divided into training
and testing vectors.
The classifier trains on the training set and applies it to classify the testing set.
The performance of the classifier is measured by comparing the predicted labels and
actual values.
11
Support Vector Machine Classifier
12
EXPERIMENTAL RESULTS
13
Figures in the Previous slide – Segmentation of diseased leaves using K-Means
algorithm after color space conversion
The segmentation of the diseased leaves results in identifying the diseased parts
of the leaves.
The classification of the leaves into diseased or not is done by classification using
SVM.
The input image is resized to 256 x 256 and converted to gray scale image.
The GLCM texture features – Contrast, Correlation, Energy and Homogeneity are
extracted and stored for all the images in the database.
The four feature vectors of all the images of database are given as input to the
classifier.
The database is divided randomly into training vectors and testing vectors.
14
The graph obtained for SVM classification using two feature vectors
15
ADVANTAGES AND APPLICATIONS
Advantages
Improves productivity in agricultural field
Leads to economic growth in the country
Reduces large work of monitoring in big firms
Reduces chance of diseases occurring in plants
Applications
Used for protection and crop production.
Used to detect pathogen which is induced non-invasively
Used in identification and quantification of chlorophyll
Used to detect impact of external factors
16
CONCLUSION
A method for detection and classification of leaf diseases is implemented.
The segmentation of the diseased part is done using K-Means segmentation.
Then, GLCM texture features are extracted and classification is done using
SVM.
The method is tested for detection of diseases in citrus leaves.
Future work is to be carried out for classification of diseases in different
plant species and to improve the classification accuracy.
17
REFERENCES
[1] R.Meena Prakash, G.P.Saraswathy, G.Ramalakshmi,” Detection of Leaf
Diseases and Classification using Digital Image Processing”.
[2] Jaskaran Singh, Harpreet Kaur, “A Review on: Various Techniques of Plant
Leaf Disease Detection”
[3] Bharat Mishra, Sumit Nema, Mamta Lambert, Swapnil Nema,” Recent
Technologies of Leaf Disease Detection using Image Processing Approach – A
Review”.
[4]Pooja.V, Rahul Das, Kanchana.V, ”Identification Of Plant Leaf Diseases
Using Image Processing Techniques”.
[5] Chaitali G, K.H. Wanjale,“ A Modern Approach for Plant Leaf Disease
Classification which Depends on Leaf Image Processing”
18
19