Segmentation and Object Recognition Using Edge Detection Techniques
Segmentation and Object Recognition Using Edge Detection Techniques
ABSTRACT
Image segmentation is to partition an image into meaningful regions with respect to a particular
application. Object recognition is the task of finding a given object in an image or video sequence. In this
paper, interaction between image segmentation (using different edge detection methods) and object
recognition are discussed. Edge detection methods such as Sobel, Prewitt, Roberts, Canny, Laplacian of
Guassian(LoG) are used for segmenting the image. Expectation-Maximization (EM) algorithm, OSTU
and Genetic algorithms were used to demonstrate the synergy between the segmented images and object
recognition.
KEYWORDS
EM algorithm, OSTU, Genetic Algorithm, Image Segmentation, Object Recognition.
1. INTRODUCTION
Image segmentation is the foundation of object recognition and computer vision. In general,
image noise should be eliminated through image preprocessing. And there is some specifically-
given work (such as region extraction and image marking) to do after the main operation of
image segmentation for the sake of getting better visual effect. Two major computer vision
problems, image segmentation and object recognition, have been traditionally dealt with using a
strict, bottom-up ordering.
Image segmentation is the process of partitioning/subdividing a digital image into multiple
meaningful regions or sets of pixels regions with respect to a particular application. The
segmentation is based on measurements taken from the image and might be grey level, colour,
texture, depth or motion. The result of image segmentation is a set of segments that collectively
cover the entire image. All the pixels in region are similar with respect to some characteristic or
computed property, such as color, intensity, or texture. Adjacent regions differ with respect to
same characteristics. Edge detection is one of the frequently used techniques in digital image
processing.
Object recognition is the task of finding a given object in an image or video sequence. For any
object in an image, there are many 'features' which are interesting points on the object that can
be extracted to provide a "feature" description of the object. This description extracted from a
training image can then be used to identify the object when attempting to locate the object in a
test image containing many other objects[3][4].
Image segmentation is done using various edge detection techniques such as Sobel, Prewitt,
Roberts, Canny, LoG, EM algorithm, OSTU and Genetic Algorithm [1][2]. Implementation
DOI : 10.5121/ijcsit.2010.2614 153
International Journal of Computer Science & Information Technology (IJCSIT), Vol 2, No 6, December 2010
between Segmentation and Object recognition using the said techniques is studied [1].
Experimentation is performed on gray scale image using MATLAB 7.9. In this paper, Section 2
discusses about segmentation, and comparison of various edge detection techniques. Section 3
discusses about image segmentation and object recognition using other techniques. The
implementation and results are shown in section 4. The paper is concluded in section 5.
2. SEGMENTATION
Segmentation is the process of partitioning a digital image into its constituent parts or objects or
regions. These regions share common characteristics based on color, intensity, texture, etc. The
first step in image analysis is to segment an image based on discontinuity detection technique
(Edge-based) or similarity detection technique (Region-based). In discontinuity detection
technique, one approach is to partition an image based on abrupt changes in intensity near the
edges and it is known as Edge-based segmentation. In Similarity detection technique, region-
based segmentation partitions an image into regions that are similar according to a set of
predefined criteria. Region-based segmentation looks for uniformity within a sub-region, based
on a desired property, e.g., intensity, color, and texture as shown figure 1. The difference
between Region-based segmentation and edge-based segmentation is shown in Table 1.
Active contours are popular technique for image segmentation. An advantage of active contours
as image segmentation methods is that they partition an image into sub-regions with continuous
boundaries. There are two kinds of active contour models: Edge-based active contours use an
edge detector, usually based on the image gradient, to find the boundaries of sub-regions and to
attract the contours to the detected boundaries. Region-based active contours use the statistical
information of image intensity within each subset instead of searching geometrical boundaries
154
International Journal of Computer Science & Information Technology (IJCSIT), Vol 2, No 6, December 2010
a. Filtering: Images are corrupted by noise such as salt and pepper noise, impulse noise and
Gaussian noise. As there is a trade-off between edge strength and noise reduction, filtering
is done.
b. Enhancement: It emphasizes pixels where there is a significant change in local intensity
values and is usually performed by computing the gradient magnitude.
c. Detection: Many points in an image have a nonzero value for the gradient, and not all of
these points are edges for a particular application. Thresholding is used for the detection of
edge points.
The different edge detection methods used are Sobel, Prewitt, Roberts, Canny, LoG, EM
algorithm, OSTU algorithm and Genetic Algorithm.
2.1.1. Sobel Operator: It performs 2-D spatial gradient measurement on an image and so
emphasizes regions of high spatial frequency that correspond to edges. The convolution mask
of Sobel operator are as shown Figure 2, which are used to obtain the gradient magnitude of
the image from the original
1 2 1 -1 0 1
0 0 0 -2 0 2
-1 -2 -1 -1 0 1
Figure 2. Sobel Mask
2.1.2. Prewitt Operator: The prewitt operator is an approximate way to estimate the
magnitude and orientation of the edge. The convolution mask of prewitt operator is shown in
figure 3.
1 1 1 -1 0 1
0 0 0 -1 0 1
-1 -1 -1 -1 0 1
Figure 3. Prewitt Mask
-1 0 0 1
0 -1 -1 0
Figure 4. Roberts Mask
0 -1 0
-1 4 -1
0 -1 0
Figure 5. Laplacian of Guassian(LoG) Operator
155
International Journal of Computer Science & Information Technology (IJCSIT), Vol 2, No 6, December 2010
2.1.5. Canny Operator: It is a method to find edges by isolating noise from the image without
affecting the features of the edges in the image and then applying the tendency to find the
edges and the critical value for threshold.
156
International Journal of Computer Science & Information Technology (IJCSIT), Vol 2, No 6, December 2010
mathematically proven that minimizing the within class variance is same as maximizing the
between class variance.
The different thresholding techniques are
The major problem with thresholding is that we consider only the intensity, not any
relationships between the pixels. There is no guarantee that the pixels identified by the
thresholding process are contiguous.
157
International Journal of Computer Science & Information Technology (IJCSIT), Vol 2, No 6, December 2010
1. Select two bit strings (chromosomes), or in case of the genetic programming: select a
branch of each parent.
2. Cut the chromosome (or branch) at a particular location.
3. Swap the bits/branches of the two parents.
Sobel
Prewitt
158
International Journal of Computer Science & Information Technology (IJCSIT), Vol 2, No 6, December 2010
Roberts
Laplacian of Gaussian
(LoG)
Canny
EM Algorithm
159
International Journal of Computer Science & Information Technology (IJCSIT), Vol 2, No 6, December 2010
OSTU Algorithm
Genetic Algorithm
Canny operator performed better than Sobel, Prewitt, Roberts and LoG. The EM algorithm
produced stable segmentation effect on different types of images. OTSU showed good and
stable segmentation effect. Genetic algorithm exhibited normal segmentation effect on all types
of images.
5. CONCLUSIONS
This paper focuses mainly on the Image segmentation using edge operators. The interaction
between image segmentation and object recognition in the framework of the Sobel, Prewitt,
Roberts, Canny, LoG, Expectation-Maximization (EM) algorithm, OSTU Algorithm and
Genetic Algorithm are studied. MATLAB 7.9. was used for experimentation image.
Expectation-Maximization algorithm and OTSU algorithm exhibited stable segmentation effect.
6. REFERENCES
[1] Iasonas Kokkinos, and Petros Maragos (2009),”Synergy between Object Recognition and image
segmentation using Expectation and Maximization Algorithm”., IEEE Trans. on Pattern
Analysis and Machine Intelligence (PAMI), Vol. 31(8), pp. 1486-1501, 2009.
[2] Wen-Xiong Kang, Qing-Qiang Yang, Run-Peng Liang (2009), “The Comparative Research
on Image Segmentation Algorithms,” First International Workshop on Education Technology
and Computer Science.
[3] V. Ferrari, T. Tuytelaars, and L.V. Gool(2004), “Simultaneous Object Recognition and
Segmentation by Image Exploration,” Proc. Eighth European Conf. Computer Vision, 2004.
[4] B. Leibe, A. Leonardis, and B. Schiele(2004), “Combined Object Categorization and
Segmentation with an Implicit Shape Model,” Proc. ECCV Workshop Statistical Learning in
Computer Vision, 2004.
[5] Y.Ramadevi, B.Kalyani, T.Sridevi(2010), “ Synergy between Object Recognition and Image
Segmentation”, International Journal on Computer Science and Engineering, Vol. 02, No. 08,
2010, 2767-2772.
[6] N.Senthilkumarn, R.Rajesh(2009), ”Edge Detection Techniques for Image Segmentation- A
Survey of Soft Computing Approaches”, IJRTE, vol1,No2, 2009 250-254.
160
International Journal of Computer Science & Information Technology (IJCSIT), Vol 2, No 6, December 2010
Y Rama Devi received B.E. from Osmania University in 1991 and M.Tech (CSE)
degree from JNT University in 1997. She received his Ph.D. degree Central University,
Hyderabad in 2009. She is Professor, Chaitanya Bharathi Institute of Technology,
Hyderabad. Her research interests include Image Processing, Soft Computing, Data
Mining, and Bio-Informatics. She is a member for IEEE, ISTE, IETE, and IE. She has
published more than 25 research publications in various National, Inter-national
conferences, proceedings and Journals.
T.Sridevi received B E from Osmania University in 1992 and M.Tech (CSE) degree from
JNT University in 2002. She is pursuing her Ph D from Osmania University in Computer
Science under the guidance of Dr V. Vijaya Kumar. She has 14 years of teaching/industry
experience. She joined as Assistant Professor in Chaitanya Bharathi Institute of
Technology, Hyderabad, India in 2002. Presently she is an Associate Professor, Chaitanya
Bharathi Institute of Technology, Hyderabad. Her research areas include Water Marking,
Image Processing and Soft Computing. She is a life member of IETE. She has published
more than 5 research publications in various National, Inter-national conferences,
proceedings and Journals.
B.Poornima received her B.Tech from J.N.T.U in 2005. She is pursuing her M.Tech from
CBIT under Osmania University. Her area of interest is image processing.
161