Image Segmentationand Semantic Labelingusing Machine Learning
Image Segmentationand Semantic Labelingusing Machine Learning
net/publication/343205756
CITATIONS READS
5 318
2 authors:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Abhishek Thakur on 25 July 2020.
I. INTRODUCTION
The objective of this paper is to find out objects in an
image/video with the help of segmentation algorithm.
Object detection is the most difficult problem in image
processing. In image segmentation it is not necessary to Fig. 1 Image segmentation (right column) of the input
know initially what the visual objects are present in image (left column).
image/video. Object segmentation is different from image
classification. Ideal image segmentation algorithm segment
unknown objects but in image classification only known
categories are classified [1]. There are so many applications
[2] where segmentation is used to find out image forgery
detection. In image forgery detection, segmentation is
performed on each image and it is added to the database. In
copy move forgery detection similar color pixel patch are
extracted and matched with the help of color segmentation.
If forged region query is passed to the algorithm it suggest
expected regions of the forgery in the database. Another
application in unattended baggage detection. In this task
same query is asked by security officer in bus stand, the
proposed algorithm segment objects on the basis of that
query. When a new image is given to segmentation
algorithm, it should segment each pixels of the image into
such categories. For example, in Fig. 1, the input image
consists of a natural image of plants. In Fig. 2, the
segmentation of the input image consists of a semantic
objects which clusters the pixels of plant and colored them
green. Similarly with the background grass and rocks are
colored with dark green and light green. Segmenting an
image includes a deep semantic understanding of the world Fig. 2 Image semantic labeling (bottom image) of the
and which things are parts of a whole input image (top image).
Published By:
Blue Eyes Intelligence Engineering
Retrieval Number: ES2045017519/19©BEIESP 268 & Sciences Publication
Image Segmentation and Semantic Labeling using Machine Learning
we will mainly focus on image segmentation and graph and over segmentation, 2) extract super pixels & set
semantic labeling, i.e., per-pixel class segmentation. In order parameter for mean shift, 3) extract regions with red
to understand deep image segmentation and semantic boundaries and get the overall number of super pixels, 4)
segmentation systems concept we will discuss some Build the super pixel graph and compute Ncut.
common networks, methods, and design. In addition, we Eigenvectors, k-means clustering 5) save the results.
will discuss data pre-processing techniques and learning
technique.
1.1 Common Deep Network Architectures Original images
Published By:
Blue Eyes Intelligence Engineering
Retrieval Number: ES2045017519/19©BEIESP 269 & Sciences Publication
International Journal of Recent Technology and Engineering (IJRTE)
ISSN: 2277-3878, Volume-7 Issue-5S2, January 2019
Published By:
Blue Eyes Intelligence Engineering
Retrieval Number: ES2045017519/19©BEIESP 270 & Sciences Publication
Image Segmentation and Semantic Labeling using Machine Learning
3.3 Define Label IDs Fig. 6 Image segmentation (right column) of the input
Label IDs are defined as: (128 128 128) as sky, (000 128 image (left column).
064) as bridge, (128 000 000) as building, (064 192 000) Fig. 7 shows the semantic segmentation of the input
wall, (064 000 064) tunnel, (192 000 128) as archway, (192 image.All the input images from different dataset are
192 128) as column pole, (000 000 064) as traffic cone, (128 segmented and stored as a labels in a disk. Fig. 8 shows the
064 128) as road, (128 000 192) as lane mkgs driv, (192 000 semantic output labels of the input original and segmented
064) as lane mkgs non driv Pavement, (000 000 192) as side label image. These semantic labels are generated after
walk, (064 192 128) as parking block, (128 128 192) as road training a deep neural network with defined color maps.
shoulder, (128 128 000) as tree, (192 192 000) as vegetation These color maps show the categories label.
misc, (192 128 128) as sign symbol, (128 128 064) as misc
text, (000 064 064) as traffic light, (064 064 128) as fence,
(064 000 128) as car, (064 128 192) as SUV pickup truck,
(192 128 192) as truck bus, (192 064 128) as train, (128 064
064) as Other Moving, (064 064 000) as pedestrian, (192
128 064) as child, (064 000 192) as cart luggage pram, (064
128 064) as animal, (000 128 192) as bicyclist and (192 000
192) as motor cycle scooter.
3.4 Define the color map
Add a color bar to the current axis. The color bar is
formatted to display the class names with the color. Define
the color map used by CamVid dataset. Color map as (128
128 128 Sky), (128 0 0 Building), (192 192 192 Pole), (128
64 128 Road), (60 40 222 Pavement), (128 128 0 Tree),
(192 128 128 Sign Symbol), (64 64 128 Fence), (64 0 128 Fig. 7 Image segmentation (right column) of the input
Car), (64 64 0 Pedestrian) and (0 128 192 Bicyclist). image (left column).
III. RESULT
This section, illustrate the results of segmentation in Fig. Fig. 8 Training accuracy (left) and training loss (right).
6 and semantic labeling in Fig. 7.
Fig. 8 shows the training accuracy vs loss graph. The
training accuracy was 60% initially but after some iteration
it approaches to 99%. The training loss was 2% initially but
after some iteration it reduced to 0.1%. If we train our deep
neural network for long time its accuracy reach to 99.99%
and loss decrease to 0.001%.
IV. CONCLUSION
There are so many segmentation technique available
nowadays. The color segmentation technique is very
important technique that are used for image segmentation
and also used for semantic labeling. Here we concluded
that the color segmentation for semantic segmentation
gives good results to differentiate each object. In semantic
labeling deep neural network predict each object label
correctly with accuracy of 99.99% in the training phase. So
it is a best color segmentation technique in compare to
other.
Published By:
Blue Eyes Intelligence Engineering
Retrieval Number: ES2045017519/19©BEIESP 271 & Sciences Publication
International Journal of Recent Technology and Engineering (IJRTE)
ISSN: 2277-3878, Volume-7 Issue-5S2, January 2019
REFERENCE
1. Badrinarayanan, Vijay, Alex Kendall, and Roberto Cipolla. "Segnet:
A deep convolutional encoder-decoder architecture for image
segmentation." arXiv preprint arXiv:1511.00561 (2015).
2. Brostow, Gabriel J., Julien Fauqueur, and Roberto Cipolla. "Semantic
object classes in video: A high-definition ground truth database."
Pattern Recognition Letters 30.2 (2009): 88-97.
3. Li, Zhenguo, Xiao-Ming Wu, and Shih-Fu Chang. "Segmentation
using superpixels: A bipartite graph partitioning approach." Computer
Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on.
IEEE, 2012.
4. Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. "Imagenet
classification with deep convolutional neural networks." Advances in
neural information processing systems. 2012.
5. Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional
networks for large-scale image recognition." arXiv preprint
arXiv:1409.1556 (2014).
6. Szegedy, Christian, et al. "Going deeper with convolutions."
Proceedings of the IEEE conference on computer vision and pattern
recognition. 2015.
7. Visin, Francesco, et al. "Renet: A recurrent neural network based
alternative to convolutional networks." arXiv preprint
arXiv:1505.00393 (2015).
8. Yosinski, Jason, et al. "How transferable are features in deep neural
networks?." Advances in neural information processing systems.
2014.
9. Garcia-Garcia, Alberto, et al. "A survey on deep learning techniques
for image and video semantic segmentation." Applied Soft Computing
70 (2018): 41-65.
10. Guo, Yanming, et al. "A review of semantic segmentation using deep
neural networks." International Journal of Multimedia Information
Retrieval 7.2 (2018): 87-93.
11. Thakur, Abhishek, and Neeru Jindal. "Image forensics using color
illumination, block and key point based approach." Multimedia Tools
and Applications (2018): 1-21.
12. Kim, Tae Hoon, Kyoung Mu Lee, and Sang Uk Lee. "Learning full
pairwise affinities for spectral segmentation." IEEE transactions on
pattern analysis and machine intelligence 35.7 (2013): 1690-1703.
13. Cour, Timothee, Florence Benezit, and Jianbo Shi. "Spectral
segmentation with multiscale graph decomposition." Computer Vision
and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society
Conference on. Vol. 2. IEEE, 2005.
Published By:
Blue Eyes Intelligence Engineering
Retrieval Number: ES2045017519/19©BEIESP 272 & Sciences Publication
View publication stats