research-article

Research on Scene Semantic Segmentation Based on Deep Learning

Authors:

Xiaoming ShiAuthors Info & Claims

CIPAE 2020: Proceedings of the 2020 International Conference on Computers, Information Processing and Advanced Education

Pages 1 - 5

https://doi.org/10.1145/3419635.3419636

Published: 18 September 2020 Publication History

Abstract

Because of the problem of the low accuracy and slow speed of the traditional semantic segmentation model, making it difficult to actually use. In response to this problem, this paper focuses on the method to improve the precision and speed of the algorithm. According to this theory, based on the convolution neural network, we have designed the PSPNet and ICNet models. Meanwhile, a scene semantic segmentation network based on deep learning was presented. The network effectively improves the accuracy of semantic segmentation of convolutional neural networks by merging multi-level depth and network features. The test results on the LISA traffic sign data set show that the proposed semantic segmentation network has outstanding performance compared with other state of the art semantic segmentation network structures.

References

[1]

He P., Huang W., H, T., Zhu Q, Qiao, Y., Li, X.: Single shot text detector with regional attention. In: IEEE International Conference on Computer Vision, pp. 3066--3074 (2017).

[2]

Ren Deng, D., Liu, H., Li, X., Cai, D.: PixelLink: detecting scene text via instance segmentation (2018).

[3]

He, W., Zhang, X. Y., Yin, F., Liu, C. L.: Deep direct regression for multi-oriented scene text detection, pp. 745--753 (2017).

[4]

Liu, W., et al.: SSD: single shot MultiBox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21--37. Springer, Cham (2016).

[5]

Liao, M., Zhu, Z., Shi, B., Xia, G., Bai, X.: Rotation-sensitive regression for oriented scene text detection (2018).

[6]

Zhou, X., et al.: EAST: an efficient and accurate scene text detector, pp. 2642--2651 (2017).

[7]

Shi, B., Bai, X., Belongie, S.: Detecting oriented text in natural images by linking segments, pp. 3482--3490 (2017).

[8]

Zhou, X., et al.: EAST: an efficient and accurate scene text detector, pp. 2642--2651 (2017).

[9]

Karatzas, D., et al.: ICDAR 2015 competition on robust reading. In: International Conference on Document Analysis and Recognition, pp. 1156--1160 (2015).

Digital Library

[10]

X., Bo, L., Fox, D.: RGB-(D) scene labeling: features and algorithms. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2759--2766 (2012).

[11]

Gupta, S., Girshick, R., Arbeláez, P., Malik, J.: Learning rich features from RGB-D images for object detection and segmentation. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014 Part VII. LNCS, vol. 8695, pp. 345--360. Springer, Cham (2014).

[12]

Syu, J. H., Wang, S. J., Wang, L. C.: Hierarchical image segmentation based on iterative contraction and merging. IEEE Trans. Image Process. 26(5), 2246--2260 (2017).

Digital Library

[13]

Truong BT, Venkatesh S, Dorai C (2003) Scene extraction in motion picture. IEEE Trans Circuits Syst Video Technol 13(1):5--15.

[14]

Sundaram H, Chang SF (2000) Video scene segmentation using video and audio features. IEEE proceeding on International Conference on Multimedia and Expo, 1145--1148.

[15]

Cernekova Z, Kotropoulos C, Pitas I (2003) Video shot segmentation using singular value decomposition. IEEE proceeding on International Conference on Multimedia and Expo, 301--302.

[16]

Adams B, Dorai C, Venkatesh S (2000). Towards automatic extraction of expressive elements from motion pictures: tempo. IEEE proceeding on International Conference on Image Processing, 641--644.

Index Terms

Research on Scene Semantic Segmentation Based on Deep Learning
1. Information systems
  1. Information systems applications
    1. Process control systems

Recommendations

Deep learning with multiresolution handcrafted features for brain MRI segmentation
Abstract
The segmentation of magnetic resonance (MR) images is a crucial task for creating pseudo computed tomography (CT) images which are used to achieve positron emission tomography (PET) attenuation correction. One of the main challenges of ...
Highlights
- NSCT and NSST coefficients can effectively enrich CNN’s features.
- The entropy ...
Synergy between Semantic Segmentation and Image Denoising via Alternate Boosting
The capability of image semantic segmentation may be deteriorated due to the noisy input image, where image denoising prior to segmentation may help. Both image denoising and semantic segmentation have been developed significantly with the advance of deep ...
A survey on deep learning-based fine-grained object classification and semantic segmentation

The deep learning technology has shown impressive performance in various vision tasks such as image classification, object detection and semantic segmentation. In particular, recent advances of deep learning techniques bring encouraging performance to ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

CIPAE 2020: Proceedings of the 2020 International Conference on Computers, Information Processing and Advanced Education

October 2020

527 pages

ISBN:9781450387729

DOI:10.1145/3419635

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 September 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

CIPAE 2020

CIPAE 2020: 2020 International Conference on Computers, Information Processing and Advanced Education

October 16 - 18, 2020

ON, Ottawa, Canada

Acceptance Rates

CIPAE 2020 Paper Acceptance Rate 101 of 216 submissions, 47%;

Overall Acceptance Rate 101 of 216 submissions, 47%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
55
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)0

Reflects downloads up to 13 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents