Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3419635.3419636acmotherconferencesArticle/Chapter ViewAbstractPublication PagescipaeConference Proceedingsconference-collections
research-article

Research on Scene Semantic Segmentation Based on Deep Learning

Published: 18 September 2020 Publication History

Abstract

Because of the problem of the low accuracy and slow speed of the traditional semantic segmentation model, making it difficult to actually use. In response to this problem, this paper focuses on the method to improve the precision and speed of the algorithm. According to this theory, based on the convolution neural network, we have designed the PSPNet and ICNet models. Meanwhile, a scene semantic segmentation network based on deep learning was presented. The network effectively improves the accuracy of semantic segmentation of convolutional neural networks by merging multi-level depth and network features. The test results on the LISA traffic sign data set show that the proposed semantic segmentation network has outstanding performance compared with other state of the art semantic segmentation network structures.

References

[1]
He P., Huang W., H, T., Zhu Q, Qiao, Y., Li, X.: Single shot text detector with regional attention. In: IEEE International Conference on Computer Vision, pp. 3066--3074 (2017).
[2]
Ren Deng, D., Liu, H., Li, X., Cai, D.: PixelLink: detecting scene text via instance segmentation (2018).
[3]
He, W., Zhang, X. Y., Yin, F., Liu, C. L.: Deep direct regression for multi-oriented scene text detection, pp. 745--753 (2017).
[4]
Liu, W., et al.: SSD: single shot MultiBox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21--37. Springer, Cham (2016).
[5]
Liao, M., Zhu, Z., Shi, B., Xia, G., Bai, X.: Rotation-sensitive regression for oriented scene text detection (2018).
[6]
Zhou, X., et al.: EAST: an efficient and accurate scene text detector, pp. 2642--2651 (2017).
[7]
Shi, B., Bai, X., Belongie, S.: Detecting oriented text in natural images by linking segments, pp. 3482--3490 (2017).
[8]
Zhou, X., et al.: EAST: an efficient and accurate scene text detector, pp. 2642--2651 (2017).
[9]
Karatzas, D., et al.: ICDAR 2015 competition on robust reading. In: International Conference on Document Analysis and Recognition, pp. 1156--1160 (2015).
[10]
X., Bo, L., Fox, D.: RGB-(D) scene labeling: features and algorithms. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2759--2766 (2012).
[11]
Gupta, S., Girshick, R., Arbeláez, P., Malik, J.: Learning rich features from RGB-D images for object detection and segmentation. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014 Part VII. LNCS, vol. 8695, pp. 345--360. Springer, Cham (2014).
[12]
Syu, J. H., Wang, S. J., Wang, L. C.: Hierarchical image segmentation based on iterative contraction and merging. IEEE Trans. Image Process. 26(5), 2246--2260 (2017).
[13]
Truong BT, Venkatesh S, Dorai C (2003) Scene extraction in motion picture. IEEE Trans Circuits Syst Video Technol 13(1):5--15.
[14]
Sundaram H, Chang SF (2000) Video scene segmentation using video and audio features. IEEE proceeding on International Conference on Multimedia and Expo, 1145--1148.
[15]
Cernekova Z, Kotropoulos C, Pitas I (2003) Video shot segmentation using singular value decomposition. IEEE proceeding on International Conference on Multimedia and Expo, 301--302.
[16]
Adams B, Dorai C, Venkatesh S (2000). Towards automatic extraction of expressive elements from motion pictures: tempo. IEEE proceeding on International Conference on Image Processing, 641--644.

Index Terms

  1. Research on Scene Semantic Segmentation Based on Deep Learning

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    CIPAE 2020: Proceedings of the 2020 International Conference on Computers, Information Processing and Advanced Education
    October 2020
    527 pages
    ISBN:9781450387729
    DOI:10.1145/3419635
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 18 September 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. CNN
    2. Deep Learning
    3. LISA Traffic Sign Dataset
    4. Semantic Segmentation
    5. Target Detection

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    CIPAE 2020

    Acceptance Rates

    CIPAE 2020 Paper Acceptance Rate 101 of 216 submissions, 47%;
    Overall Acceptance Rate 101 of 216 submissions, 47%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 55
      Total Downloads
    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 13 Jan 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media