DPNet: A Dual Path Network for Road Scene Semantic Segmentation

Ye, Lu; Zhu, Jiayi; Zhou, Wujie; Duan, Ting; Sugianto, Sugianto; Agordzo, George Kofi; Yeboah, Derrick; Kevin, Mukonde Tonderayi

doi:10.1007/978-3-662-61510-2_6

Lu Ye^12,13,
Jiayi Zhu¹³,
Wujie Zhou^12,13,
Ting Duan¹³,
Sugianto Sugianto¹²,
George Kofi Agordzo¹²,
Derrick Yeboah¹² &
…
Mukonde Tonderayi Kevin¹²

Part of the book series: Lecture Notes in Computer Science ((TEDUTAIN,volume 11782))

1036 Accesses

Abstract

Road scene segmentation has always been regarded as a pixel-wise task in computer vision studies. In this paper, we introduce a practical and new features fusion structure named “Dual Path Network” for road semantic segmentation. This form aims to reduce the gap between low-level and high-level information, thereby improving features fusion. The Dual Path consists of two subpaths: Context Path and Spatial Path. In the Context Path, we select a pre-trained ResNet-101 model as the backbone and use multi-scale convolution blocks comprise the Spatial Path. Then, we create a fusion residual block and channel attention model to further optimize the network. The results of the experiment confirm a state-of-the-art mean intersection-over-union of 68.5% using the CamVid dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Research on Lightweight Road Semantic Segmentation Algorithm Based on DeepLabv3+

LFFNet: lightweight feature-enhanced fusion network for real-time semantic segmentation of road scenes

Article 05 March 2024

Real-time segmentation algorithm of unstructured road scenes based on improved BiSeNet

Article 12 May 2024

References

Long, J., Shelhamer, E., Darrell, T.: Intelligence, fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Google Scholar
Chen, L.-C., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation (2017). arXiv:1706.05587
Lin, G., Milan, A., Shen, C., Reid, I.: RefineNet: multi-path refinement networks for high-resolution semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1925–1934 (2017)
Google Scholar
Guo, X., et al.: GAN-based virtual-to-real image translation for urban scene semantic segmentation. Neurocomputing. (2019). https://doi.org/10.1016/j.neucom.2019.01.115
Article Google Scholar
Yu, H., et al.: Methods and datasets on semantic segmentation: a review. Neurocomputing 304, 82–103 (2018)
Article Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: Proceedings of the International Conference on Medical Image Computing and Computer-assisted Intervention, pp. 234–241 (2015)
Google Scholar
Quan, T.M., Hildebrand, D.G., Jeong, W.K.: Fusionnet: a deep fully residual convolutional neural network for image segmentation in connectomics (2016). arXiv:1612.05360
Wang, P., et al.: Understanding convolution for semantic segmentation. In: Proceedings of the IEEE Winter Conference on Applications of Computer Vision, pp. 1451–1460 (2018)
Google Scholar
Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. IEEE Trans. Pattern Anal. Mach. Intell. 40, 834–848 (2017)
Article Google Scholar
Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions (2015). arXiv:1511.07122
Zhang, R., Tang, S., Zhang, Y., Li, J., Yan, S.: Scale-adaptive convolutions for scene parsing. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2031–2039 (2017)
Google Scholar
Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., Sang, N.: Learning a discriminative feature network for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1857–1866 (2018)
Google Scholar
Chen, L., Zhang, H., Xiao, J., Nie, L., Shao, J., Liu, W.: SCA-CNN: spatial and channel-wise attention in convolutional networks for image captioning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5659–5667 (2017)
Google Scholar
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Google Scholar
Mnih, V., Heess, N., Graves, A.: Recurrent models of visual attention. In: Proceedings of the Advances in Neural Information Processing Systems, pp. 2204–2212 (2014)
Google Scholar
Wang, F., Jiang, M., Qian, C., Yang, S., Li, C., Zhang, H.: Residual attention network for image classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3156–3164 (2017)
Google Scholar
Chen, L.-C., Yang, Y., Wang, J., Xu, W., Yuille, A.L.: Attention to scale: scale-aware semantic image segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3640–3649 (2016)
Google Scholar
Ghiasi, G., Fowlkes, C.C.: Laplacian pyramid reconstruction and refinement for semantic segmentation. In: Proceedings of the European Conference on Computer Vision, pp. 519–534 (2016)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S.: Imagenet large scale visual recognition challenge. Int. J. Comput. Vision 115, 211–252 (2015)
Article MathSciNet Google Scholar
Ibtehaz, N., Rahman, M.S.: MultiResUNet: Rethinking the U-Net Architecture for Multimodal Biomedical Image Segmentation (2019). arXiv:1902.04049
Berman, M., Triki, A.R., Blaschko, M.B.: The Lovász-Softmax loss: a tractable surrogate for the optimization of the intersection-over-union measure in neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4413–4421 (2018)
Google Scholar
Jaccard, P.: Étude comparative de la distribution florale dans une portion des Alpes et des Jura. Bull Soc Vaudoise Sci Nat. 37, 547–579 (1901)
Google Scholar
Bach, F.: Learning with submodular functions: A convex optimization perspective. Found. Trends® Mach. Learn. 6, 145–373 (2013)
Google Scholar
Kendall, A., Badrinarayanan, V., Cipolla, R.: Bayesian SegNet: model uncertainty in deep convolutional encoder-decoder architectures for scene understanding (2015). arXiv:1511.02680
Paszke, A., Chaurasia, A., Kim, S., Culurciello, E.: ENet: a deep neural network architecture for real-time semantic segmentation (2016). arXiv:1606.02147
Chaurasia, A., Culurciello, E.: LinkNet: exploiting encoder representations for efficient semantic segmentation. In: Proceedings of the IEEE Visual Communications and Image Processing, pp. 1–4 (2017)
Google Scholar
Iglovikov, V., Shvets, A.: TernausNet: U-net with VGG11 encoder pre-trained on imagenet for image segmentation (2018). arXiv:1801.05746
Visin, F., et al.: ReSeg: a recurrent neural network-based model for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 41–48 (2016)
Google Scholar
Li, H., Xiong, P., Fan, H., Sun, J.: DFANet: deep feature aggregation for real-time semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 9522–9531 (2019)
Google Scholar
Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., Sang, N.: Learning a discriminative feature network for semantic segmentation (2018). arXiv:1804.09337

Download references

Author information

Authors and Affiliations

School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou, 310023, China
Lu Ye, Wujie Zhou, Sugianto Sugianto, George Kofi Agordzo, Derrick Yeboah & Mukonde Tonderayi Kevin
School of Mechanical and Energy Engineering, Zhejiang University of Science and Technology, Hangzhou, 310023, China
Lu Ye, Jiayi Zhu, Wujie Zhou & Ting Duan

Authors

Lu Ye
View author publications
You can also search for this author in PubMed Google Scholar
Jiayi Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Wujie Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Ting Duan
View author publications
You can also search for this author in PubMed Google Scholar
Sugianto Sugianto
View author publications
You can also search for this author in PubMed Google Scholar
George Kofi Agordzo
View author publications
You can also search for this author in PubMed Google Scholar
Derrick Yeboah
View author publications
You can also search for this author in PubMed Google Scholar
Mukonde Tonderayi Kevin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lu Ye .

Editor information

Editors and Affiliations

Hangzhou Normal University, Hangzhou, China
Zhigeng Pan
Imagineering Institute, Nusajaya, Malaysia
Adrian David Cheok
University of Education, Weingarten, Germany
Wolfgang Müller
Zhejiang University, Hangzhou, China
Mingmin Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ye, L. et al. (2020). DPNet: A Dual Path Network for Road Scene Semantic Segmentation. In: Pan, Z., Cheok, A., Müller, W., Zhang, M. (eds) Transactions on Edutainment XVI. Lecture Notes in Computer Science(), vol 11782. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-61510-2_6

Download citation

DOI: https://doi.org/10.1007/978-3-662-61510-2_6
Published: 12 April 2020
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-61509-6
Online ISBN: 978-3-662-61510-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

DPNet: A Dual Path Network for Road Scene Semantic Segmentation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Research on Lightweight Road Semantic Segmentation Algorithm Based on DeepLabv3+

LFFNet: lightweight feature-enhanced fusion network for real-time semantic segmentation of road scenes

Real-time segmentation algorithm of unstructured road scenes based on improved BiSeNet

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

DPNet: A Dual Path Network for Road Scene Semantic Segmentation

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Research on Lightweight Road Semantic Segmentation Algorithm Based on DeepLabv3+

LFFNet: lightweight feature-enhanced fusion network for real-time semantic segmentation of road scenes

Real-time segmentation algorithm of unstructured road scenes based on improved BiSeNet

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation