Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3503161.3548241acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

SGINet: Toward Sufficient Interaction Between Single Image Deraining and Semantic Segmentation

Published: 10 October 2022 Publication History
  • Get Citation Alerts
  • Abstract

    Data-driven single image deraining (SID) models have achieved greater progress by simulations, but there is still a large gap between current deraining performance and practical high-level applications, since high-level semantic information is usually neglected in current studies. Although few studies jointly considered high-level tasks (e.g., segmentation) to enable the model to learn more high-level information, there are two obvious shortcomings. First, they require the segmentation labels for training, limiting their operations on other datasets without high-level labels. Second, high- and low-level information are not fully interacted, hence having limited improvement in both deraining and segmentation tasks. In this paper, we propose a Semantic Guided Interactive Network (SGINet), which considers the sufficient interaction between SID and semantic segmentation using a three-stage deraining manner, i.e., coarse deraining, semantic information extraction, and semantics guided deraining. Specifically, a Full Resolution Module (FRM) without down-/up-sampling is proposed to predict the coarse deraining images without context damage. Then, a Segmentation Extracting Module (SEM) is designed to extract accurate semantic information. We also develop a novel contrastive semantic discovery (CSD) loss, which can instruct the process of semantic segmentation without real semantic segmentation labels. Finally, a triple-direction U-net-based Semantic Interaction Module (SIM) takes advantage of the coarse deraining images and semantic information for fully interacting low-level with high-level tasks. Extensive simulations on the newly-constructed complex datasets Cityscapes_syn and Cityscapes_real demonstrated that our model could obtain more promising results. Overall, our SGINet achieved SOTA deraining and segmentation performance in both simulation and real-scenario data, compared with other representative SID methods.

    Supplementary Material

    MP4 File (MM22-mmfp2037.mp4)
    This paper has improved the rain removal task by exploring the sufficient interaction problem between single image deraining and semantic segmentation. A new Semantic Guided Interactive Network (SGINet) with a three-stage deraining process was proposed. Three new modules are designed for different stages, including predicting coarse derained images without context damage, extracting accurate semantic information from previous products, taking advantage of the coarse derained images and semantic information for full interaction, and a novel contrastive semantic discovery loss to discover semantic information without real segmentation labels. Two new datasets Cityscapes_syn and Cityscapes_real were also constructed for joint deraining and segmentation. Extensive experiments demonstrate that SGINet can obtain more pleasant derained images and better segmentation results, with significant improvement. Results also demonstrated the strong robustness of our SGINet in real-scenario.

    References

    [1]
    Chenghao Chen and Hao Li. 2021. Robust representation learning with feedback for single image deraining. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7742--7751.
    [2]
    Yi-Lei Chen and Chiou-Ting Hsu. 2013. A generalized low-rank appearance model for spatio-temporally correlated rain streaks. In Proceedings of the IEEE International Conference on Computer Vision. 1968--1975.
    [3]
    Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, and Bernt Schiele. 2016. The cityscapes dataset for semantic urban scene understanding. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3213--3223.
    [4]
    Ze Cui, Jing Wang, Shangyin Gao, Tiansheng Guo, Yihui Feng, and Bo Bai. 2021. Asymmetric gained deep image compression with continuous rate adaptation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 10532--10541.
    [5]
    Xueyang Fu, Jiabin Huang, Delu Zeng, Yue Huang, Xinghao Ding, and John Paisley. 2017. Removing rain from single images via a deep detail network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3855--3863.
    [6]
    Shirsendu Sukanta Halder, Jean-François Lalonde, and Raoul de Charette. 2019. Physics-based rendering for improving robustness to rain. In Proceedings of the IEEE International Conference on Computer Vision. 10203--10212.
    [7]
    Dailan He, Yaoyan Zheng, Baocheng Sun, Yan Wang, and Hongwei Qin. 2021. Checkerboard context model for efficient learned image compression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 14771--14780.
    [8]
    Xiaowei Hu, Chi-Wing Fu, Lei Zhu, and Pheng-Ann Heng. 2019. Depthattentional features for single-image rain removal. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 8022--8031.
    [9]
    Quan Huynh-Thu and Mohammed Ghanbari. 2008. Scope of validity of PSNR in image/video quality assessment. Electronics Letters 44, 13 (2008), 800--801.
    [10]
    Li-Wei Kang, Chia-Wen Lin, and Yu-Hsiang Fu. 2011. Automatic single-image based rain streaks removal via image decomposition. IEEE Transactions on Image Processing 21, 4 (2011), 1742--1755.
    [11]
    Guanbin Li, Yuan Xie, Liang Lin, and Yizhou Yu. 2017. Instance-level salient object segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2386--2395.
    [12]
    Ruoteng Li, Loong-Fah Cheong, and Robby T Tan. 2019. Heavy rain image restoration: integrating physics model and conditional adversarial learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1633--1642.
    [13]
    Siyuan Li, Iago Breno Araujo, Wenqi Ren, Zhangyang Wang, Eric K Tokuda, Roberto Hirata Junior, Roberto Cesar-Junior, Jiawan Zhang, Xiaojie Guo, and Xiaochun Cao. 2019. Single image deraining: a comprehensive benchmark analysis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3838--3847.
    [14]
    Xia Li, JianlongWu, Zhouchen Lin, Hong Liu, and Hongbin Zha. 2018. Recurrent squeeze-and-excitation context aggregation net for single image deraining. In Proceedings of the European Conference on Computer Vision. 254--269.
    [15]
    Yi Li, Yi Chang, Changfeng Yu, and Luxin Yan. 2022. Close the Loop: a unified bottom-up and top-down paradigm for joint image deraining and segmentation. In Proceedings of the AAAI Conference on Artificial Intelligence. 1--9.
    [16]
    Yu Li, Robby T Tan, Xiaojie Guo, Jiangbo Lu, and Michael S Brown. 2016. Rain streak removal using layer priors. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2736--2744.
    [17]
    Dong Liang, Ling Li, MingqiangWei, Shuo Yang, Liyan Zhang,Wenhan Yang, Yun Du, and Huiyu Zhou. 2022. Semantically contrastive learning for low-light image enhancement. In Proceedings of the AAAI Conference on Artificial Intelligence. 1--9.
    [18]
    Ding Liu, BihanWen, Jianbo Jiao, Xianming Liu, ZhangyangWang, and Thomas S Huang. 2020. Connecting image denoising and high-level vision tasks via deep learning. IEEE Transactions on Image Processing 29 (2020), 3695--3706.
    [19]
    Yu Luo, Yong Xu, and Hui Ji. 2015. Removing rain from a single image via discriminative sparse coding. In Proceedings of the IEEE International Conference on Computer Vision. 3397--3405.
    [20]
    Rui Qian, Robby T Tan, Wenhan Yang, Jiajun Su, and Jiaying Liu. 2018. Attentive generative adversarial network for raindrop removal from a single image. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2482--2491.
    [21]
    Dongwei Ren, Wangmeng Zuo, Qinghua Hu, Pengfei Zhu, and Deyu Meng. 2019. Progressive image deraining networks: a better and simpler baseline. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3937--3946.
    [22]
    Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: towards real-time object detection with region proposal networks. In Proceedings of the Advances in Neural Information Processing Systems. 91--99.
    [23]
    Wenqi Ren, Jingang Zhang, Xiangyu Xu, Lin Ma, Xiaochun Cao, Gaofeng Meng, and Wei Liu. 2018. Deep video dehazing with semantic segmentation. IEEE Transactions on Image Processing 28, 4 (2018), 1895--1908.
    [24]
    Ziyi Shen, Wei-Sheng Lai, Tingfa Xu, Jan Kautz, and Ming-Hsuan Yang. 2020. Exploiting semantics for face image deblurring. International Journal of Computer Vision 128, 7 (2020), 1829--1846.
    [25]
    K. Simonyan and A. Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. Computer Science (2014), 1--14.
    [26]
    HongWang, Qi Xie, Qian Zhao, and Deyu Meng. 2020. A model-driven deep neural network for single image rain removal. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3103--3112.
    [27]
    Jingdong Wang, Ke Sun, Tianheng Cheng, Borui Jiang, Chaorui Deng, Yang Zhao, Dong Liu, Yadong Mu, Mingkui Tan, Xinggang Wang, et al. 2020. Deep high-resolution representation learning for visual recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 10 (2020), 3349--3364.
    [28]
    Tianyu Wang, Xin Yang, Ke Xu, Shaozhe Chen, Qiang Zhang, and Rynson WH Lau. 2019. Spatial attentive single-image deraining with a high quality real rain dataset. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 12270--12279.
    [29]
    Zhou Wang, Alan C Bovik, Hamid R Sheikh, and Eero P Simoncelli. 2004. Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing 13, 4 (2004), 600--612.
    [30]
    Wei Wei, Deyu Meng, Qian Zhao, Zongben Xu, and Ying Wu. 2019. Semisupervised transfer learning for image rain removal. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3877--3886.
    [31]
    Yanyan Wei, Zhao Zhang, Yang Wang, Mingliang Xu, Yi Yang, Shuicheng Yan, and Meng Wang. 2021. Deraincyclegan: rain attentive cyclegan for single image deraining and rainmaking. IEEE Transactions on Image Processing 30 (2021), 4788--4801.
    [32]
    Yanyan Wei, Zhao Zhang, Yang Wang, Haijun Zhang, Mingbo Zhao, Mingliang Xu, and MengWang. 2021. Semi-deraingan: A new semi-supervised single image deraining. In Proceedings of the IEEE International Conference on Multimedia and Expo. 1--6.
    [33]
    YanyanWei, Zhao Zhang, Haijun Zhang, Richang Hong, and MengWang. 2019. A coarse-to-fine multi-stream hybrid deraining network for single image deraining. In Proceedings of the IEEE International Conference on Data Mining. 628--637.
    [34]
    Wenhan Yang, Robby T Tan, Jiashi Feng, Zongming Guo, Shuicheng Yan, and Jiaying Liu. 2019. Joint rain detection and removal from a single image with contextualized deep networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 42, 6 (2019), 1377--1393.
    [35]
    Wenhan Yang, Robby T Tan, Jiashi Feng, Jiaying Liu, Zongming Guo, and Shuicheng Yan. 2017. Deep joint rain detection and removal from a single image. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1357--1366.
    [36]
    He Zhang and Vishal M Patel. 2018. Density-aware single image de-raining using a multi-stream dense network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 695--704.
    [37]
    He Zhang, Vishwanath Sindagi, and Vishal M Patel. 2019. Image de-raining using a conditional generative adversarial network. IEEE Transactions on Circuits and Systems for Video Technology 30, 11 (2019), 3943--3956.
    [38]
    Zhao Zhang, Yanyan Wei, Haijun Zhang, Yi Yang, Shuicheng Yan, and Meng Wang. 2021. Data-driven single image deraining: a comprehensive review and new perspectives. TechRxiv (2021), 1--20.
    [39]
    Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, Xiaogang Wang, and Jiaya Jia. 2017. Pyramid scene parsing network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2881--2890.
    [40]
    Lei Zhu, Chi-Wing Fu, Dani Lischinski, and Pheng-Ann Heng. 2017. Joint bi-layer optimization for single-image rain streak removal. In Proceedings of the IEEE International Conference on Computer Vision. 2526--2534.

    Cited By

    View all
    • (2024)RCDNet: An Interpretable Rain Convolutional Dictionary Network for Single Image DerainingIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.323145335:6(8668-8682)Online publication date: Jul-2024
    • (2024)A Prior Guided Wavelet-Spatial Dual Attention Transformer Framework for Heavy Rain Image RestorationIEEE Transactions on Multimedia10.1109/TMM.2024.335948026(7043-7057)Online publication date: 2024
    • (2024)FRC-Net: A Simple Yet Effective Architecture for Low-Light Image EnhancementIEEE Transactions on Consumer Electronics10.1109/TCE.2023.328046770:1(3332-3340)Online publication date: Mar-2024
    • Show More Cited By

    Index Terms

    1. SGINet: Toward Sufficient Interaction Between Single Image Deraining and Semantic Segmentation

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MM '22: Proceedings of the 30th ACM International Conference on Multimedia
      October 2022
      7537 pages
      ISBN:9781450392037
      DOI:10.1145/3503161
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 10 October 2022

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. contrastive semantic discovery loss
      2. semantic guided interactive network
      3. semantic interaction module
      4. single image deraining

      Qualifiers

      • Research-article

      Funding Sources

      Conference

      MM '22
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 995 of 4,171 submissions, 24%

      Upcoming Conference

      MM '24
      The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)114
      • Downloads (Last 6 weeks)10
      Reflects downloads up to 26 Jul 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)RCDNet: An Interpretable Rain Convolutional Dictionary Network for Single Image DerainingIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.323145335:6(8668-8682)Online publication date: Jul-2024
      • (2024)A Prior Guided Wavelet-Spatial Dual Attention Transformer Framework for Heavy Rain Image RestorationIEEE Transactions on Multimedia10.1109/TMM.2024.335948026(7043-7057)Online publication date: 2024
      • (2024)FRC-Net: A Simple Yet Effective Architecture for Low-Light Image EnhancementIEEE Transactions on Consumer Electronics10.1109/TCE.2023.328046770:1(3332-3340)Online publication date: Mar-2024
      • (2024)Integrating Cross-Domain Feature Representation and Semantic Guidance for Underwater Image EnhancementIEEE Signal Processing Letters10.1109/LSP.2024.340590931(1511-1515)Online publication date: 2024
      • (2024)Recovering a clean backgroundPattern Recognition Letters10.1016/j.patrec.2024.01.006178:C(153-159)Online publication date: 17-Apr-2024
      • (2024)From heavy rain removal to detail restorationPattern Recognition10.1016/j.patcog.2023.110205148:COnline publication date: 17-Apr-2024
      • (2024)Cycle contrastive adversarial learning with structural consistency for unsupervised high-quality image deraining transformerNeural Networks10.1016/j.neunet.2024.106428178(106428)Online publication date: Oct-2024
      • (2024)Semantic-aware enhancementEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.107793130:COnline publication date: 2-Jul-2024
      • (2023)Real Rainy Scene Analysis: A Dual-Module Benchmark for Image Deraining and Segmentation2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)10.1109/ICMEW59549.2023.00018(69-74)Online publication date: Jul-2023
      • (2023)A Clearer Image: Improving Object Detection in Real Rainy Conditions with Two-Stage Processing2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)10.1109/ICMEW59549.2023.00016(57-62)Online publication date: Jul-2023
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media