Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Local Bidirection Recurrent Network for Efficient Video Deblurring with the Fused Temporal Merge Module

Published: 07 June 2023 Publication History

Abstract

Video deblurring methods exploit the correlation between consecutive blurry inputs to generate sharp frames. However, designing an effective and efficient method is a challenging problem for video deblurring. To guarantee the effectiveness and further improve the deblurring performance, we adopt the recurrent-based method as the baseline and reconsider the recurrent mechanism as well as the temporal feature alignment in the state-of-the-art methods. For the recurrent mechanism, we add the local backward connection to the global forward recurrent backbone to effectively exploit accurate future information. For the temporal alignment, we adopt a fused temporal merge module that exploits the superiority of flow-based and kernel-based methods with progressive correlation volumes estimation. In addition, we evaluate our method with both synthetic datasets (GoPro, DVD) and a realistic dataset (BSD). The experimental results demonstrate that our method achieves significant performance improvement with a slight computational cost increase against the state-of-the-art video deblurring methods. The extended ablation studies verify the effectiveness of our model.

Supplementary Material

tomm-2022-0600-File003 (tomm-2022-0600-file003.zip)
Supplementary material

References

[1]
Wenbo Bao, Wei-Sheng Lai, Chao Ma, Xiaoyun Zhang, Zhiyong Gao, and Ming-Hsuan Yang. 2019. Depth-aware video frame interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3703–3712.
[2]
Kelvin C. K. Chan, Xintao Wang, Ke Yu, Chao Dong, and Chen Change Loy. 2021. BasicVSR: The search for essential components in video super-resolution and beyond. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4947–4956.
[3]
Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, and Chen Change Loy. 2022. BasicVSR++: Improving video super-resolution with enhanced propagation and alignment. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5972–5981.
[4]
Sunghyun Cho, Jue Wang, and Seungyong Lee. 2012. Video deblurring for hand-held cameras using patch-based synthesis. ACM Transactions on Graphics 31, 4 (2012), 1–9.
[5]
Mauricio Delbracio and Guillermo Sapiro. 2015. Hand-held video deblurring via efficient fourier aggregation. IEEE Transactions on Computational Imaging 1, 4 (2015), 270–283.
[6]
Jiale He, Gaobo Yang, Xin Liu, and Xiangling Ding. 2020. Spatio-temporal saliency-based motion vector refinement for frame rate up-conversion. ACM Transactions on Multimedia Computing, Communications, and Applications 16, 2 (2020), 1–18.
[7]
Mengshun Hu, Kui Jiang, Zhixiang Nie, and Zheng Wang. 2022. You only align once: Bidirectional interaction for spatial-temporal video super-resolution. In Proceedings of the 30th ACM International Conference on Multimedia. 847–855.
[8]
Tae Hyun Kim, Byeongjoo Ahn, and Kyoung Mu Lee. 2013. Dynamic scene deblurring. In Proceedings of the IEEE International Conference on Computer Vision. 3160–3167.
[9]
Tae Hyun Kim and Kyoung Mu Lee. 2014. Segmentation-free dynamic scene deblurring. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2766–2773.
[10]
Tae Hyun Kim and Kyoung Mu Lee. 2015. Generalized video deblurring for dynamic scenes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5426–5434.
[11]
Tae Hyun Kim, Kyoung Mu Lee, Bernhard Scholkopf, and Michael Hirsch. 2017. Online video deblurring via dynamic temporal blending network. In Proceedings of the IEEE International Conference on Computer Vision. 4038–4047.
[12]
Tae Hyun Kim, Seungjun Nah, and Kyoung Mu Lee. 2017. Dynamic video deblurring using a locally adaptive blur model. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 10 (2017), 2374–2387.
[13]
Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[14]
Chen Li, Li Song, Rong Xie, and Wenjun Zhang. 2022. L0 structure-prior assisted blur-intensity aware efficient video deblurring. Neurocomputing 483 (2022), 195–209.
[15]
Dongxu Li, Chenchen Xu, Kaihao Zhang, Xin Yu, Yiran Zhong, Wenqi Ren, Hanna Suominen, and Hongdong Li. 2021. Arvo: Learning all-range volumetric correspondence for video deblurring. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7721–7731.
[16]
Xianguo Li, Yemei Sun, Yanli Yang, and Changyun Miao. 2019. Symmetrical residual connections for single image super-resolution. ACM Transactions on Multimedia Computing, Communications, and Applications 15, 1 (2019), 1–10.
[17]
Yunpeng Li, Sing Bing Kang, Neel Joshi, Steve M. Seitz, and Daniel P. Huttenlocher. 2010. Generating sharp panoramas from motion-blurred videos. In Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, Los Alamitos, CA, 2424–2431.
[18]
Jing Lin, Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Youliang Yan, Xueyi Zou, Henghui Ding, Yulun Zhang, Radu Timofte, and Luc Van Gool. 2022. Flow-guided sparse transformer for video deblurring. In Proceedings of the International Conference on Machine Learning. 13334–13343.
[19]
Yasuyuki Matsushita, Eyal Ofek, Weina Ge, Xiaoou Tang, and Heung-Yeung Shum. 2006. Full-frame video stabilization with motion inpainting. IEEE Transactions on Pattern Analysis and Machine Intelligence7 (2006), 1150–1163.
[20]
Seungjun Nah, Sungyong Baik, Seokil Hong, Gyeongsik Moon, Sanghyun Son, Radu Timofte, and Kyoung Mu Lee. 2019. NTIRE 2019 challenge on video deblurring and super-resolution: Dataset and study. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.
[21]
Seungjun Nah, Tae Hyun Kim, and Kyoung Mu Lee. 2017. Deep multi-scale convolutional neural network for dynamic scene deblurring. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3883–3891.
[22]
Seungjun Nah, Sanghyun Son, and Kyoung Mu Lee. 2019. Recurrent neural networks with intra-frame iterations for video deblurring. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8102–8111.
[23]
Jinshan Pan, Haoran Bai, and Jinhui Tang. 2020. Cascaded deep video deblurring using temporal sharpness prior. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3043–3051.
[24]
Wenqi Ren, Jinshan Pan, Xiaochun Cao, and Ming-Hsuan Yang. 2017. Video deblurring via semantic segmentation and pixel-wise non-linear kernel. In Proceedings of the IEEE International Conference on Computer Vision. 1077–1085.
[25]
Hyeongseok Son, Junyong Lee, Jonghyeop Lee, Sunghyun Cho, and Seungyong Lee. 2021. Recurrent video deblurring with blur-invariant motion estimation and pixel volumes. ACM Transactions on Graphics 40, 5 (2021), 1–18.
[26]
Shuochen Su, Mauricio Delbracio, Jue Wang, Guillermo Sapiro, Wolfgang Heidrich, and Oliver Wang. 2017. Deep video deblurring for hand-held cameras. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1279–1288.
[27]
Deqing Sun, Xiaodong Yang, Ming-Yu Liu, and Jan Kautz. 2018. PWC-Net: CNNs for optical flow using pyramid, warping, and cost volume. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 8934–8943.
[28]
Jian Sun, Wenfei Cao, Zongben Xu, and Jean Ponce. 2015. Learning a convolutional neural network for non-uniform motion blur removal. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 769–777.
[29]
Xintao Wang, Kelvin C. K. Chan, Ke Yu, Chao Dong, and Chen Change Loy. 2019. EDVR: Video restoration with enhanced deformable convolutional networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.
[30]
Patrick Wieschollek, Michael Hirsch, Bernhard Scholkopf, and Hendrik Lensch. 2017. Learning blind motion deblurring. In Proceedings of the IEEE International Conference on Computer Vision. 231–240.
[31]
Jonas Wulff and Michael Julian Black. 2014. Modeling blurred video with layers. In Proceedings of the European Conference on Computer Vision. 236–252.
[32]
Peng Yi, Zhongyuan Wang, Kui Jiang, Junjun Jiang, Tao Lu, Xin Tian, and Jiayi Ma. 2021. Omniscient video super-resolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 4429–4438.
[33]
Haichao Zhang, David Wipf, and Yanning Zhang. 2014. Multi-observation blind deconvolution with an adaptive sparse prior. IEEE Transactions on Pattern Analysis and Machine Intelligence 36, 8 (2014), 1628–1643.
[34]
Huicong Zhang, Haozhe Xie, and Hongxun Yao. 2022. Spatio-temporal deformable attention network for video deblurring. In Computer Vision—ECCV 2022. Lecture Notes in Computer Science, Vol. 13676. Springer, 581–596.
[35]
Zhihang Zhong, Ye Gao, Yinqiang Zheng, and Bo Zheng. 2020. Efficient spatio-temporal recurrent neural network for video deblurring. In Proceedings of the European Conference on Computer Vision. 191–207.
[36]
Shangchen Zhou, Jiawei Zhang, Jinshan Pan, Haozhe Xie, Wangmeng Zuo, and Jimmy Ren. 2019. Spatio-temporal filter adaptive network for video deblurring. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 2482–2491.

Cited By

View all
  • (2024)Continuous Space-Time Video Super-Resolution with Multi-Stage Motion Information ReorganizationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366564620:9(1-23)Online publication date: 21-May-2024
  • (2024)SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text RetrievalACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3664816Online publication date: 11-May-2024
  • (2024)Universal Relocalizer for Weakly Supervised Referring Expression GroundingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365604520:7(1-23)Online publication date: 16-May-2024
  • Show More Cited By

Index Terms

  1. Local Bidirection Recurrent Network for Efficient Video Deblurring with the Fused Temporal Merge Module

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Multimedia Computing, Communications, and Applications
      ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 19, Issue 5s
      October 2023
      280 pages
      ISSN:1551-6857
      EISSN:1551-6865
      DOI:10.1145/3599694
      • Editor:
      • Abdulmotaleb El Saddik
      Issue’s Table of Contents

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 07 June 2023
      Online AM: 13 March 2023
      Accepted: 07 March 2023
      Revised: 06 February 2023
      Received: 08 October 2022
      Published in TOMM Volume 19, Issue 5s

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Video deblurring
      2. local bidirection
      3. fused temporal merge

      Qualifiers

      • Research-article

      Funding Sources

      • Fundamental Research Funds for the Central Universities, 111 Project, China
      • Shanghai Key Laboratory of Digital Media Processing and Transmissions, China

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)104
      • Downloads (Last 6 weeks)13
      Reflects downloads up to 04 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Continuous Space-Time Video Super-Resolution with Multi-Stage Motion Information ReorganizationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366564620:9(1-23)Online publication date: 21-May-2024
      • (2024)SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text RetrievalACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3664816Online publication date: 11-May-2024
      • (2024)Universal Relocalizer for Weakly Supervised Referring Expression GroundingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365604520:7(1-23)Online publication date: 16-May-2024
      • (2024)Context-detail-aware United Network for Single Image DerainingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363940720:5(1-18)Online publication date: 22-Jan-2024
      • (2024)Emotional Video Captioning With Vision-Based Emotion Interpretation NetworkIEEE Transactions on Image Processing10.1109/TIP.2024.335904533(1122-1135)Online publication date: 1-Feb-2024
      • (2024)Optimised Propainter for Video Diminished Reality Inpainting2024 IEEE International Symposium on Biomedical Imaging (ISBI)10.1109/ISBI56570.2024.10635380(1-5)Online publication date: 27-May-2024
      • (2024)Leveraging chaos for enhancing encryption and compression in large cloud data transfersThe Journal of Supercomputing10.1007/s11227-024-05906-380:9(11923-11957)Online publication date: 4-Feb-2024
      • (2024)Dependability of Network Services in the Context of NFV: A Taxonomy and State of the Art ClassificationJournal of Network and Systems Management10.1007/s10922-024-09810-232:2Online publication date: 26-Mar-2024
      • (2024)Memory-based gradient-guided progressive propagation network for video deblurringThe Visual Computer10.1007/s00371-024-03306-5Online publication date: 6-Mar-2024
      • (2023)Context-Aware Magnetic MIMO Wireless Charging with Parallel In-Band CommunicationACM Transactions on Sensor Networks10.1145/358269219:4(1-24)Online publication date: 16-May-2023
      • Show More Cited By

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Full Text

      View this article in Full Text.

      Full Text

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media