research-article

Local Bidirection Recurrent Network for Efficient Video Deblurring with the Fused Temporal Merge Module

Authors:

Wenjun ZhangAuthors Info & Claims

ACM Transactions on Multimedia Computing, Communications and Applications, Volume 19, Issue 5s

Article No.: 170, Pages 1 - 18

https://doi.org/10.1145/3587468

Published: 07 June 2023 Publication History

Abstract

Video deblurring methods exploit the correlation between consecutive blurry inputs to generate sharp frames. However, designing an effective and efficient method is a challenging problem for video deblurring. To guarantee the effectiveness and further improve the deblurring performance, we adopt the recurrent-based method as the baseline and reconsider the recurrent mechanism as well as the temporal feature alignment in the state-of-the-art methods. For the recurrent mechanism, we add the local backward connection to the global forward recurrent backbone to effectively exploit accurate future information. For the temporal alignment, we adopt a fused temporal merge module that exploits the superiority of flow-based and kernel-based methods with progressive correlation volumes estimation. In addition, we evaluate our method with both synthetic datasets (GoPro, DVD) and a realistic dataset (BSD). The experimental results demonstrate that our method achieves significant performance improvement with a slight computational cost increase against the state-of-the-art video deblurring methods. The extended ablation studies verify the effectiveness of our model.

Supplementary Material

tomm-2022-0600-File003 (tomm-2022-0600-file003.zip)

Supplementary material

Download
13.11 MB

References

[1]

Wenbo Bao, Wei-Sheng Lai, Chao Ma, Xiaoyun Zhang, Zhiyong Gao, and Ming-Hsuan Yang. 2019. Depth-aware video frame interpolation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3703–3712.

[2]

Kelvin C. K. Chan, Xintao Wang, Ke Yu, Chao Dong, and Chen Change Loy. 2021. BasicVSR: The search for essential components in video super-resolution and beyond. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4947–4956.

[3]

Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, and Chen Change Loy. 2022. BasicVSR++: Improving video super-resolution with enhanced propagation and alignment. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5972–5981.

[4]

Sunghyun Cho, Jue Wang, and Seungyong Lee. 2012. Video deblurring for hand-held cameras using patch-based synthesis. ACM Transactions on Graphics 31, 4 (2012), 1–9.

Digital Library

[5]

Mauricio Delbracio and Guillermo Sapiro. 2015. Hand-held video deblurring via efficient fourier aggregation. IEEE Transactions on Computational Imaging 1, 4 (2015), 270–283.

[6]

Jiale He, Gaobo Yang, Xin Liu, and Xiangling Ding. 2020. Spatio-temporal saliency-based motion vector refinement for frame rate up-conversion. ACM Transactions on Multimedia Computing, Communications, and Applications 16, 2 (2020), 1–18.

Digital Library

[7]

Mengshun Hu, Kui Jiang, Zhixiang Nie, and Zheng Wang. 2022. You only align once: Bidirectional interaction for spatial-temporal video super-resolution. In Proceedings of the 30th ACM International Conference on Multimedia. 847–855.

Digital Library

[8]

Tae Hyun Kim, Byeongjoo Ahn, and Kyoung Mu Lee. 2013. Dynamic scene deblurring. In Proceedings of the IEEE International Conference on Computer Vision. 3160–3167.

Digital Library

[9]

Tae Hyun Kim and Kyoung Mu Lee. 2014. Segmentation-free dynamic scene deblurring. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2766–2773.

Digital Library

[10]

Tae Hyun Kim and Kyoung Mu Lee. 2015. Generalized video deblurring for dynamic scenes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5426–5434.

[11]

Tae Hyun Kim, Kyoung Mu Lee, Bernhard Scholkopf, and Michael Hirsch. 2017. Online video deblurring via dynamic temporal blending network. In Proceedings of the IEEE International Conference on Computer Vision. 4038–4047.

[12]

Tae Hyun Kim, Seungjun Nah, and Kyoung Mu Lee. 2017. Dynamic video deblurring using a locally adaptive blur model. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 10 (2017), 2374–2387.

Digital Library

[13]

Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).

[14]

Chen Li, Li Song, Rong Xie, and Wenjun Zhang. 2022. L0 structure-prior assisted blur-intensity aware efficient video deblurring. Neurocomputing 483 (2022), 195–209.

Digital Library

[15]

Dongxu Li, Chenchen Xu, Kaihao Zhang, Xin Yu, Yiran Zhong, Wenqi Ren, Hanna Suominen, and Hongdong Li. 2021. Arvo: Learning all-range volumetric correspondence for video deblurring. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7721–7731.

[16]

Xianguo Li, Yemei Sun, Yanli Yang, and Changyun Miao. 2019. Symmetrical residual connections for single image super-resolution. ACM Transactions on Multimedia Computing, Communications, and Applications 15, 1 (2019), 1–10.

Digital Library

[17]

Yunpeng Li, Sing Bing Kang, Neel Joshi, Steve M. Seitz, and Daniel P. Huttenlocher. 2010. Generating sharp panoramas from motion-blurred videos. In Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, Los Alamitos, CA, 2424–2431.

[18]

Jing Lin, Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Youliang Yan, Xueyi Zou, Henghui Ding, Yulun Zhang, Radu Timofte, and Luc Van Gool. 2022. Flow-guided sparse transformer for video deblurring. In Proceedings of the International Conference on Machine Learning. 13334–13343.

[19]

Yasuyuki Matsushita, Eyal Ofek, Weina Ge, Xiaoou Tang, and Heung-Yeung Shum. 2006. Full-frame video stabilization with motion inpainting. IEEE Transactions on Pattern Analysis and Machine Intelligence7 (2006), 1150–1163.

Digital Library

[20]

Seungjun Nah, Sungyong Baik, Seokil Hong, Gyeongsik Moon, Sanghyun Son, Radu Timofte, and Kyoung Mu Lee. 2019. NTIRE 2019 challenge on video deblurring and super-resolution: Dataset and study. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.

[21]

Seungjun Nah, Tae Hyun Kim, and Kyoung Mu Lee. 2017. Deep multi-scale convolutional neural network for dynamic scene deblurring. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 3883–3891.

[22]

Seungjun Nah, Sanghyun Son, and Kyoung Mu Lee. 2019. Recurrent neural networks with intra-frame iterations for video deblurring. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8102–8111.

[23]

Jinshan Pan, Haoran Bai, and Jinhui Tang. 2020. Cascaded deep video deblurring using temporal sharpness prior. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3043–3051.

[24]

Wenqi Ren, Jinshan Pan, Xiaochun Cao, and Ming-Hsuan Yang. 2017. Video deblurring via semantic segmentation and pixel-wise non-linear kernel. In Proceedings of the IEEE International Conference on Computer Vision. 1077–1085.

[25]

Hyeongseok Son, Junyong Lee, Jonghyeop Lee, Sunghyun Cho, and Seungyong Lee. 2021. Recurrent video deblurring with blur-invariant motion estimation and pixel volumes. ACM Transactions on Graphics 40, 5 (2021), 1–18.

Digital Library

[26]

Shuochen Su, Mauricio Delbracio, Jue Wang, Guillermo Sapiro, Wolfgang Heidrich, and Oliver Wang. 2017. Deep video deblurring for hand-held cameras. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1279–1288.

[27]

Deqing Sun, Xiaodong Yang, Ming-Yu Liu, and Jan Kautz. 2018. PWC-Net: CNNs for optical flow using pyramid, warping, and cost volume. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 8934–8943.

[28]

Jian Sun, Wenfei Cao, Zongben Xu, and Jean Ponce. 2015. Learning a convolutional neural network for non-uniform motion blur removal. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 769–777.

[29]

Xintao Wang, Kelvin C. K. Chan, Ke Yu, Chao Dong, and Chen Change Loy. 2019. EDVR: Video restoration with enhanced deformable convolutional networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.

[30]

Patrick Wieschollek, Michael Hirsch, Bernhard Scholkopf, and Hendrik Lensch. 2017. Learning blind motion deblurring. In Proceedings of the IEEE International Conference on Computer Vision. 231–240.

[31]

Jonas Wulff and Michael Julian Black. 2014. Modeling blurred video with layers. In Proceedings of the European Conference on Computer Vision. 236–252.

[32]

Peng Yi, Zhongyuan Wang, Kui Jiang, Junjun Jiang, Tao Lu, Xin Tian, and Jiayi Ma. 2021. Omniscient video super-resolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 4429–4438.

[33]

Haichao Zhang, David Wipf, and Yanning Zhang. 2014. Multi-observation blind deconvolution with an adaptive sparse prior. IEEE Transactions on Pattern Analysis and Machine Intelligence 36, 8 (2014), 1628–1643.

Digital Library

[34]

Huicong Zhang, Haozhe Xie, and Hongxun Yao. 2022. Spatio-temporal deformable attention network for video deblurring. In Computer Vision—ECCV 2022. Lecture Notes in Computer Science, Vol. 13676. Springer, 581–596.

Digital Library

[35]

Zhihang Zhong, Ye Gao, Yinqiang Zheng, and Bo Zheng. 2020. Efficient spatio-temporal recurrent neural network for video deblurring. In Proceedings of the European Conference on Computer Vision. 191–207.

Digital Library

[36]

Shangchen Zhou, Jiawei Zhang, Jinshan Pan, Haozhe Xie, Wangmeng Zuo, and Jimmy Ren. 2019. Spatio-temporal filter adaptive network for video deblurring. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 2482–2491.

Cited By

Zhang YYang DChen ZDing W(2024)Continuous Space-Time Video Super-Resolution with Multi-Stage Motion Information ReorganizationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366564620:9(1-23)Online publication date: 21-May-2024
https://dl.acm.org/doi/10.1145/3665646
Liu YYuan XLi HTan ZHuang JXiao JLi WMo T(2024)SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text RetrievalACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3664816Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3664816
Zhang PLiu MSong XCao DGao ZNie L(2024)Universal Relocalizer for Weakly Supervised Referring Expression GroundingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365604520:7(1-23)Online publication date: 16-May-2024
https://dl.acm.org/doi/10.1145/3656045
Show More Cited By

Index Terms

Local Bidirection Recurrent Network for Efficient Video Deblurring with the Fused Temporal Merge Module
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Reconstruction
  2. Computer graphics
    1. Image manipulation
      1. Image processing

Recommendations

Learn from the past – sequentially one-to-one video deblurring network
Highlights
- A spatio-temporal recurrent network to utilize prior frames to benefit de-blurring.
- A novel spatio-temporal attention module for informative feature maps.
- Outperforming the state-of-the-art in various real-world video deblurring ...
Abstract
With the growing availability of hand-held cameras in recent years, more and more images and videos are taken at any time and any place. However, they usually suffer from undesirable blur due to camera shake or object motion in the scene. In ...
Spatio-Temporal Deformable Attention Network for Video Deblurring
Computer Vision – ECCV 2022
Abstract
The key success factor of the video deblurring methods is to compensate for the blurry pixels of the mid-frame with the sharp pixels of the adjacent video frames. Therefore, mainstream methods align the adjacent frames based on the estimated ...
Real-World Video Deblurring: A Benchmark Dataset and an Efficient Recurrent Neural Network
Abstract
Real-world video deblurring in real time still remains a challenging task due to the complexity of spatially and temporally varying blur itself and the requirement of low computational cost. To improve the network efficiency, we adopt residual ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 19, Issue 5s

October 2023

280 pages

ISSN:1551-6857

EISSN:1551-6865

DOI:10.1145/3599694

Editor:
Abdulmotaleb El Saddik
Mohamed Bin Zayed University of Artificial Intelligence, UAE and University of Ottawa, Canada

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 June 2023

Online AM: 13 March 2023

Accepted: 07 March 2023

Revised: 06 February 2023

Received: 08 October 2022

Published in TOMM Volume 19, Issue 5s

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Fundamental Research Funds for the Central Universities, 111 Project, China
Shanghai Key Laboratory of Digital Media Processing and Transmissions, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

16
Total Citations
View Citations
262
Total Downloads

Downloads (Last 12 months)104
Downloads (Last 6 weeks)13

Reflects downloads up to 04 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Zhang YYang DChen ZDing W(2024)Continuous Space-Time Video Super-Resolution with Multi-Stage Motion Information ReorganizationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366564620:9(1-23)Online publication date: 21-May-2024
https://dl.acm.org/doi/10.1145/3665646
Liu YYuan XLi HTan ZHuang JXiao JLi WMo T(2024)SEMScene: Semantic-Consistency Enhanced Multi-Level Scene Graph Matching for Image-Text RetrievalACM Transactions on Multimedia Computing, Communications, and Applications10.1145/3664816Online publication date: 11-May-2024
https://dl.acm.org/doi/10.1145/3664816
Zhang PLiu MSong XCao DGao ZNie L(2024)Universal Relocalizer for Weakly Supervised Referring Expression GroundingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365604520:7(1-23)Online publication date: 16-May-2024
https://dl.acm.org/doi/10.1145/3656045
Hsu WLin H(2024)Context-detail-aware United Network for Single Image DerainingACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363940720:5(1-18)Online publication date: 22-Jan-2024
https://dl.acm.org/doi/10.1145/3639407
Song PGuo DYang XTang SWang M(2024)Emotional Video Captioning With Vision-Based Emotion Interpretation NetworkIEEE Transactions on Image Processing10.1109/TIP.2024.335904533(1122-1135)Online publication date: 1-Feb-2024
https://dl.acm.org/doi/10.1109/TIP.2024.3359045
Li PLiu LSchönlieb CAviles-Rivero A(2024)Optimised Propainter for Video Diminished Reality Inpainting2024 IEEE International Symposium on Biomedical Imaging (ISBI)10.1109/ISBI56570.2024.10635380(1-5)Online publication date: 27-May-2024
https://doi.org/10.1109/ISBI56570.2024.10635380
Bhattacharjee SSharma HChoudhury TAbdelmoniem A(2024)Leveraging chaos for enhancing encryption and compression in large cloud data transfersThe Journal of Supercomputing10.1007/s11227-024-05906-380:9(11923-11957)Online publication date: 4-Feb-2024
https://dl.acm.org/doi/10.1007/s11227-024-05906-3
Azadiabad SKhendek F(2024)Dependability of Network Services in the Context of NFV: A Taxonomy and State of the Art ClassificationJournal of Network and Systems Management10.1007/s10922-024-09810-232:2Online publication date: 26-Mar-2024
https://dl.acm.org/doi/10.1007/s10922-024-09810-2
Song GGai SDa F(2024)Memory-based gradient-guided progressive propagation network for video deblurringThe Visual Computer10.1007/s00371-024-03306-5Online publication date: 6-Mar-2024
https://doi.org/10.1007/s00371-024-03306-5
Zhou WZhou HWang ZTan HLi X(2023)Context-Aware Magnetic MIMO Wireless Charging with Parallel In-Band CommunicationACM Transactions on Sensor Networks10.1145/358269219:4(1-24)Online publication date: 16-May-2023
https://dl.acm.org/doi/10.1145/3582692
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Full Text

View this article in Full Text.

Media

Figures

Other

Tables

View full text|Download PDF

View Issue’s Table of Contents