Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3488933.3489010acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiprConference Proceedingsconference-collections
research-article

Fast Super-Resolution Algorithm for Real-Time Communication

Published: 25 February 2022 Publication History
  • Get Citation Alerts
  • Abstract

    Video super-resolution aims to restore a high-resolution video frame from multiple low-resolution frames which can effectively improve the perceived quality of the video and enhance the user's visual experience in real-time video communication. Current video super-resolution algorithms pay more attention to super-resolution performance rather than the inference speed. Most of them adopt computationally expensive alignment and fusion module, which leads to high inference time cost and hinders the real-world deployment. Therefore, it is necessary to achieve a balance between inference speed and super-resolution performance. In this paper, we propose a fast video super-resolution network which is achieved through three lightweight alignment methods and implement it on the video restoration algorithm with enhanced deformable convolutional networks (EDVR). We trained the model through the Vimeo-90K training dataset, and tested the algorithm through the Vid4 and Vimeo-90K-T test datasets. The experimental results show that the inference time of the network with our alignment methods can be nearly 38% shorter than original EDVR.

    References

    [1]
    Wang, Z., Chen, J. and Hoi, S.C.H. 2020. Deep Learning for Image Super-resolution: A Survey. arXiv:1902.06068 [cs]. (Feb. 2020).
    [2]
    Li, S., He, F., Du, B., Zhang, L., Xu, Y. and Tao, D. 2019. Fast Spatio-Temporal Residual Network for Video Super-Resolution. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (Long Beach, CA, USA, Jun. 2019), 10514–10523.
    [3]
    Yi, P., Wang, Z., Jiang, K., Jiang, J. and Ma, J. 2019. Progressive Fusion Video Super-Resolution Network via Exploiting Non-Local Spatio-Temporal Correlations. 2019 IEEE/CVF International Conference on Computer Vision (ICCV) (Seoul, Korea (South), Oct. 2019), 3106–3115.
    [4]
    Ahn, N., Kang, B. and Sohn, K.-A. 2018. Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network. arXiv:1803.08664 [cs]. (Oct. 2018).
    [5]
    Sajjadi, M.S.M., Vemulapalli, R. and Brown, M. 2018. Frame-Recurrent Video Super-Resolution. arXiv:1801.04590 [cs, stat]. (Mar. 2018).
    [6]
    Haris, M., Shakhnarovich, G. and Ukita, N. 2019. Recurrent Back-Projection Network for Video Super-Resolution. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (Long Beach, CA, USA, Jun. 2019), 3892–3901.
    [7]
    Drulea, M. and Nedevschi, S. 2011. Total variation regularization of local-global optical flow. 2011 14th International IEEE Conference on Intelligent Transportation Systems (ITSC) (Oct. 2011), 318–323.
    [8]
    Kappeler, A., Yoo, S., Dai, Q. and Katsaggelos, A.K. 2016. Video Super-Resolution With Convolutional Neural Networks. IEEE Transactions on Computational Imaging. 2, 2 (Jun. 2016), 109–122.
    [9]
    Liao, R., Tao, X., Li, R., Ma, Z. and Jia, J. 2015. Video Super-Resolution via Deep Draft-Ensemble Learning. (2015), 531–539.
    [10]
    Liu, D., Wang, Z., Fan, Y., Liu, X., Wang, Z., Chang, S. and Huang, T. 2017. Robust Video Super-Resolution With Learned Temporal Dynamics. (2017), 2507–2515.
    [11]
    Caballero, J., Ledig, C., Aitken, A., Acosta, A., Totz, J., Wang, Z. and Shi, W. 2017. Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation. arXiv:1611.05250 [cs]. (Apr. 2017).
    [12]
    Tao, X., Gao, H., Liao, R., Wang, J. and Jia, J. 2017. Detail-Revealing Deep Video Super-Resolution. (2017), 4472–4480.
    [13]
    Wang, X., Chan, K.C.K., Yu, K., Dong, C. and Loy, C.C. 2019. EDVR: Video Restoration with Enhanced Deformable Convolutional Networks. arXiv:1905.02716 [cs]. (May 2019).
    [14]
    Tian, Y., Zhang, Y., Fu, Y. and Xu, C. 2020. TDAN: Temporally-Deformable Alignment Network for Video Super-Resolution. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (Seattle, WA, USA, Jun. 2020), 3357–3366.
    [15]
    Xue, T., Chen, B., Wu, J., Wei, D. and Freeman, W.T. 2019. Video Enhancement with Task-Oriented Flow. International Journal of Computer Vision. 127, 8 (Aug. 2019), 1106–1125.
    [16]
    Liu, C. and Sun, D. 2014. On Bayesian Adaptive Video Super Resolution. IEEE Transactions on Pattern Analysis and Machine Intelligence. 36, 2 (Feb. 2014), 346–360.
    [17]
    Lai, W.-S., Huang, J.-B., Ahuja, N. and Yang, M.-H. 2017. Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (Honolulu, HI, Jul. 2017), 5835–5843.
    [18]
    Kingma, D.P. and Ba, J. 2017. Adam: A Method for Stochastic Optimization. arXiv:1412.6980 [cs]. (Jan. 2017).

    Index Terms

    1. Fast Super-Resolution Algorithm for Real-Time Communication
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Information & Contributors

            Information

            Published In

            cover image ACM Other conferences
            AIPR '21: Proceedings of the 2021 4th International Conference on Artificial Intelligence and Pattern Recognition
            September 2021
            715 pages
            ISBN:9781450384087
            DOI:10.1145/3488933
            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            Published: 25 February 2022

            Permissions

            Request permissions for this article.

            Check for updates

            Author Tags

            1. Alignment
            2. Deformable Convolution
            3. Lightweight Network
            4. Temporal Convolution
            5. Video Super-Resolution

            Qualifiers

            • Research-article
            • Research
            • Refereed limited

            Conference

            AIPR 2021

            Contributors

            Other Metrics

            Bibliometrics & Citations

            Bibliometrics

            Article Metrics

            • 0
              Total Citations
            • 32
              Total Downloads
            • Downloads (Last 12 months)8
            • Downloads (Last 6 weeks)0

            Other Metrics

            Citations

            View Options

            Get Access

            Login options

            View options

            PDF

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader

            HTML Format

            View this article in HTML Format.

            HTML Format

            Media

            Figures

            Other

            Tables

            Share

            Share

            Share this Publication link

            Share on social media