Feature distillation network for efficient super-resolution with vast receptive field

Zhang, Yanfeng; Tan, Wenan; Mao, Wenyi

doi:10.1007/s11760-024-03750-9

Feature distillation network for efficient super-resolution with vast receptive field

Original Paper
Published: 06 January 2025

Volume 19, article number 191, (2025)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Yanfeng Zhang¹,
Wenan Tan¹^na1 &
Wenyi Mao¹^na1

136 Accesses
Explore all metrics

Abstract

In recent years, convolutional neural networks have seen rapid advancements, leading to the proposal of numerous lightweight image super-resolution techniques tailored for deployment on edge devices. This paper examines the information distillation mechanism and the vast-receptive-field attention mechanism utilized in lightweight super-resolution. Additionally, it introduces a new network structure named the vast-receptive-field feature distillation network, named VFDN, which effectively enhances inference speed and reduces GPU memory consumption. The receptive field of the attention block is expanded, and the utilization of large dense convolution kernels is substituted with depth-wise separable convolutions. Meanwhile, we modify the reconstruction block to obtain better reconstruction quality and introduce a Fourier transform-based loss function that emphasizes the frequency domain information of the input image. Experiments show that the designed VFDN achieves comparable results to RFDN, but the parameters are only 307K(55.81$\%$ of RFDN), which is advantageous for deployment on edge devices.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Residual Feature Distillation Network for Lightweight Image Super-Resolution

Lightweight image super-resolution network using involution

Article 16 July 2022

Efficient Contextual Feature Network for Single Image Super Resolution

Data availability

The data used to support the findings of this study is available from the corresponding author upon request.

References

Dong, C., Loy, C.C., He, K., Tang, X.: Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 295–307 (2015)
Article MATH Google Scholar
Kim, J., Lee, J.K., Lee, K.M.: Accurate image super-resolution using very deep convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1646–1654 (2016)
Lim, B., Son, S., Kim, H., Nah, S., Mu Lee, K.: Enhanced deep residual networks for single image super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 136–144 (2017)
Kim, J., Lee, J.K., Lee, K.M.: Deeply-recursive convolutional network for image super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1637–1645 (2016)
Tai, Y., Yang, J., Liu, X.: Image super-resolution via deep recursive residual network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3147–3155 (2017)
Ahn, N., Kang, B., Sohn, K.-A.: Fast, accurate, and lightweight super-resolution with cascading residual network. In Proceedings of the European conference on computer vision (ECCV), pp 252–268 (2018)
Hui, Z., Gao, X., Yang, Y., Wang, X.: Lightweight image super-resolution with information multi-distillation network. In Proceedings of the 27th Acm international conference on multimedia, pp 2024–2032 (2019)
Hui, Z., Wang, X., Gao, X.: Fast and accurate single image super-resolution via information distillation network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 723–731 (2018)
Ding, X., Guo, Y., Ding, G., Han, J.: Acnet: Strengthening the kernel skeletons for powerful cnn via asymmetric convolution blocks. In Proceedings of the IEEE/CVF international conference on computer vision, pp 1911–1920 (2019)
Ding, X., Zhang, X., Han, J., Ding, G.: Diverse branch block: Building a convolution as an inception-like unit. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10886–10895 (2021)
Zhang, X., Zeng, H., Zhang, L.: Edge-oriented convolution block for real-time super resolution on mobile devices. In Proceedings of the 29th ACM international conference on multimedia, pp 4034–4043 (2021)
Zhao, H., Kong, X., He, J., Qiao, Y., Dong, C.: Efficient image super-resolution using pixel attention. In Computer vision–ECCV 2020 workshops: Glasgow, UK, August 23–28, 2020, Proceedings, Part III 16, pp 56–72 (2020). Springer
Liu, J., Tang, J., Wu, G.: Residual feature distillation network for lightweight image super-resolution. In Computer vision–ECCV 2020 workshops: Glasgow, UK, August 23–28, 2020, Proceedings, Part III 16, pp 41–55 (2020). Springer
Li, Y., Gu, S., Zhang, K., Van Gool, L., Timofte, R.: Dhp: Differentiable meta pruning via hypernetworks. In Computer vision–ECCV 2020: 16th European conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part VIII 16, pp 608–624 (2020). Springer
Li, Y., Li, W., Danelljan, M., Zhang, K., Gu, S., Van Gool, L., Timofte, R.: The heterogeneity hypothesis: Finding layer-wise differentiated network architectures. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 2144–2153 (2021)
Kong, F., Li, M., Liu, S., Liu, D., He, J., Bai, Y., Chen, F., Fu, L.: Residual local feature network for efficient super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 766–776 (2022)
Chu, X., Zhang, B., Ma, H., Xu, R., Li, Q.: Fast, accurate and lightweight super-resolution with neural architecture search. In 2020 25th international conference on pattern recognition (ICPR), pp 59–64 (2021). IEEE
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778 (2016)
Zeyde, R., Elad, M., Protter, M.: On single image scale-up using sparse-representations. In Curves and surfaces: 7th international conference, Avignon, France, June 24-30, 2010, Revised Selected Papers 7, pp 711–730 (2012). Springer
Li, Z., Liu, Y., Chen, X., Cai, H., Gu, J., Qiao, Y., Dong, C.: Blueprint separable residual network for efficient image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 833–843 (2022)
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141 (2018)
Roy, A.G., Navab, N., Wachinger, C.: Concurrent spatial and channel ‘squeeze & excitation’in fully convolutional networks. In Medical image computing and computer assisted intervention–MICCAI 2018: 21st international conference, Granada, Spain, September 16-20, 2018, Proceedings, Part I, pp 421–429 (2018). Springer
Zhang, Y., Li, K., Li, K., Wang, L., Zhong, B., Fu, Y.: Image super-resolution using very deep residual channel attention networks. In Proceedings of the European conference on computer vision (ECCV), pp 286–301 (2018)
Dai, T., Cai, J., Zhang, Y., Xia, S.-T., Zhang, L.: Second-order attention network for single image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11065–11074 (2019)
Liang, J., Cao, J., Sun, G., Zhang, K., Van Gool, L., Timofte, R.: Swinir: Image restoration using swin transformer. In Proceedings of the IEEE/CVF international conference on computer vision, pp 1833–1844 (2021)
Chen, X., Wang, X., Zhou, J., Qiao, Y., Dong, C.: Activating more pixels in image super-resolution transformer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 22367–22377 (2023)
Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., Guo, B.: Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision, pp 10012–10022 (2021)
Deng, W., Yuan, H., Deng, L., Lu, Z.: Reparameterized residual feature network for lightweight image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1712–1721 (2023)
Chen, H., Wang, Y., Guo, T., Xu, C., Deng, Y., Liu, Z., Ma, S., Xu, C., Xu, C., Gao, W.: Pre-trained image processing transformer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 12299–12310 (2021)
Liang, J., Cao, J., Sun, G., Zhang, K., Van Gool, L., Timofte, R.: Swinir: Image restoration using swin transformer. In Proceedings of the IEEE/CVF international conference on computer vision, pp 1833–1844 (2021)
Zhang, H., Hu, W., Wang, X.: Parc-net: Position aware circular convolution with merits from convnets and transformer. In European conference on computer vision, pp 613–630 (2022). Springer
Ding, X., Zhang, X., Han, J., Ding, G.: Scaling up your kernels to 31x31: Revisiting large kernel design in cnns. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11963–11975 (2022)
Lai, W.-S., Huang, J.-B., Ahuja, N., Yang, M.-H.: Deep laplacian pyramid networks for fast and accurate super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 624–632 (2017)
Zhao, H., Gallo, O., Frosio, I., Kautz, J.: Loss functions for image restoration with neural networks. IEEE Trans. Comput. Imag. 3(1), 47–57 (2016)
Article MATH Google Scholar
Lai, W.-S., Huang, J.-B., Ahuja, N., Yang, M.-H.: Deep laplacian pyramid networks for fast and accurate super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 624–632 (2017)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Dong, C., Loy, C.C., Tang, X.: Accelerating the super-resolution convolutional neural network. In Computer Vision–ECCV 2016: 14th European conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II 14, pp 391–407 (2016). Springer
Lai, W.-S., Huang, J.-B., Ahuja, N., Yang, M.-H.: Fast and accurate image super-resolution with deep laplacian pyramid networks. IEEE Trans. Pattern Anal. Mach. Intell. 41(11), 2599–2613 (2018)
Article MATH Google Scholar
Tai, Y., Yang, J., Liu, X., Xu, C.: Memnet: a persistent memory network for image restoration. In Proceedings of the IEEE international conference on computer vision, pp 4539–4547 (2017)

Download references

Author information

Wenan Tan, Wenyi Mao have contributed equally to this work.

Authors and Affiliations

School of Computer and Information Engineering, Shanghai Polytechnic University, Jinhai Road, Shanghai, 200000, China
Yanfeng Zhang, Wenan Tan & Wenyi Mao

Authors

Yanfeng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wenan Tan
View author publications
You can also search for this author in PubMed Google Scholar
Wenyi Mao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Yanfeng Zhang is responsible for the conception of the model and the implementation of the experiment, and is the author of the paper. Wenan Tan is responsible for the quality control of the whole process and acts as the corresponding author of the paper. Wenyi Mao is responsible for the verification and sorting of the experimental data.

Corresponding author

Correspondence to Wenan Tan.

Ethics declarations

Conflict of interest

All the authors declare that they have no competing financial interests or personal relationships that could influence the work reported in this paper.

Ethical approval

This article does not contain studies with human participants or animals. Statement of informed consent is not applicable since the manuscript does not contain any patient data.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Zhang, Y., Tan, W. & Mao, W. Feature distillation network for efficient super-resolution with vast receptive field. SIViP 19, 191 (2025). https://doi.org/10.1007/s11760-024-03750-9

Download citation

Received: 01 August 2024
Revised: 17 September 2024
Accepted: 25 November 2024
Published: 06 January 2025
DOI: https://doi.org/10.1007/s11760-024-03750-9

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Feature distillation network for efficient super-resolution with vast receptive field

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Residual Feature Distillation Network for Lightweight Image Super-Resolution

Lightweight image super-resolution network using involution

Efficient Contextual Feature Network for Single Image Super Resolution

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Feature distillation network for efficient super-resolution with vast receptive field

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Residual Feature Distillation Network for Lightweight Image Super-Resolution

Lightweight image super-resolution network using involution

Efficient Contextual Feature Network for Single Image Super Resolution

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation