Abstract
In recent years, convolutional neural networks have seen rapid advancements, leading to the proposal of numerous lightweight image super-resolution techniques tailored for deployment on edge devices. This paper examines the information distillation mechanism and the vast-receptive-field attention mechanism utilized in lightweight super-resolution. Additionally, it introduces a new network structure named the vast-receptive-field feature distillation network, named VFDN, which effectively enhances inference speed and reduces GPU memory consumption. The receptive field of the attention block is expanded, and the utilization of large dense convolution kernels is substituted with depth-wise separable convolutions. Meanwhile, we modify the reconstruction block to obtain better reconstruction quality and introduce a Fourier transform-based loss function that emphasizes the frequency domain information of the input image. Experiments show that the designed VFDN achieves comparable results to RFDN, but the parameters are only 307K(55.81\(\%\) of RFDN), which is advantageous for deployment on edge devices.
Similar content being viewed by others
Data availability
The data used to support the findings of this study is available from the corresponding author upon request.
References
Dong, C., Loy, C.C., He, K., Tang, X.: Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 295–307 (2015)
Kim, J., Lee, J.K., Lee, K.M.: Accurate image super-resolution using very deep convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1646–1654 (2016)
Lim, B., Son, S., Kim, H., Nah, S., Mu Lee, K.: Enhanced deep residual networks for single image super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 136–144 (2017)
Kim, J., Lee, J.K., Lee, K.M.: Deeply-recursive convolutional network for image super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1637–1645 (2016)
Tai, Y., Yang, J., Liu, X.: Image super-resolution via deep recursive residual network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3147–3155 (2017)
Ahn, N., Kang, B., Sohn, K.-A.: Fast, accurate, and lightweight super-resolution with cascading residual network. In Proceedings of the European conference on computer vision (ECCV), pp 252–268 (2018)
Hui, Z., Gao, X., Yang, Y., Wang, X.: Lightweight image super-resolution with information multi-distillation network. In Proceedings of the 27th Acm international conference on multimedia, pp 2024–2032 (2019)
Hui, Z., Wang, X., Gao, X.: Fast and accurate single image super-resolution via information distillation network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 723–731 (2018)
Ding, X., Guo, Y., Ding, G., Han, J.: Acnet: Strengthening the kernel skeletons for powerful cnn via asymmetric convolution blocks. In Proceedings of the IEEE/CVF international conference on computer vision, pp 1911–1920 (2019)
Ding, X., Zhang, X., Han, J., Ding, G.: Diverse branch block: Building a convolution as an inception-like unit. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 10886–10895 (2021)
Zhang, X., Zeng, H., Zhang, L.: Edge-oriented convolution block for real-time super resolution on mobile devices. In Proceedings of the 29th ACM international conference on multimedia, pp 4034–4043 (2021)
Zhao, H., Kong, X., He, J., Qiao, Y., Dong, C.: Efficient image super-resolution using pixel attention. In Computer vision–ECCV 2020 workshops: Glasgow, UK, August 23–28, 2020, Proceedings, Part III 16, pp 56–72 (2020). Springer
Liu, J., Tang, J., Wu, G.: Residual feature distillation network for lightweight image super-resolution. In Computer vision–ECCV 2020 workshops: Glasgow, UK, August 23–28, 2020, Proceedings, Part III 16, pp 41–55 (2020). Springer
Li, Y., Gu, S., Zhang, K., Van Gool, L., Timofte, R.: Dhp: Differentiable meta pruning via hypernetworks. In Computer vision–ECCV 2020: 16th European conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part VIII 16, pp 608–624 (2020). Springer
Li, Y., Li, W., Danelljan, M., Zhang, K., Gu, S., Van Gool, L., Timofte, R.: The heterogeneity hypothesis: Finding layer-wise differentiated network architectures. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 2144–2153 (2021)
Kong, F., Li, M., Liu, S., Liu, D., He, J., Bai, Y., Chen, F., Fu, L.: Residual local feature network for efficient super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 766–776 (2022)
Chu, X., Zhang, B., Ma, H., Xu, R., Li, Q.: Fast, accurate and lightweight super-resolution with neural architecture search. In 2020 25th international conference on pattern recognition (ICPR), pp 59–64 (2021). IEEE
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778 (2016)
Zeyde, R., Elad, M., Protter, M.: On single image scale-up using sparse-representations. In Curves and surfaces: 7th international conference, Avignon, France, June 24-30, 2010, Revised Selected Papers 7, pp 711–730 (2012). Springer
Li, Z., Liu, Y., Chen, X., Cai, H., Gu, J., Qiao, Y., Dong, C.: Blueprint separable residual network for efficient image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 833–843 (2022)
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7132–7141 (2018)
Roy, A.G., Navab, N., Wachinger, C.: Concurrent spatial and channel ‘squeeze & excitation’in fully convolutional networks. In Medical image computing and computer assisted intervention–MICCAI 2018: 21st international conference, Granada, Spain, September 16-20, 2018, Proceedings, Part I, pp 421–429 (2018). Springer
Zhang, Y., Li, K., Li, K., Wang, L., Zhong, B., Fu, Y.: Image super-resolution using very deep residual channel attention networks. In Proceedings of the European conference on computer vision (ECCV), pp 286–301 (2018)
Dai, T., Cai, J., Zhang, Y., Xia, S.-T., Zhang, L.: Second-order attention network for single image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11065–11074 (2019)
Liang, J., Cao, J., Sun, G., Zhang, K., Van Gool, L., Timofte, R.: Swinir: Image restoration using swin transformer. In Proceedings of the IEEE/CVF international conference on computer vision, pp 1833–1844 (2021)
Chen, X., Wang, X., Zhou, J., Qiao, Y., Dong, C.: Activating more pixels in image super-resolution transformer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 22367–22377 (2023)
Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., Guo, B.: Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision, pp 10012–10022 (2021)
Deng, W., Yuan, H., Deng, L., Lu, Z.: Reparameterized residual feature network for lightweight image super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1712–1721 (2023)
Chen, H., Wang, Y., Guo, T., Xu, C., Deng, Y., Liu, Z., Ma, S., Xu, C., Xu, C., Gao, W.: Pre-trained image processing transformer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 12299–12310 (2021)
Liang, J., Cao, J., Sun, G., Zhang, K., Van Gool, L., Timofte, R.: Swinir: Image restoration using swin transformer. In Proceedings of the IEEE/CVF international conference on computer vision, pp 1833–1844 (2021)
Zhang, H., Hu, W., Wang, X.: Parc-net: Position aware circular convolution with merits from convnets and transformer. In European conference on computer vision, pp 613–630 (2022). Springer
Ding, X., Zhang, X., Han, J., Ding, G.: Scaling up your kernels to 31x31: Revisiting large kernel design in cnns. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 11963–11975 (2022)
Lai, W.-S., Huang, J.-B., Ahuja, N., Yang, M.-H.: Deep laplacian pyramid networks for fast and accurate super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 624–632 (2017)
Zhao, H., Gallo, O., Frosio, I., Kautz, J.: Loss functions for image restoration with neural networks. IEEE Trans. Comput. Imag. 3(1), 47–57 (2016)
Lai, W.-S., Huang, J.-B., Ahuja, N., Yang, M.-H.: Deep laplacian pyramid networks for fast and accurate super-resolution. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp 624–632 (2017)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Dong, C., Loy, C.C., Tang, X.: Accelerating the super-resolution convolutional neural network. In Computer Vision–ECCV 2016: 14th European conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II 14, pp 391–407 (2016). Springer
Lai, W.-S., Huang, J.-B., Ahuja, N., Yang, M.-H.: Fast and accurate image super-resolution with deep laplacian pyramid networks. IEEE Trans. Pattern Anal. Mach. Intell. 41(11), 2599–2613 (2018)
Tai, Y., Yang, J., Liu, X., Xu, C.: Memnet: a persistent memory network for image restoration. In Proceedings of the IEEE international conference on computer vision, pp 4539–4547 (2017)
Author information
Authors and Affiliations
Contributions
Yanfeng Zhang is responsible for the conception of the model and the implementation of the experiment, and is the author of the paper. Wenan Tan is responsible for the quality control of the whole process and acts as the corresponding author of the paper. Wenyi Mao is responsible for the verification and sorting of the experimental data.
Corresponding author
Ethics declarations
Conflict of interest
All the authors declare that they have no competing financial interests or personal relationships that could influence the work reported in this paper.
Ethical approval
This article does not contain studies with human participants or animals. Statement of informed consent is not applicable since the manuscript does not contain any patient data.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, Y., Tan, W. & Mao, W. Feature distillation network for efficient super-resolution with vast receptive field. SIViP 19, 191 (2025). https://doi.org/10.1007/s11760-024-03750-9
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11760-024-03750-9