N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution

Choi, Haram; Lee, Jeongmin; Yang, Jihoon

Computer Science > Computer Vision and Pattern Recognition

arXiv:2211.11436 (cs)

[Submitted on 21 Nov 2022 (v1), last revised 20 Mar 2023 (this version, v3)]

Title:N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution

Authors:Haram Choi, Jeongmin Lee, Jihoon Yang

View PDF

Abstract:While some studies have proven that Swin Transformer (Swin) with window self-attention (WSA) is suitable for single image super-resolution (SR), the plain WSA ignores the broad regions when reconstructing high-resolution images due to a limited receptive field. In addition, many deep learning SR methods suffer from intensive computations. To address these problems, we introduce the N-Gram context to the low-level vision with Transformers for the first time. We define N-Gram as neighboring local windows in Swin, which differs from text analysis that views N-Gram as consecutive characters or words. N-Grams interact with each other by sliding-WSA, expanding the regions seen to restore degraded pixels. Using the N-Gram context, we propose NGswin, an efficient SR network with SCDP bottleneck taking multi-scale outputs of the hierarchical encoder. Experimental results show that NGswin achieves competitive performance while maintaining an efficient structure when compared with previous leading methods. Moreover, we also improve other Swin-based SR methods with the N-Gram context, thereby building an enhanced model: SwinIR-NG. Our improved SwinIR-NG outperforms the current best lightweight SR approaches and establishes state-of-the-art results. Codes are available at this https URL.

Comments:	CVPR 2023 camera-ready. Codes are available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2211.11436 [cs.CV]
	(or arXiv:2211.11436v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2211.11436

Submission history

From: Haram Choi [view email]
[v1] Mon, 21 Nov 2022 13:23:52 UTC (5,721 KB)
[v2] Thu, 2 Mar 2023 12:56:46 UTC (5,553 KB)
[v3] Mon, 20 Mar 2023 12:48:37 UTC (5,553 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators