Uformer: A General U-Shaped Transformer for Image Restoration

Wang, Zhendong; Cun, Xiaodong; Bao, Jianmin; Zhou, Wengang; Liu, Jianzhuang; Li, Houqiang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2106.03106 (cs)

[Submitted on 6 Jun 2021 (v1), last revised 25 Nov 2021 (this version, v2)]

Title:Uformer: A General U-Shaped Transformer for Image Restoration

Authors:Zhendong Wang, Xiaodong Cun, Jianmin Bao, Wengang Zhou, Jianzhuang Liu, Houqiang Li

View PDF

Abstract:In this paper, we present Uformer, an effective and efficient Transformer-based architecture for image restoration, in which we build a hierarchical encoder-decoder network using the Transformer block. In Uformer, there are two core designs. First, we introduce a novel locally-enhanced window (LeWin) Transformer block, which performs nonoverlapping window-based self-attention instead of global self-attention. It significantly reduces the computational complexity on high resolution feature map while capturing local context. Second, we propose a learnable multi-scale restoration modulator in the form of a multi-scale spatial bias to adjust features in multiple layers of the Uformer decoder. Our modulator demonstrates superior capability for restoring details for various image restoration tasks while introducing marginal extra parameters and computational cost. Powered by these two designs, Uformer enjoys a high capability for capturing both local and global dependencies for image restoration. To evaluate our approach, extensive experiments are conducted on several image restoration tasks, including image denoising, motion deblurring, defocus deblurring and deraining. Without bells and whistles, our Uformer achieves superior or comparable performance compared with the state-of-the-art algorithms. The code and models are available at this https URL.

Comments:	17 pages, 13 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2106.03106 [cs.CV]
	(or arXiv:2106.03106v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2106.03106

Submission history

From: Zhendong Wang [view email]
[v1] Sun, 6 Jun 2021 12:33:22 UTC (19,206 KB)
[v2] Thu, 25 Nov 2021 10:19:05 UTC (32,995 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Uformer: A General U-Shaped Transformer for Image Restoration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Uformer: A General U-Shaped Transformer for Image Restoration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators