Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Li, Jun; Chen, Junyu; Tang, Yucheng; Wang, Ce; Landman, Bennett A.; Zhou, S. Kevin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2206.01136 (cs)

[Submitted on 2 Jun 2022 (v1), last revised 21 Nov 2022 (this version, v3)]

Title:Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Authors:Jun Li, Junyu Chen, Yucheng Tang, Ce Wang, Bennett A. Landman, S. Kevin Zhou

View PDF

Abstract:Transformer, the latest technological advance of deep learning, has gained prevalence in natural language processing or computer vision. Since medical imaging bear some resemblance to computer vision, it is natural to inquire about the status quo of Transformers in medical imaging and ask the question: can the Transformer models transform medical imaging? In this paper, we attempt to make a response to the inquiry. After a brief introduction of the fundamentals of Transformers, especially in comparison with convolutional neural networks (CNNs), and highlighting key defining properties that characterize the Transformers, we offer a comprehensive review of the state-of-the-art Transformer-based approaches for medical imaging and exhibit current research progresses made in the areas of medical image segmentation, recognition, detection, registration, reconstruction, enhancement, etc. In particular, what distinguishes our review lies in its organization based on the Transformer's key defining properties, which are mostly derived from comparing the Transformer and CNN, and its type of architecture, which specifies the manner in which the Transformer and CNN are combined, all helping the readers to best understand the rationale behind the reviewed approaches. We conclude with discussions of future perspectives.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2206.01136 [cs.CV]
	(or arXiv:2206.01136v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2206.01136

Submission history

From: Jun Li [view email]
[v1] Thu, 2 Jun 2022 16:38:31 UTC (12,430 KB)
[v2] Fri, 3 Jun 2022 17:41:59 UTC (12,430 KB)
[v3] Mon, 21 Nov 2022 18:16:35 UTC (14,485 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators