Recent Progress in Transformer-based Medical Image Analysis

Liu, Zhaoshan; Lv, Qiujie; Yang, Ziduo; Li, Yifan; Lee, Chau Hung; Shen, Lei

doi:10.1016/j.compbiomed.2023.107268

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2208.06643 (eess)

[Submitted on 13 Aug 2022 (v1), last revised 25 Jul 2023 (this version, v4)]

Title:Recent Progress in Transformer-based Medical Image Analysis

Authors:Zhaoshan Liu, Qiujie Lv, Ziduo Yang, Yifan Li, Chau Hung Lee, Lei Shen

View PDF

Abstract:The transformer is primarily used in the field of natural language processing. Recently, it has been adopted and shows promise in the computer vision (CV) field. Medical image analysis (MIA), as a critical branch of CV, also greatly benefits from this state-of-the-art technique. In this review, we first recap the core component of the transformer, the attention mechanism, and the detailed structures of the transformer. After that, we depict the recent progress of the transformer in the field of MIA. We organize the applications in a sequence of different tasks, including classification, segmentation, captioning, registration, detection, enhancement, localization, and synthesis. The mainstream classification and segmentation tasks are further divided into eleven medical image modalities. A large number of experiments studied in this review illustrate that the transformer-based method outperforms existing methods through comparisons with multiple evaluation metrics. Finally, we discuss the open challenges and future opportunities in this field. This task-modality review with the latest contents, detailed information, and comprehensive comparison may greatly benefit the broad MIA community.

Comments:	Computers in Biology and Medicine Accepted
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
MSC classes:	I.2.m, I.4.9, I.5.4, J.0
Cite as:	arXiv:2208.06643 [eess.IV]
	(or arXiv:2208.06643v4 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2208.06643
Related DOI:	https://doi.org/10.1016/j.compbiomed.2023.107268

Submission history

From: Zhaoshan Liu [view email]
[v1] Sat, 13 Aug 2022 13:13:41 UTC (3,182 KB)
[v2] Tue, 6 Sep 2022 07:39:12 UTC (4,267 KB)
[v3] Tue, 21 Mar 2023 12:54:36 UTC (2,360 KB)
[v4] Tue, 25 Jul 2023 08:53:16 UTC (2,357 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Electrical Engineering and Systems Science > Image and Video Processing

Title:Recent Progress in Transformer-based Medical Image Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Electrical Engineering and Systems Science > Image and Video Processing

Title:Recent Progress in Transformer-based Medical Image Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators