Towards a vision foundation model for comprehensive assessment of Cardiac MRI

Jacob, Athira J; Borgohain, Indraneel; Chitiboi, Teodora; Sharma, Puneet; Comaniciu, Dorin; Rueckert, Daniel

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2410.01665 (eess)

[Submitted on 2 Oct 2024 (v1), last revised 6 Oct 2024 (this version, v2)]

Title:Towards a vision foundation model for comprehensive assessment of Cardiac MRI

Authors:Athira J Jacob, Indraneel Borgohain, Teodora Chitiboi, Puneet Sharma, Dorin Comaniciu, Daniel Rueckert

View PDF

Abstract:Cardiac magnetic resonance imaging (CMR), considered the gold standard for noninvasive cardiac assessment, is a diverse and complex modality requiring a wide variety of image processing tasks for comprehensive assessment of cardiac morphology and function. Advances in deep learning have enabled the development of state-of-the-art (SoTA) models for these tasks. However, model training is challenging due to data and label scarcity, especially in the less common imaging sequences. Moreover, each model is often trained for a specific task, with no connection between related tasks. In this work, we introduce a vision foundation model trained for CMR assessment, that is trained in a self-supervised fashion on 36 million CMR images. We then finetune the model in supervised way for 9 clinical tasks typical to a CMR workflow, across classification, segmentation, landmark localization, and pathology detection. We demonstrate improved accuracy and robustness across all tasks, over a range of available labeled dataset sizes. We also demonstrate improved few-shot learning with fewer labeled samples, a common challenge in medical image analyses. We achieve an out-of-box performance comparable to SoTA for most clinical tasks. The proposed method thus presents a resource-efficient, unified framework for CMR assessment, with the potential to accelerate the development of deep learning-based solutions for image analysis tasks, even with few annotated data available.

Comments:	11 pages, 3 figures, 4 tables
Subjects:	Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2410.01665 [eess.IV]
	(or arXiv:2410.01665v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2410.01665

Submission history

From: Athira Jacob [view email]
[v1] Wed, 2 Oct 2024 15:32:01 UTC (682 KB)
[v2] Sun, 6 Oct 2024 22:28:20 UTC (712 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Towards a vision foundation model for comprehensive assessment of Cardiac MRI

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Towards a vision foundation model for comprehensive assessment of Cardiac MRI

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators