Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology

Tang, Wenhao; Zhou, Fengtao; Huang, Sheng; Zhu, Xiang; Zhang, Yi; Liu, Bo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2402.17228 (cs)

[Submitted on 27 Feb 2024 (v1), last revised 25 Jul 2024 (this version, v4)]

Title:Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology

Authors:Wenhao Tang, Fengtao Zhou, Sheng Huang, Xiang Zhu, Yi Zhang, Bo Liu

View PDF HTML (experimental)

Abstract:Multiple instance learning (MIL) is the most widely used framework in computational pathology, encompassing sub-typing, diagnosis, prognosis, and more. However, the existing MIL paradigm typically requires an offline instance feature extractor, such as a pre-trained ResNet or a foundation model. This approach lacks the capability for feature fine-tuning within the specific downstream tasks, limiting its adaptability and performance. To address this issue, we propose a Re-embedded Regional Transformer (R$^2$T) for re-embedding the instance features online, which captures fine-grained local features and establishes connections across different regions. Unlike existing works that focus on pre-training powerful feature extractor or designing sophisticated instance aggregator, R$^2$T is tailored to re-embed instance features online. It serves as a portable module that can seamlessly integrate into mainstream MIL models. Extensive experimental results on common computational pathology tasks validate that: 1) feature re-embedding improves the performance of MIL models based on ResNet-50 features to the level of foundation model features, and further enhances the performance of foundation model features; 2) the R$^2$T can introduce more significant performance improvements to various MIL models; 3) R$^2$T-MIL, as an R$^2$T-enhanced AB-MIL, outperforms other latest methods by a large this http URL code is available at: this https URL.

Comments:	Accepted by CVPR2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2402.17228 [cs.CV]
	(or arXiv:2402.17228v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2402.17228

Submission history

From: Sheng Huang [view email]
[v1] Tue, 27 Feb 2024 05:42:38 UTC (8,718 KB)
[v2] Sun, 7 Apr 2024 02:43:54 UTC (8,719 KB)
[v3] Tue, 9 Apr 2024 01:10:15 UTC (8,719 KB)
[v4] Thu, 25 Jul 2024 01:20:23 UTC (8,720 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators