CECT: Controllable Ensemble CNN and Transformer for COVID-19 image classification by capturing both local and global image features

Liu, Zhaoshan; Shen, Lei

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2302.02314v1 (eess)

COVID-19 e-print

Important: e-prints posted on arXiv are not peer-reviewed by arXiv; they should not be relied upon without context to guide clinical practice or health-related behavior and should not be reported in news media as established information without consulting multiple experts in the field.

[Submitted on 5 Feb 2023 (this version), latest version 31 Mar 2024 (v4)]

Title:CECT: Controllable Ensemble CNN and Transformer for COVID-19 image classification by capturing both local and global image features

Authors:Zhaoshan Liu, Lei Shen

View PDF

Abstract:Purpose: Most computer vision models are developed based on either convolutional neural network (CNN) or transformer, while the former (latter) method captures local (global) features. To relieve model performance limitations due to the lack of global (local) features, we develop a novel classification network named CECT by controllable ensemble CNN and transformer. Methods: The proposed CECT is composed of a CNN-based encoder block, a deconvolution-ensemble decoder block, and a transformer-based classification block. Different from conventional CNN- or transformer-based methods, our CECT can capture features at both multi-local and global scales, and the contribution of local features at different scales can be controlled with the proposed ensemble coefficients. Results: We evaluate CECT on two public COVID-19 datasets and it outperforms other state-of-the-art methods on all evaluation metrics. Conclusion: With remarkable feature capture ability, we believe CECT can also be used in other medical image classification scenarios to assist the diagnosis.

Comments:	20 pages, 5 figures
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2302.02314 [eess.IV]
	(or arXiv:2302.02314v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2302.02314

Submission history

From: Zhaoshan Liu [view email]
[v1] Sun, 5 Feb 2023 06:27:45 UTC (3,491 KB)
[v2] Tue, 14 Mar 2023 12:12:00 UTC (3,406 KB)
[v3] Mon, 31 Jul 2023 15:56:24 UTC (3,461 KB)
[v4] Sun, 31 Mar 2024 11:58:28 UTC (3,577 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:CECT: Controllable Ensemble CNN and Transformer for COVID-19 image classification by capturing both local and global image features

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:CECT: Controllable Ensemble CNN and Transformer for COVID-19 image classification by capturing both local and global image features

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators