Abstract
This study presents a novel approach to diagnosing the highly contagious COVID-19 respiratory disease. Traditional diagnosis methods, such as polymerase chain reaction (PCR) and rapid antigen test (RAT), have been found to be resource-intensive and expensive, prompting the need for alternative diagnostic methods. Existing machine learning-based diagnosis approaches, such as X-rays and CT scans, suffer from suboptimal performance, primarily due to data imbalance and data paucity. To this end, this study proposes C3-PO, Cough sounds on Convolutional neural network (CNN) for COVID-19 PredictiOn. The framework utilises data augmentation and segmentation techniques to increase the volume of data to more than three times the original size. It includes an ensemble method to further mitigate the impacts of data paucity and data imbalances. Our CNN model was tested on the crowdsourced Coswara dataset and validated by the Russian dataset. It achieved an accuracy rate of 92.7% and an area under the receiver operating characteristics curve (AUC-ROC) of 98.1% on the Russian dataset, exceeding the existing works by 22% in terms of accuracy. On the Coswara dataset, the method achieved an accuracy rate of 72.3% and an AUC-ROC of 80.0%. Codes and evaluations are publicly available at https://github.com/ZakirANU/C3-PO-CovidCough-CNN.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Andreu-Perez, J., et al.: A generic deep learning based cough analysis system from clinically validated samples for point-of-need Covid-19 test and severity levels. IEEE Trans. Serv. Comput. 15(3), 1220–1232 (2021). https://doi.org/10.1109/TSC.2021.3061402
Ashby, A.E., Meister, J.A., Soldar, G., Nguyen, K.A.: A novel cough audio segmentation framework for covid-19 detection. In: Proceedings of the Symposium on Open Data and Knowledge for a Post-Pandemic Era ODAK22, UK, pp. 1–8 (2022). https://doi.org/10.14236/ewic/ODAK22.1
Bhattacharya, D., et al.: Coswara: a respiratory sounds and symptoms dataset for remote screening of SARS-COV-2 infection. Sci. Data 10(1), 397 (2023)
Duan, H., Wei, Y., Liu, P., Yin, H.: A novel ensemble framework based on k-means and resampling for imbalanced data. Appl. Sci. 10(5), 1684 (2020)
Furtado, A., da Purificação, C.A.C., Badaró, R., Nascimento, E.G.S.: A light deep learning algorithm for CT diagnosis of COVID-19 Pneumonia. Diagnostics 12(7), 1527 (2022). https://doi.org/10.3390/diagnostics12071527
Geertsen, A., Chmelyuk, V.: Dataset of recordings of induced cough (Dec 2020). https://github.com/covid19-cough/dataset
Hasan, M.R., Hasan, M.M., Hossain, M.Z.: How many mel-frequency cepstral coefficients to be utilized in speech recognition? a study with the Bengali language. J. Eng. 2021(12), 817–827 (2021). https://doi.org/10.1049/tje2.12082
Hossain, M.Z., Uddin, M.B., Yang, Y., Ahmed, K.A.: CovidEnvelope: an automated fast approach to diagnose Covid-19 from cough signals. In: 2021 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE), pp. 1–6. IEEE, Brisbane, Australia (Dec 2021). https://doi.org/10.1109/CSDE53843.2021.9718501
Imran, A., et al.: AI4COVID-19: AI enabled preliminary diagnosis for COVID-19 from cough samples via an app. Inform. Med. Unlocked 20, 100378 (2020). https://doi.org/10.1016/j.imu.2020.100378
Kassania, S.H., Kassanib, P.H., Wesolowskic, M.J., Schneidera, K.A., Detersa, R.: Automatic detection of coronavirus disease (Covid-19) in x-ray and CT images: a machine learning based approach. Biocybernet. Biomed. Eng. 41(3), 867–879 (2021)
Kranthi Kumar, L., Alphonse, P.: Covid-19 disease diagnosis with light-weight CNN using modified MFCC and enhanced GFCC from human respiratory sounds. Europ. Phys. J. Special Topics 231(18), 3329–3346 (2022)
Laguarta, J., Hueto, F., Subirana, B.: COVID-19 artificial intelligence diagnosis using only cough recordings. IEEE Open J. Eng. Med. Biol. 1, 275–281 (2020). https://doi.org/10.1109/OJEMB.2020.3026928
Liu, X., Hasan, M.R., Ahmed, K.A., Hossain, M.Z.: Machine learning to analyse omic-data for COVID-19 diagnosis and prognosis. BMC Bioinform. 24(7), 1–20 (2023). https://doi.org/10.1186/s12859-022-05127-6
Mahanta, S.K., Kaushik, D., Jain, S., Van Truong, H., Guha, K.: COVID-19 diagnosis from cough acoustics using convnets and data augmentation (May 2022). arXiv:2110.06123
Mohammed, E.A., Keyhani, M., Sanati-Nezhad, A., Hejazi, S.H., Far, B.H.: An ensemble learning approach to digital corona virus preliminary screening from cough sounds. Sci. Rep. 11(1), 15404 (2021)
Muguli, A., et al.: Dicova challenge: dataset, task, and baseline system for covid-19 diagnosis using acoustics. arXiv preprint arXiv:2103.09148 (2021). 10.48550/arXiv. 2103.09148
Orlandic, L., Teijeiro, T., Atienza, D.: The coughvid crowdsourcing dataset, a corpus for the study of large-scale cough analysis algorithms. Sci. Data 8(1), 156 (2021). https://doi.org/10.1038/s41597-021-00937-4
Pramono, R.X.A., Imtiaz, S.A., Rodriguez-Villegas, E.: A cough-based algorithm for automatic diagnosis of pertussis. PLoS ONE 11(9), e0162128 (2016). https://doi.org/10.1371/journal.pone.0162128
Richards, R.: Evidence on the accuracy of the number of reported covid-19 infections and deaths in lower-middle income countries. K4D Helpdesk Report 856 (2020). https://opendocs.ids.ac.uk/opendocs/handle/20.500.12413/15576
Schuller, B.W., Coppock, H., Gaskell, A.: Detecting covid-19 from breathing and coughing sounds using deep neural networks. arXiv preprint arXiv:2012.14553 (2020)
Sharma, N., et al.: Coswara - a database of breathing, cough, and voice sounds for Covid-19 diagnosis. In: Interspeech 2020, pp. 4811–4815 (Oct 2020). https://doi.org/10.21437/Interspeech.2020–2768
Sharma, N.K., Chetupalli, S.R., Bhattacharya, D., Dutta, D., Mote, P., Ganapathy, S.: The second dicova challenge: dataset and performance analysis for diagnosis of covid-19 using acoustics. In: ICASSP 2022–2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 556–560 (2022). https://doi.org/10.1109/ICASSP43922.2022.9747188
Sheehan, K.B.: Crowdsourcing research: data collection with amazon’s mechanical turk. Commun. Monogr. 85(1), 140–156 (2018). https://doi.org/10.1080/03637751.2017.1342043
Tian, S., Hu, W., Niu, L., Liu, H., Xu, H., Xiao, S.Y.: Pulmonary pathology of early-phase 2019 novel coronavirus (covid-19) pneumonia in two patients with Lung Cancer. J. Thorac. Oncol. 15(5), 700–704 (2020). https://doi.org/10.1016/j.jtho.2020.02.010
World Health Organization: WHO coronavirus (COVID-19) dashboard (2023). https://covid19.who.int/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Chen, X., Khan, M.A.R., Hasan, M.R., Gedeon, T., Hossain, M.Z. (2024). C3-PO: A Convolutional Neural Network for COVID Onset Prediction from Cough Sounds. In: Rudinac, S., et al. MultiMedia Modeling. MMM 2024. Lecture Notes in Computer Science, vol 14556. Springer, Cham. https://doi.org/10.1007/978-3-031-53311-2_26
Download citation
DOI: https://doi.org/10.1007/978-3-031-53311-2_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-53310-5
Online ISBN: 978-3-031-53311-2
eBook Packages: Computer ScienceComputer Science (R0)