A deep CNN-based acoustic model for the identification of lung diseases utilizing extracted MFCC features from respiratory sounds

Alghamdi, Norah Saleh; Zakariah, Mohammed; Karamti, Hanen

doi:10.1007/s11042-024-18703-0

A deep CNN-based acoustic model for the identification of lung diseases utilizing extracted MFCC features from respiratory sounds

Published: 12 March 2024

Volume 83, pages 82871–82903, (2024)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

341 Accesses
Explore all metrics

Abstract

Machine learning algorithms have recently been increasingly used in medical data, particularly in healthcare areas where image processing techniques have played a crucial role. This study aims to utilize artificial intelligence (AI) techniques to forecast respiratory diseases by implementing a deep convolutional neural network (CNN) structure. The study employs an extensive dataset, specifically the Public Breathing Sound Database, which includes breathing sounds from 126 individuals with six different respiratory disorders. Furthermore, the main aim of this study is to tackle the difficulties related to the precise detection of lung disorders by creating a strong and effective model. The study examines the intricacies of pre-processing audio data, augmenting it, and extracting information from it. The primary focus is the utilization of Mel-frequency cepstral coefficients (MFCC) to identify significant characteristics of respiratory sounds. The suggested methodology utilizes a deep CNN structure to analyze retrieved characteristics and accurately identify diseases by detecting patterns and correlations. Moreover, the outcomes demonstrate a significant improvement in the precision of the model following the implementation of data balancing and augmentation strategies. The created model obtains a remarkable accuracy of 97.4% on the validation dataset, showcasing its effectiveness in training. Furthermore, it maintains a high accuracy of 95.1% on the independent test dataset. This research adds to the expanding collection of studies at the crossroads of AI and healthcare and shows great potential for promptly and precisely detecting respiratory disorders using acoustic signals. The results highlight the capacity of deep learning methods to transform diagnostic procedures in respiratory healthcare fundamentally.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A review on lung disease recognition by acoustic signal analysis with deep learning networks

Article Open access 12 June 2023

Deep learning models for detecting respiratory pathologies from raw lung auscultation sounds

Article 26 September 2022

Convolutional neural networks based efficient approach for classification of lung diseases

Article Open access 23 December 2019

Data availability

The dataset used in this study can be found at “https://www.kaggle.com/vbookshelf/respiratory-sound-database”.

Abbreviations

AI:: Artificial Intelligence
ML:: Machine Learning
MFCC:: Mel-frequency cepstral coefficients
PD:: Parkinson's syndrome
SVM:: Support Vector Machine
Grad-CAM:: Gradient-weighted class activation mapping
BiGRU:: Bidirectional GRU
URTI:: Upper Respiratory Tract Infection
ReLU:: Rectified Linear Unit
FFT:: Fast Fourier transform
FP/ FN:: False Positive/ False Negative
ROC:: Receiver Operating Characteristic
CNN:: Convolutional Neural Network
DL:: Deep Learning
COPD:: Chronic Obstructive Pulmonary Disease
ABG:: Arterial Blood Gas test
RSD:: Respiratory Sound dataset
GRU:: Gated Recurrent Unit
TCN:: Gated temporal convolution layer
LRTI:: Lower Respiratory Tract Infection
FC:: Fully Connected
TP/ TN :: True Positive/ True Negative
CPU:: Central Processing Unit

References

Ulukaya S, Sarıca AA, Erdem O, Karaali A (2023) MSCCov19Net: multi-branch deep learning model for COVID-19 detection from cough sounds. Med Biol Eng Comput 61(7):1619–1629. https://doi.org/10.1007/s11517-023-02803-4
Article Google Scholar
Mashika M, van der Haar D (2023) Mel frequency Cepstral coefficients and Support Vector machines for Cough Detection pp 250–259.https://doi.org/10.1007/978-3-031-35748-0_18
Nayak SS, Darji AD, Shah PK (2023) Machine learning approach for detecting Covid-19 from speech signal using Mel frequency magnitude coefficient. Signal Image Video Process 17(6):3155–3162. https://doi.org/10.1007/s11760-023-02537-8
Article Google Scholar
Garcia-Mendez JP et al (2023) Machine learning for automated classification of abnormal lung sounds obtained from public databases: a systematic review. Bioengineering 10(10):1155. https://doi.org/10.3390/bioengineering10101155
Article Google Scholar
Issahaku FY, Liu X, Lu K, Fang X, Danwana SB, Asimeng E (2024) Multimodal deep learning model for Covid-19 detection. Biomed Signal Process Control 91:105906. https://doi.org/10.1016/j.bspc.2023.105906
Article Google Scholar
Prajapati SK, Choudhary TS, Mishra S (2023) Early detection of lung disease using multi-class classifiers. In: IEEE 4th Annual Flagship India Council International Subsections Conference (INDISCON), IEEE, pp 01–06.https://doi.org/10.1109/INDISCON58499.2023.10270105
Lal KN (2023) A lung sound recognition model to diagnoses the respiratory diseases by using transfer learning. Multimed Tools Appl 82(23):36615–36631. https://doi.org/10.1007/s11042-023-14727-0
Article Google Scholar
Shivaanivarsha N, Sriram A, Saravaanan S, Rajesh V (2023) Respiratory sound analysis for lung disease diagnosis. In: International Conference on Ambient Intelligence, Knowledge Informatics and Industrial Electronics (AIKIIE), IEEE, pp 1–4. https://doi.org/10.1109/AIKIIE60097.2023.10390099
Dubey R, Bodade RM, Dubey D (2023) Efficient classification of the adventitious sounds of the lung through a combination of SVM-LSTM-Bayesian optimization algorithm with features based on wavelet bi-phase and bi-spectrum. Res Biomed Eng 39(2):349–363. https://doi.org/10.1007/s42600-023-00270-2
Article Google Scholar
Ge B, Yang H, Ma P, Guo T, Pan J, Wang W (2023) Detection of pulmonary hypertension associated with congenital heart disease based on time-frequency domain and deep learning features. Biomed Signal Process Control 81:104316. https://doi.org/10.1016/j.bspc.2022.104316
Article Google Scholar
Cinyol F, Baysal U, Köksal D, Babaoğlu E, Ulaşlı SS (2023) Incorporating support vector machine to the classification of respiratory sounds by convolutional neural network. Biomed Signal Process Control 79:104093. https://doi.org/10.1016/j.bspc.2022.104093
Article Google Scholar
Zakaria N, Sundaraj K (2023) VGG16-based deep learning architectures for classification of lung sounds into normal, crackles, and wheezes using gammatonegrams. In: 2023 International Conference on Information Technology (ICIT), IEEE, pp 83–88. https://doi.org/10.1109/ICIT58056.2023.10225790
Mridha K, Sarkar S, Kumar D (2021) Respiratory disease classification by CNN using MFCC. In: 2021 IEEE 6th International Conference on Computing, Communication and Automation (ICCCA), IEEE, pp 517–523. https://doi.org/10.1109/ICCCA52192.2021.9666346
Shuvo SB, Ali SN, Swapnil SI, Hasan T, Bhuiyan MIH (2021) A lightweight CNN model for detecting respiratory diseases from lung auscultation sounds using EMD-CWT-Based hybrid scalogram. IEEE J Biomed Health Inform 25(7):2595–2603. https://doi.org/10.1109/JBHI.2020.3048006
Article Google Scholar
Deeven VR, Kumar VN, Padma Sai Y, Akshitha N, Kaivalya M (2023) Pulmonary sound analysis with deep learning for efficient respiratory disease categorization. In Second International Conference on Emerging Trends in Engineering (ICETE 2023). Atlantis Press, pp 68–78
Tariq Z, Shah SK, Lee Y (2022) Feature-based Fusion using CNN for Lung and Heart Sound classification. Sensors 22(4):1521. https://doi.org/10.3390/s22041521
Article Google Scholar
Zulfiqar R, Majeed F, Irfan R, Rauf HT, Benkhelifa E, Belkacem AN (2021) Abnormal respiratory sounds classification using deep CNN through artificial noise addition. Front Med (Lausanne) 8. https://doi.org/10.3389/fmed.2021.714811
Balasubramanian S, Rajadurai P (2023) Machine learning-based classification of pulmonary diseases through real-time lung sounds. Int J Eng Technol Innov 14(1):85–102. https://doi.org/10.46604/ijeti.2023.12294
Article Google Scholar
Sadi TM, Hassan R (2020) Development of classification methods for wheeze and crackle using mel frequency cepstral coefficient (MFCC): a deep learning approach. Int J Percept Cognit Comput 6(2):107–114. https://doi.org/10.31436/ijpcc.v6i2.166
Roy A, Satija U (2023) RDLINet: a novel lightweight inception network for respiratory disease classification using lung sounds. IEEE Trans Instrum Meas 72:1–13. https://doi.org/10.1109/TIM.2023.3292953
Article Google Scholar
Alqudah AM, Qazan S, Obeidat YM (2022) Deep learning models for detecting respiratory pathologies from raw lung auscultation sounds. Soft Comput 26(24):13405–13429. https://doi.org/10.1007/s00500-022-07499-6
Article Google Scholar
Fraiwan M, Fraiwan L, Alkhodari M, Hassanin O (2022) Recognition of pulmonary diseases from lung sounds using convolutional neural networks and long short-term memory. J Ambient Intell Humaniz Comput 13(10):4759–4771. https://doi.org/10.1007/s12652-021-03184-y
Article Google Scholar
Baghel N, Nangia V, Dutta MK (2021) ALSD-Net: Automatic lung sounds diagnosis network from pulmonary signals. Neural Comput Appl 33(24):17103–17118. https://doi.org/10.1007/s00521-021-06302-1
Article Google Scholar
Kim Y, Camacho D, Choi C (2024) Real-time multi-class classification of respiratory diseases through dimensional data combinations. Cognit Comput 16(2):776–787. https://doi.org/10.1007/s12559-023-10228-2
Article Google Scholar
Xia T, Han J, Mascolo C (2022) Exploring machine learning for audio-based respiratory condition screening: A concise review of databases, methods, and open issues. Exp Biol Med 247(22):2053–2061. https://doi.org/10.1177/15353702221115428
Article Google Scholar
Alice RS, Wendling L, Santosh K (2023) 2D respiratory sound analysis to detect lung abnormalities. 46–58. https://doi.org/10.1007/978-3-031-23599-3_5
Huang D-M, Huang J, Qiao K, Zhong N-S, Lu H-Z, Wang W-J (2023) Deep learning-based lung sound analysis for intelligent stethoscope. Mil Med Res 10(1):44. https://doi.org/10.1186/s40779-023-00479-3
Article Google Scholar
Kranthi Kumar L, Alphonse PJA (2022) COVID-19: respiratory disease diagnosis with regularized deep convolutional neural network using human respiratory sounds. Eur Phys J Spec Top 231(18–20):3673–3696. https://doi.org/10.1140/epjs/s11734-022-00649-9
Article Google Scholar
Sonali CS, Kiran J, Chinmayi BS, Suma KV, Easa M (2023) Transformer-based network for Accurate classification of lung auscultation sounds. Crit Rev Biomed Eng 51(6):1–16. https://doi.org/10.1615/CritRevBiomedEng.2023048981
Article Google Scholar
Zhang P, Swaminathan A, Uddin AA (2023) Pulmonary disease detection and classification in patient respiratory audio files using long short-term memory neural networks. Front Med (Lausanne) 10. https://doi.org/10.3389/fmed.2023.1269784
Alqudaihi KS et al (2021) Cough sound detection and diagnosis using artificial intelligence techniques: challenges and opportunities. IEEE Access 9:102327–102344. https://doi.org/10.1109/ACCESS.2021.3097559
Article Google Scholar
Dar JA, Srivastava KK, Mishra A (2023) Lung anomaly detection from respiratory sound database (sound signals). Comput Biol Med 164:107311. https://doi.org/10.1016/j.compbiomed.2023.107311
Article Google Scholar
Choi Y, Lee H (2023) Interpretation of lung disease classification with light attention connected module. Biomed Signal Process Control 84:104695. https://doi.org/10.1016/j.bspc.2023.104695
Article Google Scholar
Basu V, Rana S (2020) Respiratory diseases recognition through respiratory sound with the help of deep neural network. In: 2020 4th International Conference on Computational Intelligence and Networks (CINE), IEEE, pp 1–6. https://doi.org/10.1109/CINE48825.2020.234388
Lo Giudice M, Mammone N, Ieracitano C, Aguglia U, Mandic D, Morabito FC (2022) Explainable Deep Learning Classification of Respiratory Sound for Telemedicine Applications. In International Conference on Applied Intelligence and Informatics. Cham: Springer Nature Switzerland, pp 391–403
Nassif AB, Shahin I, Bader M, Hassan A, Werghi N (2022) COVID-19 detection systems using deep-learning algorithms based on speech and image data. Mathematics 10(4):564. https://doi.org/10.3390/math10040564
Article Google Scholar
Saeed T, Ijaz A, Sadiq I, Qureshi HN, Rizwan A, Imran A (2024) An AI-Enabled bias-free respiratory disease diagnosis model using cough audio. Bioengineering 11(1):55. https://doi.org/10.3390/bioengineering11010055
Article Google Scholar
Dey RK, Das AK (2023) Modified term frequency-inverse document frequency based deep hybrid framework for sentiment analysis. Multimed Tools Appl 82(21):32967–32990. https://doi.org/10.1007/s11042-023-14653-1
Article Google Scholar
Dey RK, Das AK (2022) A simple strategy for handling ‘NOT’ can improve the performance of sentiment analysis pp 255–267. https://doi.org/10.1007/978-981-19-3089-8_25

Download references

Funding

This research was funded by the Deanship of Scientific Research at Princess Nourah bint Abdulrahman University, through the Research Funding Program, Grant No. (FRP-1444-1)

Author information

Authors and Affiliations

Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P.O.Box 84428, Riyadh, 11671, Saudi Arabia
Norah Saleh Alghamdi & Hanen Karamti
Department of Computer Science, College of Computer and Information Sciences, King Saud University, P.O. Box 57168, Riyadh, 21574, Saudi Arabia
Mohammed Zakariah

Authors

Norah Saleh Alghamdi
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Zakariah
View author publications
You can also search for this author in PubMed Google Scholar
Hanen Karamti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Norah Saleh Alghamdi.

Ethics declarations

Conflict of interest

The authors declare that there is no conflict of interest regarding the publication of this paper.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Alghamdi, N.S., Zakariah, M. & Karamti, H. A deep CNN-based acoustic model for the identification of lung diseases utilizing extracted MFCC features from respiratory sounds. Multimed Tools Appl 83, 82871–82903 (2024). https://doi.org/10.1007/s11042-024-18703-0

Download citation

Received: 15 July 2023
Revised: 10 February 2024
Accepted: 23 February 2024
Published: 12 March 2024
Issue Date: October 2024
DOI: https://doi.org/10.1007/s11042-024-18703-0

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A deep CNN-based acoustic model for the identification of lung diseases utilizing extracted MFCC features from respiratory sounds

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A review on lung disease recognition by acoustic signal analysis with deep learning networks

Deep learning models for detecting respiratory pathologies from raw lung auscultation sounds

Convolutional neural networks based efficient approach for classification of lung diseases

Data availability

Abbreviations

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

A deep CNN-based acoustic model for the identification of lung diseases utilizing extracted MFCC features from respiratory sounds

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A review on lung disease recognition by acoustic signal analysis with deep learning networks

Deep learning models for detecting respiratory pathologies from raw lung auscultation sounds

Convolutional neural networks based efficient approach for classification of lung diseases

Data availability

Abbreviations

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation