Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

An ensemble technique to predict Parkinson's disease using machine learning algorithms

Published: 17 July 2024 Publication History
  • Get Citation Alerts
  • Highlights

    Parkinson's Disease (PD) prediction is based on optimised machine learning algorithms and an ensemble feature selection algorithm.
    Three datasets containing voice samples were utilised to analyse performance metrics comprehensively.
    Performance evaluation was conducted using 10-fold cross-validation to calculate mean accuracy with standard deviation, ensuring robust evaluation across all datasets.
    The proposed model, employing ensemble feature selection techniques, demonstrated superior predictive performance across all datasets.

    Abstract

    Parkinson's Disease (PD) is a progressive neurodegenerative disorder affecting motor and non-motor symptoms. Its symptoms develop slowly, making early identification difficult. Machine learning has a significant potential to predict Parkinson's disease on features hidden in voice data. This work aimed to identify the most relevant features from a high-dimensional dataset, which helps accurately classify Parkinson's Disease with less computation time. Three individual datasets with various medical features based on voice have been analyzed in this work. An Ensemble Feature Selection Algorithm (EFSA) technique based on filter, wrapper, and embedding algorithms that pick highly relevant features for identifying Parkinson's Disease is proposed, and the same has been validated on three different datasets based on voice. These techniques can shorten training time to improve model accuracy and minimize overfitting. We utilized different ML models such as K-Nearest Neighbors (KNN), Random Forest, Decision Tree, Support Vector Machine (SVM), Bagging Classifier, Multi-Layer Perceptron (MLP) Classifier, and Gradient Boosting. Each of these models was fine-tuned to ensure optimal performance within our specific context. Moreover, in addition to these established classifiers, we proposed an ensemble classifier is found on a high optimal majority of the votes. Dataset-I achieves classification accuracy with 97.6 %, F1-score 97.9 %, precision with 98 % and recall with 98 %. Dataset-II achieves classification accuracy 90.2 %, F1-score 90.2 %, precision 90.2 %, and recall 90.5 %. Dataset-III achieves 83.3 % accuracy, F1-score 83.3 %, precision 83.5 % and recall 83.3 %. These results have been taken using 13 out of 23, 45 out of 754, and 17 out of 46 features from respective datasets. The proposed EFSA model has performed with higher accuracy and is more efficient than other models for each dataset.

    References

    [1]
    A. Al-Husban, M.M. Abdulridha, A.A.H. Mohamad, A.M. Ibrahim, Biocomposite's multiple uses for a new approach in the diagnosis of Parkinson's disease using a machine learning algorithm, Adsorpt. Sci. Technol. 2022 (2022) 1–7,.
    [2]
    A.M. Ali, F. Salim, F. Saeed, Parkinson's disease detection using filter feature selection and a genetic algorithm with ensemble learning, Diagnostics 13 (17) (2023),.
    [3]
    M. Al-Sarem, F. Saeed, W. Boulila, A.H. Emara, M. Al-Mohaimeed, M. Errais, Feature selection and classification using CatBoost method for improving the performance of predicting Parkinson's disease, Advances in Intelligent Systems and Computing, Springer Science and Business Media Deutschland GmbH, 2021, pp. 189–199,.
    [4]
    E. Avuçlu, A. Elen, Evaluation of train and test performance of machine learning algorithms and Parkinson diagnosis with statistical measurements, Med. Biol. Eng. Comput. 58 (11) (2020) 2775–2788,.
    [5]
    G. Bao, M. Lin, X. Sang, Y. Hou, Y. Liu, Y. Wu, Classification of dysphonic voices in Parkinson's disease with semi-supervised competitive learning algorithm, Biosensors 12 (7) (2022),.
    [6]
    T.J. Bradshaw, Z. Huemann, J. Hu, A. Rahmim, A guide to cross-validation for artificial intelligence in medical imaging, Radiology: Artificial Intelligence, 5, Radiological Society of North America Inc., 2023,.
    [7]
    T.B. Chandra, K. Verma, B.K. Singh, D. Jain, S.S. Netam, Coronavirus disease (COVID-19) detection in Chest X-Ray images using majority voting based classifier ensemble, Expert. Syst. Appl. 165 (2021),.
    [8]
    L.-C. Chang et al., “Machine learning approaches to identify Parkinson's disease using voice signal features.”.
    [9]
    R. Das, A comparison of multiple classification methods for diagnosis of Parkinson disease, Expert. Syst. Appl. 37 (2) (2010) 1568–1572,.
    [10]
    M. Devarajan, L. Ravi, Intelligent cyber-physical system for an efficient detection of Parkinson disease using fog computing, Multimed. Tools. Appl. 78 (23) (2019) 32695–32719,.
    [11]
    J. Dhar, An adaptive intelligent diagnostic system to predict early stage of Parkinson's disease using two-stage dimension reduction with genetically optimized lightgbm algorithm, Neural Comput. Appl. 34 (6) (2022) 4567–4593,.
    [12]
    I.M. El-Hasnony, S.I. Barakat, R.R. Mostafa, Optimized ANFIS model using hybrid metaheuristic algorithms for Parkinson's disease prediction in iot environment, IEEE Access. 8 (2020) 119252–119270,.
    [13]
    Z. Fang, Improved KNN algorithm with information entropy for the diagnosis of Parkinson's disease, in: Proceedings - 2022 International Conference on Machine Learning and Knowledge Engineering, MLKE 2022, 2022, pp. 98–101,.
    [14]
    H.A. Fayed, A.F. Atiya, Speed up grid-search for parameter selection of support vector machines, Appl. Soft Comput. J. 80 (2019) 202–210,.
    [15]
    S. Hawi, J. Alhozami, R. AlQahtani, D. AlSafran, M. Alqarni, L. El Sahmarany, Automatic Parkinson's disease detection based on the combination of long-term acoustic features and Mel frequency cepstral coefficients (MFCC), Biomed. Signal. Process. Control 78 (2022),.
    [16]
    M.M. Hussain, D. Weslin, S. Kumari, S. Umamaheswari, K. Kamalakannan, Enhancing Parkinson's disease identification using ensemble classifier and data augmentation techniques in machine learning, Clin. eHealth 6 (2023) 150–158,.
    [17]
    M.T. Ibarra-Gutiérrez, N. Serrano-García, M. Orozco-Ibarra, Rotenone-induced model of Parkinson's disease: beyond mitochondrial complex I inhibition, Molecular Neurobiology, Springer, 2023,.
    [18]
    D. Jain, A.K. Mishra, S.K. Das, Machine learning based automatic prediction of Parkinson's disease using speech features, Advances in Intelligent Systems and Computing, Springer, 2021, pp. 351–362,.
    [19]
    S. Kaur, H. Aggarwal, R. Rani, Hyper-parameter optimization of deep learning model for prediction of Parkinson's disease, Machine Vision and Applications, Springer, 2020,.
    [20]
    S. Kumari, D. Kumar, M. Mittal, An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier, Int. J. Cognit. Comput. Eng. 2 (2021) 40–46,.
    [21]
    R. Lamba, T. Gulati, A. Jain, A hybrid feature selection approach for Parkinson's detection based on mutual information gain and recursive feature elimination, Arab. J. Sci. Eng. 47 (8) (2022) 10263–10276,.
    [22]
    M.A. Little, P.E. McSharry, E.J. Hunter, J. Spielman, L.O. Ramig, Suitability of dysphonia measurements for telemonitoring of Parkinson's disease, IEEE Trans. Biomed. Eng. 56 (4) (2009) 1015–1022,.
    [23]
    Y. Liu, Z. Liu, X. Luo, H. Zhao, Diagnosis of Parkinson's disease based on SHAP value feature selection, Biocybern. Biomed. Eng. 42 (3) (2022) 856–869,.
    [24]
    M. Mamun, M.I. Mahmud, M.I. Hossain, A.M. Islam, M.S. Ahammed, M.M. Uddin, Vocal feature guided detection of Parkinson's disease using machine learning algorithms, in: 2022 IEEE 13th Annual Ubiquitous Computing, Electronics and Mobile Communication Conference, UEMCON 2022, 2022, pp. 566–572,.
    [25]
    Y. Mittra and V. Rustagi, “Classification of subjects with Parkinson's Disease using Gait Data Analysis.”.
    [26]
    N. Nahar, F. Ara, M.A.I. Neloy, A. Biswas, M.S. Hossain, K. Andersson, Feature Selection Based Machine Learning to Improve Prediction of Parkinson Disease, Springer International Publishing, 2021,. vol. 12960 LNAI.
    [27]
    L. Naranjo, C.J. Pérez, Y. Campos-Roca, J. Martín, Addressing voice recording replications for Parkinson's disease detection, Expert. Syst. Appl. 46 (2016) 286–292,.
    [28]
    “Parkinson disease.” Accessed 17 January 2023. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/parkinson-disease.
    [29]
    W. Poewe, et al., Parkinson disease, Nat. Rev. Dis. Primers. 3 (2017) 1–21,.
    [30]
    K. Polat, A hybrid approach to Parkinson disease classification using speech signal: the combination of SMOTE and random forests, in: 2019 Scientific Meeting on Electrical-Electronics and Biomedical Engineering and Computer Science, EBBT 2019, Institute of Electrical and Electronics Engineers Inc., 2019,.
    [31]
    M. Raheem, “Machine Learning Based Idiopathic Parkinson's Disease Detection Using Speech Machine Learning Based Idiopathic Parkinson's Disease Detection Using Speech Data,” 2022.
    [32]
    B. Sabeena, S. Sivakumari, D.M. Teressa, Optimization-based ensemble feature selection algorithm and deep learning classifier for Parkinson's disease, J. Healthc. Eng. 2022 (2022),.
    [33]
    C.O. Sakar, et al., A comparative analysis of speech signal processing algorithms for Parkinson's disease classification and the use of the tunable Q-factor wavelet transform, Appl. Soft Comput. J. 74 (2019) 255–263,.
    [34]
    C.O. Sakar, O. Kursun, Telediagnosis of parkinson's disease using measurements of dysphonia, J. Med. Syst. 34 (4) (2010) 591–599,.
    [35]
    S. Saravanan, et al., A systematic review of Artificial Intelligence (AI) based approaches for the diagnosis of Parkinson's disease, Archives of Computational Methods in Engineering 29 (6) (2022) 3639–3653,.
    [36]
    Z.K. Senturk, Early diagnosis of Parkinson's disease using machine learning algorithms, Med. Hypotheses. 138 (2020),.
    [37]
    B. Shahbaba, R. Neal, Nonlinear models using Dirichlet process mixtures, J. Mach. Learn. Res. 10 (2009) 1829–1850.
    [38]
    P. Sharma, S. Sundaram, M. Sharma, A. Sharma, D. Gupta, Diagnosis of Parkinson's disease using modified grey wolf optimization, Cogn. Syst. Res. 54 (2019) 100–115,.
    [39]
    K.A. Shastry, An ensemble nearest neighbor boosting technique for prediction of Parkinson's disease, Healthc. Anal. 3 (2023),.
    [40]
    G. Tallapureddy, D. Radha, Analysis of ensemble of machine learning algorithms for detection of Parkinson's disease, in: Proceedings - International Conference on Applied Artificial Intelligence and Computing, ICAAIC 2022, Institute of Electrical and Electronics Engineers Inc., 2022, pp. 354–361,.
    [41]
    “UCI Machine Learning Repository: Parkinson Dataset with replicated acoustic features Data Set.” 6 Accessed 2022. [Online]. Available: https://archive.ics.uci.edu/ml/datasets/Parkinson%2BDataset%2Bwith%2Breplicated%2Bacoustic%2Bfeatures%2B.
    [42]
    “UCI Machine Learning Repository: Parkinsons Data Set.” 6 Accessed 2022. [Online]. Available: https://archive.ics.uci.edu/ml/datasets/parkinsons.
    [43]
    “UCI Machine Learning Repository: Parkinson's Disease Classification Data Set.” Accessed 6, 2022. [Online]. Available: https://archive.ics.uci.edu/ml/datasets/Parkinson%27s+Disease+Classification.
    [44]
    L. Yuan, Y. Liu, H.-M. Feng, Parkinson disease prediction using machine learning-based features from speech signal, Serv. Orient. Comput. Appl. (2023),.
    [45]
    Ş. Yücelbaş, Simple logistic hybrid system based on greedy stepwise algorithm for feature analysis to diagnose Parkinson's disease according to gender, Arab. J. Sci. Eng. 45 (3) (2020) 2001–2016,.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Speech Communication
    Speech Communication  Volume 159, Issue C
    Apr 2024
    90 pages

    Publisher

    Elsevier Science Publishers B. V.

    Netherlands

    Publication History

    Published: 17 July 2024

    Author Tags

    1. Parkinson's disease
    2. Voice disorder
    3. Machine learning
    4. Feature selection
    5. SMOTE
    6. Ensemble and hyper tuning

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 0
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0

    Other Metrics

    Citations

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media