Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Parkinson’s disease classification using nature inspired feature selection and recursive feature elimination

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

New progress in machine learning (ML) have paved the way for solving various existing complex problems, including in the medical field. In the proposed paper, we have investigated methods to detect Parkinson’s Disease (PD) with Nature Inspired Feature Selection (NIFS) using the Zebra Optimization Algorithm and Recursive Feature Elimination Cross Validation (RFECV). A vocal feature-based dataset has been used for PD Detection as it contains data relating to the vocal features of PD patients. It has been proven from research that vocal disorders are observed in the majority of individuals in the preliminary phases of this disease. The number of features has been reduced from 754, in the original dataset, to 40 using feature selection, and the classification results have been obtained for two cases, one being a 70:30 train test split and the other being tenfold cross-validation. We have implemented the proposed technique on 11 different classifiers. Out of these, the Gaussian Process classifier showed the best accuracy for both cases. The accuracy values obtained for cases one and two are 96% and 97.07%, respectively, one of the highest accuracy values obtained for similar research done by other researchers. Additionally, the generalization capability of the model obtained may be enhanced by including more data points in the dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data availability

The dataset is publicly available.

References

  1. Braga D, Madureira AM, Coelho L, Ajith R (2019) Automatic detection of Parkinson’s disease based on acoustic analysis of speech. Eng Appl Artif Intell 77:148–158. https://doi.org/10.1016/j.engappai.2018.09.018

    Article  Google Scholar 

  2. Wooten GF, Currie LJ, Bovbjerg VE, Lee JK, Patrie J (2004) Are men at greater risk for Parkinson’s disease than women? J Neurol Neurosurg. Psychiatry 75(4):637 LP – 639. https://doi.org/10.1136/jnnp.2003.020982

    Article  Google Scholar 

  3. Arkinson C, Walden H (2018) Parkin function in Parkinson’s disease. Science (80-.) 360(6386):267–268. https://doi.org/10.1126/science.aar6606

    Article  Google Scholar 

  4. Rehman A, Saba T, Mujahid M, Alamri FS, ElHakim N (2023) Parkinson’s disease detection using hybrid lstm-gru deep learning model. Electronics 12(13). https://doi.org/10.3390/electronics12132856

  5. Haas BR, Stewart TH, Zhang J (2012) Premotor biomarkers for Parkinson’s disease - a promising direction of research. Transl Neurodegener 1(1):11. https://doi.org/10.1186/2047-9158-1-11

    Article  Google Scholar 

  6. Shivangi, Johri A, Tripathi A (2019) Parkinson Disease Detection Using Deep Neural Networks, in 2019 Twelfth International Conference on Contemporary Computing (IC3), pp. 1–4. https://doi.org/10.1109/IC3.2019.8844941

  7. Solana-Lavalle G, Galán-Hernández J-C, Rosas-Romero R (2020) Automatic Parkinson disease detection at early stages as a pre-diagnosis tool by using classifiers and a small set of vocal features. Biocybern Biomed Eng 40(1):505–516. https://doi.org/10.1016/j.bbe.2020.01.003

    Article  Google Scholar 

  8. Tuncer T, Dogan S, Acharya UR (2020) Automated detection of Parkinson’s disease using minimum average maximum tree and singular value decomposition method with vowels. Biocybern Biomed Eng 40(1):211–220. https://doi.org/10.1016/j.bbe.2019.05.006

    Article  Google Scholar 

  9. Liu Y, Li Y, Tan X, Wang P, Zhang Y (2021) Local discriminant preservation projection embedded ensemble learning based dimensionality reduction of speech data of Parkinson’s disease. Biomed. Signal Process. Control 63:102165. https://doi.org/10.1016/j.bspc.2020.102165

    Article  Google Scholar 

  10. Karan B, Sahu SS, Orozco-Arroyave JR, Mahto K (2020) Hilbert spectrum analysis for automatic detection and evaluation of Parkinson’s speech. Biomed. Signal Process. Control 61:102050. https://doi.org/10.1016/j.bspc.2020.102050

    Article  Google Scholar 

  11. Xu S, Pan Z (2020) A novel ensemble of random forest for assisting diagnosis of Parkinson’s disease on small handwritten dynamics dataset. Int. J. Med. Inform. 144:104283. https://doi.org/10.1016/j.ijmedinf.2020.104283

    Article  Google Scholar 

  12. Sakar CO et al (2019) A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor wavelet transform. Appl Soft Comput 74:255–263. https://doi.org/10.1016/j.asoc.2018.10.022

    Article  Google Scholar 

  13. Olivares R et al (2020) An optimized brain-based algorithm for classifying Parkinson’s disease. Appl Sci 10(5). https://doi.org/10.3390/app10051827

  14. Gunduz H (2019) Deep Learning-Based Parkinson’s Disease Classification Using Vocal Feature Sets. IEEE Access 7:115540–115551. https://doi.org/10.1109/ACCESS.2019.2936564

    Article  Google Scholar 

  15. Enireddy V, Gunda K, Kalyani NL, Prakash KB (2020) Nature Inspired Binary Grey Wolf Optimizer for Feature Selection in the detection of neurodegenerative (Parkinson) disease. IJATCSE 9(3):3977–3987. https://doi.org/10.30534/ijatcse/2020/222932020

  16. Hasan KA, Hasan MAM (2020) Classification of Parkinson’s Disease by Analyzing Multiple Vocal Features Sets, in 2020 IEEE Region 10 Symposium (TENSYMP), pp. 758–761. https://doi.org/10.1109/TENSYMP50017.2020.9230842

  17. Polat K (2019) A Hybrid Approach to Parkinson Disease Classification Using Speech Signal: The Combination of SMOTE and Random Forests, in 2019 Scientific Meeting on Electrical-Electronics & Biomedical Engineering and Computer Science (EBBT), pp. 1–3. https://doi.org/10.1109/EBBT.2019.8741725

  18. Varalakshmi P, Tharani Priya B, Anu Rithiga B, Bhuvaneaswari R (2021) Parkinson disease detection based on speech using various machine learning models and deep learning models, in 2021 International Conference on System, Computation, Automation and Networking (ICSCAN), pp. 1–6. https://doi.org/10.1109/ICSCAN53069.2021.9526372

  19. Hoq M, Uddin MN, Park S-B (2021) Vocal feature extraction-based artificial intelligent model for Parkinson’s disease detection. Diagnostics 11(6):1076

    Article  Google Scholar 

  20. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357

    Article  Google Scholar 

  21. Chandrashekar G, Sahin F (2014) A survey on feature selection methods. Comput Electr Eng 40(1):16–28. https://doi.org/10.1016/j.compeleceng.2013.11.024

    Article  Google Scholar 

  22. El Aboudi N, Benhlima L (2016) Review on wrapper feature selection approaches, in 2016 International Conference on Engineering & MIS (ICEMIS), pp. 1–5. https://doi.org/10.1109/ICEMIS.2016.7745366

  23. Kaur A, Guleria K, Trivedi NK (2021) Feature Selection in Machine Learning: Methods and Comparison, in 2021 International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), pp. 789–795. https://doi.org/10.1109/ICACITE51222.2021.9404623

  24. Yang X-S (2020) Nature-inspired optimization algorithms, 2nd edn. Academic Press. https://doi.org/10.1016/B978-0-12-821986-7.00002-0

  25. Caro T, Izzo A, Reiner RCJ, Walker H, Stankowich T (2014) The function of zebra stripes. Nat Commun 5:3535. https://doi.org/10.1038/ncomms4535

    Article  Google Scholar 

  26. Trojovská E, Dehghani M, Trojovský P (2022) Zebra optimization algorithm: A new bio-inspired optimization algorithm for solving optimization algorithm. IEEE Access 10:49445–49473

    Article  Google Scholar 

  27. Pastor J, Cohen Y, Hobbs NT (2006) The roles of large herbivores in ecosystem nutrient cycles. Conserv Biol Ser 11:289

    Google Scholar 

  28. Estes RD, Otte D, Wilson EO (2012) The behavior guide to African mammals: including hoofed mammals, carnivores, primates, 20th Anniversary Edn. University of California Press

  29. Wilson AM et al (2018) Biomechanics of predator–prey arms race in lion, zebra, cheetah and impala. Nature 554(7691):183–188

    Article  Google Scholar 

  30. Wottschel V et al (2019) SVM recursive feature elimination analyses of structural brain MRI predicts near-term relapses in patients with clinically isolated syndromes suggestive of multiple sclerosis. NeuroImage Clin 24:102011

    Article  Google Scholar 

  31. Guyon I, Weston J, Barnhill S, Vapnik V (2002) Gene Selection for Cancer Classification using Support Vector Machines. Mach Learn 46(1):389–422. https://doi.org/10.1023/A:1012487302797

    Article  Google Scholar 

  32. Schein AI, Ungar LH (2007) Active learning for logistic regression: an evaluation. Mach Learn 68(3):235–265. https://doi.org/10.1007/s10994-007-5019-5

    Article  Google Scholar 

  33. Wilson E, Tufts DW (1994) Multilayer perceptron design algorithm, in Proceedings of IEEE Workshop on Neural Networks for Signal Processing, pp. 61–68. https://doi.org/10.1109/NNSP.1994.366063

  34. Taunk K, De S, Verma S, Swetapadma A (2019) A Brief Review of Nearest Neighbor Algorithm for Learning and Classification, in 2019 International Conference on Intelligent Computing and Control Systems (ICCS), pp. 1255–1260. https://doi.org/10.1109/ICCS45141.2019.9065747

  35. Al-Mejibli IS, Abd DH, Alwan JK, Rabash AJ (2018) Performance Evaluation of Kernels in Support Vector Machine, in 2018 1st Annual International Conference on Information and Sciences (AiCIS), pp. 96–101. https://doi.org/10.1109/AiCIS.2018.00029

  36. Xiao T, Timo ON, Avellaneda F, Malik Y, Bruda S (2020) An Approach to Evaluating Learning Algorithms for Decision Trees, arXiv Prepr. arXiv2010.13665

  37. Jaiswal JK, Samikannu R (2017) Application of random forest algorithm on feature subset selection and classification and regression, in 2017 World Congress on Computing and Communication Technologies (WCCCT), pp. 65–68. https://doi.org/10.1109/WCCCT.2016.25

  38. Rasmussen CE (2004) Gaussian Processes in Machine Learning BT - Advanced Lectures on Machine Learning: ML Summer Schools 2003, Canberra, Australia, February 2 - 14, 2003, Tübingen, Germany, August 4 - 16, 2003, Revised Lectures, O. Bousquet, U. von Luxburg, and G. Rätsch, Eds., Berlin, Heidelberg: Springer Berlin Heidelberg, pp. 63–71. https://doi.org/10.1007/978-3-540-28650-9_4

  39. Tharwat A (2016) Linear vs. quadratic discriminant analysis classifier: a tutorial. Int J Appl Pattern Recognit 3(2):145–180

    Article  Google Scholar 

  40. Jahromi AH, Taheri M (2017) A non-parametric mixture of Gaussian naive Bayes classifiers based on local independent features, in 2017 Artificial intelligence and signal processing conference (AISP), IEEE, pp. 209–212

  41. Vakili M, Ghamsari M, Rezaei M (2020) Performance analysis and comparison of machine and deep learning algorithms for IoT data classification, arXiv Prepr. arXiv2001.09636

  42. Huang Q et al (2023) A novel image-to-knowledge inference approach for automatically diagnosing tumors. Expert Syst. Appl. 229:120450. https://doi.org/10.1016/j.eswa.2023.120450

    Article  Google Scholar 

  43. Luo Y, Lu Z, Liu L, Huang Q (2023) Deep fusion of human-machine knowledge with attention mechanism for breast cancer diagnosis. Biomed. Signal Process. Control 84:104784. https://doi.org/10.1016/j.bspc.2023.104784

    Article  Google Scholar 

  44. Xi J, Sun D, Chang C, Zhou S, Huang Q (2023) An omics-to-omics joint knowledge association subtensor model for radiogenomics cross-modal modules from genomics and ultrasonic images of breast cancers. Comput. Biol. Med. 155:106672. https://doi.org/10.1016/j.compbiomed.2023.106672

    Article  Google Scholar 

  45. Li G, Xiao L, Wang G, Liu Y, Liu L, Huang Q (2023) Knowledge Tensor-Aided Breast Ultrasound Image Assistant Inference Framework. Healthc (Basel, Switzerland) 11(14). https://doi.org/10.3390/healthcare11142014

  46. Huang Q, Ye L (2022) Multi-Task/Single-Task Joint Learning of Ultrasound BI-RADS Features. IEEE Trans Ultrason Ferroelectr Freq Control 69(2):691–701. https://doi.org/10.1109/TUFFC.2021.3132933

    Article  Google Scholar 

  47. Zhou J, Pan F, Li W, Hu H, Wang W, Huang Q (2022) Feature Fusion for Diagnosis of Atypical Hepatocellular Carcinoma in Contrast- Enhanced Ultrasound. IEEE Trans Ultrason Ferroelectr Freq Control 69(1):114–123. https://doi.org/10.1109/TUFFC.2021.3110590

    Article  Google Scholar 

  48. Lee KY, Park J (2006) Application of Particle Swarm Optimization to Economic Dispatch Problem: Advantages and Disadvantages, in 2006 IEEE PES Power Systems Conference and Exposition, pp. 188–192. https://doi.org/10.1109/PSCE.2006.296295

  49. Heim B, Krismer F, De Marzi R, Seppi K (2017) Magnetic resonance imaging for the diagnosis of Parkinson’s disease. J Neural Transm 124(8):915–964. https://doi.org/10.1007/s00702-017-1717-8

    Article  Google Scholar 

Download references

Funding

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hemprasad Yashwant Patil.

Ethics declarations

Ethical Statements

1) This material is the authors’ own original work, which has not been previously published elsewhere.

2) The paper is not currently being considered for publication elsewhere.

3) The paper reflects the authors’ own research and analysis in a truthful and complete manner.

4) The paper properly credits the meaningful contributions of co-authors and co-researchers.

5) The results are appropriately placed in the context of prior and existing research.

Consent to participate

Not Applicable (Experimentation is performed on publicly available dataset).

Consent to publish

Not Applicable (Experimentation is performed on publicly available dataset).

Competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chawla, P.K., Nair, M.S., Malkhede, D.G. et al. Parkinson’s disease classification using nature inspired feature selection and recursive feature elimination. Multimed Tools Appl 83, 35197–35220 (2024). https://doi.org/10.1007/s11042-023-16804-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-16804-w

Keywords