Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Prevalence and Early Prediction of Diabetes Using Machine Learning in North Kashmir: : A Case Study of District Bandipora

Published: 01 January 2022 Publication History

Abstract

Diabetes is one of the biggest health problems that affect millions of people across the world. Uncontrolled diabetes can increase the risk of heart attack, cancer, kidney damage, blindness, and other illnesses. Researchers are motivated to create a Machine Learning methodology that can predict diabetes in the future. Exploiting Machine Learning Algorithms (MLA) is essential if healthcare professionals are able to identify diseases more effectively. In order to improve the medical diagnosis of diabetes this research explored and contrasts various MLA that can identify diabetes risk early. The research includes the analysis on real datasets such as a clinical dataset gathered from a doctor in the Indian district of Bandipora in the years April 2021–Feb2022. MLA are currently important in the healthcare sector due to their prediction abilities. Researchers are using MLA to improve disease prediction and reduce cost. In this Paper author developed a methodology using Machine Learning Algorithms for Diabetes Disease Risk Prediction in North Kashmir. Six MLA have been successfully used in the experimental study such as Random Forest (RF), Multi-Layer Perceptron (MLP), Support Vector Machine (SVM), Gradient Boost (GB), Decision Tree (DT), and Logistic Regression (LR). RF is the most accurate classifier with the uppermost accuracy rate of 98 percent followed by MLP (90.99%), SVM (92%), GBC (97%), DT (96%), and LR (69%), respectively, with the balanced data set. Lastly, this study enables us to effectively identify the prevalence and prediction of diabetes.

References

[1]
B. S. Kim, M. Ahn, W.-W. Cho, G Gao, J. Jang, and D.-W. Cho, “Engineering of diseased human skin equivalent using 3D cell printing for representing pathophysiological hallmarks of type 2 diabetes in vitro,” Biomaterials, vol. 272, 2021.
[2]
K. E. Minges, P. Zimmet, D. J. Magliano, D. W. Dunstan, A. Brown, and J. E. Shaw, “Diabetes prevalence and determinants in Indigenous Australian populations: a systematic review,” Diabetes Research and Clinical Practice, vol. 93, no. 2, pp. 139–149, 2011.
[3]
Q. Zou, K. Qu, Y. Luo, D. Yin, Y. Ju, and H. Tang, “Predicting diabetes mellitus with machine learning techniques,” Frontiers in Genetics, vol. 9, p. 515, 2018.
[4]
S. Gautam, N. Shrestha, S. Mahato, T. P. A. Nguyen, S. R. Mishra, and G. Berg-Beckhoff, “Diabetes among tuberculosis patients and its impact on tuberculosis treatment in South Asia: a systematic review and meta-analysis,” Scientific Reports, vol. 11, no. 1, pp. 2113–2212, 2021.
[5]
J. C. Won, J. H. Lee, J. H. Kim, E. S. Kang, K. C. Won, D. J. Kim, and M. K. Lee, “Diabetes fact sheet in Korea, 2016: an appraisal of current status,” Diabetes & metabolism journal, vol. 42, no. 5, pp. 415–424, 2018.
[6]
Y. K. Qawqzeh, A. S. Bajahzar, M. Jemmali, M. M. Otoom, and A. Thaljaoui, “Classification of diabetes using photoplethysmogram (PPG) waveform analysis logistic regression modeling,” BioMed Research International, vol. 2020, 6 pages, 2020.
[7]
S. Wei, X. Zhao, and C. Miao, “A comprehensive exploration to the machine learning techniques for diabetes identification,” in Proceedings of the 2018 IEEE 4th World Forum on Internet of Things (WF-IoT), pp. 291–295, Singapore, May 2018.
[8]
G. A. Pethunachiyar, “Classification of diabetes patients using kernel based support vector machines,” in Proceedings of the 2020 International Conference on Computer Communication and Informatics (ICCCI), pp. 1–4, Coimbatore, India, June 2020.
[9]
S. Abhari, S. R. Niakan Kalhori, M. Ebrahimi, H. Hasannejadasl, and A. Garavand, “Artificial intelligence applications in type 2 diabetes mellitus care: focus on machine learning methods,” Healthcare informatics research, vol. 25, no. 4, pp. 248–261, 2019.
[10]
M. D. Maniruzzaman, M. J. Rahman, B. Ahammed, and M. M. Abedin, “Classification and prediction of diabetes disease using machine learning paradigm,” Health Information Science and Systems, vol. 8, no. 1, pp. 7–14, 2020.
[11]
Y. Weiss, Y. Elovici, and L. Rokach, “The cash algorithm-cost-sensitive attribute selection using histograms,” Information Sciences, vol. 222, pp. 247–268, 2013.
[12]
M. M. Islam, R. Ferdousi, S. Rahman, and Y. B. Humayra, “Likelihood prediction of diabetes at early stage using data mining techniques,” in Computer Vision and Machine Intelligence in Medical Image Analysis, pp. 113–212, Springer, Singapore, 2020.
[13]
I. Kavakiotis, O. Tsave, A. Salifoglou, N. Maglaveras, I. Vlahavas, and I. Chouvarda, “Machine learning and data mining methods in diabetes research,” Computational and Structural Biotechnology Journal, vol. 15, pp. 104–116, 2017.
[14]
S. Kumari, D. Kumar, and M. Mittal, “An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier,” International Journal of Cognitive Computing in Engineering, vol. 2, pp. 40–46, 2021.
[15]
V. K. Gunjan, S. Kumar, M. D. Ansari, and Y. Vijayalata, “Prediction of agriculture yields using machine learning algorithms,” in Proceedings of the 2nd International Conference on Recent Trends in Machine Learning, IoT, Smart Cities and Applications, pp. 17–26, Springer, Singapore, Hyderabad, India, March 2022.
[16]
B. Divya, N. Mohan, D. B. Mehta, P. A. Barot, S. G. Langhnoja, S. Sarkar, H. Chatterjee, P. Saha, M. Biswas, K. M. Priyanga, and M. Alamelu, “4th international conference on trends in electronics and informatics (ICOEI),” Navigation, vol. 23, p. 6, 2020.
[17]
U. C. U. Z. İlknur and U. C. Ayla, “Artificial neural networks based-prediction of autism spectrum disorder,” The Journal of Cognitive Systems, vol. 5, no. 2, pp. 78–82, 2020.
[18]
Q. Wang, W. Cao, J. Guo, J. Ren, Y. Cheng, and D. N. Davis, “DMP_MI: an effective diabetes mellitus classification algorithm on imbalanced data with missing values,” IEEE Access, vol. 7, pp. 102232–102238, 2019.
[19]
D. A. Otchere, T. O. Arbi Ganat, R. Gholami, and S. Ridha, “Application of supervised machine learning paradigms in the prediction of petroleum reservoir properties: comparative analysis of ANN and SVM models,” Journal of Petroleum Science and Engineering, vol. 200, 2021.
[20]
F. I. Adiba, T. Islam, M. Shamim Kaiser, M. Mahmud, and M. A. Rahman, “Effect of corpora on classification of fake news using naive Bayes classifier,” International Journal of Automation, Artificial Intelligence and Machine Learning, vol. 1, no. 1, pp. 80–92, 2020.
[21]
Z. Mushtaq, M. F. Ramzan, S. Ali, S. Baseer, A. Samad, and M. Husnain, “Voting classification-based diabetes mellitus prediction using hypertuned machine-learning techniques,” Mobile Information Systems, vol. 2022, 16 pages, 2022.
[22]
H. Liu, J. Li, J. Leng, H. Wang, J. Liu, W. Li, H. Liu, S. Wang, J. Ma, J. C. Chan, Z. Yu, G. Hu, C. Li, and X Yang, “Machine learning risk score for prediction of gestational diabetes in early pregnancy in Tianjin, China,” Diabetes/metabolism research and reviews, vol. 37, no. 5, 2021.
[23]
M. F. Faruque and I. H. Sarker, “Performance analysis of machine learning techniques to predict diabetes mellitus,” in Proceedings of the 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE), IEEE, pp. 1–4, Cox’sBazar, Bangladesh, February 2019.
[24]
Z. Taushanov, H. Verloo, B. Wernli, S. Di Giovanni, A. Von Gunten, and F. Pereira, “Transforming a patient registry into a customized data set for the advanced statistical analysis of health risk factors and for medication-related hospitalization research: retrospective hospital patient registry study,” JMIR medical informatics, vol. 9, no. 5, 2021.
[25]
R. Hajizadeh, A. Ali, and M. Ezoji, “Mutual neighborhood and modified majority voting based KNN classifier for multi-categories classification Pattern Analysis and Applications,” Pattern Analysis and Applications, pp. 1–21, 2022.
[26]
F. P. Kemeth, T. Bertalan, T. Thiem, F. Dietrich, S. J. Moon, C. R. Laing, and I. G. Kevrekidis, “Learning emergent partial differential equations in a learned emergent space,” Nature Communications, vol. 13, no. 1, pp. 3318–3413, 2022.
[27]
S. Shafi and G. A. Ansari, “Early prediction of diabetes disease & classification of algorithms using machine learning approach,” in Proceedings of the International Conference on Smart Data Intelligence, (ICSMDI 2021), Trichy, Tamil Nadu, April 2021.
[28]
H. M. D. Kabir, M. Abdar, A. Khosravi, S. M. J. Jalali, A. F. Atiya, S. Nahavandi, and D. Srinivasan, “Spinalnet: deep neural network with gradual input,” IEEE Transactions on Artificial Intelligence, pp. 1–13, 2022.
[29]
K. Sethi, V. Jaiswal, and M. D. Ansari, “Machine learning based support system for students to select stream (subject),” Recent advances in computer science and communications (Formerly: Recent patents on computer science), vol. 13, no. 3, pp. 336–344, 2020.
[30]
M. Agarwal, V. K. Bohat, M. D. Ansari, A. Sinha, S. K. Gupta, and D. Garg, “A convolution neural network based approach to detect the disease in corn crop,” in Proceedings of the 2019 IEEE 9th international conference on advanced computing (IACC), pp. 176–181, Tiruchirappalli, India, December 2019.
[31]
R. Kumar, M. Saraswat, D. Ather, M. N. Mumtaz Bhutta, S. Basheer, R. N. Thakur, and R. N. Thakur, “Deformation adjustment with single real signature image for biometric verification using CNN,” Computational Intelligence and Neuroscience, vol. 2022, 12 pages, 2022.
[32]
P. Madan, V. Singh, V. Chaudhari, Y. Albagory, A. Dumka, R. Singh, A. Gehlot, M. Rashid, S. S. Alshamrani, and A. S. AlGhamdi, “An optimization-based diabetes prediction model using CNN and Bi-directional LSTM in real-time environment,” Applied Sciences, vol. 12, no. 8, p. 3989, 2022.

Cited By

View all
  • (2023)Market-Based Stock Allocation Using a Hybrid Regression ModelSN Computer Science10.1007/s42979-023-01883-14:4Online publication date: 1-Jun-2023

Index Terms

  1. Prevalence and Early Prediction of Diabetes Using Machine Learning in North Kashmir: A Case Study of District Bandipora
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image Computational Intelligence and Neuroscience
      Computational Intelligence and Neuroscience  Volume 2022, Issue
      2022
      32389 pages
      ISSN:1687-5265
      EISSN:1687-5273
      Issue’s Table of Contents
      This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

      Publisher

      Hindawi Limited

      London, United Kingdom

      Publication History

      Published: 01 January 2022

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 14 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Market-Based Stock Allocation Using a Hybrid Regression ModelSN Computer Science10.1007/s42979-023-01883-14:4Online publication date: 1-Jun-2023

      View Options

      View options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media