Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3429889.3429894acmotherconferencesArticle/Chapter ViewAbstractPublication PagesisaimsConference Proceedingsconference-collections
research-article

Predictive Modeling of Diabetic Kidney Disease using Random Forest Algorithm along with Features Selection

Published: 04 December 2020 Publication History

Abstract

At present, the number of diabetes mellitus patients in China ranks first in the world, and diabetic kidney disease is the most common disease in complications. Therefore, it is necessary to establish a predictive model for early diagnosis of diabetic kidney disease. The model predicts the risk of diabetic kidney disease in the general Asian population, and recognizes high-risk groups, then warns the onset of diabetes. The data were obtained from the electronic medical record of patients in Beijing Pinggu Hospital. Twenty-nine initial candidate indicators including age, ALB, and A/C were selected. The random forest algorithm was used to predict diabetic kidney disease, and the classification accuracy was 89.831%. The importance weight ratio of each factor index was also given, Microalbuminuria (ALB), albumin-to-creatinine ratio (A/C), serum creatinine (SCr), Serum albumin (umALB), and blood urea nitrogen (BUN) accounted for a relatively high proportion of the weight of the characteristic variables. So these five indicators can be the primary indicators of our classification prediction, and the accuracy can reach 87.453%. Some other typical classification algorithms, liking KNN, logistic regression, and decision tree, were compared to classify and predict diabetic kidney disease, and precision recall f1-score and area AUC under ROC curve were used to evaluate these models. By experiments, random forest model was better than other algorithm models on both the classification accuracy and the evaluation indicators. The results can be applied to the screening of patients with high risk of diabetic kidney disease and the guidance of risk intervention measures. Consequently, the detection rate of undiagnosed diabetic kidney disease in the population can be improved, and the prevention effect of diabetic kidney disease can be enhanced as well.

References

[1]
Chinese Diabetes Society Special Issue Standards of care for type2 diabetes In China (2017 edition). Chinese Journal of Practical Internal Medicine, 2018, 38 (04), 292--344.
[2]
Seki N, Nishimura M, Matsumoto T, et al. (2013) Relationship between BNP level and renal function in diabetic kidney disease with microalbuminuria. J Diabetes Complications, 27(1), 92--97.
[3]
Jong HoKim, Sang SooKim, et al. (2017) Nonalbumin proteinuria is a simple and practical predictor of the progression of early-stage type 2 diabetic kidney disease. Journal of Diabetes and its Complications, 31( 2), 395--399
[4]
Nai-Arun N, Moungmai R. (2015) Comparison of Classifiers for the Risk of Diabetes Prediction. Procedia Computer Science, 69, 132--142.
[5]
Tziomalos K, Athyros V G. (2015) Diabetic kidney disease: New Risk Factors and Improvements in Diagnosis. Rev Diabet Stud, 12(1-2), 110--118.
[6]
Lin C C, Chen C C, Chen F N, et al. (2013) Risks of diabetic kidney disease with variation in hemoglobin A1c and fasting plasma glucose. American Journal of Medicine, 126(11), 1--10.
[7]
López, Beatriz, Torrent-Fontbona F, et al. (2018) Single Nucleotide Polymorphism relevance learning with Random Forests for Type 2 diabetes risk prediction. Artificial Intelligence in Medicine, 85, 43--49.
[8]
Wu H, Yang S, Huang Z, et al. (2018) Type 2 diabetes mellitus prediction model based on data mining[J]. Informatics in Medicine Unlocked, 10, 100--107.
[9]
Dong Z, Wang Y, Qiang Q, et al. (2016) Clinical predictors differentiating non-diabetic renal diseases from diabetic kidney disease in a large population of type 2 diabetes patients[J]. Diabetes Res Clin Pract, 121, 112--118.
[10]
Oh S W, Kim S, Na K Y, et al. (2012) Clinical implications of pathologic diagnosis and classification for diabetic kidney disease. Diabetes Res Clin Pract, 97(3), 418--424.
[11]
Kavakiotis I, Tsave O, Salifoglou A, et al. (2017) Machine Learning and Data Mining Methods in Diabetes Research. Computational & Structural Biotechnology Journal, 15(C), 104--116.
[12]
Meza-Palacios R, Aguilar-Lasserre A A, Ureña-Bogarín E L, et al. (2016) Development of a fuzzy expert system for the nephropathy control assessment in patients with type 2 diabetes mellitus. Expert Systems with Applications.
[13]
Irannejad A, Ghajar A, Afarideh M, et al.(2017) Association of peripheral nesfatin-1 with early stage diabetic kidney disease. Pathophysiology, 24(1), 17--22.
[14]
Lindner, C., Bromiley, P. A., Ionita, M. C. (2015). Robust and accurate shape model matching using random forest regression-voting. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1862--1874
[15]
Boufenara, M. N., Boufaida, M. (2019) A Machine learning technique dedicated for biological data. International Conference on Theoretical and Applicative Aspects of Computer Science.

Cited By

View all
  • (2024)Machine Learning based Diagnosis of Kidney Abnormality Recognition on CT Scan Images2024 11th International Conference on Computing for Sustainable Global Development (INDIACom)10.23919/INDIACom61295.2024.10498420(1283-1289)Online publication date: 28-Feb-2024
  • (2023)Kidney Impairment Prediction Due to Diabetes Using Extended Ensemble Learning Machine AlgorithmJournal of Machine and Computing10.53759/7669/jmc202303027(312-325)Online publication date: 5-Jul-2023
  • (2021)Predicting and Staging Chronic Kidney Disease using Optimized Random Forest Algorithm2021 International Conference on Information Systems and Advanced Technologies (ICISAT)10.1109/ICISAT54145.2021.9678441(1-8)Online publication date: 27-Dec-2021

Index Terms

  1. Predictive Modeling of Diabetic Kidney Disease using Random Forest Algorithm along with Features Selection

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ISAIMS '20: Proceedings of the 1st International Symposium on Artificial Intelligence in Medical Sciences
    September 2020
    313 pages
    ISBN:9781450388603
    DOI:10.1145/3429889
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 December 2020

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Random forest
    2. diabetic kidney disease
    3. risk prediction

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ISAIMS 2020

    Acceptance Rates

    ISAIMS '20 Paper Acceptance Rate 53 of 112 submissions, 47%;
    Overall Acceptance Rate 53 of 112 submissions, 47%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)12
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 25 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Machine Learning based Diagnosis of Kidney Abnormality Recognition on CT Scan Images2024 11th International Conference on Computing for Sustainable Global Development (INDIACom)10.23919/INDIACom61295.2024.10498420(1283-1289)Online publication date: 28-Feb-2024
    • (2023)Kidney Impairment Prediction Due to Diabetes Using Extended Ensemble Learning Machine AlgorithmJournal of Machine and Computing10.53759/7669/jmc202303027(312-325)Online publication date: 5-Jul-2023
    • (2021)Predicting and Staging Chronic Kidney Disease using Optimized Random Forest Algorithm2021 International Conference on Information Systems and Advanced Technologies (ICISAT)10.1109/ICISAT54145.2021.9678441(1-8)Online publication date: 27-Dec-2021

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media