Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Advertisement

An ensemble framework for risk prediction of left atrial thrombus based on undersampling with replacement

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

    We’re sorry, something doesn't seem to be working properly.

    Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

Left atrial thrombus (LAT) impacts humans greatly and can result in ischemia and necrosis in severe cases. Therefore, health workers appeal to the social community to emphasize the importance of preventive treatment for LAT. This paper proposes an ensemble framework for risk prediction of LAT based on undersampling with replacement (EFRP-UR), addressing the problem of data imbalance. Firstly, in the feature selection process, we adopt the method of separately counting the essential features of data subsets. In view of the characteristics of class imbalance in medical data, we apply our improved undersamling method, “undersampling with replacement", to obtain a number of training subsets, train multiple base-classifiers, and use an iterative method to select the classifiers with better performance for subsequent integration, improving the prediction accuracy of the proposed EFRP-UR. To aim for disease risk prediction, we synthesize the results of different ensemble algorithms in the end to increase the recall rate. Applied to the LAT dataset obtained from the Regional Medical Center, our experimental results prove that the proposed EFRP-UR has improved in accuracy, recall rate and F1 value, compared with any single base-classifier. In addition, if comprehensive data on other diseases exist, EFRP-UR can also be transferred to predict other diseases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Algorithm 1
Fig. 2
Fig. 3

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

Our dataset is provided by Department of Cardiovascular Medicine, Renmin Hospital of Wuhan University. It can be made available on reasonable request.

References

  1. Lip GY, Tse H-F (2007) Management of atrial fibrillation. Lancet 370(9587):604–618

    Google Scholar 

  2. Lippi G, Sanchis-Gomar F, Cervellin G (2021) Global epidemiology of atrial fibrillation: an increasing epidemic and public health challenge. Int J Stroke 16(2):217–221

    Google Scholar 

  3. Lurie A, Wang J, Hinnegan KJ, McIntyre WF, Belley-Côté EP, Amit G, Healey JS, Connolly SJ, Wong JA (2021) Prevalence of left atrial thrombus in anticoagulated patients with atrial fibrillation. J Am Coll Cardiol 77(23):2875–2886

    Google Scholar 

  4. Gaita F, Caponi D, Pianelli M, Scaglione M, Toso E, Cesarani F, Boffano C, Gandini G, Valentini MC, De Ponti R et al (2010) Radiofrequency catheter ablation of atrial fibrillation: A cause of silent thromboembolism? magnetic resonance imaging assessment of cerebral thromboembolism in patients undergoing ablation of atrial fibrillation. Circulation 122(17):1667–1673

    Google Scholar 

  5. Cresti A, García-Fernández MA, Sievert H, Mazzone P, Baratta P, Solari M, Geyer A, De Sensi F, Limbruno U (2019) Prevalence of extra-appendage thrombosis in non-valvular atrial fibrillation and atrial flutter in patients undergoing cardioversion: a large transoesophageal echo study. EuroIntervention 15(3):225–230

    Google Scholar 

  6. Leung DY, Davidson PM, Cranney GB, Walsh WF (1997) Thromboembolic risks of left atrial thrombus detected by transesophageal echocardiogram. Am J Cardiol 79(5):626–629

    Google Scholar 

  7. Gurudevan SV, Shah H, Tolstrup K, Siegel R, Krishnan SC (2010) Septal thrombus in the left atrium: Is the left atrial septal pouch the culprit? JACC Cardiovasc Imaging 3(12):1284–1286

    Google Scholar 

  8. Cresti A, Galli CA, Alimento ML, De Sensi F, Baratta P, D’Aiello I, Limbruno U, Pepi M, Fusini L, Maltagliati AC (2019) Does mitral regurgitation reduce the risks of thrombosis in atrial fibrillation and flutter? J Cardiovasc Med 20(10):660–666

    Google Scholar 

  9. Miller LM, Gal A (2017) Cardiovascular system and lymphatic vessels. Pathol Basis Vet Dis. https://doi.org/10.1016/B978-0-323-35775-3.00010-2

    Article  Google Scholar 

  10. Ali MM, Paul BK, Ahmed K, Bui FM, Quinn JM, Moni MA (2021) Heart disease prediction using supervised machine learning algorithms: performance analysis and comparison. Comput Biol Med 136:104672

    Google Scholar 

  11. Yang L, Sun G, Wang A, Jiang H, Zhang S, Yang Y, Li X, Hao D, Xu M, Shao J (2020) Predictive models of hypertensive disorders in pregnancy based on support vector machine algorithm. Technol Health Care 28(S1):181–186

    Google Scholar 

  12. Chen M, Hao Y, Hwang K, Wang L, Wang L (2017) Disease prediction by machine learning over big data from healthcare communities. Ieee Access 5:8869–8879

    Google Scholar 

  13. Smiti A (2020) When machine learning meets medical world: current status and future challenges. Comput Sci Rev 37:100280

    MathSciNet  Google Scholar 

  14. Rostami M, Forouzandeh S, Berahmand K, Soltani M (2020) Integration of multi-objective pso based feature selection and node centrality for medical datasets. Genomics 112(6):4370–4384

    Google Scholar 

  15. Yan C, Ma J, Luo H, Patel A (2019) Hybrid binary coral reefs optimization algorithm with simulated annealing for feature selection in high-dimensional biomedical datasets. Chemom Intell Lab Syst 184:102–111

    Google Scholar 

  16. Han W, Huang Z, Li S, Jia Y (2019) Distribution-sensitive unbalanced data oversampling method for medical diagnosis. J Med Syst 43(2):1–10

    Google Scholar 

  17. Devarriya D, Gulati C, Mansharamani V, Sakalle A, Bhardwaj A (2020) Unbalanced breast cancer data classification using novel fitness functions in genetic programming. Expert Syst Appl 140:112866

    Google Scholar 

  18. Cai J, Luo J, Wang S, Yang S (2018) Feature selection in machine learning: a new perspective. Neurocomputing 300:70–79

    Google Scholar 

  19. Ventrella P, Delgrossi G, Ferrario G, Righetti M, Masseroli M (2021) Supervised machine learning for the assessment of chronic kidney disease advancement. Comput Methods Programs Biomed 209:106329

    Google Scholar 

  20. Saidi R, Bouaguel W, Essoussi N (2019) Hybrid feature selection method based on the genetic algorithm and pearson correlation coefficient. In: Machine learning paradigms: theory and application, pp 3–24

  21. Johnson P, Vandewater L, Wilson W, Maruff P, Savage G, Graham P, Macaulay LS, Ellis KA, Szoeke C, Martins RN et al (2014) Genetic algorithm with logistic regression for prediction of progression to alzheimer’s disease. BMC Bioinform 15(16):1–14

    Google Scholar 

  22. Wang Y, Makedon F (2004) Application of relief-f feature filtering algorithm to selecting informative genes for cancer classification using microarray data. In: Proceedings. In: 2004 IEEE computational systems bioinformatics conference ( IEEE), 2004. CSB 2004, pp 497–498

  23. Aada A, Tiwari S (2019) Predicting diabetes in medical datasets using machine learning techniques. Int J Sci Eng Res 5(2):257–267

    Google Scholar 

  24. Mohammed R, Rawashdeh J, Abdullah M (2020) Machine learning with oversampling and undersampling techniques: overview study and experimental results. In: 2020 11th international conference on information and communication systems (ICICS) (IEEE), pp 243–248

  25. Kaur P, Gosain A (2018) Comparing the behavior of oversampling and undersampling approach of class imbalance learning by combining class imbalance problem with noise. In: ICT based innovations, pp 23–30

  26. Fernández A, Garcia S, Herrera F, Chawla NV (2018) Smote for learning from imbalanced data: progress and challenges, marking the 15-year anniversary. J Artif Intell Res 61:863–905

    MathSciNet  Google Scholar 

  27. Mishra S (2017) Handling imbalanced data: smote versus random undersampling. Int Res J Eng Technol 4(8):317–320

    Google Scholar 

  28. Edward J, Rosli MM, Seman A (2023) A new multi-class rebalancing framework for imbalance medical data. IEEE Access. https://doi.org/10.1109/ACCESS.2023.3309732

    Article  Google Scholar 

  29. Tang X, Cai L, Meng Y, Gu C, Yang J, Yang J (2021) A novel hybrid feature selection and ensemble learning framework for unbalanced cancer data diagnosis with transcriptome and functional proteomic. IEEE Access 9:51659–51668

    Google Scholar 

  30. Rahim A, Rasheed Y, Azam F, Anwar MW, Rahim MA, Muzaffar AW (2021) An integrated machine learning framework for effective prediction of cardiovascular diseases. IEEE Access 9:106575–106588

    Google Scholar 

  31. Angebrandt Belošević P, Šmalcelj A, Kos N, Kordić K, Golubić K (2022) Left ventricular ejection fraction can predict atrial thrombosis even in non-high-risk individuals with atrial fibrillation. J Clin Med 11(14):3965

    Google Scholar 

  32. Cao M, Guo H, Zhao X, Li X, Sun C (2022) Refinement of chads2 and cha2ds2-vasc scores predict left atrial thrombus or spontaneous echo contrast in nonvalvular atrial fibrillation patients. J Int Med Res 50(1):03000605221074520

    Google Scholar 

  33. Wang F, Zhu M, Wang X, Zhang W, Su Y, Lu Y, Pan X, Gao D, Zhang X, Chen W et al (2018) Predictive value of left atrial appendage lobes on left atrial thrombus or spontaneous echo contrast in patients with non-valvular atrial fibrillation. BMC Cardiovasc Disord 18(1):1–11

    Google Scholar 

  34. Wang Y, Qiao Y, Mao Y, Jiang C, Fan J, Luo K (2020) Numerical prediction of thrombosis risk in left atrium under atrial fibrillation. Math Biosci Eng 17(3):2348–2360

    Google Scholar 

  35. Garcia-Villalba M, Rossini L, Gonzalo A, Vigneault D, Kahn AM, Flores O, McVeigh E, del Alamo de Pedro JC (2018) Patient-specific mapping of left atrial thrombosis risk by computational fluid dynamics. Circulation 138(Suppl_1):15017–15017

  36. Li Z, Pan L, Deng Y, Liu Q, Hidru TH, Liu F, Li C, Cong T, Yang X, Xia Y (2022) Development and validation of a nomogram for estimation of left atrial thrombus or spontaneous echo contrast risk in non-valvular atrial fibrillation patients with low to borderline cha2ds2-vasc score. Int J Gen Med 15:7329–7339

    Google Scholar 

  37. Wang G, Sun J, Ma J, Xu K, Gu J (2014) Sentiment classification: the contribution of ensemble learning. Decis Support Syst 57:77–93

    Google Scholar 

  38. Sun X, Liu L, Wang Z, Miao J, Wang Y, Luo Z, Li G (2017) An optimized multi-classifiers ensemble learning for identification of ginsengs based on electronic nose. Sens Actuators A 266:135–144

    Google Scholar 

  39. Davagdorj K, Pham VH, Theera-Umpon N, Ryu KH (2020) Xgboost-based framework for smoking-induced noncommunicable disease prediction. Int J Environ Res Public Health 17(18):6513

    Google Scholar 

  40. Chen R, Zhang S, Li J, Guo D, Zhang W, Wang X, Tian D, Qu Z, Wang X (2023) A study on predicting the length of hospital stay for chinese patients with ischemic stroke based on the xgboost algorithm. BMC Med Inform Decis Mak 23(1):1–10

    Google Scholar 

  41. Zhang B, Dong X, Hu Y, Jiang X, Li G (2023) Classification and prediction of spinal disease based on the smote-rfe-xgboost model. PeerJ Comput Sci 9:1280

    Google Scholar 

  42. Jothi Prakash V, Karthikeyan N (2021) Enhanced evolutionary feature selection and ensemble method for cardiovascular disease prediction. Interdiscip Sci Comput Life Sci 13(3):389–412

    Google Scholar 

  43. Qasem AG, Lam SS (2023) Prediction of wart treatment response using a hybrid ga-ensemble learning approach. Expert Syst Appl 221:119737

    Google Scholar 

  44. Vijayarani S, Dhayanand S, Phil M (2015) Kidney disease prediction using svm and ann algorithms. Int J Comput Bus Res 6(2):1–12

    Google Scholar 

  45. Vijayarani S, Dhayanand S (2015) Liver disease prediction using svm and naïve bayes algorithms. Int J Sci Eng Technol Res 4(4):816–820

    Google Scholar 

  46. Lin X, Song K, Lim N, Yuan X, Johnson T, Abderrahmani A, Vollenweider P, Stirnadel H, Sundseth S, Lai E et al (2009) Risk prediction of prevalent diabetes in a swiss population using a weighted genetic score-the colaus study. Diabetologia 52:600–608

    Google Scholar 

  47. Lee JM, Cho MS, Cha M-j, Kim J, Gi-Byoung N, Choi K-j (2022) Incidence and predictors of left atrial thrombus in patients with atrial fibrillation under standard anticoagulation therapy. Circulation 146:11864–11864

    Google Scholar 

  48. Zheng N, Zhang J (2022) External validation and comparison of cha2ds2-vasc-raf and cha2ds2-vasc-laf scores for predicting left atrial thrombus and spontaneous echo contrast in patients with non-valvular atrial fibrillation. J Interv Card Electrophysiol 65(2):535–542

    Google Scholar 

  49. Jia F, Tian Y, Lei S, Yang Y, Luo S, He Q (2019) Incidence and predictors of left atrial thrombus in patients with atrial fibrillation prior to ablation in the real world of china. Indian Pacing Electrophysiol J 19(4):134–139

    Google Scholar 

  50. Burczak DR, Julakanti RR, Kara Balla A, Scott CG, Geske JB, Ommen SR, Nkomo VT, Gersh BJ, Noseworthy PA, Siontis KC (2023) Risk of left atrial thrombus in patients with hypertrophic cardiomyopathy and atrial fibrillation. J Am Coll Cardiol 82(3):278–279

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Li Li.

Ethics declarations

Conflict of interest

The study has no conflict of interest to declare by any author.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, L., Fang, D., Ye, Q. et al. An ensemble framework for risk prediction of left atrial thrombus based on undersampling with replacement. Neural Comput & Applic 36, 18613–18625 (2024). https://doi.org/10.1007/s00521-024-10166-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-024-10166-6

Keywords