An ensemble framework for risk prediction of left atrial thrombus based on undersampling with replacement

Li, Li; Fang, Dongshen; Ye, Qiyao; Hu, Tan; Shi, Shaobo

doi:10.1007/s00521-024-10166-6

An ensemble framework for risk prediction of left atrial thrombus based on undersampling with replacement

Original Article
Published: 30 July 2024

Volume 36, pages 18613–18625, (2024)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Li Li ORCID: orcid.org/0000-0003-0611-583X¹,
Dongshen Fang¹,
Qiyao Ye¹,
Tan Hu¹ &
…
Shaobo Shi²

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

Left atrial thrombus (LAT) impacts humans greatly and can result in ischemia and necrosis in severe cases. Therefore, health workers appeal to the social community to emphasize the importance of preventive treatment for LAT. This paper proposes an ensemble framework for risk prediction of LAT based on undersampling with replacement (EFRP-UR), addressing the problem of data imbalance. Firstly, in the feature selection process, we adopt the method of separately counting the essential features of data subsets. In view of the characteristics of class imbalance in medical data, we apply our improved undersamling method, “undersampling with replacement", to obtain a number of training subsets, train multiple base-classifiers, and use an iterative method to select the classifiers with better performance for subsequent integration, improving the prediction accuracy of the proposed EFRP-UR. To aim for disease risk prediction, we synthesize the results of different ensemble algorithms in the end to increase the recall rate. Applied to the LAT dataset obtained from the Regional Medical Center, our experimental results prove that the proposed EFRP-UR has improved in accuracy, recall rate and F1 value, compared with any single base-classifier. In addition, if comprehensive data on other diseases exist, EFRP-UR can also be transferred to predict other diseases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Solving the class imbalance problem using ensemble algorithm: application of screening for aortic dissection

Article Open access 28 March 2022

An Ensemble-Based Extra Feature Selection Approach for Predicting Heart Disease

Early Heart Disease Diagnosis Through Feature Optimization with Ensemble Learning-Based Hybrid Bagging and Boosting Techniques

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availability

Our dataset is provided by Department of Cardiovascular Medicine, Renmin Hospital of Wuhan University. It can be made available on reasonable request.

References

Lip GY, Tse H-F (2007) Management of atrial fibrillation. Lancet 370(9587):604–618
Google Scholar
Lippi G, Sanchis-Gomar F, Cervellin G (2021) Global epidemiology of atrial fibrillation: an increasing epidemic and public health challenge. Int J Stroke 16(2):217–221
Google Scholar
Lurie A, Wang J, Hinnegan KJ, McIntyre WF, Belley-Côté EP, Amit G, Healey JS, Connolly SJ, Wong JA (2021) Prevalence of left atrial thrombus in anticoagulated patients with atrial fibrillation. J Am Coll Cardiol 77(23):2875–2886
Google Scholar
Gaita F, Caponi D, Pianelli M, Scaglione M, Toso E, Cesarani F, Boffano C, Gandini G, Valentini MC, De Ponti R et al (2010) Radiofrequency catheter ablation of atrial fibrillation: A cause of silent thromboembolism? magnetic resonance imaging assessment of cerebral thromboembolism in patients undergoing ablation of atrial fibrillation. Circulation 122(17):1667–1673
Google Scholar
Cresti A, García-Fernández MA, Sievert H, Mazzone P, Baratta P, Solari M, Geyer A, De Sensi F, Limbruno U (2019) Prevalence of extra-appendage thrombosis in non-valvular atrial fibrillation and atrial flutter in patients undergoing cardioversion: a large transoesophageal echo study. EuroIntervention 15(3):225–230
Google Scholar
Leung DY, Davidson PM, Cranney GB, Walsh WF (1997) Thromboembolic risks of left atrial thrombus detected by transesophageal echocardiogram. Am J Cardiol 79(5):626–629
Google Scholar
Gurudevan SV, Shah H, Tolstrup K, Siegel R, Krishnan SC (2010) Septal thrombus in the left atrium: Is the left atrial septal pouch the culprit? JACC Cardiovasc Imaging 3(12):1284–1286
Google Scholar
Cresti A, Galli CA, Alimento ML, De Sensi F, Baratta P, D’Aiello I, Limbruno U, Pepi M, Fusini L, Maltagliati AC (2019) Does mitral regurgitation reduce the risks of thrombosis in atrial fibrillation and flutter? J Cardiovasc Med 20(10):660–666
Google Scholar
Miller LM, Gal A (2017) Cardiovascular system and lymphatic vessels. Pathol Basis Vet Dis. https://doi.org/10.1016/B978-0-323-35775-3.00010-2
Article Google Scholar
Ali MM, Paul BK, Ahmed K, Bui FM, Quinn JM, Moni MA (2021) Heart disease prediction using supervised machine learning algorithms: performance analysis and comparison. Comput Biol Med 136:104672
Google Scholar
Yang L, Sun G, Wang A, Jiang H, Zhang S, Yang Y, Li X, Hao D, Xu M, Shao J (2020) Predictive models of hypertensive disorders in pregnancy based on support vector machine algorithm. Technol Health Care 28(S1):181–186
Google Scholar
Chen M, Hao Y, Hwang K, Wang L, Wang L (2017) Disease prediction by machine learning over big data from healthcare communities. Ieee Access 5:8869–8879
Google Scholar
Smiti A (2020) When machine learning meets medical world: current status and future challenges. Comput Sci Rev 37:100280
MathSciNet Google Scholar
Rostami M, Forouzandeh S, Berahmand K, Soltani M (2020) Integration of multi-objective pso based feature selection and node centrality for medical datasets. Genomics 112(6):4370–4384
Google Scholar
Yan C, Ma J, Luo H, Patel A (2019) Hybrid binary coral reefs optimization algorithm with simulated annealing for feature selection in high-dimensional biomedical datasets. Chemom Intell Lab Syst 184:102–111
Google Scholar
Han W, Huang Z, Li S, Jia Y (2019) Distribution-sensitive unbalanced data oversampling method for medical diagnosis. J Med Syst 43(2):1–10
Google Scholar
Devarriya D, Gulati C, Mansharamani V, Sakalle A, Bhardwaj A (2020) Unbalanced breast cancer data classification using novel fitness functions in genetic programming. Expert Syst Appl 140:112866
Google Scholar
Cai J, Luo J, Wang S, Yang S (2018) Feature selection in machine learning: a new perspective. Neurocomputing 300:70–79
Google Scholar
Ventrella P, Delgrossi G, Ferrario G, Righetti M, Masseroli M (2021) Supervised machine learning for the assessment of chronic kidney disease advancement. Comput Methods Programs Biomed 209:106329
Google Scholar
Saidi R, Bouaguel W, Essoussi N (2019) Hybrid feature selection method based on the genetic algorithm and pearson correlation coefficient. In: Machine learning paradigms: theory and application, pp 3–24
Johnson P, Vandewater L, Wilson W, Maruff P, Savage G, Graham P, Macaulay LS, Ellis KA, Szoeke C, Martins RN et al (2014) Genetic algorithm with logistic regression for prediction of progression to alzheimer’s disease. BMC Bioinform 15(16):1–14
Google Scholar
Wang Y, Makedon F (2004) Application of relief-f feature filtering algorithm to selecting informative genes for cancer classification using microarray data. In: Proceedings. In: 2004 IEEE computational systems bioinformatics conference ( IEEE), 2004. CSB 2004, pp 497–498
Aada A, Tiwari S (2019) Predicting diabetes in medical datasets using machine learning techniques. Int J Sci Eng Res 5(2):257–267
Google Scholar
Mohammed R, Rawashdeh J, Abdullah M (2020) Machine learning with oversampling and undersampling techniques: overview study and experimental results. In: 2020 11th international conference on information and communication systems (ICICS) (IEEE), pp 243–248
Kaur P, Gosain A (2018) Comparing the behavior of oversampling and undersampling approach of class imbalance learning by combining class imbalance problem with noise. In: ICT based innovations, pp 23–30
Fernández A, Garcia S, Herrera F, Chawla NV (2018) Smote for learning from imbalanced data: progress and challenges, marking the 15-year anniversary. J Artif Intell Res 61:863–905
MathSciNet Google Scholar
Mishra S (2017) Handling imbalanced data: smote versus random undersampling. Int Res J Eng Technol 4(8):317–320
Google Scholar
Edward J, Rosli MM, Seman A (2023) A new multi-class rebalancing framework for imbalance medical data. IEEE Access. https://doi.org/10.1109/ACCESS.2023.3309732
Article Google Scholar
Tang X, Cai L, Meng Y, Gu C, Yang J, Yang J (2021) A novel hybrid feature selection and ensemble learning framework for unbalanced cancer data diagnosis with transcriptome and functional proteomic. IEEE Access 9:51659–51668
Google Scholar
Rahim A, Rasheed Y, Azam F, Anwar MW, Rahim MA, Muzaffar AW (2021) An integrated machine learning framework for effective prediction of cardiovascular diseases. IEEE Access 9:106575–106588
Google Scholar
Angebrandt Belošević P, Šmalcelj A, Kos N, Kordić K, Golubić K (2022) Left ventricular ejection fraction can predict atrial thrombosis even in non-high-risk individuals with atrial fibrillation. J Clin Med 11(14):3965
Google Scholar
Cao M, Guo H, Zhao X, Li X, Sun C (2022) Refinement of chads2 and cha2ds2-vasc scores predict left atrial thrombus or spontaneous echo contrast in nonvalvular atrial fibrillation patients. J Int Med Res 50(1):03000605221074520
Google Scholar
Wang F, Zhu M, Wang X, Zhang W, Su Y, Lu Y, Pan X, Gao D, Zhang X, Chen W et al (2018) Predictive value of left atrial appendage lobes on left atrial thrombus or spontaneous echo contrast in patients with non-valvular atrial fibrillation. BMC Cardiovasc Disord 18(1):1–11
Google Scholar
Wang Y, Qiao Y, Mao Y, Jiang C, Fan J, Luo K (2020) Numerical prediction of thrombosis risk in left atrium under atrial fibrillation. Math Biosci Eng 17(3):2348–2360
Google Scholar
Garcia-Villalba M, Rossini L, Gonzalo A, Vigneault D, Kahn AM, Flores O, McVeigh E, del Alamo de Pedro JC (2018) Patient-specific mapping of left atrial thrombosis risk by computational fluid dynamics. Circulation 138(Suppl_1):15017–15017
Li Z, Pan L, Deng Y, Liu Q, Hidru TH, Liu F, Li C, Cong T, Yang X, Xia Y (2022) Development and validation of a nomogram for estimation of left atrial thrombus or spontaneous echo contrast risk in non-valvular atrial fibrillation patients with low to borderline cha2ds2-vasc score. Int J Gen Med 15:7329–7339
Google Scholar
Wang G, Sun J, Ma J, Xu K, Gu J (2014) Sentiment classification: the contribution of ensemble learning. Decis Support Syst 57:77–93
Google Scholar
Sun X, Liu L, Wang Z, Miao J, Wang Y, Luo Z, Li G (2017) An optimized multi-classifiers ensemble learning for identification of ginsengs based on electronic nose. Sens Actuators A 266:135–144
Google Scholar
Davagdorj K, Pham VH, Theera-Umpon N, Ryu KH (2020) Xgboost-based framework for smoking-induced noncommunicable disease prediction. Int J Environ Res Public Health 17(18):6513
Google Scholar
Chen R, Zhang S, Li J, Guo D, Zhang W, Wang X, Tian D, Qu Z, Wang X (2023) A study on predicting the length of hospital stay for chinese patients with ischemic stroke based on the xgboost algorithm. BMC Med Inform Decis Mak 23(1):1–10
Google Scholar
Zhang B, Dong X, Hu Y, Jiang X, Li G (2023) Classification and prediction of spinal disease based on the smote-rfe-xgboost model. PeerJ Comput Sci 9:1280
Google Scholar
Jothi Prakash V, Karthikeyan N (2021) Enhanced evolutionary feature selection and ensemble method for cardiovascular disease prediction. Interdiscip Sci Comput Life Sci 13(3):389–412
Google Scholar
Qasem AG, Lam SS (2023) Prediction of wart treatment response using a hybrid ga-ensemble learning approach. Expert Syst Appl 221:119737
Google Scholar
Vijayarani S, Dhayanand S, Phil M (2015) Kidney disease prediction using svm and ann algorithms. Int J Comput Bus Res 6(2):1–12
Google Scholar
Vijayarani S, Dhayanand S (2015) Liver disease prediction using svm and naïve bayes algorithms. Int J Sci Eng Technol Res 4(4):816–820
Google Scholar
Lin X, Song K, Lim N, Yuan X, Johnson T, Abderrahmani A, Vollenweider P, Stirnadel H, Sundseth S, Lai E et al (2009) Risk prediction of prevalent diabetes in a swiss population using a weighted genetic score-the colaus study. Diabetologia 52:600–608
Google Scholar
Lee JM, Cho MS, Cha M-j, Kim J, Gi-Byoung N, Choi K-j (2022) Incidence and predictors of left atrial thrombus in patients with atrial fibrillation under standard anticoagulation therapy. Circulation 146:11864–11864
Google Scholar
Zheng N, Zhang J (2022) External validation and comparison of cha2ds2-vasc-raf and cha2ds2-vasc-laf scores for predicting left atrial thrombus and spontaneous echo contrast in patients with non-valvular atrial fibrillation. J Interv Card Electrophysiol 65(2):535–542
Google Scholar
Jia F, Tian Y, Lei S, Yang Y, Luo S, He Q (2019) Incidence and predictors of left atrial thrombus in patients with atrial fibrillation prior to ablation in the real world of china. Indian Pacing Electrophysiol J 19(4):134–139
Google Scholar
Burczak DR, Julakanti RR, Kara Balla A, Scott CG, Geske JB, Ommen SR, Nkomo VT, Gersh BJ, Noseworthy PA, Siontis KC (2023) Risk of left atrial thrombus in patients with hypertrophic cardiomyopathy and atrial fibrillation. J Am Coll Cardiol 82(3):278–279
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronic Information, Wuhan University, Wuhan, 430072, China
Li Li, Dongshen Fang, Qiyao Ye & Tan Hu
Department of Cardiology, Renmin Hospital of Wuhan University, Wuhan, 430060, China
Shaobo Shi

Authors

Li Li
View author publications
You can also search for this author in PubMed Google Scholar
Dongshen Fang
View author publications
You can also search for this author in PubMed Google Scholar
Qiyao Ye
View author publications
You can also search for this author in PubMed Google Scholar
Tan Hu
View author publications
You can also search for this author in PubMed Google Scholar
Shaobo Shi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Li Li.

Ethics declarations

Conflict of interest

The study has no conflict of interest to declare by any author.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Li, L., Fang, D., Ye, Q. et al. An ensemble framework for risk prediction of left atrial thrombus based on undersampling with replacement. Neural Comput & Applic 36, 18613–18625 (2024). https://doi.org/10.1007/s00521-024-10166-6

Download citation

Received: 12 January 2023
Accepted: 01 July 2024
Published: 30 July 2024
Issue Date: October 2024
DOI: https://doi.org/10.1007/s00521-024-10166-6

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An ensemble framework for risk prediction of left atrial thrombus based on undersampling with replacement

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Solving the class imbalance problem using ensemble algorithm: application of screening for aortic dissection

An Ensemble-Based Extra Feature Selection Approach for Predicting Heart Disease

Early Heart Disease Diagnosis Through Feature Optimization with Ensemble Learning-Based Hybrid Bagging and Boosting Techniques

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

An ensemble framework for risk prediction of left atrial thrombus based on undersampling with replacement

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Solving the class imbalance problem using ensemble algorithm: application of screening for aortic dissection

An Ensemble-Based Extra Feature Selection Approach for Predicting Heart Disease

Early Heart Disease Diagnosis Through Feature Optimization with Ensemble Learning-Based Hybrid Bagging and Boosting Techniques

Explore related subjects

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation