Abstract
This study adopts five common machine learning algorithms for predicting consumer personal loan uptake, including Logistic Regression, Support Vector Machine, Multilayer Perceptron, Gradient Boosting Decision Trees Catboost, and Xgboost. The research utilizes data from Thera Bank available in the public database Kaggle, featuring fields like age, work experience, income, family size, average credit card expenditure, education level, home loans, securities account, deposit account, and internet banking usage. The study addresses the issue of imbalanced data using the SMOTE (Synthetic Minority Over-sampling Technique) method and compares the accuracy and stability of predictions using the five models with three different sampling rates to identify the optimal model and key factors. Empirical results show that the Gradient Boosting Catboost model and the Support Vector Machine model perform with stability and precision across different sampling ratios, making them the best models. Moreover, through the Gradient Boosting Xgboost model, the study identifies key features such as educational factors, income, family size, the existence of a deposit account, and annual credit card spending. The findings of this research can provide crucial factors for financial institutions when formulating marketing strategies for personal loans.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Akça, M.F., Sevli, O.: Predicting acceptance of the bank loan offers by using support vector machines. Int. Adv. Res. Eng. J. 6(2), 142–147 (2022)
Agarwal, K., Jain, M., & Kumawat, A.: Comparing classification algorithms on predicting loans. In: Information Systems and Management Science: Conference Proceedings of 3rd International Conference on Information Systems and Management Science (ISMS) 2020 (pp. 240–249). Springer International Publishing (2022). https://doi.org/10.1007/978-3-030-86223-7_21
Amari, S.: A theory of adaptive pattern classifiers. IEEE Trans. Electron. Comput. 3, 299–307 (1967)
Anand, M., Velu, A., Whig, P.: Prediction of loan behaviour with machine learning models for secure banking. J. Comput. Sci. Eng. (JCSE) 3(1), 1–13 (2022)
Arun, K., Ishan, G., Sanmeet, K.: Loan approval prediction based on machine learning approach. IOSR J. Comput. Eng 18(3), 18–21 (2016)
Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152 (1992)
Cox, D.R.: The regression analysis of binary sequences. J. R. Stat. Soc. Ser. B Stat Methodol. 20(2), 215–232 (1958)
Cramer, J.S.: The origins of logistic regression (2002)
Eletter, S.F., Yaseen, S.G.: Loan decision models for the Jordanian commercial banks. Global Bus. Econ. Rev. 19(3), 323–338 (2017)
Huang, J., Chai, J., Cho, S.: Deep learning in finance and banking: a literature review and classification. Front. Bus. Res. China 14(1), 1–24 (2020)
Ibrahim, A.A., Ridwan, R.L., Muhammed, M.M., Abdulaziz, R.O., Saheed, G.A.: Comparison of the CatBoost classifier with other machine learning methods. Int. J. Adv. Comput. Sci. Appl. 11(11) (2020)
Li, X., Ergu, D., Zhang, D., Qiu, D., Cai, Y., Ma, B.: Prediction of loan default based on multi-model fusion. Procedia Comput. Sci. 199, 757–764 (2022)
Ma, L., Sun, B.: Machine learning and AI in marketing–Connecting computing power to human insights. Int. J. Res. Mark. 37(3), 481–504 (2020)
Nosratabadi, S., et al.: Data science in economics: comprehensive review of advanced machine learning and deep learning methods. Mathematics 8(10), 1799 (2020)
Prasad, K.G.S., Chidvilas, P.V.S., Kumar, V.V.: Customer loan approval classification by supervised learning model. Int. J. Recent Technol. Eng. 8(4), 9898–9901 (2019)
Sreesouthry, S., Ayubkhan, A., Rizwan, M.M., Lokesh, D., Raj, K.P.: Loan prediction using logistic regression in machine learning. Ann. Romanian Soc. Cell Biol. 25(4), 2790–2794 (2021)
Tax, N., et al.: Machine learning for fraud detection in e-commerce: a research agenda. In: Deployable Machine Learning for Security Defense: Second International Workshop, MLHat 2021, Virtual Event, August 15, 2021, Proceedings 2 (pp. 30–54). Springer International Publishing (2021). https://doi.org/10.1007/978-3-030-87839-9_2
Zhang, D., Gong, Y., Yu, L., Wang, X.: P2P online loan willingness prediction and influencing factors analysis based on factor analysis and XGBoost. J. Phys.: Conf. Ser. 1624(4), 042039 (2020). IOP Publishing
Zhu, Y., Zhou, L., Xie, C., Wang, G.J., Nguyen, T.V.: Forecasting SMEs’ credit risk in supply chain finance with an enhanced hybrid ensemble machine learning approach. Int. J. Prod. Econ. 211, 22–33 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Huang, HI., Wang, CW., Wu, CW. (2024). Predictive Analysis for Personal Loans by Using Machine Learning. In: Nah, F.FH., Siau, K.L. (eds) HCI in Business, Government and Organizations. HCII 2024. Lecture Notes in Computer Science, vol 14720. Springer, Cham. https://doi.org/10.1007/978-3-031-61315-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-031-61315-9_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-61314-2
Online ISBN: 978-3-031-61315-9
eBook Packages: Computer ScienceComputer Science (R0)