Rahman 2020
Rahman 2020
Rahman 2020
Abstract—The number of service providers are being increased effective use of marketing capital. Technology has been used to
very rapidly in every business. In these days, there is no shortage help businesses to retain a competitive edge [3]. Data mining
of options for customers in the banking sector when choosing techniques [4] are a commonly used information technology
where to put their money. As a result, customer churn and
engagement has become one of the top issues for most of the for the extraction of marketing expertise and further guidance
banks. In this paper, a method to predicts the customer churn for business decisions.
in a Bank, using machine learning techniques, which is a branch It is very easy for customers to switch from one organiza-
of artificial intelligence is proposed. The research promotes the tion(Bank) to another for a better service quality or price rates.
exploration of the likelihood of churn by analyzing customer Organizations are convinced that recruiting new customers is
behavior. The KNN, SVM, Decision Tree, and Random Forest
classifiers are used in this study. Also, some feature selection far more expensive and hard than keeping existing clients
methods have been done to find the more relevant features and to [5]. But delivering reliable service on time and in budget
verify system performance. The experimentation was conducted to customers while maintaining a good working partnership
on the churn modeling dataset from Kaggle. The results are with them is another significant challenge for them. They
compared to find an appropriate model with higher precision need to consider consumers and their needs to resolve these
and predictability. As a result, the use of the Random Forest
model after oversampling is better compared to other models in challenges. Among these, one of their primary emphasis will
terms of accuracy. be on client churn. Customer churn takes place when clients
Index Terms—Customer churn in Bank, k-Nearest Neighbor, or subscribers cease to engage incorporation with a company
Support Vector Machine, Decision Tree, Random Forest. or service. For any organization, winning business from new
clients means going via the sales pipeline, using their sales
I. I NTRODUCTION and marketing assets in the cycle. Customer retention, on the
The market is very dynamic and highly competitive nowa- other hand, is usually more budget-effective, because they
days. It is because of the availability of a large number have already gained the confidence and loyalty of current
of service providers. The challenges of service providers customers. So, the need for a system that can efficiently predict
are finding the changing customer behavior and their rising customer churn in the early stages is really important for
expectations. The raising aspirations of current generation any organization. This paper aims to build a framework that
consumers and their diverse demands for connectivity and can predict the client churn in the banking sector using some
innovative, personalized approaches are very distinct from Machine learning techniques [6].
previous generations of consumers. They are well educated
and better informed of emerging approaches. Such advanced II. L ITERATURE R EVIEW
knowledge has changed their purchasing behavior, resulting in The analysis of the client churn in banking is a really broad
a trend of ’analysis-paralysis’ over-analyzing the selling and area. In one of these studies, [7] pursue commercial bank client
purchase scenario, which ultimately helps them to improve churn prediction based on the SVM model. For this work,
their purchase decisions. Therefore, this is a big challenge for a Chinese commercial bank consumer dataset that contains
the new generation service providers to think of innovatively 50,000 customer information is selected. After preprocessing
to fulfil and add values to the customers. records, there are eventually 46,406 valid data records. Two
Corporations need to recognize their consumers. Liu and types of SVM model is selected: linear SVM and SVM
Shih [1] strengthen this argument by implying that increasing with radial basis kernel function. The predictive effect of
competitive pressures on organizations to develop innova- the classification models was greatly improved by the under-
tive marketing approaches, to meet consumer expectations sampling approach. Due to the lopsided features of the actual
and enhance loyalty and retention. Canning [2] argues that commercial bank client churn dataset, the SVM model can not
offering more to all is no longer a viable sales strategy, accurately predict churners and even the general assessment
and a market environment that continues to become more parameters can not calculate the predictive power of the
competitive needs an agenda that emphasizes on the most model. The findings show that the integration of the random
Authorized licensed use limited to: University of Cape Town. Downloaded on May 19,2021 at 00:58:44 UTC from IEEE Xplore. Restrictions apply.
Fourth International Conference on Electronics, Communication and Aerospace Technology (ICECA-2020)
IEEE Xplore Part Number: CFP20J88-ART; ISBN: 978-1-7281-6387-1
Authorized licensed use limited to: University of Cape Town. Downloaded on May 19,2021 at 00:58:44 UTC from IEEE Xplore. Restrictions apply.
Fourth International Conference on Electronics, Communication and Aerospace Technology (ICECA-2020)
IEEE Xplore Part Number: CFP20J88-ART; ISBN: 978-1-7281-6387-1
TABLE II
DATASET DESCRIPTION
1) Irrelevancy: Data or features which have no impact • Gender: Female -> 0 and Male -> 1
on the topic of discussion shall be considered as irrelevant.
Keeping such attributes may sometimes affect the performance C. Feature Selection
of classifiers. While considering the churn dataset, the features In machine learning, the process of identifying a subset of
named Row number, Customer Id, Surname, and Geography appropriate predictors to be used in model building is called
has nothing to do with the prediction. So these features have feature selection. The selection phase of the feature is very
been neglected manually in this study. critical as it will help to shorten the training time, escape the
2) Transformation: Data transformation is the practice of high-dimensionality curse, and above all simplify the model.
turning data from one form into another. Properly structured 1) mRMR: Minimum Redundancy Maximum Relevance
and validated data enhance the quality of data and protect (mRMR) is one of the feature selection methods of filter type.
applications against possible minefields such as null values, For classification problems, it will rate features sequentially
unwanted duplicates, incorrect indexing, and incompatible using the mRMR algorithm [10]. The filter type feature
formats. In this work the following data transformation is selection algorithm evaluates feature significance based on
carried out; feature characteristics, such as the variance of feature and
Authorized licensed use limited to: University of Cape Town. Downloaded on May 19,2021 at 00:58:44 UTC from IEEE Xplore. Restrictions apply.
Fourth International Conference on Electronics, Communication and Aerospace Technology (ICECA-2020)
IEEE Xplore Part Number: CFP20J88-ART; ISBN: 978-1-7281-6387-1
Authorized licensed use limited to: University of Cape Town. Downloaded on May 19,2021 at 00:58:44 UTC from IEEE Xplore. Restrictions apply.
Fourth International Conference on Electronics, Communication and Aerospace Technology (ICECA-2020)
IEEE Xplore Part Number: CFP20J88-ART; ISBN: 978-1-7281-6387-1
Authorized licensed use limited to: University of Cape Town. Downloaded on May 19,2021 at 00:58:44 UTC from IEEE Xplore. Restrictions apply.
Fourth International Conference on Electronics, Communication and Aerospace Technology (ICECA-2020)
IEEE Xplore Part Number: CFP20J88-ART; ISBN: 978-1-7281-6387-1
R EFERENCES
[1] D.-R. Liu and Y.-Y. Shih, “Integrating ahp and data mining for product
recommendation based on customer lifetime value,” Information &
Management, vol. 42, no. 3, pp. 387–400, 2005.
[2] G. Canning Jr, “Do a value analysis of your customer base,” Industrial
Marketing Management, vol. 11, no. 2, pp. 89–93, 1982.
[3] R. W. Stone and D. J. Good, “The assimilation of computer-aided
marketing activities,” Information & management, vol. 38, no. 7, pp.
437–447, 2001.
[4] M.-S. Chen, J. Han, and P. S. Yu, “Data mining: an overview from
a database perspective,” IEEE Transactions on Knowledge and data
Engineering, vol. 8, no. 6, pp. 866–883, 1996.
[5] M.-K. Kim, M.-C. Park, and D.-H. Jeong, “The effects of customer
satisfaction and switching barrier on customer loyalty in korean mobile
telecommunication services,” Telecommunications policy, vol. 28, no. 2,
pp. 145–159, 2004.
[6] I. T. J. Swamidason, “Survey of data mining algorithm’s for intelligent
computing system,” Journal of Trends in Computer Science and Smart
Technology, vol. 01, pp. 14–23, 09 2019.
[7] B. He, Y. Shi, Q. Wan, and X. Zhao, “Prediction of customer attrition of
commercial banks based on svm model,” Procedia Computer Science,
vol. 31, pp. 423–430, 2014.
[8] A. Bilal Zorić, “Predicting customer churn in banking industry using
neural networks,” Interdisciplinary Description of Complex Systems:
INDECS, vol. 14, no. 2, pp. 116–124, 2016.
[9] A. K. Ahmad, A. Jafar, and K. Aljoumaa, “Customer churn prediction
in telecom using machine learning in big data platform,” Journal of Big
Data, vol. 6, no. 1, p. 28, 2019.
[10] Y. Jiang and C. Li, “mrmr-based feature selection for classification
of cotton foreign matter using hyperspectral imaging,” Computers and
Electronics in Agriculture, vol. 119, pp. 191–200, 2015.
[11] L. Beretta and A. Santaniello, “Implementing relieff filters to extract
meaningful features from genetic lifetime datasets,” Journal of biomed-
ical informatics, vol. 44, no. 2, pp. 361–369, 2011.
[12] R. O. Duda, P. E. Hart, and D. G. Stork, Pattern classification. John
Wiley & Sons, 2012.
[13] C. Cortes and V. Vapnik, “Support-vector networks,” Machine learning,
vol. 20, no. 3, pp. 273–297, 1995.
[14] V. Vapnik, The nature of statistical learning theory. Springer science
& business media, 2013.
[15] V. N. Vapnik, “An overview of statistical learning theory,” IEEE trans-
actions on neural networks, vol. 10, no. 5, pp. 988–999, 1999.
[16] J. Raj and V. Ananthi, “Recurrent neural networks and nonlinear predic-
tion in support vector machines,” Journal of Soft Computing Paradigm,
vol. 2019, pp. 33–40, 09 2019.
[17] P. G. Nieto, E. F. Combarro, J. del Coz Dı́az, and E. Montañés, “A svm-
based regression model to study the air quality at local scale in oviedo
urban area (northern spain): A case study,” Applied Mathematics and
Computation, vol. 219, no. 17, pp. 8923–8937, 2013.
[18] S.-G. Cao, Y.-B. Liu, and Y.-P. Wang, “A forecasting and forewarning
model for methane hazard in working face of coal mine based on ls-
svm,” Journal of China University of Mining and Technology, vol. 18,
no. 2, pp. 172–176, 2008.
[19] Y. Tang, “Deep learning using linear support vector machines,” arXiv
preprint arXiv:1306.0239, 2013.
[20] L. Breiman, J. Friedman, C. J. Stone, and R. A. Olshen, Classification
and regression trees. CRC press, 1984.
[21] N. B. Amor, S. Benferhat, and Z. Elouedi, “Qualitative classification
with possibilistic decision trees,” in Modern Information Processing.
Elsevier, 2006, pp. 159–169.
[22] L. Breiman, “Random forests,” Machine learning, vol. 45, no. 1, pp.
5–32, 2001.
Authorized licensed use limited to: University of Cape Town. Downloaded on May 19,2021 at 00:58:44 UTC from IEEE Xplore. Restrictions apply.