Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

A novel time series clustering method with fine-tuned support vector regression for customer behavior analysis

Published: 15 October 2022 Publication History

Highlights

A two-stage customer behavior analysis methodology is proposed.
The methodology performs customer segmentation using historical transactions.
Hybrid SVRGOA is proposed to forecast customers’ behavior accurately.
The empirical results show that the SVRGOA outperforms other standard techniques.

Abstract

Exploring and forecasting customers’ behavior via time series analysis techniques has gained much attention in recent years. In this context, distance-based time series clustering methods are widely utilized to divide customers into segments. However, the performance of distance-based clustering is highly influenced by the distance metrics chosen. Determining a suitable distance metric for raw time series is a challenging task that must consider several factors. Therefore, in this study, we focus on feature-based time series clustering and propose a new featurization technique exploiting the Laplacian feature ranking method to obtain meaningful customer segments. We evaluate the proposed featurization approach with four state-of-the-art clustering methods using the point-of-sale (POS) transaction data. The clustering results indicate that the proposed method outperforms the baseline method in terms of the Silhouette measure. Besides, we present an hybrid support vector regression with the grasshopper optimization (SVRGOA) to forecast customers’ behavior. This method is compared against three benchmarks, and the results reveal that SVRGOA outperforms other models in the majority of cases in terms of the symmetric mean absolute percentage error (SMAPE).

References

[1]
H. Abbasimehr, A. Bahrini, An analytical framework based on the recency, frequency, and monetary model and time series clustering techniques for dynamic segmentation, Expert Systems with Applications 192 (2022),.
[2]
H. Abbasimehr, R. Paki, A. Bahrini, A novel approach based on combining deep learning models with statistical methods for COVID-19 time series forecasting, Neural Computing and Applications 34 (4) (2022) 3135–3149,.
[3]
H. Abbasimehr, M. Shabani, A new methodology for customer behavior analysis using time series clustering, Kybernetes 50 (2) (2019) 221–242,.
[4]
Abbasimehr, H., & Shabani, M. (2020a). Forecasting of Customer Behavior Using Time Series Analysis, Cham.
[5]
H. Abbasimehr, M. Shabani, A new framework for predicting customer behavior in terms of RFM by considering the temporal aspect based on time series techniques, Journal of Ambient Intelligence and Humanized Computing 12 (1) (2021) 515–531,.
[6]
H. Abbasimehr, M. Shabani, M. Yousefi, An optimized model using LSTM network for demand forecasting, Computers & Industrial Engineering 143 (2020),.
[7]
S. Aghabozorgi, A. Seyed Shirkhorshidi, T. Ying Wah, Time-series clustering – A decade review, Information Systems 53 (2015) 16–38,.
[8]
E. Akhondzadeh-Noughabi, A. Albadvi, Mining the dominant patterns of customer shifts between segments by using top-k and distinguishing sequential rules, Management Decision 53 (9) (2015) 1976–2003.
[9]
M. Alborzi, M. Khanbabaei, Using data mining and neural networks techniques to propose a new hybrid customer behaviour analysis and credit scoring model in banking services based on a developed RFM analysis method, International Journal of Business Information Systems 23 (1) (2016) 1–22,.
[10]
O. Arbelaitz, I. Gurrutxaga, J. Muguerza, J.M. Pérez, I. Perona, An extensive comparative study of cluster validity indices, Pattern Recognition 46 (1) (2013) 243–256,.
[11]
M. Awad, R. Khanna, Support Vector Regression, in: M. Awad, R. Khanna (Eds.), Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System Designers, Apress, Berkeley, CA, 2015, pp. 67–80.
[12]
K. Bandara, C. Bergmeir, S. Smyl, Forecasting across time series databases using recurrent neural networks on groups of similar series: A clustering approach, Expert Systems with Applications 140 (2020),.
[13]
P. Berkhin, A Survey of Clustering Data Mining Techniques, in: J. Kogan, C. Nicholas, M. Teboulle (Eds.), Grouping Multidimensional Data: Recent Advances in Clustering, Springer, Berlin Heidelberg, Berlin, Heidelberg, 2006, pp. 25–71.
[14]
L. Calvet, A. Ferrer, M.I. Gomes, A.A. Juan, D. Masip, Combining statistical learning with metaheuristics for the Multi-Depot Vehicle Routing Problem with market segmentation, Computers & Industrial Engineering 94 (2016) 93–104,.
[15]
C. Chen, L. Geng, S. Zhou, Design and implementation of bank CRM system based on decision tree algorithm, Neural Computing and Applications 33 (14) (2021) 8237–8247,.
[16]
R.K. Dash, T.N. Nguyen, K. Cengiz, A. Sharma, Fine-tuned support vector regression model for stock predictions, Neural Computing and Applications (2021) 1–15.
[17]
Desgraupes, B. (2013). Clustering indices. University of Paris Ouest-Lab Modal’X, 1, 34.
[18]
V. Djurisic, L. Kascelan, S. Rogic, B. Melovic, Bank CRM optimization using predictive classification based on the support vector machine method, Applied Artificial Intelligence 34 (12) (2020) 941–955,.
[19]
O. Doğan, E. Ayçi̇n, Z.A. Bulut, Customer segmentation by using RFM model and clustering methods: A case study in retail industry, International Journal of Contemporary Economics and Administrative Sciences 8 (1) (2018) 1–19.
[20]
Y. Duan, G. Cao, J.S. Edwards, Understanding the impact of business analytics on innovation, European Journal of Operational Research (2018),.
[21]
A. Dursun, M. Caber, Using data mining techniques for profiling profitable hotel customers: An application of RFM analysis, Tourism Management Perspectives 18 (2016) 153–160,.
[22]
D. Eni, Seasonal ARIMA modeling and forecasting of rainfall in Warri Town, Nigeria, Journal of Geoscience and Environment Protection 3 (06) (2015) 91.
[23]
V. Guerola-Navarro, R. Oltra-Badenes, H. Gil-Gomez, J.-A. Gil-Gomez, Customer relationship management (CRM): A bibliometric analysis, International Journal of Services Operations and Informatics 10 (3) (2020) 242–268.
[24]
J. Han, M. Kamber, J. Pei, Data mining: Concepts and techniques, Elsevier Science, 2011.
[25]
X. He, D. Cai, P. Niyogi, Laplacian score for feature selection, Advances in Neural Information Processing Systems 18 (2005) 507–514.
[26]
A. Hughes, Strategic Database Marketing: The Masterplan for Starting and Managing a Profitable, Customer-Based Marketing Program, 4 ed., McGraw-Hill Companies, Incorporated, United State of America, 2011.
[27]
Hyndman, R. J., Wang, E., & Laptev, N. (2015, 14-17 Nov. 2015). Large-Scale Unusual Time Series Detection. Paper presented at the 2015 IEEE International Conference on Data Mining Workshop (ICDMW).
[28]
Kalpakis, K., Gada, D., & Puttagunta, V. (2001). Distance measures for effective clustering of ARIMA time-series. Paper presented at the Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on.
[29]
Y. Kang, R.J. Hyndman, K. Smith-Miles, Visualising forecasting algorithm performance using time series instance spaces, International Journal of forecasting 33 (2) (2017) 345–358.
[30]
H. Khobzi, E. Akhondzadeh-Noughabi, B. Minaei-Bidgoli, A new application of RFM clustering for guild segmentation to mine the pattern of using banks’ e-payment services, Journal of Global Marketing 27 (3) (2014) 178–190,.
[31]
T. Kohonen, Essentials of the self-organizing map, Neural Networks 37 (2013) 52–65.
[32]
Y. Li, J. Yang, Y. Wang, Time series prediction method for continuous spares requirement based on recursive decomposition of information compression matrix, Soft Computing (2022) 1–9.
[33]
F. Martínez, M.P. Frías, M.D. Pérez-Godoy, A.J. Rivera, Dealing with seasonality by narrowing the training set in time series forecasting with kNN, Expert Systems with Applications 103 (2018) 38–48.
[34]
F. Martínez, M.P. Frías, M.D. Pérez, A.J. Rivera, A methodology for applying k-nearest neighbor to time series forecasting, Artificial Intelligence Review 52 (2019) 2019–2037,.
[35]
Momin, S., Bohra, T., & Raut, P. (2020). Prediction of Customer Churn Using Machine Learning, Cham.
[36]
A. Mosaddegh, A. Albadvi, M.M. Sepehri, B. Teimourpour, Dynamics of customer segments: A predictor of customer lifetime value, Expert Systems with Applications 172 (2021),.
[37]
P.W. Murray, B. Agard, M.A. Barajas, Market segmentation through data mining: A method to extract behaviors from a noisy data set, Computers & Industrial Engineering 109 (2017) 233–252,.
[38]
J. Nayak, B. Naik, H. Behera, Fuzzy C-means (FCM) clustering algorithm: A decade review from 2000 to 2014, Computational intelligence in data mining-volume 2 (2015) 133–149.
[39]
H.-S. Park, C.-H. Jun, A simple and fast algorithm for K-medoids clustering, Expert Systems with Applications 36 (2) (2009) 3336–3341.
[40]
A. Parvaneh, M. Tarokh, H. Abbasimehr, Combining data mining and group decision making in retailer segmentation based on LRFMP variables, International Journal of Industrial Engineering & Production Research 25 (3) (2014) 197–206.
[41]
A.V. Phan, M.L. Nguyen, L.T. Bui, Feature weighting and SVM parameters optimization based on genetic algorithms for classification problems, Applied Intelligence 46 (2) (2017) 455–469.
[42]
S.A. Priyambada, M. Er, B.N. Yahya, T. Usagawa, Profile-based cluster evolution analysis: Identification of migration patterns for understanding student learning behavior, IEEE Access 9 (2021) 101718–101728,.
[43]
R. Ramon-Gonen, R. Gelbard, Cluster evolution analysis: Identification and detection of similar clusters and migration patterns, Expert Systems with Applications 83 (2017) 363–378,.
[44]
M.B. Richman, T. Trafalis, I. Adrianto, Multiple imputation through machine learning algorithms, Paper presented at the Fifth conference on artificial intelligence applications to environmental science, 2007.
[45]
Saas, A., Guitart, A., & Á, P. (2016, 20-23 Sept. 2016). Discovering playing patterns: Time series clustering of free-to-play game data. Paper presented at the 2016 IEEE Conference on Computational Intelligence and Games (CIG).
[46]
S. Saremi, S. Mirjalili, A. Lewis, Grasshopper Optimisation Algorithm: Theory and application, Advances in Engineering Software 105 (2017) 30–47,.
[47]
A. Seret, S.K. vanden Broucke, B. Baesens, J. Vanthienen, A dynamic understanding of customer behavior processes based on clustering and sequence mining, Expert Systems with Applications 41 (10) (2014) 4648–4657.
[48]
Sharma, T., Gupta, P., Nigam, V., & Goel, M. (2020). Customer Churn Prediction in Telecommunications Using Gradient Boosted Trees, Singapore.
[49]
N. Singh, P. Singh, M. Gupta, An inclusive survey on machine learning for CRM: A paradigm shift, DECISION 47 (4) (2020) 447–457,.
[50]
M. Song, X. Zhao, E. Hailhong, Z. Ou, Statistics-based CRM approach via time series segmenting RFM on large scale data, Knowledge-Based Systems 132 (2017) 21–29,.
[51]
Syaputra, A., Zulkarnain, & Laoh, E. (2020, 19-20 Nov. 2020). Customer Segmentation on Returned Product Customers Using Time Series Clustering Analysis. Paper presented at the 2020 International Conference on ICT for Smart Society (ICISS).
[52]
X. Wang, K. Smith, R. Hyndman, Characteristic-based clustering for time series data, Data Mining and Knowledge Discovery 13 (3) (2006) 335–364,.
[53]
J.T. Wei, S.-Y. Lin, Y.-Z. Yang, H.-H. Wu, The application of data mining and RFM model in market segmentation of a veterinary hospital, Journal of Statistics and Management Systems (2019) 1–17.
[54]
X.-S. Yang, Nature-inspired metaheuristic algorithms, Luniver Press, 2008.
[55]
Zhou, Y., Hu, Z., & Liu, Y. (2021, 28-29 Jan. 2021). Analyzing User Behavior Patterns in Casual Games Using Time Series Clustering. Paper presented at the 2021 2nd International Conference on Computing and Data Science (CDS).

Index Terms

  1. A novel time series clustering method with fine-tuned support vector regression for customer behavior analysis
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image Expert Systems with Applications: An International Journal
        Expert Systems with Applications: An International Journal  Volume 204, Issue C
        Oct 2022
        1098 pages

        Publisher

        Pergamon Press, Inc.

        United States

        Publication History

        Published: 15 October 2022

        Author Tags

        1. Customer relationship management (CRM)
        2. Customer behavior
        3. Feature-based time series clustering
        4. Time series forecasting
        5. Support vector regression

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 0
          Total Downloads
        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 15 Feb 2025

        Other Metrics

        Citations

        View Options

        View options

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media