Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Improved Customer Lifetime Value Prediction With Sequence-To-Sequence Learning and Feature-Based Models

Published: 10 May 2021 Publication History

Abstract

The prediction of the Customer Lifetime Value (CLV) is an important asset for tool-supported marketing by customer relationship managers. Since standard methods based on purchase recency, frequency, and past profit and revenue statistics often have limited predictive power, advanced machine learning (ML) techniques were applied to this task in recent years. However, existing approaches are often not fully capable of modeling certain temporal patterns that can be commonly found in practice, such as periodic purchasing behavior of customers. To address these shortcomings, we propose a novel method for CLV prediction based on a combination of several ML techniques. At its core, our method consists of a tailored deep learning approach based on encoder–decoder sequence-to-sequence recurrent neural networks with augmented temporal convolutions. This model is then combined with gradient boosting machines (GBMs) and a set of novel features in a hybrid framework. Empirical evaluations based on real-world data from a larger e-commerce company and a public dataset from the domain of online retail show that already the sequence-based model leads to competitive performance results. Stacking it with the GBM model is synergistic and further improves accuracy, indicating that the two models capture different patterns in the data.

References

[1]
Md Al-Amin, Md Saiful Islam, and Shapan Das Uzzal. 2017. Sentiment analysis of Bengali comments with Word2Vec and sentiment information of words. In 2017 International Conference on Electrical, Computer and Communication Engineering (ECCE). IEEE, 186--190.
[2]
Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv:1409.0473 (2014).
[3]
Jan Beyersmann, Arthur Allignol, and Martin Schumacher. 2011. Competing Risks And Multistate Models With R. Springer Science & Business Media.
[4]
Derya Birant. 2011. Data mining using RFM analysis. In Knowledge-Oriented Applications in Data Mining. InTech.
[5]
Robert C. Blattberg, Byung-Do Kim, and Scott A. Neslin. 2008. Database Marketing – Analyzing and Managing Customers. Springer.
[6]
Sharad Borle, Siddharth S. Singh, and Dipak C. Jain. 2008. Customer lifetime value measurement. Management Science 54, 1 (2008), 100--112.
[7]
Léon Bottou. 2012. Stochastic gradient descent tricks. In Neural Networks: Tricks of the Trade. Springer, 421--436.
[8]
Peter J. Brockwell and Richard A. Davis. 2016. Introduction to Time Series and Forecasting (3 ed.). Springer.
[9]
Jan Roelf Bult and Tom Wansbeek. 1995. Optimal selection for direct mail. Marketing Science 14, 4 (1995), 378--394.
[10]
Benjamin Paul Chamberlain, Angelo Cardoso, Chak H. Liu, Roberto Pagliari, and Marc Peter Deisenroth. 2017. Customer lifetime value prediction using embeddings. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 1753--1762.
[11]
Daqing Chen, Sai Laing Sain, and Kun Guo. 2012. Data mining for the online retail industry: A case study of RFM model-based customer segmentation using data mining. Journal of Database Marketing & Customer Strategy Management 19, 3 (2012), 197--208.
[12]
Tianqi Chen and Carlos Guestrin. 2016. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’16). ACM, New York, NY, USA, 785--794.
[13]
Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv:1406.1078 (2014).
[14]
Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. 2014. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv:1412.3555 (2014).
[15]
Peter S. Fader, Bruce G. S. Hardie, and Ka Lok Lee. 2005. “Counting your customers” the easy way: An alternative to the Pareto/NBD model. Marketing Science 24, 2 (2005), 275--284.
[16]
Peter S. Fader, Bruce G. S. Hardie, and Ka Lok Lee. 2005. RFM and CLV: Using iso-value curves for customer base analysis. Journal of Marketing Research 42, 4 (2005), 415--430.
[17]
Ian Goodfellow, Yoshua Bengio, Aaron Courville, and Yoshua Bengio. 2016. Deep Learning. Vol. 1. MIT Press Cambridge.
[18]
Alex Graves, Greg Wayne, and Ivo Danihelka. 2014. Neural turing machines. arXiv:1410.5401 (2014).
[19]
Riccardo Guidotti, Michele Coscia, Dino Pedreschi, and Diego Pennacchioli. 2015. Behavioral entropy and profitability in retail. In 2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA). 1--10.
[20]
R. Guidotti, G. Rossetti, L. Pappalardo, F. Giannotti, and D. Pedreschi. 2019. Personalized market basket prediction with temporal annotated recurring sequences. IEEE Transactions on Knowledge and Data Engineering 31, 11 (2019), 2151--2163.
[21]
Sunil Gupta, Dominique Hanssens, Bruce Hardie, Wiliam Kahn, V. Kumar, Nathaniel Lin, Nalini Ravishanker, and S. Sriram. 2006. Modeling customer lifetime value. Journal of Service Research 9, 2 (2006), 139--155.
[22]
Sunil Gupta and Valarie Zeithaml. 2006. Customer metrics and their impact on financial performance. Marketing Science 25, 6 (2006), 718--739.
[23]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Computation 9, 8 (1997), 1735--1780.
[24]
Chun-Yao Huang. 2012. To model, or not to model: Forecasting for customer prioritization. International Journal of Forecasting 28, 2 (2012), 497--506.
[25]
Dipak Jain and Siddhartha S. Singh. 2002. Customer lifetime value research in marketing: A review and future directions. Journal of Interactive Marketing 16, 2 (2002), 34--46.
[26]
Mahboubeh Khajvand, Kiyana Zolfaghar, Sarah Ashoori, and Somayeh Alizadeh. 2011. Estimating customer lifetime value based on RFM analysis of customer purchase behavior: Case study. Procedia Computer Science 3 (2011), 57--63.
[27]
Gary Koop and Dimitris Korobilis. 2010. Bayesian multivariate time series methods for empirical macroeconomics. Foundations and Trends in Econometrics 3, 4 (2010), 267--358.
[28]
Vineet Kumar and Werner Reinartz. 2018. Customer Relationship Management: Concept, Strategy, and Tools. Springer.
[29]
Richard Lewis and Gregory C. Reinsel. 1985. Prediction of multivariate time series by autoregressive model fitting. Journal of Multivariate Analysis 16, 3 (1985), 393--411.
[30]
Xiao-Hui Li, Caleb Chen Cao, Yuhan Shi, Wei Bai, Han Gao, Luyu Qiu, Cong Wang, Yuanyuan Gao, Shenjia Zhang, Xun Xue, et al. 2020. A survey of data-driven and knowledge-aware eXplainable AI. IEEE Transactions on Knowledge and Data Engineering (2020).
[31]
Kenneth S. Lorek and G. Lee Willinger. 1996. A multivariate time-series prediction model for cash-flow data. The Accounting Review 71, 1 (1996), 81--102.
[32]
Scott M. Lundberg and Su-In Lee. 2017. A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems. 4765--4774.
[33]
Spyros Makridakis, Evangelos Spiliotis, and Vassilios Assimakopoulos. 2018. Statistical and machine learning forecasting methods: Concerns and ways forward. PLoS ONE 13, 3 (2018).
[34]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems. 3111--3119.
[35]
Phillip E. Pfeifer and Robert L. Carraway. 2000. Modeling customer relationships as Markov chains. Journal of Interactive Marketing 14, 2 (2000), 43.
[36]
David C. Schmittlein, Donald G. Morrison, and Richard Colombo. 1987. Counting your customers: Who-are they and what will they do next? Management Science 33, 1 (1987), 1--24.
[37]
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems. 3104--3112.
[38]
Ali Vanderveld, Addhyan Pandey, Angela Han, and Rajesh Parekh. 2016. An engagement-based customer lifetime value system for e-commerce. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 293--302.
[39]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems. 5998--6008.
[40]
Zhi-Hua Zhou and Ji Feng. 2017. Deep forest. arXiv:1702.08835 (2017).
[41]
Barret Zoph and Quoc V. Le. 2016. Neural architecture search with reinforcement learning. arXiv:1611.01578 (2016).

Cited By

View all
  • (2024)Skewness-aware Boosting Regression Trees for Customer Contribution Prediction in Financial Precision MarketingCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3648346(461-470)Online publication date: 13-May-2024
  • (2024)Customer lifetime value (CLV) insights for strategic marketing success and its impact on organizational financial performanceCogent Business & Management10.1080/23311975.2024.236132111:1Online publication date: 11-Jun-2024
  • (2024)Predicting e-commerce CLV with neural networks: The role of NPS, ATV, and CESJournal of Economy and Technology10.1016/j.ject.2024.04.0042(174-189)Online publication date: Nov-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data
ACM Transactions on Knowledge Discovery from Data  Volume 15, Issue 5
October 2021
508 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3461317
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 May 2021
Accepted: 01 December 2020
Revised: 01 November 2020
Received: 01 February 2019
Published in TKDD Volume 15, Issue 5

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Customer lifetime value
  2. machine learning
  3. neural networks

Qualifiers

  • Research-article
  • Research
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)259
  • Downloads (Last 6 weeks)18
Reflects downloads up to 30 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Skewness-aware Boosting Regression Trees for Customer Contribution Prediction in Financial Precision MarketingCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3648346(461-470)Online publication date: 13-May-2024
  • (2024)Customer lifetime value (CLV) insights for strategic marketing success and its impact on organizational financial performanceCogent Business & Management10.1080/23311975.2024.236132111:1Online publication date: 11-Jun-2024
  • (2024)Predicting e-commerce CLV with neural networks: The role of NPS, ATV, and CESJournal of Economy and Technology10.1016/j.ject.2024.04.0042(174-189)Online publication date: Nov-2024
  • (2024)A Decade of Churn Prediction Techniques in the TelCo Domain: A SurveySN Computer Science10.1007/s42979-024-02722-75:4Online publication date: 6-Apr-2024
  • (2023)Customer Behavior Prediction using Deep Learning Techniques for Online Purchasing2023 2nd International Conference for Innovation in Technology (INOCON)10.1109/INOCON57975.2023.10101102(1-7)Online publication date: 3-Mar-2023
  • (2023)Effect of Low-Level Interaction Data in Repeat Purchase Prediction TaskInternational Journal of Human–Computer Interaction10.1080/10447318.2023.217597340:10(2515-2533)Online publication date: 17-Feb-2023
  • (2023)How to really quantify the economic value of customer information in corporate databasesHumanities and Social Sciences Communications10.1057/s41599-023-01654-610:1Online publication date: 13-Apr-2023
  • (2023)Interpretable Embeddings for Geographic Transactional Activity AnalysisProcedia Computer Science10.1016/j.procs.2023.12.038229(357-366)Online publication date: 2023
  • (2023)METO-S2S: A S2S based vessel trajectory prediction method with Multiple-semantic Encoder and Type-Oriented DecoderOcean Engineering10.1016/j.oceaneng.2023.114248277(114248)Online publication date: Jun-2023
  • (2023)A meta-learning based stacked regression approach for customer lifetime value predictionJournal of Economy and Technology10.1016/j.ject.2023.09.0011(197-207)Online publication date: Nov-2023
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media