Abstract
Existing sales forecasting models are not comprehensive and flexible enough to consider dynamic changes and nonlinearities in sales time-series at the store and product levels. To capture different big data characteristics in sales forecasting data, such as seasonal and trend variations, this study develops a hybrid model combining adaptive trend estimated series (ATES) with a deep neural network model. ATES is first used to model seasonal effects and incorporate holiday, weekend, and marketing effects on sales. The deep neural network model is then proposed to model residuals by capturing complex high-level spatiotemporal features from the data. The proposed hybrid model is equipped with a feature-extraction component that automatically detects the patterns and trends in time-series, which makes the forecasting model robust against noise and time-series length. To validate the proposed hybrid model, a large volume of sales data is processed with a three-dimensional data model to effectively support business decisions at the product-specific store level. To demonstrate the effectiveness of the proposed model, a comparative analysis is performed with several state-of-the-art sales forecasting methods. Here, we show that the proposed hybrid model outperforms existing models for forecasting horizons ranging from one to 12 months.
Similar content being viewed by others
References
Ali, Ö. G., & Gürlek, R. (2020). Automatic interpretable retail forecasting with promotional scenarios. International Journal of Forecasting, 36(4), 1389–1406.
Ali, O. G., & Pinar, E. (2016). Multi-period-ahead forecasting with residual extrapolation and information sharing-utilizing a multitude of retail series. International Journal of Forecasting, 32(2), 502–517.
Arunraj, N. S., & Ahrens, D. (2015). A hybrid seasonal autoregressive integrated moving average and quantile regression for daily food sales forecasting. International Journal of Production Economics, 170, 321–335.
Berry, L. R., Helman, P., & West, M. (2020). Probabilistic forecasting of heterogeneous consumer transaction-sales time series. International Journal of Forecasting, 36(2), 552–569.
Bohanec, M., Borštnar, M. K., & Robnik-Šikonja, M. (2017). Explaining machine learning models in sales predictions. Expert Systems with Applications, 71, 416–428.
Boone, T., Ganeshan, R., Jain, A., et al. (2019). Forecasting sales in the supply chain: Consumer analytics in the big data era. International Journal of Forecasting, 35(1), 170–180.
Bose, R. (2009). Advanced analytics: Opportunities and challenges. Industrial Management & Data Systems, 109(2), 155–172.
Box, G. E., Jenkins, G. M., Reinsel, G. C., et al. (2015). Time series analysis: Forecasting and control. Wiley.
Caiado, J., Crato, N., & Poncela, P. (2020). A fragmented-periodogram approach for clustering big data time series. Advances in Data Analysis and Classification, 14(1), 117–146.
Chen, F., & Ou, T. (2011). Sales forecasting system based on gray extreme learning machine with Taguchi method in retail industry. Expert Systems with Applications, 38(3), 1336–1345.
Chen, I. F., & Lu, C. J. (2017). Sales forecasting by combining clustering and machine-learning techniques for computer retailing. Neural Computing and Applications, 28(9), 2633–2647.
Choi, T. M., Hui, C. L., Ng, S. F., et al. (2011). Color trend forecasting of fashionable products with very few historical data. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 42(6), 1003–1010.
Chu, C. W., & Zhang, G. P. (2003). A comparative study of linear and nonlinear models for aggregate retail sales forecasting. International Journal of Production Economics, 86(3), 217–231.
Chu, T. H., Nguyen, Q. U., & Cao, V. L. (2018) Semantics based substituting technique for reducing code bloat in genetic programming. In Proceedings of the ninth international symposium on information and communication technology (pp. 77–83).
Ding, J., Chen, Z., Xiaolong, L., & Lai, B. (2020) Sales forecasting based on catboost. In 2020 2nd international conference on information technology and computer application (ITCA) (pp. 636–639). IEEE.
Disney, S. M., Ponte, B., & Wang, X. (2021). Exploring the nonlinear dynamics of the lost-sales order-up-to policy. International Journal of Production Research, 59(19), 5809–5830.
do Nascimento Camelo, H., Lucio, P. S., Junior, J. B. V. L., et al. (2018). Innovative hybrid models for forecasting time series applied in wind generation based on the combination of time series models with artificial neural networks. Energy, 151, 347–357.
Eachempati, P., Srivastava, P. R., Kumar, A., et al. (2022). Can customer sentiment impact firm value? An integrated text mining approach. Technological Forecasting and Social Change, 174(121), 265.
Efat, M. I. A., Bashar, R., Uddin, K. I., & Bhuiyan, T. (2018) Trend estimation of stock market: An intelligent decision system. In International conference on cyber security and computer science (ICONCS’18) (pp. 44–49).
Ferreira, S. L., Caires, A. O., Borges, Td. S., et al. (2017). Robustness evaluation in analytical methods optimized using experimental designs. Microchemical Journal, 131, 163–169.
Flores, B. E. (1989). The utilization of the Wilcoxon test to compare forecasting methods: A note. International Journal of Forecasting, 5(4), 529–535.
Gahirwal, M, (2013) Inter time series sales forecasting. arXiv preprint arXiv:1303.0117
Ganesan, V. A., Divi, S., Moudhgalya, N. B., Sriharsha, U., & Vijayaraghavan, V. (2019) Forecasting food sales in a multiplex using dynamic artificial neural networks. In Science and information conference (pp. 69–80). Springer.
Gelper, S., Fried, R., & Croux, C. (2010). Robust forecasting with exponential and Holt–Winters smoothing. Journal of Forecasting, 29(3), 285–300.
Harsoor, A. S., & Patil, A. (2015). Forecast of sales of Walmart store using big data applications. International Journal of Research in Engineering and Technology, 4(6), 51–59.
Huang, T., Fildes, R., & Soopramanien, D. (2019). Forecasting retailer product sales in the presence of structural change. European Journal of Operational Research, 279(2), 459–470.
Iwok, I. A. (2016). Seasonal modelling of Fourier series with linear trend. International Journal of Statistics and Probability, 5(6), 65–72.
Jha, A., Ray, S., Seaman, B., & Dhillon, I. S. (2015) Clustering to forecast sparse time-series data. In 2015 IEEE 31st international conference on data engineering (pp. 1388–1399). IEEE.
Ji, S., Wang, X., Zhao, W., & Guo, D. (2019). An application of a three-stage xgboost-based model to sales forecasting of a cross-border e-commerce enterprise. Mathematical Problems in Engineering. https://doi.org/10.1155/2019/8503252
Jiménez, F., Sánchez, G., García, J. M., et al. (2017). Multi-objective evolutionary feature selection for online sales forecasting. Neurocomputing, 234, 75–92.
Kaggle (2018) Corporación favorita grocery sales forecasting. Retrieved February 3, 2020, from https://www.kaggle.com/c/favorita-grocery-sales-forecasting
Kechyn, G., Yu, L., Zang, Y., & Kechyn, S. (2018) Sales forecasting using WaveNet within the framework of the Kaggle competition. arXiv preprint arXiv:1803.04037
Kharfan, M., Chan, V. W. K., & Firdolas Efendigil, T. (2021). A data-driven forecasting approach for newly launched seasonal products by leveraging machine-learning approaches. Annals of Operations Research, 303(1), 159–174.
Kim, T. Y., & Cho, S. B. (2019). Predicting residential energy consumption using CNN–LSTM neural networks. Energy, 182, 72–81.
Klimberg, R., & Ratick, S. (2000) A new measure of relative forecast error. In INFORMS fall meeting
Kolassa, S. (2016). Evaluating predictive count data distributions in retail sales forecasting. International Journal of Forecasting, 32(3), 788–803.
Kraus, M., Feuerriegel, S., & Oztekin, A. (2020). Deep learning in business analytics and operations research: Models, applications and managerial implications. European Journal of Operational Research, 281(3), 628–641.
Kuleshov, V., Fenner, N., & Ermon, S. (2018) Accurate uncertainties for deep learning using calibrated regression. In International conference on machine learning, PMLR (pp. 2796–2804).
Kumar, A., Shankar, R., & Aljohani, N. R. (2020). A big data driven framework for demand-driven forecasting with effects of marketing-mix variables. Industrial Marketing Management, 90, 493–507.
Li, C., Cheang, B., Luo, Z., et al. (2021). An exponential factorization machine with percentage error minimization to retail sales forecasting. ACM Transactions on Knowledge Discovery from Data (TKDD), 15(2), 1–32.
Li, C., & Lim, A. (2018). A greedy aggregation-decomposition method for intermittent demand forecasting in fashion retailing. European Journal of Operational Research, 269(3), 860–869.
Li, M., Ji, S., & Liu, G. (2018). Forecasting of Chinese e-commerce sales: an empirical comparison of Arima, nonlinear autoregressive neural network, and a combined ARIMA–NARNN model. Mathematical Problems in Engineering, 2018, 1–12.
Liang, Y., Wu, J., Wang, W., Cao, Y., Zhong, B., Chen, Z., & Li, Z. (2019) Product marketing prediction based on xgboost and lightGBM algorithm. In Proceedings of the 2nd international conference on artificial intelligence and pattern recognition (pp. 150–153).
Lim, B., Arık, S. Ö., Loeff, N., et al. (2021). Temporal fusion transformers for interpretable multi-horizon time series forecasting. International Journal of Forecasting, 37(4), 1748–1764.
Liu, N., Ren, S., Choi, T. M., et al. (2013). Sales forecasting for fashion retailing service industry: A review. Mathematical Problems in Engineering, 738, 675.
Loureiro, A. L., Miguéis, V. L., & da Silva, L. F. (2018). Exploring the use of deep neural networks for sales forecasting in fashion retail. Decision Support Systems, 114, 81–93.
Lu, C. J., Lee, T. S., & Lian, C. M. (2012). Sales forecasting for computer wholesalers: A comparison of multivariate adaptive regression splines and artificial neural networks. Decision Support Systems, 54(1), 584–596.
Ma, S., & Fildes, R. (2021). Retail sales forecasting with meta-learning. European Journal of Operational Research, 288(1), 111–128.
Ma, S., Fildes, R., & Huang, T. (2016). Demand forecasting with high dimensional data: The case of SKU retail sales forecasting with intra-and inter-category promotional information. European Journal of Operational Research, 249(1), 245–257.
Misiorek, A., Trueck, S., & Weron, R. (2006). Point and interval forecasting of spot electricity prices: Linear vs non-linear time series models. Studies in Nonlinear Dynamics & Econometrics. https://doi.org/10.2202/1558-3708.1362
Navratil, M., & Kolkova, A. (2019). Decomposition and forecasting time series in business economy using prophet forecasting model. Central European Business Review, 8(4), 26–39.
Noh, J., Park, H. J., Kim, J. S., et al. (2020). Gated recurrent unit with genetic algorithm for product demand forecasting in supply chain management. Mathematics, 8(4), 565.
Pan, H., Zhou, H., et al. (2020). Study on convolutional neural network and its application in data mining and sales forecasting for e-commerce. Electronic Commerce Research, 20(2), 297–320.
Paria, B., Sen, R., Ahmed, A., & Das, A. (2022) Hierarchically regularized deep forecasting. arXiv preprint arXiv:2106.07630
Pavlyshenko, B. (2018) Using stacking approaches for machine learning models. In 2018 IEEE second international conference on data stream mining & processing (DSMP) (pp. 255–258). IEEE.
Pavlyshenko, B. M. (2016) Linear, machine learning and probabilistic approaches for time series analysis. In 2016 IEEE first international conference on data stream mining & processing (DSMP) (pp. 377–381). IEEE.
Pavlyshenko, B. M. (2019). Machine-learning models for sales time series forecasting. Data, 4(1), 15.
Proietti, T., & Lütkepohl, H. (2013). Does the Box–Cox transformation help in forecasting macroeconomic time series? International Journal of Forecasting, 29(1), 88–99.
Ramos, P., Santos, N., & Rebelo, R. (2015). Performance of state space and Arima models for consumer retail sales forecasting. Robotics and Computer-Integrated Manufacturing, 34, 151–163.
Ren, S., Chan, H. L., & Siqin, T. (2020). Demand forecasting in retail operations for fashionable products: Methods, practices, and real case study. Annals of Operations Research, 291(1), 761–777.
Sagaert, Y. R., Aghezzaf, E. H., Kourentzes, N., & Desmet, B. (2018a). Tactical sales forecasting using a very large set of macroeconomic indicators. European Journal of Operational Research, 264(2), 558–569.
Sagaert, Y. R., Aghezzaf, E. H., Kourentzes, N., & Desmet, B. (2018b). Temporal big data for tire industry tactical sales forecasting. Interfaces, 48(2), 121–129.
Škare, M., & Porada-Rochoń, M. (2020). Forecasting financial cycles: Can big data help? Technological and Economic Development of Economy, 26(5), 974–988.
Smyl, S. (2020). A hybrid method of exponential smoothing and recurrent neural networks for time series forecasting. International Journal of Forecasting, 36(1), 75–85.
Sprangers, O., Schelter, S., & de Rijke, M. (2022). Parameter-efficient deep probabilistic forecasting. International Journal of Forecasting. https://doi.org/10.1016/j.ijforecast.2021.11.011
Sun, Z. L., Choi, T. M., Au, K. F., et al. (2008). Sales forecasting using extreme learning machine with applications in fashion retailing. Decision Support Systems, 46(1), 411–419.
Taylor, J. W. (2010). Exponentially weighted methods for forecasting intraday time series with multiple seasonal cycles. International Journal of Forecasting, 26(4), 627–646.
Taylor, J. W. (2011). Multi-item sales forecasting with total and split exponential smoothing. Journal of the Operational Research Society, 62(3), 555–563.
Taylor, S. J., & Letham, B. (2018). Forecasting at scale. The American Statistician, 72(1), 37–45.
Tehrani, A. F., & Ahrens, D. (2016) Improved forecasting and purchasing of fashion products based on the use of big data techniques. In Supply management research (pp. 293–312). Springer.
Teunter, R. H., Syntetos, A. A., & Babai, M. Z. (2011). Intermittent demand: Linking forecasting to inventory obsolescence. European Journal of Operational Research, 214(3), 606–615.
Thomassey, S. (2010). Sales forecasts in clothing industry: The key success factor of the supply chain management. International Journal of Production Economics, 128(2), 470–483.
Ulrich, M., Jahnke, H., Langrock, R., et al. (2021). Distributional regression for demand forecasting in e-grocery. European Journal of Operational Research, 294(3), 831–842.
Vairagade, N., Logofatu, D., Leon, F., & Muharemi, F. (2019) Demand forecasting using random forest and artificial neural network for supply chain management. In International conference on computational collective intelligence (pp. 328–339). Springer.
von Sachs, R. (2020). Nonparametric spectral analysis of multivariate time series. Annual Review of Statistics and Its Application, 7, 361–386.
Wang, X., Smith, K., & Hyndman, R. (2006). Characteristic-based clustering for time series data. Data Mining and Knowledge Discovery, 13(3), 335–364.
Wang, X., Smith-Miles, K., & Hyndman, R. (2009). Rule induction for forecasting method selection: Meta-learning the characteristics of univariate time series. Neurocomputing, 72(10–12), 2581–2594.
Weng, T., Liu, W., & Xiao, J. (2019). Supply chain sales forecasting based on lightGBM and LSTM combination model. Industrial Management & Data Systems, 120(2), 265–279.
Wong, W., & Guo, Z. (2010). A hybrid intelligent model for medium-term sales forecasting in fashion retail supply chains using extreme learning machine and harmony search algorithm. International Journal of Production Economics, 128(2), 614–624.
Wu, L., Kong, C., Hao, X., et al. (2020). A short-term load forecasting method based on GRU–CNN hybrid neural network model. Mathematical Problems in Engineering, 1428, 104.
Xia, M., & Wong, W. K. (2014). A seasonal discrete grey forecasting model for fashion retailing. Knowledge-Based Systems, 57, 119–126.
Zhang, G. P. (2003). Time series forecasting using a hybrid Arima and neural network model. Neurocomputing, 50, 159–175.
Zhang, G. P., & Qi, M. (2005). Neural network forecasting for seasonal and trend time series. European Journal of Operational Research, 160(2), 501–514.
Zhang, Q., Yang, L. T., Chen, Z., et al. (2018). A survey on deep learning for big data. Information Fusion, 42, 146–157.
Zhao, K., & Wang, C. (2017) Sales forecast in e-commerce using convolutional neural network. arXiv preprint arXiv:1708.07946
Acknowledgements
This article was directed by Software Evaluation and Re-Engineering Research Lab (SERER Lab) and supported by the scientific research project of the Czech Sciences Foundation Grant No. 19-15498S.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix 1: Forecasting error performance for the Walmart sales dataset
Appendix 1: Forecasting error performance for the Walmart sales dataset
Statistical time-series methods | ||||||
---|---|---|---|---|---|---|
Time dimension | ARIMA | HWES | PROPHET | SES | SARIMA | ATES |
RMSE | ||||||
One month | 0.1072 | 0.1630 | 0.2135 | 0.1678 | 0.1147 | 0.0824 |
One quarter | 0.2833 | 0.3345 | 0.3342 | 0.8877 | 0.2567 | 0.2100 |
Half year | 0.3976 | 0.5782 | 0.4123 | 1.2340 | 0.4123 | 0.3321 |
One year | 0.7560 | 0.8876 | 0.5678 | 1.3456 | 0.6318 | 0.4567 |
MAPE | ||||||
One month | 14.58% | 18.96% | 23.78% | 17.90% | 13.59% | 8.62% |
One quarter | 18.56% | 24.80% | 25.23% | 28.31% | 21.93% | 17.24% |
Half year | 26.79% | 31.21% | 28.35% | 47.10% | 25.02% | 23.77% |
One year | 41.35% | 44.87% | 31.41% | 55.42% | 32.28% | 27.66% |
PFE | ||||||
One Month | 4.66% | 4.99% | 5.67% | 4.96% | 4.92% | 4.23% |
One quarter | 8.56% | 8.92% | 9.22% | 10.12% | 8.26% | 7.38% |
Half year | 11.23% | 11.76% | 11.22% | 16.78% | 11.42% | 10.26% |
One year | 13.44% | 13.77% | 13.23% | 20.03% | 13.18% | 11.23% |
Neural network models | |||||||
---|---|---|---|---|---|---|---|
Time dimension | ANN | CNN | DNN | LSTM | GRU | CNN + LSTM | GRU–CNN–LSTM |
RMSE | |||||||
One month | 0.2105 | 0.1194 | 0.1044 | 0.1223 | 0.1072 | 0.1361 | 0.0918 |
One quarter | 0.3012 | 0.1988 | 0.2109 | 0.2467 | 0.2234 | 0.2123 | 0.1876 |
Half year | 0.3987 | 0.3184 | 0.2908 | 0.3123 | 0.2987 | 0.2911 | 0.2843 |
One year | 0.6881 | 0.4765 | 0.4908 | 0.5467 | 0.4654 | 0.4678 | 0.4123 |
MAPE | |||||||
One month | 24.58% | 15.11% | 13.23% | 13.79% | 14.17% | 14.09% | 12.11% |
One quarter | 30.21% | 24.45% | 23.78% | 21.90% | 19.54% | 20.10% | 17.00% |
Half year | 41.89% | 31.22% | 29.09% | 33.89% | 28.22% | 29.21% | 23.22% |
One year | 49.56% | 41.22% | 39.54% | 40.22% | 38.78% | 39.22% | 31.87% |
PFE | |||||||
One month | 4.90% | 4.60% | 4.39% | 4.12% | 4.34% | 4.31% | 4.01% |
One quarter | 8.23% | 8.37% | 9.01% | 8.47% | 8.21% | 8.01% | 7.64% |
Half year | 11.23% | 10.89% | 10.18% | 10.23% | 10.01% | 11.21% | 9.87% |
One year | 13.78% | 13.58% | 13.15% | 13.45% | 12.69% | 12.27% | 11.78% |
Hybrid time-series/neural network models | ||||
---|---|---|---|---|
Time dimension | ATES + CNN | ATES + DNN | ATES + GRU | ATES + GRU–CNN–LSTM |
RMSE | ||||
One month | 0.1241 | 0.1356 | 0.1230 | 0.0851 |
One quarter | 0.1723 | 0.1733 | 0.2145 | 0.1409 |
Half year | 0.2562 | 0.2312 | 0.2876 | 0.2133 |
One year | 0.3578 | 0.3289 | 0.3512 | 0.2809 |
MAPE | ||||
One month | 16.12% | 15.32% | 14.93% | 12.43% |
One quarter | 18.21% | 18.81% | 18.99% | 16.22% |
Half year | 24.34% | 26.22% | 26.44% | 22.65% |
One year | 30.18% | 31.45% | 30.70% | 28.11% |
PFE | ||||
One month | 4.32% | 4.12% | 4.22% | 2.43% |
One quarter | 7.88% | 7.78% | 7.65% | 5.38% |
Half year | 10.83% | 10.67% | 10.27% | 8.88% |
One year | 13.32% | 12.51% | 13.01% | 11.32% |
Rights and permissions
About this article
Cite this article
Efat, M.I.A., Hajek, P., Abedin, M.Z. et al. Deep-learning model using hybrid adaptive trend estimated series for modelling and forecasting sales. Ann Oper Res 339, 297–328 (2024). https://doi.org/10.1007/s10479-022-04838-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10479-022-04838-6