Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3459637.3481942acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article
Open access

Online Multi-horizon Transaction Metric Estimation with Multi-modal Learning in Payment Networks

Published: 30 October 2021 Publication History
  • Get Citation Alerts
  • Abstract

    Predicting metrics associated with entities' transnational behavior within payment processing networks is essential for system monitoring. Multivariate time series, aggregated from the past transaction history, can provide valuable insights for such prediction. The general multivariate time series prediction problem has been well studied and applied across several domains, including manufacturing, medical, and entomology. However, new domain-related challenges associated with the data such as concept drift and multi-modality have surfaced in addition to the real-time requirements of handling the payment transaction data at scale. In this work, we study the problem of multivariate time series prediction for estimating transaction metrics associated with entities in the payment transaction database. We propose a model with five unique components to estimate the transaction metrics from multi-modality data. Four of these components capture interaction, temporal, scale, and shape perspectives, and the fifth component fuses these perspectives together. We also propose a hybrid offline/online training scheme to address concept drift in the data and fulfill the real-time requirements. Combining the estimation model with a graphical user interface, the prototype transaction metric estimation system has demonstrated its potential benefit as a tool for improving a payment processing company's system monitoring capability.

    Supplementary Material

    MP4 File (CIKM21-afp2132.mp4)
    Presentation video

    References

    [1]
    Shaojie Bai, J Zico Kolter, and Vladlen Koltun. 2018. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv preprint arXiv:1803.01271 (2018).
    [2]
    Albert Bifet and Ricard Gavalda. 2007. Learning from time-changing data with adaptive windowing. In Proceedings of the 2007 SIAM international conference on data mining. SIAM, 443--448.
    [3]
    George EP Box, Gwilym M Jenkins, Gregory C Reinsel, and Greta M Ljung. 2015. Time series analysis: forecasting and control. John Wiley & Sons.
    [4]
    Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014).
    [5]
    Jan G De Gooijer and Rob J Hyndman. 2006. 25 years of time series forecasting. International journal of forecasting, Vol. 22, 3 (2006), 443--473.
    [6]
    Christos Faloutsos, Valentin Flunkert, Jan Gasthaus, Tim Januschowski, and Yuyang Wang. 2019. Forecasting Big Time Series: Theory and Practice. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 3209--3210.
    [7]
    Chenyou Fan, Yuze Zhang, Yi Pan, Xiaoyue Li, Chi Zhang, Rong Yuan, Di Wu, Wensheng Wang, Jian Pei, and Heng Huang. 2019. Multi-horizon time series forecasting with temporal attention learning. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2527--2535.
    [8]
    Jo ao Gama, Indr.e vZ liobait.e, Albert Bifet, Mykola Pechenizkiy, and Abdelhamid Bouchachia. 2014. A survey on concept drift adaptation. ACM computing surveys (CSUR), Vol. 46, 4 (2014), 1--37.
    [9]
    Gharghabi et al. 2017. Matrix profile VIII: domain agnostic online semantic segmentation at superhuman performance levels. In 2017 IEEE International Conference on Data Mining (ICDM). IEEE, 117--126.
    [10]
    Gharghabi et al. 2019. Domain agnostic online semantic segmentation for multi-dimensional time series. Data mining and knowledge discovery, Vol. 33, 1 (2019), 96--130.
    [11]
    Steven CH Hoi, Doyen Sahoo, Jing Lu, and Peilin Zhao. 2018. Online learning: A comprehensive survey. arXiv preprint arXiv:1802.02871(2018).
    [12]
    Xiaowei Jia, Ankush Khandelwal, Guruprasad Nayak, James Gerber, Kimberly Carlson, Paul West, and Vipin Kumar. 2017. Incremental dual-memory lstm in land cover prediction. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 867--876.
    [13]
    Girish Kapur, Bharat Prasad, and Nathan Nickels. 2020. Solving Merchant Attrition using Machine Learning. https://medium.com/@ODSC/solving-merchant-attrition-using-machine-learning-989e41e81b0e.
    [14]
    Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
    [15]
    Bartosz Krawczyk, Leandro L Minku, Jo ao Gama, Jerzy Stefanowski. 2017. Ensemble learning for data stream analysis: A survey. Information Fusion, Vol. 37 (2017), 132--156.
    [16]
    Yann A LeCun, Léon Bottou, Genevieve B Orr, and Klaus-Robert Müller. 2012. Efficient backprop. In Neural networks: Tricks of the trade. Springer, 9--48.
    [17]
    Anjin Liu, Yiliao Song, Guangquan Zhang, and Jie Lu. 2017. Regional concept drift detection and density synchronized drift adaptation. In IJCAI International Joint Conference on Artificial Intelligence.
    [18]
    Viktor Losing, Barbara Hammer, and Heiko Wersing. 2016. KNN classifier with self adjusting memory for heterogeneous concept drift. In 2016 IEEE 16th international conference on data mining (ICDM). IEEE, 291--300.
    [19]
    Viktor Losing, Barbara Hammer, and Heiko Wersing. 2018. Incremental on-line learning: A review and comparison of state of the art algorithms. Neurocomputing, Vol. 275 (2018), 1261--1274.
    [20]
    Jie Lu, Anjin Liu, Fan Dong, Feng Gu, Joao Gama, and Guangquan Zhang. 2018. Learning under concept drift: A review. IEEE Transactions on Knowledge and Data Engineering, Vol. 31, 12 (2018), 2346--2363.
    [21]
    Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of machine learning research, Vol. 9, Nov (2008), 2579--2605.
    [22]
    Andreu Mora. 2019. Predicting and monitoring payment volumes with Spark and ElasticSearch. https://www.adyen.com/blog/predicting-and-monitoring-payment-volumes-with-spark-and-elasticsearch.
    [23]
    Abdullah Mueen, Yan Zhu, Chin-Chia Michael Yeh, Kaveh Kamgar, Krishnamurthy Viswanathan, Chetan Gupta, and Eamonn Keogh. 2017. The fastest similarity search algorithm for time series subsequences under euclidean distance.
    [24]
    Oracle Corporation and/or its affiliates. 2021. MySQL#8482;. https://www.mysql.com/.
    [25]
    Nikunj C Oza. 2005. Online bagging and boosting. In 2005 IEEE international conference on systems, man and cybernetics, Vol. 3. Ieee, 2340--2345.
    [26]
    Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. Pytorch: An imperative style, high-performance deep learning library. In Advances in neural information processing systems. 8026--8037.
    [27]
    Jamie Pearson. [n.d.]. Real-time Payments Analytics - Prediction. https://www.ir.com/blog/payments/real-time-payments-analytics-prediction.
    [28]
    John Platt. 1998. Sequential minimal optimization: A fast algorithm for training support vector machines. (1998).
    [29]
    Thanawin Rakthanmanon, Bilson Campana, Abdullah Mueen, Gustavo Batista, Brandon Westover, Qiang Zhu, Jesin Zakaria, and Eamonn Keogh. 2012. Searching and mining trillions of time series subsequences under dynamic time warping. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. 262--270.
    [30]
    Robert E Schapire. 2003. The boosting approach to machine learning: An overview. In Nonlinear estimation and classification. Springer, 149--171.
    [31]
    Jeffrey C Schlimmer and Richard H Granger. 1986. Incremental learning from noisy data. Machine learning, Vol. 1, 3 (1986), 317--354.
    [32]
    Rajat Sen, Hsiang-Fu Yu, and Inderjit S. Dhillon. 2019. Think globally, act locally: A deep neural network approach to high-dimensional time series forecasting. In Advances in Neural Information Processing Systems. 4838--4847.
    [33]
    Shun-Yao Shih, Fan-Keng Sun, and Hung-Yi Lee. 2019. Temporal pattern attention for multivariate time series forecasting. Machine Learning, Vol. 108, 8--9 (2019), 1421--1441.
    [34]
    Swabha Swayamdipta, Roy Schwartz, Nicholas Lourie, Yizhong Wang, Hannaneh Hajishirzi, Noah A Smith, and Yejin Choi. 2020. Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics. arXiv preprint arXiv:2009.10795 (2020).
    [35]
    Souhaib Ben Taieb and Amir F Atiya. 2015. A bias and variance analysis for multistep-ahead time series forecasting. IEEE transactions on neural networks and learning systems, Vol. 27, 1 (2015), 62--76.
    [36]
    The Apache Software Foundation. 2021 a. Apache Hadoop#8482;. https://hadoop.apache.org/.
    [37]
    The Apache Software Foundation. 2021 b. Apache Hive#8482;. https://hive.apache.org/.
    [38]
    Arun Thomas. 2020. Atm cash-out: The biggest threat we ignore. https://medium.com/@netsentries/atm-cash-out-the-biggest-threat-we-ignore-d0d4f3e50096.
    [39]
    Andrew H Van Benschoten, Austin Ouyang, Francisco Bischoff, and Tyler W Marrs. 2020. MPA: a novel cross-language API for time series analysis. Journal of Open Source Software, Vol. 5, 49 (2020), 2179.
    [40]
    Ruofeng Wen, Kari Torkkola, Balakrishnan Narayanaswamy, and Dhruv Madeka. 2017. A multi-horizon quantile recurrent forecaster. arXiv preprint arXiv:1711.11053 (2017).
    [41]
    Chin-Chia Michael Yeh. 2018. Towards a Near Universal Time Series Data Mining Tool: Introducing the Matrix Profile. arXiv preprint arXiv:1811.03064 (2018).
    [42]
    Chin-Chia Michael Yeh, Dhruv Gelda, Zhongfang Zhuang, Yan Zheng, Liang Gou, and Wei Zhang. 2020 a. Towards a Flexible Embedding Learning Framework. arXiv preprint arXiv:2009.10989 (2020).
    [43]
    Chin-Chia Michael Yeh, Nickolas Kavantzas, and Eamonn Keogh. 2017. Matrix profile vi: meaningful multidimensional motif discovery. In 2017 IEEE international conference on data mining (ICDM). IEEE, 565--574.
    [44]
    Chin-Chia Michael Yeh, Yan Zhu, Liudmila Ulanova, Nurjahan Begum, Yifei Ding, Hoang Anh Dau, Diego Furtado Silva, Abdullah Mueen, and Eamonn Keogh. 2016. Matrix profile I: all pairs similarity joins for time series: a unifying view that includes motifs, discords and shapelets. In 2016 IEEE 16th international conference on data mining (ICDM). Ieee, 1317--1322.
    [45]
    Chin-Chia Michael Yeh, Yan Zhu, Liudmila Ulanova, Nurjahan Begum, Yifei Ding, Hoang Anh Dau, Zachary Zimmerman, Diego Furtado Silva, Abdullah Mueen, and Eamonn Keogh. 2018. Time series joins, motifs, discords and shapelets: a unifying view that exploits the matrix profile. Data Mining and Knowledge Discovery, Vol. 32, 1 (2018), 83--123.
    [46]
    Chin-Chia Michael Yeh, Zhongfang Zhuang, Wei Zhang, and Liang Wang. 2020 b. Multi-future Merchant Transaction Prediction. arXiv preprint arXiv:2007.05303 (2020).
    [47]
    Chin-Chia Michael Yeh, Zhongfang Zhuang, Yan Zheng, Liang Wang, Junpeng Wang, and Wei Zhang. 2020 c. Merchant Category Identification Using Credit Card Transactions. arXiv preprint arXiv:2011.02602 (2020).
    [48]
    Shujian Yu and Zubin Abraham. 2017. Concept drift detection with hierarchical hypothesis testing. In Proceedings of the 2017 SIAM International Conference on Data Mining. SIAM, 768--776.
    [49]
    Wei Zhang, Liang Wang, Robert Christensen, Yan Zheng, Liang Gou, and Hao Yang. 2020. Transaction sequence processing with embedded real-time decision feedback. US Patent App. 16/370,426.
    [50]
    Zhu et al. 2016. Matrix profile ii: Exploiting a novel algorithm and gpus to break the one hundred million barrier for time series motifs and joins. In 2016 IEEE 16th international conference on data mining (ICDM). IEEE, 739--748.
    [51]
    Zhongfang Zhuang, Chin-Chia Michael Yeh, Liang Wang, Wei Zhang, and Junpeng Wang. 2020. Multi-stream RNN for Merchant Transaction Prediction. arXiv preprint arXiv:2008.01670 (2020).
    [52]
    2010. Learning under concept drift: an overview. arXiv preprint arXiv:1010.4784 (2010).

    Cited By

    View all
    • (2023)Multitask Learning for Time Series Data with 2D Convolution2023 International Conference on Machine Learning and Applications (ICMLA)10.1109/ICMLA58977.2023.00010(9-16)Online publication date: 15-Dec-2023
    • (2023)Sketching Multidimensional Time Series for Fast Discord Mining2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386363(443-452)Online publication date: 15-Dec-2023
    • (2023)Forecasting online adaptation methods for energy domainEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.106499123:PCOnline publication date: 1-Aug-2023

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management
    October 2021
    4966 pages
    ISBN:9781450384469
    DOI:10.1145/3459637
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 October 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. financial technology
    2. online learning
    3. regression
    4. time series

    Qualifiers

    • Research-article

    Conference

    CIKM '21
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)73
    • Downloads (Last 6 weeks)8
    Reflects downloads up to 27 Jul 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Multitask Learning for Time Series Data with 2D Convolution2023 International Conference on Machine Learning and Applications (ICMLA)10.1109/ICMLA58977.2023.00010(9-16)Online publication date: 15-Dec-2023
    • (2023)Sketching Multidimensional Time Series for Fast Discord Mining2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386363(443-452)Online publication date: 15-Dec-2023
    • (2023)Forecasting online adaptation methods for energy domainEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.106499123:PCOnline publication date: 1-Aug-2023

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media