research-article

Sparse Attentive Memory Network for Click-through Rate Prediction with Long Sequences

Authors:

Qing Da,

Bing WangAuthors Info & Claims

CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

Pages 3312 - 3321

https://doi.org/10.1145/3511808.3557095

Published: 17 October 2022 Publication History

Get Access

Abstract

Sequential recommendation predicts users' next behaviors with their historical interactions. Recommending with longer sequences improves recommendation accuracy and increases the degree of personalization. As sequences get longer, existing works have not yet addressed the following two main challenges. Firstly, modeling long-range intra-sequence dependency is difficult with increasing sequence lengths. Secondly, it requires efficient memory and computational speeds. In this paper, we propose a Sparse Attentive Memory (SAM) network for long sequential user behavior modeling. SAM supports efficient training and real-time inference for user behavior sequences with lengths on the scale of thousands. In SAM, we model the target item as the query and the long sequence as the knowledge database, where the former continuously elicits relevant information from the latter. SAM simultaneously models target-sequence dependencies and long-range intra-sequence dependencies with O(L) complexity and O(1) number of sequential updates, which can only be achieved by the self-attention mechanism with O(L²) complexity. Extensive empirical results demonstrate that our proposed solution is effective not only in long user behavior modeling but also on short sequences modeling. Implemented on sequences of length 1000, SAM is successfully deployed on one of the largest international E-commerce platforms. This inference time is within 30ms, with a substantial 7.30% click-through rate improvement for the online A/B test. To the best of our knowledge, it is the first end-to-end long user sequence modeling framework that models intra-sequence and target-sequence dependencies with the aforementioned degree of efficiency and successfully deployed on a large-scale real-time industrial recommender system.

Supplementary Material

MP4 File (CIKM22-app095.mp4)

Presentation video for Sparse Attentive Memory(SAM) network. It is an end-to-end framework that models sequences on the scale of thousands for recommender systems. It models both intra-sequence and target-sequence dependencies within O(L) complexity and O(1) number of sequential updates. It has been deployed on the large-scale real-time recommender system for item recommendation in Alibaba Group.

Download
29.29 MB

References

[1]

Deepak Agarwal, Bee-Chung Chen, and Pradheep Elango. 2009. Spatio-temporal models for estimating click-through rate. In Proceedings of the 18th international conference on World wide web. 21--30.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Time-aware Attentive Click Sequence Network for Click-Through Rate Prediction

MIN: multi-dimensional interest network for click-through rate prediction

Triangle Graph Interest Network for Click-through Rate Prediction

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations