research-article

Efficient On-Device Session-Based Recommendation

Authors:

Xin Xia,

Junliang Yu,

Qinyong Wang,

Chaoqun Yang,

Nguyen Quoc Viet Hung,

Hongzhi YinAuthors Info & Claims

ACM Transactions on Information Systems, Volume 41, Issue 4

Article No.: 102, Pages 1 - 24

https://doi.org/10.1145/3580364

Published: 22 March 2023 Publication History

Get Access

Abstract

On-device session-based recommendation systems have been achieving increasing attention on account of the low energy/resource consumption and privacy protection while providing promising recommendation performance. To fit the powerful neural session-based recommendation models in resource-constrained mobile devices, tensor-train decomposition and its variants have been widely applied to reduce memory footprint by decomposing the embedding table into smaller tensors, showing great potential in compressing recommendation models. However, these model compression techniques significantly increase the local inference time due to the complex process of generating index lists and a series of tensor multiplications to form item embeddings. The resultant on-device recommender fails to provide real-time responses and recommendations. To improve the online recommendation efficiency, we propose to learn compositional encoding-based compact item representations. Specifically, each item is represented by a compositional code that consists of several codewords, and we learn embedding vectors to represent each codeword instead of each item. Then the composition of the codeword embedding vectors from different embedding matrices (i.e., codebooks) forms the item embedding. Since the size of codebooks can be extremely small, the recommender model is thus able to fit in resource-constrained devices and save the codebooks for fast local inference. In addition, to prevent the loss of model capacity caused by compression, we propose a bidirectional self-supervised knowledge distillation framework. Extensive experimental results on two benchmark datasets demonstrate that compared with existing methods, the proposed on-device recommender not only achieves an 8× inference speedup with a large compression ratio but also shows superior recommendation performance. The code is released at https://github.com/xiaxin1998/EODRec.

References

[1]

Benu Madhab Changmai, Divija Nagaraju, Debi Prasanna Mohanty, Kriti Singh, Kunal Bansal, and Sukumar Moharana. 2019. On-device user intent prediction for context and sequence aware recommendation. arXiv:1909.12756.

Abstract

References

Cited By

Index Terms

Recommendations

On-Device Next-Item Recommendation with Self-Supervised Knowledge Distillation

Modeling Cross-session Information with Multi-interest Graph Neural Networks for the Next-item Recommendation

A GNN Model with Adaptive Weights for Session-Based Recommendation Systems

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

Full Text

HTML Format

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations