Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3637528.3671593acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Open access

Achieving a Better Tradeoff in Multi-stage Recommender Systems through Personalization

Published: 24 August 2024 Publication History

Abstract

Recommender systems in social media websites provide value to their communities by recommending engaging content and meaningful connections. Scaling high-quality recommendations to billions of users in real-time requires sophisticated ranking models operating on a vast number of potential items to recommend, becoming prohibitively expensive computationally. A common technique "funnels'' these items through progressively complex models ("multi-stage''), each ranking fewer items but at higher computational cost for greater accuracy. This architecture introduces a trade-off between the cost of ranking items and providing users with the best recommendations. A key observation we make in this paper is that, all else equal, ranking more items indeed improves the overall objective but has diminishing returns. Following this observation, we provide a rigorous formulation through the framework of DR-submodularity, and argue that for a certain class of objectives (reward functions), it is possible to improve the trade-off between performance and computational cost in multi-stage ranking systems with strong theoretical guarantees. We show that this class of reward functions that provide this guarantee is large and robust to various noise models. Finally, we describe extensive experimentation of our method on three real-world recommender systems in Facebook, achieving 8.8% reduction in overall compute resources with no significant impact on recommendation quality, compared to a 0.8% quality loss in a non-personalized budget allocation.

Supplemental Material

WEBM File - KDD promo video
Promo video for our paper "Achieving Achieving a Better Tradeoff in Multi-stage Recommender Systems through Personalization".

References

[1]
Parag Agrawal. 2024. Building a Large-Scale Recommendation System: People You May Know. https://www.linkedin.com/blog/engineering/recommendations/ building-a-large-scale-recommendation-system-people-you-may-know
[2]
Raj Agrawal, Chandler Squires, Karren Yang, Karthikeyan Shanmugam, and Caroline Uhler. 2019. Abcd-strategy: Budgeted experimental design for targeted causal structure discovery. In The 22nd International Conference on Artificial Intelligence and Statistics. PMLR, 3400--3409.
[3]
Rohan Anil, Sandra Gadanho, Da Huang, Nijith Jacob, Zhuoshu Li, Dong Lin, Todd Phillips, Cristina Pop, Kevin Regan, Gil I Shamir, et al. 2022. On the factory floor: ML engineering for industrial-scale ads recommendation models. arXiv preprint arXiv:2209.05310 (2022).
[4]
Nima Asadi and Jimmy Lin. 2013. Effectiveness/Efficiency Tradeoffs for Candidate Generation in Multi-Stage Retrieval Architectures. In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval (Dublin, Ireland) (SIGIR '13). Association for Computing Machinery, New York, NY, USA, 997--1000. https://doi.org/10.1145/2484028.2484132
[5]
An Bian, Kfir Levy, Andreas Krause, and Joachim M Buhmann. 2017. Continuous DR-submodular maximization: Structure and algorithms. Advances in Neural Information Processing Systems 30 (2017).
[6]
Christian Borgs, Michael Brautbar, Jennifer Chayes, and Brendan Lucier. 2014. Maximizing social influence in nearly optimal time. In Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms. SIAM, 946--957.
[7]
Amanda Bower, Kristian Lum, Tomo Lazovich, Kyra Yee, and Luca Belli. 2022. Random Isn't Always Fair: Candidate Set Imbalance and Exposure Inequality in Recommender Systems. arXiv preprint arXiv:2209.05000 (2022).
[8]
Andrei Z Broder, David Carmel, Michael Herscovici, Aya Soffer, and Jason Zien. 2003. Efficient query evaluation using a two-level retrieval process. In Proceedings of the twelfth international conference on Information and knowledge management. 426--434.
[9]
Niv Buchbinder and Moran Feldman. 2023. Constrained Submodular Maximization via New Bounds for DR-Submodular Functions. arXiv preprint arXiv:2311.01129 (2023).
[10]
Emanuele Bugliarello. 2022. Mostra: A Flexible Balancing Framework to Trade-off User, Artist and Platform Objectives for Music Sequencing. In Proceedings of the ACM Web Conference 2022. 1484--1493.
[11]
Lin Chen, Hamed Hassani, and Amin Karbasi. 2018. Online continuous submodular maximization. In International Conference on Artificial Intelligence and Statistics. PMLR, 1896--1905.
[12]
Wei Chen,Wei Hu, Fu Li, Jian Li, Yu Liu, and Pinyan Lu. 2016. Combinatorial multiarmed bandit with general reward functions. Advances in Neural Information Processing Systems 29 (2016).
[13]
Wei Chen, Yajun Wang, and Siyu Yang. 2009. Efficient influence maximization in social networks. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. 199--208.
[14]
Charles LA Clarke, J Shane Culpepper, and Alistair Moffat. 2016. Assessing efficiency-effectiveness tradeoffs in multi-stage retrieval systems without using relevance judgments. Information Retrieval Journal 19 (2016), 351--377.
[15]
Paul Covington, Jay Adams, and Emre Sargin. 2016. Deep neural networks for youtube recommendations. In Proceedings of the 10th ACM conference on recommender systems. 191--198.
[16]
Khalid El-Arini, Gaurav Veda, Dafna Shahaf, and Carlos Guestrin. 2009. Turning down the noise in the blogosphere. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28 - July 1, 2009, John F. Elder IV, Françoise Fogelman-Soulié, Peter A. Flach, and Mohammed Javeed Zaki (Eds.). ACM, 289--298. https://doi.org/10.1145/ 1557019.1557056
[17]
Siyu Gu, Xiang-Rong Sheng, Biye Jiang, Siyuan Lou, Shuguang Han, Hongbo Deng, and Bo Zheng. 2022. On Ranking Consistency of Pre-ranking Stage Consistency of Pre-ranking Stage. arXiv preprint arXiv:2205.01289 (2022).
[18]
Jiri Hron, Karl Krauth, Michael Jordan, and Niki Kilbertus. 2021. On component interactions in two-stage recommender systems. Advances in neural information processing systems 34 (2021), 2744--2757.
[19]
Xu Huang, Defu Lian, Jin Chen, Liu Zheng, Xing Xie, and Enhong Chen. 2023. Cooperative Retriever and Ranker in Deep Recommenders. In Proceedings of the ACM Web Conference 2023. 1150--1161.
[20]
Kalervo Järvelin and Jaana Kekäläinen. 2002. Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems (TOIS) 20, 4 (2002), 422--446.
[21]
Mohammad Karimi, Mario Lucic, Hamed Hassani, and Andreas Krause. 2017. Stochastic submodular maximization: The case of coverage functions. Advances in Neural Information Processing Systems 30 (2017).
[22]
Jon Kleinberg and Maithra Raghu. 2018. Team performance with test scores. ACM Transactions on Economics and Computation (TEAC) 6, 3--4 (2018), 1--26.
[23]
Andreas Krause and Daniel Golovin. 2014. Submodular function maximization. Tractability 3, 71--104 (2014), 3.
[24]
Lada, Akos and Wang, Meihong and Yan, Tak. 2021. How does News Feed predict what you want to see? https://tech.facebook.com/engineering/2021/01/newsfeed-ranking/
[25]
Minghan Li, Xinyu Zhang, Ji Xin, Hongyang Zhang, and Jimmy Lin. 2022. Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang (Eds.). Association for Computational Linguistics, Abu Dhabi, United Arab Emirates, 333--345. https://doi.org/10.18653/v1/2022.emnlp-main.23
[26]
Weiwen Liu, Yunjia Xi, Jiarui Qin, Fei Sun, Bo Chen, Weinan Zhang, Rui Zhang, and Ruiming Tang. 2022. Neural Re-ranking in Multi-stage Recommender Systems: A Review. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence (IJCAI-22).
[27]
Jiaqi Ma, Zhe Zhao, Xinyang Yi, Ji Yang, Minmin Chen, Jiaxi Tang, Lichan Hong, and Ed H Chi. 2020. Off-policy learning in two-stage recommender systems. In Proceedings of The Web Conference 2020. 463--473.
[28]
Takanori Maehara, Akihiro Yabe, and Ken-ichi Kawarabayashi. 2015. Budget allocation problem with multiple advertisers: A game theoretic view. In International Conference on Machine Learning. PMLR, 428--437.
[29]
Anay Mehrotra and Nisheeth K Vishnoi. 2023. Maximizing Submodular Functions for Recommendation in the Presence of Biases. In Proceedings of the ACM Web Conference 2023. 3625--3636.
[30]
Aranyak Mehta, Uri Nadav, Alexandros Psomas, and Aviad Rubinstein. 2020. Hitting the high notes: Subset selection for maximizing expected order statistics. Advances in Neural Information Processing Systems 33 (2020), 15800--15810.
[31]
Meta Inc. 2023. Facebook People You May Know AI system. https://transparency. fb.com/features/explaining-ranking/fb-people-you-may-know/
[32]
Meta Inc. 2023. Instagram Suggested Accounts AI system. https://transparency.fb. com/features/explaining-ranking/ig-suggested-accounts/
[33]
Meta Inc. 2023. Scaling the Instagram Explore recommendations system. https://engineering.fb.com/2023/08/09/ml-applications/scaling-instagramexplore-recommendations-system/
[34]
Marko Mitrovic, Ehsan Kazemi, Morteza Zadimoghaddam, and Amin Karbasi. 2018. Data summarization at scale: A two-stage submodular approach. In International Conference on Machine Learning. PMLR, 3596--3605.
[35]
Nastaran Okati, Stratis Tsirtsis, and Manuel Gomez Rodriguez. 2023. On the within-group fairness of screening classifiers. In International Conference on Machine Learning. PMLR, 26495--26516.
[36]
Jiarui Qin, Jiachen Zhu, Bo Chen, Zhirong Liu, Weiwen Liu, Ruiming Tang, Rui Zhang, Yong Yu, and Weinan Zhang. 2022. RankFlow: Joint Optimization of Multi-Stage Cascade Ranking Systems as Flows. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 814--824.
[37]
Aytunc Sahin, Joachim Buhmann, and Andreas Krause. 2020. Constrained maximization of lattice submodular functions. In IN: ICML 2020 workshop on Negative Dependence and Submodularity for ML, Vienna, Austria, PMLR, Vol. 119.
[38]
Alberto Schiabel, Vyacheslav Kungurtsev, and Jakub Marecek. 2021. Randomized Algorithms for Monotone Submodular Function Maximization on the Integer Lattice. arXiv preprint arXiv:2111.10175 (2021).
[39]
Danny Segev and Sahil Singla. 2021. Efficient approximation schemes for stochastic probing and prophet problems. In Proceedings of the 22nd ACM Conference on Economics and Computation. 793--794.
[40]
Tasuku Soma, Naonori Kakimura, Kazuhiro Inaba, and Ken-ichi Kawarabayashi. 2014. Optimal budget allocation: Theoretical guarantee and efficient algorithm. In International Conference on Machine Learning. PMLR, 351--359.
[41]
Tasuku Soma and Yuichi Yoshida. 2017. Non-monotone dr-submodular function maximization. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 31.
[42]
Tasuku Soma and Yuichi Yoshida. 2018. Maximizing monotone submodular functions over the integer lattice. Mathematical Programming 172 (2018), 539--563.
[43]
Jinbo Song, Ruoran Huang, Xinyang Wang, Wei Huang, Qian Yu, Mingming Chen, Yafei Yao, Chaosheng Fan, Changping Peng, Zhangang Lin, Jinghe Hu, and Jingping Shao. 2022. Rethinking Large-Scale Pre-Ranking System: Entire-Chain Cross-Domain Models. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management (Atlanta, GA, USA) (CIKM '22). Association for Computing Machinery, New York, NY, USA, 4495--4499. https://doi.org/10.1145/3511808.3557683
[44]
Jan Vondrák. 2007. Submodularity in combinatorial optimization. Charles University, Prague (2007).
[45]
Stefan Wager and Susan Athey. 2018. Estimation and Inference of Heterogeneous Treatment Effects using Random Forests. J. Amer. Statist. Assoc. 113, 523 (2018), 1228--1242. https://doi.org/10.1080/01621459.2017.1319839 arXiv:https://doi.org/10.1080/01621459.2017.1319839
[46]
Lequn Wang and Thorsten Joachims. 2023. Uncertainty Quantification for Fairness in Two-Stage Recommender Systems. In Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining. 940--948.
[47]
Lequn Wang, Thorsten Joachims, and Manuel Gomez Rodriguez. 2022. Improving screening processes via calibrated subset selection. In International Conference on Machine Learning. PMLR, 22702--22726.
[48]
Xuewei Wang, Qiang Jin, Shengyu Huang, Min Zhang, Xi Liu, Zhengli Zhao, Yukun Chen, Zhengyu Zhang, Jiyan Yang, Ellie Wen, Sagar Chordia, Wenlin Chen, and Qin Huang. 2023. Towards the Better Ranking Consistency: A Multi-task Learning Framework for Early Stage Ads Ranking. arXiv:2307.11096 [cs.IR]
[49]
Xinyang Yi, Ji Yang, Lichan Hong, Derek Zhiyuan Cheng, Lukasz Heldt, Aditee Kumthekar, Zhe Zhao, Li Wei, and Ed Chi. 2019. Sampling-bias-corrected neural modeling for large corpus item recommendations. In Proceedings of the 13th ACM Conference on Recommender Systems (Copenhagen, Denmark) (RecSys '19). Association for Computing Machinery, New York, NY, USA, 269--277. https: //doi.org/10.1145/3298689.3346996
[50]
Guangyi Zhang, Nikolaj Tatti, and Aristides Gionis. 2022. Ranking with submodular functions on a budget. Data mining and knowledge discovery 36, 3 (2022), 1197--1218.
[51]
Zhixuan Zhang, Yuheng Huang, Dan Ou, Sen Li, Longbin Li, Qingwen Liu, and Xiaoyi Zeng. 2023. Rethinking the Role of Pre-ranking in Large-scale E-Commerce Searching System. arXiv preprint arXiv:2305.13647 (2023).

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
August 2024
6901 pages
ISBN:9798400704901
DOI:10.1145/3637528
This work is licensed under a Creative Commons Attribution-NonCommercial International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 August 2024

Check for updates

Author Tags

  1. multi-stage ranking
  2. personalization
  3. recommender systems
  4. submodularity

Qualifiers

  • Research-article

Conference

KDD '24
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 530
    Total Downloads
  • Downloads (Last 12 months)530
  • Downloads (Last 6 weeks)530
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media