Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3534678.3539092acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Open access

Deconfounding Duration Bias in Watch-time Prediction for Video Recommendation

Published: 14 August 2022 Publication History

Abstract

Watch-time prediction remains to be a key factor in reinforcing user engagement via video recommendations. It has become increasingly important given the ever-growing popularity of online videos. However, prediction of watch time not only depends on the match between the user and the video but is often mislead by the duration of the video itself. With the goal of improving watch time, recommendation is always biased towards videos with long duration. Models trained on this imbalanced data face the risk of bias amplification, which misguides platforms to over-recommend videos with long duration but overlook the underlying user interests. This paper presents the first work to study duration bias in watch-time prediction for video recommendation. We employ a causal graph illuminating that duration is a confounding factor that concurrently affects video exposure and watch-time prediction---the first effect on video causes the bias issue and should be eliminated, while the second effect on watch time originates from video intrinsic characteristics and should be preserved. To remove the undesired bias but leverage the natural effect, we propose a Duration-Deconfounded Quantile-based (D2Q) watch-time prediction framework, which allows for scalability to perform on industry production systems. Through extensive offline evaluation and live experiments, we showcase the effectiveness of this duration-deconfounding framework by significantly outperforming the state-of-the-art baselines. We have fully launched our approach on Kuaishou App, which has substantially improved real-time video consumption due to more accurate watch-time predictions.

Supplemental Material

MP4 File
Watch-time prediction is a key factor in video recommendation. However, with the platform goal of improving watch time, recommendation is always biased towards long videos. Models trained on this imbalanced data face the risk of bias amplification, misguiding platforms to over-recommend videos with long duration but overlook the underlying user interests. We study this duration bias in watch-time prediction. We employ a causal graph showing duration?s confounding effects on video exposure and watch-time prediction -- the one on video causes the bias, while the other on watch time is intrinsic. To remove the undesired bias but leverage the natural effect, we propose a Duration-Deconfounded Quantile-based (D2Q) watch-time prediction framework, with scalability for real applications. We show that D2Q significantly outperforms state-of-the-art baselines in extensive offline evaluation and live experiments. We have fully launched D2Q on Kuaishou App and substantially improved real-time video consumption.

References

[1]
Alex Beutel, Jilin Chen, Tulsee Doshi, Hai Qian, Li Wei, Yi Wu, Lukasz Heldt, Zhe Zhao, Lichan Hong, Ed H Chi, et al. 2019. Fairness in recommendation ranking through pairwise comparisons. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2212--2220.
[2]
Asia J Biega, Krishna P Gummadi, and GerhardWeikum. 2018. Equity of attention: Amortizing individual fairness in rankings. In The 41st international acm sigir conference on research & development in information retrieval. 405--414.
[3]
Stephen Bonner and Flavian Vasile. 2018. Causal embeddings for recommendation. In Proceedings of the 12th ACM conference on recommender systems. 104--112.
[4]
Jiawei Chen, Hande Dong, Xiang Wang, Fuli Feng, Meng Wang, and Xiangnan He. 2020. Bias and debias in recommender system: A survey and future directions. arXiv preprint arXiv:2010.03240 (2020).
[5]
Qiwei Chen, Changhua Pei, Shanshan Lv, Chao Li, Junfeng Ge, and Wenwu Ou. 2021. End-to-End User Behavior Retrieval in Click-Through RatePrediction Model. arXiv preprint arXiv:2108.04468 (2021).
[6]
Konstantina Christakopoulou, Madeleine Traverse, Trevor Potter, Emma Marriott, Daniel Li, Chris Haulk, Ed H Chi, and Minmin Chen. 2020. Deconfounding User Satisfaction Estimation from Response Rate Bias. In Fourteenth ACM Conference on Recommender Systems. 450--455.
[7]
Paul Covington, Jay Adams, and Emre Sargin. 2016. Deep neural networks for youtube recommendations. In Proceedings of the 10th ACM conference on recommender systems. 191--198.
[8]
James Davidson, Benjamin Liebald, Junning Liu, Palash Nandy, Taylor Van Vleet, Ullas Gargi, Sujoy Gupta, Yu He, Mike Lambert, Blake Livingston, et al. 2010. The YouTube video recommendation system. In Proceedings of the fourth ACM conference on Recommender systems. 293--296.
[9]
Kuaishou financial reports 2021. 2022. Taylor series. https://newsfile.futunn.com/notice/2021/09/17/9941452-0.PDF. Online; accessed 08-Jan-2022.
[10]
Yingqiang Ge, Shuchang Liu, Ruoyuan Gao, Yikun Xian, Yunqi Li, Xiangyu Zhao, Changhua Pei, Fei Sun, Junfeng Ge, Wenwu Ou, et al. 2021. Towards longterm fairness in recommendation. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 445--453.
[11]
Alois Gruson, Praveen Chandar, Christophe Charbuillet, James McInerney, Samantha Hansen, Damien Tardieu, and Ben Carterette. 2019. Offline evaluation to make decisions about playlistrecommendation algorithms. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining. 420--428.
[12]
Daya Guo, Jiangshui Hong, Binli Luo, Qirui Yan, and Zhangming Niu. 2019. Multi-modal representation learning for short video understanding and recommendation. In 2019 IEEE International Conference on Multimedia & ExpoWorkshops (ICMEW). IEEE, 687--690.
[13]
Huifeng Guo, Jinkai Yu, Qing Liu, Ruiming Tang, and Yuzhou Zhang. 2019. PAL: a position-bias aware learning framework for CTR prediction in live recommender systems. In Proceedings of the 13th ACM Conference on Recommender Systems. 452--456.
[14]
Shantanu Gupta, Zachary C Lipton, and David Childers. 2021. Estimating treatment effects with observed confounders and mediators. In Uncertainty in Artificial Intelligence. PMLR, 982--991.
[15]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.
[16]
Daniel G Horvitz and Donovan J Thompson. 1952. A generalization of sampling without replacement from a finite universe. Journal of the American statistical Association 47, 260 (1952), 663--685.
[17]
Thuc Duy Le, Lin Liu, Anna Tsykin, Gregory J Goodall, Bing Liu, Bing-Yu Sun, and Jiuyong Li. 2013. Inferring microRNA--mRNA causal regulatory relationships from expression data. Bioinformatics 29, 6 (2013), 765--771.
[18]
Rishabh Mehrotra, James McInerney, Hugues Bouchard, Mounia Lalmas, and Fernando Diaz. 2018. Towards a fair marketplace: Counterfactual evaluation of the trade-off between relevance, fairness & satisfaction in recommendation systems. In Proceedings of the 27th acm international conference on information and knowledge management. 2243--2251.
[19]
Ruth M Mickey and Sander Greenland. 1989. The impact of confounder selection criteria on effect estimation. American journal of epidemiology 129, 1 (1989), 125--137.
[20]
Judea Pearl. 2009. Causality. Cambridge university press.
[21]
Judea Pearl. 2012. The do-calculus revisited. arXiv preprint arXiv:1210.4852 (2012).
[22]
Qi Pi, Guorui Zhou, Yujing Zhang, Zhe Wang, Lejian Ren, Ying Fan, Xiaoqiang Zhu, and Kun Gai. 2020. Search-based user interest modeling with lifelong sequential behavior data for click-through rate prediction. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 2685--2692.
[23]
Miquel Porta. 2008. A dictionary of epidemiology. Oxford university press.
[24]
Yanru Qu, Han Cai, Kan Ren, Weinan Zhang, Yong Yu, Ying Wen, and Jun Wang. 2016. Product-based neural networks for user response prediction. In 2016 IEEE 16th International Conference on Data Mining (ICDM). IEEE, 1149--1154.
[25]
Matthew Richardson, Ewa Dominowska, and Robert Ragno. 2007. Predicting clicks: estimating the click-through rate for new ads. In Proceedings of the 16th international conference on World Wide Web. 521--530.
[26]
Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, and Thorsten Joachims. 2016. Recommendations as treatments: Debiasing learning and evaluation. In international conference on machine learning. PMLR, 1670-- 1679.
[27]
Harald Steck. 2018. Calibrated recommendations. In Proceedings of the 12th ACM conference on recommender systems. 154--162.
[28]
Linpeng Tang, Qi Huang, Amit Puntambekar, Ymir Vigfusson, Wyatt Lloyd, and Kai Li. 2017. Popularity prediction of facebook videos for higher quality streaming. In 2017 {USENIX} Annual Technical Conference ({USENIX} {ATC} 17). 111--123.
[29]
Peng Wang, Yunsheng Jiang, Chunxu Xu, and Xiaohui Xie. 2019. Overview of Content-Based Click-Through Rate Prediction Challenge for Video Recommendation. In Proceedings of the 27th ACM International Conference on Multimedia. 2593--2596.
[30]
Qi Wang, Dongmei Hao, Fangbai Li, Xiaoying Guan, and Pengcheng Chen. 2020. Development of a new framework to identify pathways from socioeconomic development to environmental pollution. Journal of Cleaner Production 253 (2020), 119962.
[31]
Wenjie Wang, Fuli Feng, Xiangnan He, Xiang Wang, and Tat-Seng Chua. 2021. Deconfounded recommendation for alleviating bias amplification. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 1717--1725.
[32]
Xuanhui Wang, Nadav Golbandi, Michael Bendersky, Donald Metzler, and Marc Najork. 2018. Position bias estimation for unbiased learning to rank in personal search. In Proceedings of the Eleventh ACM International Conference onWeb Search and Data Mining. 610--618.
[33]
Neil A Weiss, Paul T Holmes, and Michael Hardy. 2006. A course in probability. Pearson Addison Wesley Boston, Massachusetts, USA.
[34]
Wikipedia. 2022. Taylor series. https://en.wikipedia.org/wiki/Taylor_series. Online; accessed 08-Jan-2022.
[35]
Chuhan Wu, Fangzhao Wu, Xiting Wang, Yongfeng Huang, and Xing Xie. 2021. Fairrec: fairness-aware news recommendation with decomposed adversarial learning. AAAI.
[36]
Siqi Wu, Marian-Andrei Rizoiu, and Lexing Xie. 2018. Beyond views: Measuring and predicting engagement in online videos. In Twelfth international AAAI conference on web and social media.
[37]
Mengyue Yang, Quanyu Dai, Zhenhua Dong, Xu Chen, Xiuqiang He, and Jun Wang. 2021. Top-N Recommendation with Counterfactual User Preference Simulation. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 2342--2351.
[38]
Meike Zehlike and Carlos Castillo. 2020. Reducing disparate exposure in ranking: A learning to rank approach. In Proceedings of The Web Conference 2020. 2849--2855.
[39]
Yang Zhang, Fuli Feng, Xiangnan He, Tianxin Wei, Chonggang Song, Guohui Ling, and Yongdong Zhang. 2021. Causal Intervention for Leveraging Popularity Bias in Recommendation. arXiv preprint arXiv:2105.06067 (2021).
[40]
Yu Zheng, Chen Gao, Xiang Li, Xiangnan He, Yong Li, and Depeng Jin. 2021. Disentangling User Interest and Conformity for Recommendation with Causal Embedding. In Proceedings of the Web Conference 2021. 2980--2991.
[41]
Guorui Zhou, Xiaoqiang Zhu, Chenru Song, Ying Fan, Han Zhu, Xiao Ma, Yanghui Yan, Junqi Jin, Han Li, and Kun Gai. 2018. Deep interest network for click-through rate prediction. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1059--1068.

Cited By

View all
  • (2025)FlyCache: Recommendation-Driven Edge Caching Architecture for Full Life Cycle of Video StreamingDigital Communications and Networks10.1016/j.dcan.2025.01.001Online publication date: Jan-2025
  • (2025)Startup delay aware short video ordering: Problem, model, and a reinforcement learning based algorithmPeer-to-Peer Networking and Applications10.1007/s12083-024-01898-218:2Online publication date: 17-Jan-2025
  • (2024)Enhancing User-Item Interaction Through Counterfactual Classifier For Sequential RecommendationApplied Mathematics and Nonlinear Sciences10.2478/amns-2024-24819:1Online publication date: 3-Sep-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
August 2022
5033 pages
ISBN:9781450393850
DOI:10.1145/3534678
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 August 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. causal intervention
  2. duration bias
  3. video recommendation
  4. watch-time prediction

Qualifiers

  • Research-article

Conference

KDD '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1,082
  • Downloads (Last 6 weeks)156
Reflects downloads up to 22 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2025)FlyCache: Recommendation-Driven Edge Caching Architecture for Full Life Cycle of Video StreamingDigital Communications and Networks10.1016/j.dcan.2025.01.001Online publication date: Jan-2025
  • (2025)Startup delay aware short video ordering: Problem, model, and a reinforcement learning based algorithmPeer-to-Peer Networking and Applications10.1007/s12083-024-01898-218:2Online publication date: 17-Jan-2025
  • (2024)Enhancing User-Item Interaction Through Counterfactual Classifier For Sequential RecommendationApplied Mathematics and Nonlinear Sciences10.2478/amns-2024-24819:1Online publication date: 3-Sep-2024
  • (2024)Not All Videos Become Outdated: Short-Video Recommendation by Learning to Deconfound Release Interval BiasProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688113(179-188)Online publication date: 8-Oct-2024
  • (2024)Touch the Core: Exploring Task Dependence Among Hybrid Targets for RecommendationProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688101(329-339)Online publication date: 8-Oct-2024
  • (2024)Counteracting Duration Bias in Video Recommendation via Counterfactual Watch TimeProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671817(4455-4466)Online publication date: 25-Aug-2024
  • (2024)GradCraft: Elevating Multi-task Recommendations through Holistic Gradient CraftingProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671585(4774-4783)Online publication date: 25-Aug-2024
  • (2024)Towards a Causal Decision-Making Framework for Recommender SystemsACM Transactions on Recommender Systems10.1145/36291692:2(1-34)Online publication date: 14-May-2024
  • (2024)Enhancing Playback Performance in Video Recommender Systems with an On-Device Gating and Ranking FrameworkProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3680076(5031-5037)Online publication date: 21-Oct-2024
  • (2024)LabelCraft: Empowering Short Video Recommendations with Automated Label CraftingProceedings of the 17th ACM International Conference on Web Search and Data Mining10.1145/3616855.3635816(28-37)Online publication date: 4-Mar-2024
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media