Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3534678.3539270acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

A Generalized Doubly Robust Learning Framework for Debiasing Post-Click Conversion Rate Prediction

Published: 14 August 2022 Publication History

Abstract

Post-click conversion rate (CVR) prediction is an essential task for discovering user interests and increasing platform revenues in a range of industrial applications. One of the most challenging problems of this task is the existence of severe selection bias caused by the inherent self-selection behavior of users and the item selection process of systems. Currently, doubly robust (DR) learning approaches achieve the state-of-the-art performance for debiasing CVR prediction. However, in this paper, by theoretically analyzing the bias, variance and generalization bounds of DR methods, we find that existing DR approaches may have poor generalization caused by inaccurate estimation of propensity scores and imputation errors, which often occur in practice. Motivated by such analysis, we propose a generalized learning framework that not only unifies existing DR methods, but also provides a valuable opportunity to develop a series of new debiasing techniques to accommodate different application scenarios. Based on the framework, we propose two new DR methods, namely DR-BIAS and DR-MSE. DR-BIAS directly controls the bias of DR loss, while DR-MSE balances the bias and variance flexibly, which achieves better generalization performance. In addition, we propose a novel tri-level joint learning optimization method for DR-MSE in CVR prediction, and an efficient training algorithm correspondingly. We conduct extensive experiments on both real-world and semi-synthetic datasets, which validate the effectiveness of our proposed methods.

Supplemental Material

MP4 File
One of the most challenging problems of post-click conversion rate (CVR) prediction is the existence of severe selection bias caused by the inherent self-selection behavior of users and the item selection process of systems. Currently, doubly robust (DR) learning approaches achieve the state-of-the-art performance for debiasing CVR prediction. However, by theoretically analyzing DR methods, we find that existing DR approaches may have poor generalization caused by inaccurate estimation of propensity scores and imputation errors. Motivated by such analysis, we propose a generalized learning framework that not only unifies existing DR methods, but also provides a valuable opportunity to develop a series of new debiasing techniques to accommodate different application scenarios. Based on the framework, we propose two new DR methods, namely DR-BIAS and DR-MSE. DR-BIAS directly controls the bias of DR loss, while DR-MSE balances the bias and variance flexibly, which achieves better generalization performance.

References

[1]
Martín Abadi, Ashish Agarwal, Paul Barham, et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems.
[2]
Heejung Bang and James M. Robins. 2005. Doubly robust estimation in missing data and causal inference models. Biometrics 61 (2005), 962--972.
[3]
Stephen Bonner and Flavian Vasile. 2018. Causal embeddings for recommendation. In RecSys. 104--112.
[4]
Yin-Wen Chang, Cho-Jui Hsieh, Kai-Wei Chang, Michael Ringgaard, and Chih-Jen Lin. 2010. Training and testing low-degree polynomial data mappings via linear SVM. Journal of Machine Learning Research 11, 4 (2010).
[5]
Olivier Chapelle. 2014. Modeling delayed feedback in display advertising. In KDD. ACM, 1097--1105.
[6]
Jiawei Chen, Hande Dong, Yang Qiu, Xiangnan He, Xin Xin, Liang Chen, Guli Lin, and Keping Yang. 2021. AutoDebias: Learning to Debias for Recommendation. In SIGIR.
[7]
Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, et al. 2016. Wide & deep learning for recommender systems. In 1st DLRS workshop.
[8]
Muhammad Junaid Effendi and Syed Abbas Ali. 2017. Click Through Rate Prediction for Contextual Advertisment Using Linear Regression. IJCSIS (2017).
[9]
Huifeng Guo, Ruiming Tang, Yunming Ye, Zhenguo Li, and Xiuqiang He. 2017. Deepfm: a factorization-machine based neural network for ctr prediction. In IJCAI.
[10]
Siyuan Guo, Lixin Zou, Yiding Liu, Wenwen Ye, Suqi Cheng, Shuaiqiang Wang, Hechang Chen, Dawei Yin, and Yi Chang. 2021. Enhanced Doubly Robust Learning for Debiasing Post-Click Conversion Rate Estimation. In SIGIR.
[11]
G. W. Imbens and D. B. Rubin. 2015. Causal Inference For Statistics Social and Biomedical Science. Cambridge University Press.
[12]
Simon Jenni and Paolo Favaro. 2018. Deep bilevel learning. In Proceedings of the European conference on computer vision (ECCV). 618--633.
[13]
Christopher C. Johnson. 2014. Logistic Matrix Factorization for Implicit Feedback Data.
[14]
Yuchin Juan, Yong Zhuang,Wei-Sheng Chin, and Chih-Jen Lin. 2016. Field-aware factorization machines for CTR prediction. In RecSys. 43--50.
[15]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In ICLR, Yoshua Bengio and Yann LeCun (Eds.).
[16]
Yehuda Koren, Robert Bell, Chris Volinsky, et al. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8 (2009), 30--37.
[17]
Dugang Liu, Pengxiang Cheng, Zhenhua Dong, Xiuqiang He, Weike Pan, and Zhong Ming. 2020. Ageneral knowledge distillation framework for counterfactual recommendation via uniform data. In SIGIR. 831--840.
[18]
Quan Lu, Shengjun Pan, Liang Wang, Junwei Pan, Fengdan Wan, and Hongxia Yang. 2017. A Practical Framework of Conversion Rate Prediction for Online Display Advertising. In ADKDD. ACM, 9:1--9:9.
[19]
Xiao Ma, Liqin Zhao, Guan Huang, ZhiWang, Zelin Hu, Xiaoqiang Zhu, and Kun Gai. 2018. Entire Space Multi-Task Model: An Effective Approach for Estimating Post-Click Conversion Rate. In SIGIR. 1137--1140.
[20]
Steffen Rendle. 2010. Factorization machines. In ICDM.
[21]
Matthew Richardson, Ewa Dominowska, and Robert Ragno. 2007. Predicting clicks: estimating the click-through rate for new ads. In WWW.
[22]
P. R. Rosenbaum and D. B. Rubin. 1983. The central role of the propensity score in observational studies for causal. Biometric 70 (1983), 41--55.
[23]
D. B. Rubin. 1974. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational psychology 66 (1974), 688--701.
[24]
Yuta Saito. 2020. Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions. In RecSys. ACM, 92--100.
[25]
Yuta Saito, Gota Morishita, and Shota Yasui. 2020. Dual Learning Algorithm for Delayed Conversions. In SIGIR. ACM, 1849--1852.
[26]
Yuta Saito, Suguru Yaginuma, Yuta Nishino, Hayato Sakata, and Kazuhide Nakata. 2020. Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback. In WSDM.
[27]
Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, and Thorsten Joachims. 2016. Recommendations as Treatments: Debiasing Learning and Evaluation. In ICML. 1670--1679.
[28]
Tobias Schnabel, Adith Swaminathan, Ashudeep Singh, Navin Chandak, and Thorsten Joachims. 2016. Recommendations as treatments: Debiasing learning and evaluation. arXiv preprint arXiv:1602.05352 (2016).
[29]
Yumin Su, Liang Zhang, Quanyu Dai, Bo Zhang, Jinyao Yan, Dan Wang, Yongjun Bao, Sulong Xu, Yang He, and Weipeng Yan. 2020. An Attention-based Model for Conversion Rate Prediction with Delayed Feedback via Post-click Calibration. In IJCAI. 3522--3528.
[30]
Roman Vershynin. 2018. High-Dimensional Probability: An Introduction with Applications in Data Science. Cambridge University Press.
[31]
RuoxiWang, Bin Fu, Gang Fu, and MingliangWang. 2017. Deep & cross network for ad click predictions. In ADKDD. 1--7.
[32]
Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2019. Doubly Robust Joint Learning for Recommendation on Data Missing Not at Random. In ICML.
[33]
Xiaojie Wang, Rui Zhang, Yu Sun, and Jianzhong Qi. 2021. Combating Selection Biases in Recommender Systems with a Few Unbiased Ratings. In WSDM.
[34]
Yanshi Wang, Jie Zhang, Qing Da, and Anxiang Zeng. 2021. Delayed Feedback Modeling for the Entire Space Conversion Rate Prediction. In AAAI.
[35]
YikaiWang, Liang Zhang, Quanyu Dai, Fuchun Sun, Bo Zhang, Yang He,Weipeng Yan, and Yongjun Bao. 2019. Regularized Adversarial Sampling and Deep Timeaware Attention for Click-Through Rate Prediction. In CIKM. 349--358.
[36]
Hong Wen, Jing Zhang, Yuan Wang, Fuyu Lv, Wentian Bao, Quan Lin, and Keping Yang. 2020. Entire Space Multi-Task Modeling via Post-Click Behavior Decomposition for Conversion Rate Prediction. In SIGIR. ACM, 2377--2386.
[37]
Liang Wu, Diane Hu, Liangjie Hong, and Huan Liu. 2018. Turning Clicks into Purchases: Revenue Optimization for Product Search in E-Commerce. In SIGIR.
[38]
Peng Wu, Haoxuan Li, Yuhao Deng, Wenjie Hu, Quanyu Dai, Zhenhua Dong, Jie Sun, Rui Zhang, and Xiao-Hua Zhou. 2022. On the Opportunity of Causal Learning in Recommendation Systems: Foundation, Estimation, Prediction and Challenges. IJCAI.
[39]
Wenhao Zhang, Wentian Bao, Xiao-Yang Liu, Keping Yang, Quan Lin, Hong Wen, and Ramin Ramezani. 2020. Large-scale Causal Approaches to Debiasing Post-click Conversion Rate Estimation with Multi-task Learning. In WWW.

Cited By

View all
  • (2024)Utilizing Non-click Samples via Semi-supervised Learning for Conversion Rate PredictionProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688151(350-359)Online publication date: 8-Oct-2024
  • (2024)DDPO: Direct Dual Propensity Optimization for Post-Click Conversion Rate EstimationProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657817(1179-1188)Online publication date: 10-Jul-2024
  • (2024)Causally Debiased Time-aware RecommendationProceedings of the ACM Web Conference 202410.1145/3589334.3645400(3331-3342)Online publication date: 13-May-2024
  • Show More Cited By

Index Terms

  1. A Generalized Doubly Robust Learning Framework for Debiasing Post-Click Conversion Rate Prediction

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
      August 2022
      5033 pages
      ISBN:9781450393850
      DOI:10.1145/3534678
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 14 August 2022

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. doubly robust learning
      2. post-click conversion rate
      3. recommender systems
      4. selection bias

      Qualifiers

      • Research-article

      Funding Sources

      • The Ministry of Science and Technology of China

      Conference

      KDD '22
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)101
      • Downloads (Last 6 weeks)6
      Reflects downloads up to 10 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Utilizing Non-click Samples via Semi-supervised Learning for Conversion Rate PredictionProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688151(350-359)Online publication date: 8-Oct-2024
      • (2024)DDPO: Direct Dual Propensity Optimization for Post-Click Conversion Rate EstimationProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657817(1179-1188)Online publication date: 10-Jul-2024
      • (2024)Causally Debiased Time-aware RecommendationProceedings of the ACM Web Conference 202410.1145/3589334.3645400(3331-3342)Online publication date: 13-May-2024
      • (2024)A Click Conversion Rate Model of E-Commerce Platforms Aiming at Effective Data SparseIEEE Transactions on Emerging Topics in Computational Intelligence10.1109/TETCI.2024.33569098:2(1744-1755)Online publication date: Apr-2024
      • (2024)Uncovering the Propensity Identification Problem in Debiased Recommendations2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00056(653-666)Online publication date: 13-May-2024
      • (2024)Unbiased Recommendation Through Invariant Representation LearningMachine Learning and Knowledge Discovery in Databases. Applied Data Science Track10.1007/978-3-031-70381-2_18(280-296)Online publication date: 22-Aug-2024
      • (2023)Removing hidden confounding in recommendationProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668502(54614-54626)Online publication date: 10-Dec-2023
      • (2023)Optimal transport for treatment effect estimationProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3666359(5404-5418)Online publication date: 10-Dec-2023
      • (2023)Propensity mattersProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619239(20182-20194)Online publication date: 23-Jul-2023
      • (2023)Discriminative-invariant representation learning for unbiased recommendationProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/252(2270-2278)Online publication date: 19-Aug-2023
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media