research-article

A Self-boosted Framework for Calibrated Ranking

Authors:

Shunyu Zhang,

Hu Liu,

Wentian Bao,

Enyun Yu,

Yang SongAuthors Info & Claims

KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Pages 6226 - 6235

https://doi.org/10.1145/3637528.3671570

Published: 24 August 2024 Publication History

Get Access

Abstract

Scale-calibrated ranking systems are ubiquitous in real-world applications nowadays, which pursue accurate ranking quality and calibrated probabilistic predictions simultaneously. For instance, in the advertising ranking system, the predicted click-through rate (CTR) is utilized for ranking and required to be calibrated for the downstream cost-per-click ads bidding. Recently, multi-objective based methods have been wildly adopted as a standard approach for Calibrated Ranking, which incorporates the combination of two loss functions: a pointwise loss that focuses on calibrated absolute values and a ranking loss that emphasizes relative orderings. However, when applied to industrial online applications, existing multi-objective CR approaches still suffer from two crucial limitations First, previous methods need to aggregate the full candidate list within a single mini-batch to compute the ranking loss. Such aggregation strategy violates extensive data shuffling which has long been proven beneficial for preventing overfitting, and thus degrades the training effectiveness. Second, existing multi-objective methods apply the two inherently conflicting loss functions on a single probabilistic prediction, which results in a sub-optimal trade-off between calibration and ranking.

To tackle the two limitations, we propose a Self-Boosted framework for Calibrated Ranking (SBCR). In SBCR, the predicted ranking scores by the online deployed model are dumped into context features. With these additional context features, each single item can perceive the overall distribution of scores in the whole ranking list, so that the ranking loss can be constructed without the need for sample aggregation. As the deployed model is a few versions older than the training model, the dumped predictions reveal what was failed to learn and keep boosting the model to correct previously mis-predicted items. Moreover, a calibration module is introduced to decouple the point loss and ranking loss. The two losses are applied before and after the calibration module separately, which elegantly addresses the sub-optimal trade-off problem. We conduct comprehensive experiments on industrial scale datasets and online A/B tests, demonstrating that SBCR can achieve advanced performance on both calibration and ranking. Our method has been deployed on the video search system of Kuaishou, and results in significant performance improvements on CTR and the total amount of time users spend on Kuaishou.

Supplemental Material

MP4 File - A Self-boosted Framework for Calibrated Ranking

In real-world applications, scale-calibrated ranking aim for accurate rankings and calibrated predictions. Existing multi-objective approaches suffer from two limitations. Firstly, they require aggregating the full query candidates within a mini-batch to compute the ranking loss, violating data shuffling and degrading training effectiveness. Secondly, they apply conflicting loss functions on a single prediction, resulting in sub-optimal trade-offs. To address these, we propose a Self-Boosted framework. It incorporates predicted scores by the online deployed model into the context features, so that the ranking loss can be constructed without the need for sample aggregation. Moreover, a calibration module is introduced to decouple the point loss and ranking loss. The comprehensive results demonstrate the advanced performance of the proposed framework.

Download
23.29 MB

References

[1]

Aijun Bai, Rolf Jagerman, Zhen Qin, Le Yan, Pratyush Kar, Bing-Rong Lin, Xuanhui Wang, Michael Bendersky, and Marc Najork. 2023. Regression Compatible Listwise Objectives for Calibrated Ranking with Binary Relevance. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management. 4502--4508.

Abstract

Supplemental Material

References

Cited By

Index Terms

Recommendations

Post-ranking query suggestion by diversifying search results

Scale Calibration of Deep Ranking Models

Unbiased LambdaMART: An Unbiased Pairwise Learning-to-Rank Algorithm

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations