research-article

External Evaluation of Ranking Models under Extreme Position-Bias

Authors:

Yaron Fairstein,

Elad Haramaty,

Arnon Lazerson,

Liane Lewin-EytanAuthors Info & Claims

WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining

Pages 252 - 261

https://doi.org/10.1145/3488560.3498420

Published: 15 February 2022 Publication History

Get Access

Abstract

Implicit feedback from users behavior is a natural and scalable source for training and evaluating ranking models in human-interactive systems. However, inherent biases such as the position bias are key obstacles to its effective usage. This is further accentuated in cases of extreme bias, where behavioral feedback can be collected exclusively on the top ranked result. In fact, in such cases, state-of-art debiasing methods cannot be applied. A prominent use case of extreme position bias is the voice shopping medium, where only a small amount of information can be presented to the user during a single interaction, resulting in user behavioral signals that are almost exclusively limited to the top offer. There is no way to know how the user would have reacted to a different offer than the top one he was actually exposed to. Thus, any new ranker we wish to evaluate with respect to a behavioral metric, requires online experimentation. We propose a novel approach, based on anexternal estimator model, for accurately predicting offline the performance of a new ranker. The accuracy of our solution is proven theoretically, as well as demonstrated by a line of experiments. In these experiments, we focus on the use case of purchase prediction, and show that our estimator can accurately predict offline the purchase rate of different rankers over a segment of voice shopping traffic. Our prediction is validated online, as being compared to the actual performance obtained by each ranker when being exposed to users.

Supplementary Material

MP4 File (WSDM22-fp282.mp4)

We consider the setting of model training and evaluation in the case of extreme position bias, In which the behavioral feedback is limited almost exclusively to the top offer (motivated by the voice shopping medium). In this setting there is no way to know how the user would have reacted to a different offer than the top one he was actually exposed to. Thus, any new ranker we wish to evaluate with respect to a behavioral metric, requires online experimentation. In the talk, we introduce a novel approach, based on an external estimator model, for accurately predicting offline the performance of a new ranker. We demonstrate the accuracy of our solution by a line of experiments, in which, we focus on the use case of purchase prediction, and show that our estimator can accurately predict offline the purchase rate of different rankers over a segment of voice shopping traffic. Our prediction is validated online, as being compared to the actual performance obtained by each ranker when being exposed to user.

Download
25.02 MB

References

[1]

Qingyao Ai, Keping Bi, Cheng Luo, Jiafeng Guo, and W Bruce Croft. 2018. Unbiased learning to rank with unbiased propensity estimation. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval . 385--394.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Position Bias Estimation for Unbiased Learning to Rank in Personal Search

Click-Conversion Multi-Task Model with Position Bias Mitigation for Sponsored Search in eCommerce

Correcting for Selection Bias in Learning-to-rank Systems

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations