Accelerated Convergence for Counterfactual Learning to Rank

Jagerman, Rolf; de Rijke, Maarten

doi:10.1145/3397271.3401069

Computer Science > Machine Learning

arXiv:2005.10615 (cs)

[Submitted on 21 May 2020]

Title:Accelerated Convergence for Counterfactual Learning to Rank

Authors:Rolf Jagerman, Maarten de Rijke

View PDF

Abstract:Counterfactual Learning to Rank (LTR) algorithms learn a ranking model from logged user interactions, often collected using a production system. Employing such an offline learning approach has many benefits compared to an online one, but it is challenging as user feedback often contains high levels of bias. Unbiased LTR uses Inverse Propensity Scoring (IPS) to enable unbiased learning from logged user interactions. One of the major difficulties in applying Stochastic Gradient Descent (SGD) approaches to counterfactual learning problems is the large variance introduced by the propensity weights. In this paper we show that the convergence rate of SGD approaches with IPS-weighted gradients suffers from the large variance introduced by the IPS weights: convergence is slow, especially when there are large IPS weights. To overcome this limitation, we propose a novel learning algorithm, called CounterSample, that has provably better convergence than standard IPS-weighted gradient descent methods. We prove that CounterSample converges faster and complement our theoretical findings with empirical results by performing extensive experimentation in a number of biased LTR scenarios -- across optimizers, batch sizes, and different degrees of position bias.

Comments:	SIGIR 2020 full conference paper
Subjects:	Machine Learning (cs.LG); Information Retrieval (cs.IR); Machine Learning (stat.ML)
Cite as:	arXiv:2005.10615 [cs.LG]
	(or arXiv:2005.10615v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2005.10615
Related DOI:	https://doi.org/10.1145/3397271.3401069

Submission history

From: Rolf Jagerman [view email]
[v1] Thu, 21 May 2020 12:53:36 UTC (3,641 KB)

Computer Science > Machine Learning

Title:Accelerated Convergence for Counterfactual Learning to Rank

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Accelerated Convergence for Counterfactual Learning to Rank

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators