Robust ranking models via risk-sensitive optimization
L Wang, PN Bennett, K Collins-Thompson - Proceedings of the 35th …, 2012 - dl.acm.org
L Wang, PN Bennett, K Collins-Thompson
Proceedings of the 35th international ACM SIGIR conference on Research and …, 2012•dl.acm.orgMany techniques for improving search result quality have been proposed. Typically, these
techniques increase average effectiveness by devising advanced ranking features and/or by
developing sophisticated learning to rank algorithms. However, while these approaches
typically improve average performance of search results relative to simple baselines, they
often ignore the important issue of robustness. That is, although achieving an average gain
overall, the new models often hurt performance on many queries. This limits their application …
techniques increase average effectiveness by devising advanced ranking features and/or by
developing sophisticated learning to rank algorithms. However, while these approaches
typically improve average performance of search results relative to simple baselines, they
often ignore the important issue of robustness. That is, although achieving an average gain
overall, the new models often hurt performance on many queries. This limits their application …
Many techniques for improving search result quality have been proposed. Typically, these techniques increase average effectiveness by devising advanced ranking features and/or by developing sophisticated learning to rank algorithms. However, while these approaches typically improve average performance of search results relative to simple baselines, they often ignore the important issue of robustness. That is, although achieving an average gain overall, the new models often hurt performance on many queries. This limits their application in real-world retrieval scenarios. Given that robustness is an important measure that can negatively impact user satisfaction, we present a unified framework for jointly optimizing effectiveness and robustness. We propose an objective that captures the tradeoff between these two competing measures and demonstrate how we can jointly optimize for these two measures in a principled learning framework. Experiments indicate that ranking models learned this way significantly decreased the worst ranking failures while maintaining strong average effectiveness on par with current state-of-the-art models.
ACM Digital Library