Google Scholar

A study of smoothing methods for language models applied to ad hoc information retrieval

C Zhai, J Lafferty - ACM Sigir Forum, 2017 - dl.acm.org

C Zhai, J Lafferty

ACM Sigir Forum, 2017•dl.acm.org

Language modeling approaches to information retrieval are attractive and promising because they connect the problem of retrieval with that of language model estimation, which has been studied extensively in other application areas such as speech recognition. The basic idea of these approaches is to estimate a language model for each document, and then rank documents by the likelihood of the query according to the estimated language model. A core problem in language model estimation is smoothing, which adjusts the maximum likelihood estimator so as to correct the inaccuracy due to data sparseness. In this paper, we study the problem of language model smoothing and its influence on retrieval performance. We examine the sensitivity of retrieval performance to the smoothing parameters and compare several popular smoothing methods on different test collection.

ACM Digital Library

Show moreShow less

Save Cite Cited by 2008 Related articles All 28 versions

Cite

Advanced search

Saved to My library

A study of smoothing methods for language models applied to ad hoc information retrieval