Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1772690.1772836acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
poster

Optimizing two stage bigram language models for IR

Published: 26 April 2010 Publication History

Abstract

Although higher order language models (LMs) have shown benefit of capturing word dependencies for Information retrieval(IR), the tuning of the increased number of free parameters remains a formidable engineering challenge. Consequently,in many real world retrieval systems, applying higher order LMs is an exception rather than the rule. In this study, we address the parameter tuning problem using a framework based on a linear ranking model in which different component models are incorporated as features. Using unigram and bigram LMs with 2 stage smoothing as examples, we show that our method leads to a bigram LM that outperforms significantly its unigram counterpart and the well-tuned BM25 model.

References

[1]
J. Gao Haoliang Qi, X. Xia, and J.-Y. Nie, "Linear discriminant model for information retrieval," in SIGIR, 2005, pp. 290--297.
[2]
D. Metzler and W. B. Croft, "A markov random field model for term dependencies," in SIGIR, 2005, pp. 472--479.
[3]
C. Zhai and J. Lafferty, "Two-stage language models for information retrieval," in SIGIR, 2002, pp. 49--56.
[4]
J. Gao, J.-Y. Nie, G. Wu, and G. Cao, "Dependence language model for information retrieval," in SIGIR, 2004, pp. 170--177.
[5]
C. Zhai and J. Lafferty, "A study of smoothing methods for language models applied to information retrieval," ACM Trans. Inf. Syst., vol. 22, no. 2, pp. 179--214, 2004.
[6]
M. J. D. Powell, "An efficient method for finding the minimum of a function of several variables without calculating derivatives," The Computer Journal, vol. 7, no. 2, pp. 155--162, 1964.
[7]
M. Taylor, H. Zaragoza, N. Craswell, S. Robertson, and C. Burges, "Optimisation methods for ranking functions with multiple parameters," in CIKM, 2006, pp. 585--593.
[8]
K. Jarvelin and J. Kekalainen, "Cumulated gain-based evaluation of IR techniques," ACM Trans. Inf. Syst., vol. 20, no. 4, pp. 422--446, 2002.

Cited By

View all
  • (2013)Can predicate-argument structures be used for contextual opinion retrieval from blogs?World Wide Web10.1007/s11280-012-0170-816:5-6(763-791)Online publication date: 1-Nov-2013
  • (2011)Unsupervised query segmentation using clickthrough for information retrievalProceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval10.1145/2009916.2009957(285-294)Online publication date: 24-Jul-2011
  • (2011)Sentence-level contextual opinion retrievalProceedings of the 20th international conference companion on World wide web10.1145/1963192.1963351(403-408)Online publication date: 28-Mar-2011
  • Show More Cited By

Index Terms

  1. Optimizing two stage bigram language models for IR

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '10: Proceedings of the 19th international conference on World wide web
    April 2010
    1407 pages
    ISBN:9781605587998
    DOI:10.1145/1772690

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 26 April 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. bigram LM
    2. parameter tuning
    3. retrieval model

    Qualifiers

    • Poster

    Conference

    WWW '10
    WWW '10: The 19th International World Wide Web Conference
    April 26 - 30, 2010
    North Carolina, Raleigh, USA

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)4
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 25 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2013)Can predicate-argument structures be used for contextual opinion retrieval from blogs?World Wide Web10.1007/s11280-012-0170-816:5-6(763-791)Online publication date: 1-Nov-2013
    • (2011)Unsupervised query segmentation using clickthrough for information retrievalProceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval10.1145/2009916.2009957(285-294)Online publication date: 24-Jul-2011
    • (2011)Sentence-level contextual opinion retrievalProceedings of the 20th international conference companion on World wide web10.1145/1963192.1963351(403-408)Online publication date: 28-Mar-2011
    • (2011)Using predicate-argument structures for context-dependent opinion retrievalProceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II10.1007/978-3-642-25856-5_29(386-402)Online publication date: 17-Dec-2011
    • (2011)Semantic-based opinion retrieval using predicate-argument structures and subjective adjectivesProceedings of the 7th Asia conference on Information Retrieval Technology10.1007/978-3-642-25631-8_34(372-385)Online publication date: 18-Dec-2011

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    EPUB

    View this article in ePub.

    ePub

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media