Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Incremental blind feedback: An effective approach to automatic query expansion

Published: 03 October 2014 Publication History
  • Get Citation Alerts
  • Abstract

    Automatic query expansion (AQE) is a useful technique for enhancing the effectiveness of information retrieval systems. In this article, we propose a novel AQE algorithm which first adopts a systematic incremental approach to choose feedback documents from the top retrieved set and then selects the expansion terms aggregating the scores from each feedback set. We also devise a term selection measure and a number of weighting schemes based on easily computable features. A set of experiments with a large number of standard test collections reveals that the proposed incremental blind feedback algorithm outperforms a number of state-of-the-art query expansion methods with remarkable significance and consistency.

    References

    [1]
    IJsbrand Jan Aalbersberg. 1992. Incremental relevance feedback. In Proceedings of the 15th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’92). 11--22.
    [2]
    Gianni Amati and Cornelis Joost Van Rijsbergen. 2002. Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Trans. Inf. Syst. 20, 4, 357--389.
    [3]
    R. Attar and A. S. Fraenkel. 1977. Local feedback in full-text retrieval systems. J. ACM 24, 3, 397--417.
    [4]
    Michael Bendersky, Donald Metzler, and W. Bruce Croft. 2011. Parameterized concept weighting in verbose queries. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’11). 605--614.
    [5]
    Bodo Billerbeck and Justin Zobel. 2004. Questioning query expansion: An examination of behaviour and parameters. In Proceedings of the 15th Australasian Database Conference (ADC’04). Vol. 27, 69--76.
    [6]
    Guihong Cao, Jian-Yun Nie, Jianfeng Gao, and Stephen Robertson. 2008. Selecting good expansion terms for pseudo-relevance feedback. In Proceedings of the 31st International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’08). 243--250.
    [7]
    Claudio Carpineto, Renato de Mori, Giovanni Romano, and Brigitte Bigi. 2001. An information-theoretic approach to automatic query expansion. ACM Trans. Inf. Syst. 19, 1, 1--27.
    [8]
    Claudio Carpineto and Giovanni Romano. 2012. A survey of automatic query expansion in information retrieval. ACM Comput. Surv. 44, 1, 1:1--1:50.
    [9]
    Claudio Carpineto, Giovanni Romano, and Vittorio Giannini. 2002. Improving retrieval feedback with multiple term-ranking function combination. ACM Trans. Inf. Syst. 20, 3, 259--290.
    [10]
    Kevyn Collins-Thompson. 2009. Reducing the risk of query expansion via robust constrained optimization. In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM’09). 837--846.
    [11]
    Kevyn Collins-Thompson and Jamie Callan. 2007. Estimation and use of uncertainty in pseudo-relevance feedback. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’07). 303--310.
    [12]
    Ljiljana Dolamic and Jacques Savoy. 2009. Indexing and stemming approaches for the Czech language. Inf. Process. Manage. 45, 6, 714--720.
    [13]
    Ljiljana Dolamic and Jacques Savoy. 2010. Comparative study of indexing and search strategies for the Hindi, Marathi, and Bengali languages. ACM Trans. Asian Lang. Inform. Process. 9, 3.
    [14]
    Tamas Endre Doszkocs. 1982. AID: An Associative Interactive Dictionary for Online Bibliographic Searching. University of Maryland.
    [15]
    Nasreen Abdul Jaleel, James Allan, W. Bruce Croft, Fernando Diaz, Leah S. Larkey, Xiaoyan Li, Mark D. Smucker, and Courtney Wade. 2004. UMass at TREC 2004: Novelty and HARD. In Proceedings of the Text REtrieval Conference.
    [16]
    Eyal Krikon, Oren Kurland, and Michael Bendersky. 2010. Utilizing inter-passage and inter-document similarities for reranking search results. ACM Trans. Inf. Syst. 29, 1, 3:1--3:28.
    [17]
    Victor Lavrenko and W. Bruce Croft. 2001. Relevance-based language models. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. 120--127.
    [18]
    Kyung Soon Lee, W. Bruce Croft, and James Allan. 2008. A cluster-based resampling method for pseudo-relevance feedback. In Proceedings of the 31st International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’08). 235--242.
    [19]
    Yuanhua Lv and ChengXiang Zhai. 2009. A comparative study of methods for estimating query language models with pseudo feedback. In Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM’09). 1895--1898.
    [20]
    Yuanhua Lv and ChengXiang Zhai. 2010. Positional relevance model for pseudo-relevance feedback. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’10). 579--586.
    [21]
    Prasenjit Majumder, Mandar Mitra, Dipasree Pal, Ayan Bandyopadhyay, Samaresh Maiti, Sukomal Pal, Deboshree Modak, and Sucharita Sanyal. 2010. The FIRE 2008 evaluation exercise. Trans. Asian Lang. Inform. Process. 9, 3, 10:1--10:24.
    [22]
    Donald Metzler and W. Bruce Croft. 2005. A Markov random field model for term dependencies. In Proceedings of the 28th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’05). 472--479.
    [23]
    Jun Miao, Jimmy Xiangji Huang, and Zheng Ye. 2012. Proximity-based Rocchio’s model for pseudo relevance.In Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’12). 535--544.
    [24]
    Mandar Mitra, Amit Singhal, and Chris Buckley. 1998. Improving automatic query expansion. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. 206--214.
    [25]
    M. F. Porter. 1997. An algorithm for suffix stripping. In Readings in Information Retrieval, 313--316.
    [26]
    S. E. Robertson. 1991. On term selection for query expansion. J. Doc. 46, 4, 359--364.
    [27]
    Stephen E. Robertson and Karen Sparck Jones. 1988. Relevance weighting of search terms. In Document Retrieval Systems. 143--160.
    [28]
    Tetsuya Sakai, Toshihiko Manabe, and Makoto Koyama. 2005. Flexible pseudo-relevance feedback via selective sampling. ACM Trans. Asian Lang. Inform. Process. 4, 2, 111--135.
    [29]
    Gerard Salton (Ed.). 1971. The SMART Retrieval System: Experiments in Automatic Document Processing. Prentice Hall, Englewood, Cliffs, NJ.
    [30]
    Jinxi Xu and W. Bruce Croft. 1996. Query expansion using local and global document analysis. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. 4--11.
    [31]
    Jinxi Xu and W. Bruce Croft. 2000. Improving the effectiveness of information retrieval with local context analysis. ACM Trans. Inf. Syst. 18, 1, 79--112.

    Cited By

    View all
    • (2023)Recent Query Reformulation Approaches for Information Retrieval System - A SurveyRecent Advances in Computer Science and Communications10.2174/266625581566622040409192016:1Online publication date: Jan-2023
    • (2022)UFTDRDH: A novel user-oriented solution for full-text database retrieval in Chinese digital humanitiesJournal of Information Science10.1177/01655515221133529(016555152211335)Online publication date: 22-Nov-2022
    • (2020)An Algorithm of Query Expansion for Chinese EMR Retrieval by Improving Expansion Term Weights and Retrieval ScoresIEEE Access10.1109/ACCESS.2020.30330178(200063-200072)Online publication date: 2020
    • Show More Cited By

    Index Terms

    1. Incremental blind feedback: An effective approach to automatic query expansion

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Asian Language Information Processing
      ACM Transactions on Asian Language Information Processing  Volume 13, Issue 3
      September 2014
      83 pages
      ISSN:1530-0226
      EISSN:1558-3430
      DOI:10.1145/2676410
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 03 October 2014
      Accepted: 01 April 2014
      Revised: 01 February 2014
      Received: 01 November 2013
      Published in TALIP Volume 13, Issue 3

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Query expansion
      2. document ranking
      3. pseudo-relevance feedback
      4. query refinement
      5. search

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)6
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 26 Jul 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Recent Query Reformulation Approaches for Information Retrieval System - A SurveyRecent Advances in Computer Science and Communications10.2174/266625581566622040409192016:1Online publication date: Jan-2023
      • (2022)UFTDRDH: A novel user-oriented solution for full-text database retrieval in Chinese digital humanitiesJournal of Information Science10.1177/01655515221133529(016555152211335)Online publication date: 22-Nov-2022
      • (2020)An Algorithm of Query Expansion for Chinese EMR Retrieval by Improving Expansion Term Weights and Retrieval ScoresIEEE Access10.1109/ACCESS.2020.30330178(200063-200072)Online publication date: 2020
      • (2020)An ambiguous tag-based query reformulation technique for an effective semantic-based social image researchProcedia Computer Science10.1016/j.procs.2020.08.053176(508-520)Online publication date: 2020
      • (2020)Neural joint attention code search over structure embeddings for software Q&A sitesJournal of Systems and Software10.1016/j.jss.2020.110773(110773)Online publication date: Aug-2020
      • (2019)Query expansion techniques for information retrieval: A surveyInformation Processing & Management10.1016/j.ipm.2019.05.00956:5(1698-1735)Online publication date: Oct-2019
      • (2019)A knowledge-based semantic framework for query expansionInformation Processing & Management10.1016/j.ipm.2019.04.00756:5(1605-1617)Online publication date: Oct-2019
      • (2019)Modeling positive and negative feedback for improving document retrievalExpert Systems with Applications10.1016/j.eswa.2018.11.035120(253-261)Online publication date: May-2019
      • (2016)Query Expansion Based on Crowd Knowledge for Code SearchIEEE Transactions on Services Computing10.1109/TSC.2016.25601659:5(771-783)Online publication date: 1-Sep-2016

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media