Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3166072.3166074acmotherconferencesArticle/Chapter ViewAbstractPublication PagesadcsConference Proceedingsconference-collections
short-paper

Automatic Term Reweighting for Query Expansion

Published: 07 December 2017 Publication History

Abstract

Query expansion is used to overcome the vocabulary mismatch between the documents and queries, but it can lead to query drift. We propose an automatic term reweighting strategy for BM25 ranking functions. Using expansion terms obtained from general purpose thesauri, we found that reweighting through term frequency merging is more effective than standard query expansion. Instead of appending the new terms directly to the original query, we merge the term frequencies with the original query term. This reduces the impact of spurious expansion terms being over represented in the modified query.

References

[1]
N. Abdul-Jaleel, J. Allan, W.B. Croft, F. Diaz, L. Larkey, X. Li, M.D. Smucker, and C. Wade. 2004. UMass at TREC 2004: Novelty and HARD. In TREC 2004.
[2]
J. Bai, D. Song, P. Bruza, J.-Y. Nie, and G. Cao. 2005. Query Expansion Using Term Relationships in Language Models for Information Retrieval. In CIKM '05. 688--695.
[3]
G. W. Furnas, T. K. Landauer, L. M. Gomez, and S. T. Dumais. 1987. The Vocabulary Problem in Human-system Communication. CACM 30, 11 (1987), 964--971.
[4]
B. L. Humphreys, D. A. Lindberg, H. M. Schoolman, and G. O. Barnett. 1998. The Unified Medical Language System: an informatics research collaboration. J Am Med Inform Assoc 5, 1 (1998), 1--11.
[5]
L. Lee. 2007. IDF Revisited: A Simple New Derivation Within the Robertson-Spärck Jones Probabilistic Model. In SIGIR '07. 751--752.
[6]
F. Martínez-Santiago, M. A. García-Cumbreras, and L. A. Ureña Lòpez. 2006. Does Pseudo-relevance Feedback Improve Distributed Information Retrieval Systems? IP&M 42, 5 (2006), 1151--1162.
[7]
G. A. Miller. 1995. WordNet: A Lexical Database for English. CACM 38, 11 (1995), 39--41.
[8]
M. Mitra, A. Singhal, and C. Buckley. 1998. Improving Automatic Query Expansion. In SIGIR '98. 206--214.
[9]
J.J. Rocchio. 1971. Relevance feedback in information retrieval. In The Smart retrieval system - experiments in automatic document processing, G. Salton (Ed.). Englewood Cliffs, NJ: Prentice-Hall, 313--323.
[10]
G. Salton and M. E. Lesk. 1968. Computer Evaluation of Indexing and Text Processing. J. ACM 15, 1 (1968), 8--36.
[11]
A. Trotman, C. L. A. Clarke, I. Ounis, S. Culpepper, M.-A. Cartright, and S. Geva. 2012. Open Source Information Retrieval: A Report on the SIGIR 2012 Workshop. SIGIR Forum 46, 2 (2012), 95--101.
[12]
A. Trotman, A. Puurula, and B. Burgess. 2014. Improvements to BM25 and Language Models Examined. In ADCS '14. 58:58--58:65.
[13]
E. M. Voorhees. 1994. Query Expansion Using Lexical-semantic Relations. In SIGIR '94. 61--69.
[14]
J. Xu and W. B. Croft. 2000. Improving the Effectiveness of Information Retrieval with Local Context Analysis. TOIS 18, 1 (2000), 79--112.
[15]
L. Zhao and J. Callan. 2010. Term Necessity Prediction. In CIKM 2010. 259--268.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ADCS '17: Proceedings of the 22nd Australasian Document Computing Symposium
December 2017
76 pages
ISBN:9781450363914
DOI:10.1145/3166072
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • Queensland University of Technology
  • CSIRO: Commonwealth Scientific and Industrial Research Organisation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 December 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Ad-hoc Retrieval
  2. Query Expansion
  3. Roget
  4. Thesaurus
  5. WordNet
  6. tf-merging

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

ADCS 2017
ADCS 2017: The 22nd Australasian Document Computing Symposium
December 7 - 8, 2017
QLD, Brisbane, Australia

Acceptance Rates

Overall Acceptance Rate 30 of 57 submissions, 53%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Evaluation of semantic relations impact in query expansion-based retrieval systemsKnowledge-Based Systems10.1016/j.knosys.2023.111183283:COnline publication date: 11-Jan-2024
  • (2022)Managing and Retrieving Bilingual Documents Using Artificial Intelligence-Based Ontological FrameworkComputational Intelligence and Neuroscience10.1155/2022/46369312022Online publication date: 1-Jan-2022
  • (2019)A Taxonomy and Survey of Semantic Approaches for Query ExpansionIEEE Access10.1109/ACCESS.2019.28946797(17823-17833)Online publication date: 2019
  • (2018)Refining Query Expansion Terms using Query ContextProceedings of the 23rd Australasian Document Computing Symposium10.1145/3291992.3292000(1-4)Online publication date: 11-Dec-2018
  • (2018)Designing a Novel Framework for Precision Medicine Information RetrievalSmart Health10.1007/978-3-030-03649-2_16(167-178)Online publication date: 26-Oct-2018

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media