Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2983323.2983750acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Word Vector Compositionality based Relevance Feedback using Kernel Density Estimation

Published: 24 October 2016 Publication History

Abstract

A limitation of standard information retrieval (IR) models is that the notion of term composionality is restricted to pre-defined phrases and term proximity. Standard text based IR models provide no easy way of representing semantic relations between terms that are not necessarily phrases, such as the equivalence relationship between `osteoporosis' and the terms `bone' and `decay'. To alleviate this limitation, we introduce a relevance feedback (RF) method which makes use of word embedded vectors. We leverage the fact that the vector addition of word embeddings leads to a semantic composition of the corresponding terms, e.g. addition of the vectors for `bone' and `decay' yields a vector that is likely to be close to the vector for the word `osteoporosis'. Our proposed RF model enables incorporation of semantic relations by exploiting term compositionality with embedded word vectors. We develop our model for RF as a generalization of the relevance model (RLM). Our experiments demonstrate that our word embedding based RF model significantly outperforms the RLM model on standard TREC test collections, namely the TREC 6,7,8 and Robust ad-hoc and the TREC 9 and 10 WT10G test collections.

References

[1]
A. Berger and J. Lafferty. Information retrieval as statistical translation. In SIGIR '99, pages 222--229, 1999.
[2]
C. L. A. Clarke, N. Craswell, and I. Soboroff. Overview of the TREC 2004 terabyte track. In TREC '04, 2004.
[3]
S. Clinchant and E. Gaussier. A theoretical analysis of pseudo-relevance feedback models. In ICTIR '13, pages 6--13, 2013.
[4]
K. Collins-Thompson, C. Macdonald, P. N. Bennett, F. Diaz, and E. M. Voorhees. TREC 2014 web track overview. In Proc. of TREC 2014, 2014.
[5]
S. C. Deerwester, S. T. Dumais, T. K. Landauer, G. W. Furnas, and R. A. Harshman. Indexing by latent semantic analysis. JASIS, 41(6):391--407, 1990.
[6]
F. Diaz. Condensed list relevance models. In ICTIR '15, pages 313--316, New York, NY, USA, 2015. ACM.
[7]
M. Efron, J. Lin, J. He, and A. de Vries. Temporal feedback for tweet search with non-parametric density estimation. In Proc. of SIGIR '14, pages 33--42, 2015.
[8]
D. Ganguly, J. Leveling, and G. J. F. Jones. Topical relevance model. In AIRS '12, pages 326--335, 2012.
[9]
D. Ganguly, D. Roy, M. Mitra, and G. J. F. Jones. Word embedding based generalized language model for information retrieval. In SIGIR'15, pages 795--798, 2015.
[10]
T. Goodwin and S. M. Harabagiu. UTD at TREC 2014: Query expansion for clinical decision support. In Proc. of TREC 2014, 2014.
[11]
M. Grbovic, N. Djuric, V. Radosavljevic, F. Silvestri, and N. Bhamidipati. Context- and content-aware embeddings for query rewriting in sponsored search. In Proc. of SIGIR 2015, pages 383--392, 2015.
[12]
D. Hiemstra. Using Language Models for Information Retrieval. PhD thesis, Center of Telematics and Information Technology, AE Enschede, 2000.
[13]
T. Hofmann. Probabilistic latent semantic indexing. In Proc. of SIGIR'99, pages 50--57, 1999.
[14]
N. A. Jaleel, J. Allan, W. B. Croft, F. Diaz, L. S. Larkey, X. Li, M. D. Smucker, and C. Wade. Umass at TREC 2004: Novelty and HARD. In Proc. of TREC '04, 2004.
[15]
V. Lavrenko and B. W. Croft. Relevance based language models. In Proc. of SIGIR '01, pages 120--127, 2001.
[16]
C. Lioma, J. G. Simonsen, B. Larsen, and N. D. Hansen. Non-compositional term dependence for information retrieval. In Proc. of SIGIR '15, pages 595--604, 2015.
[17]
Y. Lv and C. Zhai. A comparative study of methods for estimating query language models with pseudo feedback. In Proc. of CIKM '09, pages 1895--1898, 2009.
[18]
D. Metzler and W. B. Croft. Latent concept expansion using markov random fields. In Proc. of SIGIR '07, pages 311--318, 2007.
[19]
T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Proc. of NIPS '13, pages 3111--3119, 2013.
[20]
D. Pal, M. Mitra, and K. Datta. Improving query expansion using wordnet. JAIST, 65(12):2469--2478, 2014.
[21]
A. Sordoni, Y. Bengio, and J.-Y. Nie. Learning concept embeddings for query expansion by quantum entropy minimization. In Proc. of AAAI '14, 2014.
[22]
I. Vulic and M. Moens. Monolingual and cross-lingual information retrieval models based on (bilingual) word embeddings. In Proc. of SIGIR '15, pages 363--372, 2015.
[23]
X. Wei and W. B. Croft. LDA-based document models for ad-hoc retrieval. In SIGIR '06, pages 178--185, 2006.
[24]
X. Yi and J. Allan. A comparative study of utilizing topic models for information retrieval. In Proc. of ECIR '09, pages 29--41, 2009.
[25]
C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to information retrieval. TOIS, 22(2):179--214, Apr. 2004.
[26]
G. Zheng and J. Callan. Learning to reweight terms with distributed representations. In Proc. of SIGIR'15, pages 575--584, 2015.

Cited By

View all
  • (2024)A Deep Learning Approach for Selective Relevance FeedbackAdvances in Information Retrieval10.1007/978-3-031-56060-6_13(189-204)Online publication date: 16-Mar-2024
  • (2023)Semantics-aware query expansion using pseudo-relevance feedbackJournal of Information Science10.1177/01655515231184831Online publication date: 22-Jul-2023
  • (2022)A Relative Information Gain-based Query Performance Prediction Framework with Generated Query VariantsACM Transactions on Information Systems10.1145/354511241:2(1-31)Online publication date: 21-Dec-2022
  • Show More Cited By

Index Terms

  1. Word Vector Compositionality based Relevance Feedback using Kernel Density Estimation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '16: Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
    October 2016
    2566 pages
    ISBN:9781450340731
    DOI:10.1145/2983323
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 October 2016

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. kernel density estimation
    2. relevance feedback
    3. word compositionality
    4. word vector embedding

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    CIKM'16
    Sponsor:
    CIKM'16: ACM Conference on Information and Knowledge Management
    October 24 - 28, 2016
    Indiana, Indianapolis, USA

    Acceptance Rates

    CIKM '16 Paper Acceptance Rate 160 of 701 submissions, 23%;
    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)24
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 01 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A Deep Learning Approach for Selective Relevance FeedbackAdvances in Information Retrieval10.1007/978-3-031-56060-6_13(189-204)Online publication date: 16-Mar-2024
    • (2023)Semantics-aware query expansion using pseudo-relevance feedbackJournal of Information Science10.1177/01655515231184831Online publication date: 22-Jul-2023
    • (2022)A Relative Information Gain-based Query Performance Prediction Framework with Generated Query VariantsACM Transactions on Information Systems10.1145/354511241:2(1-31)Online publication date: 21-Dec-2022
    • (2022)Local or Global? A Comparative Study on Applications of Embedding Models for Information RetrievalProceedings of the 5th Joint International Conference on Data Science & Management of Data (9th ACM IKDD CODS and 27th COMAD)10.1145/3493700.3493701(115-119)Online publication date: 8-Jan-2022
    • (2022)Deep-QPPProceedings of the Fifteenth ACM International Conference on Web Search and Data Mining10.1145/3488560.3498491(201-209)Online publication date: 11-Feb-2022
    • (2022)Kernel density estimation based factored relevance model for multi-contextual point-of-interest recommendationInformation Retrieval10.1007/s10791-021-09400-925:1(44-90)Online publication date: 1-Mar-2022
    • (2021)I Know What You Need: Investigating Document Retrieval Effectiveness with Partial Session ContextsACM Transactions on Information Systems10.1145/348866740:3(1-30)Online publication date: 17-Nov-2021
    • (2021)Tag embedding based personalized point of interest recommendation systemInformation Processing and Management: an International Journal10.1016/j.ipm.2021.10269058:6Online publication date: 1-Nov-2021
    • (2019)Contextualized Relevance Feedback for Precision Medicine2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)10.1109/BIBM47256.2019.8983396(1673-1680)Online publication date: Nov-2019
    • (2019)Estimating Gaussian mixture models in the local neighbourhood of embedded word vectors for query performance predictionInformation Processing and Management: an International Journal10.1016/j.ipm.2018.10.00956:3(1026-1045)Online publication date: 1-May-2019
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media