Accurate and effective latent concept modeling for ad hoc information retrieval

R Deveaud, E SanJuan, P Bellot - Document numérique, 2014 - cairn.info
Document numérique, 2014cairn.info
A keyword query is the representation of the information need of a user, and is the result of a
complex cognitive process which often results in under-specification. We propose an
unsupervised method namely Latent Concept Modeling (LCM) for mining and modeling
latent search concepts in order to recreate the conceptual view of the original information
need. We use Latent Dirichlet Allocation (LDA) to exhibit highly-specific query-related topics
from pseudo-relevant feedback documents. We define these topics as the latent concepts of …
A keyword query is the representation of the information need of a user, and is the result of a complex cognitive process which often results in under-specification. We propose an unsupervised method namely Latent Concept Modeling (LCM) for mining and modeling latent search concepts in order to recreate the conceptual view of the original information need. We use Latent Dirichlet Allocation (LDA) to exhibit highly-specific query-related topics from pseudo-relevant feedback documents. We define these topics as the latent concepts of the user query. We perform a thorough evaluation of our approach over two large ad-hoc TREC collections. Our findings reveal that the proposed method accurately models latent concepts, while being very effective in a query expansion retrieval setting.
Cairn.info