Relevance Feedback & Query Expansion
Relevance Feedback & Query Expansion
Relevance Feedback & Query Expansion
What is Relevance Feedback? What is Query Expansion? The difference between relevance feedback & query expansion Advantages & disadvantages Methods
Global Methods
Usually, users will facing a failure of a straightforward retrieval system. For example: you want to search for aircraft to match plane. The methods for tackling this problem in IR split into TWO major classes:
Global methods Local methods
Global methods techniques for expanding or reformulating query terms independent of the query and results returned from it, so that changes in the query wording will cause the new query to match other semantically similar terms. Methods/approaches:
Query expansion/reformulation with a thesaurus generation Query expansion via automatic thesaurus generation Techniques like spelling correction
Query expansion
Users give additional input on query words or phrases, suggesting terms or saying whether they regard system suggestions as good or bad search terms. <add>
12/8/2013
The central question in the use of query expansion is how to generate alternative or expanded queries for the user. The most common form of query expansion is using some form of thesaurus. For each term, t, in a query, the query can be automatically expanded with synonyms and related words of t from the thesaurus.
12/8/2013
Local Methods
Local methods techniques that adjust a query relative to the documents that initially appear to match the query. Methods/approaches:
Relevance feedback Pseudo-relevance feedback Indirect relevance feedback
Relevance feedback
The idea of relevance feedback is to involve the user in the retrieval process so as to improve the final set. In particular, the user gives feedback on the relevance of documents in an initial of results. Procedures:
The user issues a short/simple query The system returns an initial set of retrieval results The user marks some returned documents as relevant or not relevant The system computes a better representation of the information need based on the user feedback. The system displays a revised set of retrieval results.
Pseudo-relevance feedback
Known as blind relevance feedback Method that provides automatic local analysis, automates the manual part of relevance feedback so that the user gets improved retrieval performance without an extended interaction. How? The method is to do normal retrieval to find am initial set of most relevant documents,
Then ASSUME that the top k ranked documents are relevant, And finally to do relevance feedback as before under this assumption
Summary
Query expansion is often effective in increasing recall. High cost to manually producing a thesaurus and then updating it for scientific changes. In general domain-specific thesaurus, general thesauri and dictionaries give far too little coverage of the rich domain-particular vocabularies of most scientific fields.
12/8/2013
Summary..
May also significantly decrease precision, particularly when the query contains ambiguously terms. Overall, query expansion is less successful than relevance feedback, though it may be as good as pseudo-relevance feedback. Relevance feedback is successful use requires queries for which the set of relevant documents is medium to large. Full relevance feedback is often painful for the user, and its implementation is not very efficient in most IR systems.