Playing by the rules: mining query associations to predict search performance

Y Kim, A Hassan, RW White, YM Wang - … on Web search and data mining, 2013 - dl.acm.org
Y Kim, A Hassan, RW White, YM Wang
Proceedings of the sixth ACM international conference on Web search and data …, 2013dl.acm.org
Understanding the characteristics of queries where a search engine is failing is important for
improving engine performance. Previous work largely relies on user-interaction features (eg,
clickthrough statistics) to identify such underperforming queries. However, relying on
interaction behavior means that searchers need to become dissatisfied and need to exhibit
that in their search behavior, by which point it may be too late to help them. In this paper, we
propose a method to generate underperforming query identification rules instantly using …
Understanding the characteristics of queries where a search engine is failing is important for improving engine performance. Previous work largely relies on user-interaction features (e.g., clickthrough statistics) to identify such underperforming queries. However, relying on interaction behavior means that searchers need to become dissatisfied and need to exhibit that in their search behavior, by which point it may be too late to help them. In this paper, we propose a method to generate underperforming query identification rules instantly using topical and lexical attributes. The method first generates query attributes using sources such as topics, concepts (entities), and keywords in queries. Then, association rules are learned by exploiting the FP-growth algorithm and decision trees using underperforming query examples. We develop a query classification model capable of accurately estimating dissatisfaction using the generated rules, and demonstrate significant performance gains over state-of-the-art query performance prediction models.
ACM Digital Library