Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2348283.2348355acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Generating reformulation trees for complex queries

Published: 12 August 2012 Publication History

Abstract

Search queries have evolved beyond keyword queries. Many complex queries such as verbose queries, natural language question queries and document-based queries are widely used in a variety of applications. Processing these complex queries usually requires a series of query operations, which results in multiple sequences of reformulated queries. However, previous query representations, either the "bag of words" method or the recently proposed "query distribution" method, cannot effectively model these query sequences, since they ignore the relationships between two queries. In this paper, a reformulation tree framework is proposed to organize multiple sequences of reformulated queries as a tree structure, where each path of the tree corresponds to a sequence of reformulated queries. Specifically, a two-level reformulation tree is implemented for verbose queries. This tree effectively combines two query operations, i.e., subset selection and query substitution, within the same framework. Furthermore, a weight estimation approach is proposed to assign weights to each node of the reformulation tree by considering the relationships with other nodes and directly optimizing retrieval performance. Experiments on TREC collections show that this reformulation tree based representation significantly outperforms the state-of-the-art techniques.

References

[1]
N. Balasubramanian, G. Kumaran, and V. Carvalho. Exploring reductions for long web queries. In SIGIR10, pages 571--578, 2010.
[2]
M. Bendersky and W. B. Croft. Discovering key concepts in verbose queries. In SIGIR08, pages 491--498, Singapore, 2008.
[3]
M. Bendersky, D. Metzler, and W. B. Croft. Learning concept importance using a weighted dependence model. In WSDM10, New York City, NY, 2010.
[4]
P. Boldi, F. Bonchi, C. Castillo, D. Donato, A. Gionis, and S. Vigna. The query-flow graph: model and applications. In CIKM08, pages 609--618, 2008.
[5]
Z. Cao, T. Qin, T.-Y. Liu, M.-F. Tsai, and H. Li. Learning to rank: from pairwise approach to listwise approach. In ICML07, pages 129--136, 2007.
[6]
J. Guo, G. Xu, H. Li, and X. Cheng. A unified and discriminative model for query refinement. In SIGIR08, pages 379--386, Singapore, 2008.
[7]
B. He and I. Ounis. Inferring query performance using pre-retrieval predictors. In SPIRE04, pages 43--54, 2004.
[8]
J. Jeon, W. B. Croft, and J. H. Lee. Finding similar questions in large question and answer archives. In CIKM05, pages 84--90, 2005.
[9]
V. Jijkoun and M. de Rijke. Retrieving answers from frequently asked questions pages on the web. In CIKM05, pages 76--83, 2005.
[10]
R. Jones, B. Rey, O. Madani, and W. Greiner. Generating query substitutions. In WWW06, pages 387--396, Ediburgh, Scotland, 2006.
[11]
G. Kumaran and J. Allan. A case for shorter queries, and helping users creat them. In ACL07, pages 220--227, Rochester, New York, 2007.
[12]
G. Kumaran and V. R. Carvalho. Reducing long queries using query quality predictors. In SIGIR09, pages 564--571, Boston, MA, 2009.
[13]
L. Larkey. A patent search and classification system. In DL99, pages 179--187, Berkeley, CA, 1999.
[14]
V. Lavrenko and W. B. Croft. Relevance based language models. In SIGIR01, pages 120--127, New Orleans, LA, 2001.
[15]
M. Lease. An improved Markov random field model for supporting verbose queries. In SIGIR09, pages 476--483, Boston, MA, 2009.
[16]
M. Lease, J. Allan, and W. B. Croft. Regression rank: learning to meet the opportunity of descriptive queries. In SIGIR05, pages 472--479, Salvador, Brazil, 2005.
[17]
Q. Mei, K. Klinkner, R. Kumar, and A. Tomkins. An analysis framework for search sequences. In CIKM09, pages 1991--1994, 2009.
[18]
D. Metzler and W. B. Croft. Combining the language model and inference network approaches to retrieval. Information Processing and Management, 40(5):735--750, 2004.
[19]
D. Metzler and W. B. Croft. A Markov random field model for term dependencies. In SIGIR05, pages 472--479, Salvador, Brazil, 2005.
[20]
D. Metzler and W. B. Croft. Latent concept expansion using markov random fields. In SIGIR07, pages 311--318, Amsterdam, the Netherlands, 2007.
[21]
F. Peng, N. Ahmed, X. Li, and Y. Lu. Context sensitive stemming for web search. In SIGIR07, pages 639--646, Amsterdam, the Netherlands, 2007.
[22]
J. M. Ponte and W. B. Croft. A language modeling approach to information retrieval. In SIGIR98, pages 275--281, Melbourne, Australia, 1998.
[23]
M. F. Porter. An algorithm for suffix stripping. Program, 14(3):130--137, 1980.
[24]
S. Riezler, A. Vasserman, I. Tsochantaridis, V. Mittal, and Y. Liu. Statistical machine translation for query expansion in answer retrieval. In ACL07, pages 464--471, Prague, Czech Republic, June 2007. Association for Computational Linguistics.
[25]
Y. Z. S. Cronen-Townsend and W. B. Croft. Predicting query performance. In SIGIR02, pages 299--306, 2002.
[26]
X. Wang and C. Zhai. Mining term association patterns from search logs for effective query reformulation. In CIKM08, pages 479--488, Napa Valley, CA, 2008.
[27]
X. Xue and W. B. Croft. Representing queries as distributions. In SIGIR10 Workshop on Query Representation and Understanding, pages 9--12, Geneva, Switzerland, 2010.
[28]
X. Xue and W. B. Croft. Modeling subset distributions for verbose queries. In SIGIR11, pages 1133--1134, 2011.
[29]
X. Xue, W. B. Croft, and D. A. Smith. Modeling reformulation using passage analysis. In CIKM10, pages 1497--1500, 2010.
[30]
X. Xue, S. Huston, and W. B. Croft. Improving verbose queries using subset distribution. In CIKM10, pages 1059--1068, 2010.
[31]
C. Zhai and J. Lafferty. A study of smoothing methods for language models applied to ad hoc information retrieval. In SIGIR01, pages 334--342, New Orleans, LA, 2001.

Cited By

View all
  • (2020)Word Embedding-Based Reformulation for Long Queries in Information SearchWeb Information Systems and Applications10.1007/978-3-030-60029-7_19(202-214)Online publication date: 23-Sep-2020
  • (2017)New technique to deal with verbose queries in social book searchProceedings of the International Conference on Web Intelligence10.1145/3106426.3106481(799-806)Online publication date: 23-Aug-2017
  • (2015)Information Retrieval with Verbose QueriesProceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/2766462.2767877(1121-1124)Online publication date: 9-Aug-2015
  • Show More Cited By

Index Terms

  1. Generating reformulation trees for complex queries

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
    August 2012
    1236 pages
    ISBN:9781450314725
    DOI:10.1145/2348283
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 August 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. information retrieval
    2. reformulation tree
    3. verbose query

    Qualifiers

    • Research-article

    Conference

    SIGIR '12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 12 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)Word Embedding-Based Reformulation for Long Queries in Information SearchWeb Information Systems and Applications10.1007/978-3-030-60029-7_19(202-214)Online publication date: 23-Sep-2020
    • (2017)New technique to deal with verbose queries in social book searchProceedings of the International Conference on Web Intelligence10.1145/3106426.3106481(799-806)Online publication date: 23-Aug-2017
    • (2015)Information Retrieval with Verbose QueriesProceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/2766462.2767877(1121-1124)Online publication date: 9-Aug-2015
    • (2014)Finding similar queries based on query representation analysisWorld Wide Web10.1007/s11280-013-0233-517:5(1161-1188)Online publication date: 1-Sep-2014
    • (2013)Retrieving opinions from discussion forumsProceedings of the 22nd ACM international conference on Information & Knowledge Management10.1145/2505515.2507861(1225-1228)Online publication date: 27-Oct-2013

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media