Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Learning from homologous queries and semantically related terms for query auto completion

Published: 01 July 2016 Publication History

Abstract

We propose a learning to rank based query auto completion model (L2R-QAC) that exploits contributions from so-called homologous queries for a QAC candidate, in which two kinds of homologous queries are taken into account.We propose semantic features for QAC, using the semantic relatedness of terms inside a query candidate and of pairs of terms from a candidate and from queries previously submitted in the same session.We analyze the effectiveness of our L2R-QAC model with newly added features, and find that it significantly outperforms state-of-the-art QAC models, either based on learning to rank or on popularity. Query auto completion (QAC) models recommend possible queries to web search users when they start typing a query prefix. Most of today's QAC models rank candidate queries by popularity (i.e., frequency), and in doing so they tend to follow a strict query matching policy when counting the queries. That is, they ignore the contributions from so-called homologous queries, queries with the same terms but ordered differently or queries that expand the original query. Importantly, homologous queries often express a remarkably similar search intent. Moreover, today's QAC approaches often ignore semantically related terms. We argue that users are prone to combine semantically related terms when generating queries.We propose a learning to rank-based QAC approach, where, for the first time, features derived from homologous queries and semantically related terms are introduced. In particular, we consider: (i) the observed and predicted popularity of homologous queries for a query candidate; and (ii) the semantic relatedness of pairs of terms inside a query and pairs of queries inside a session. We quantify the improvement of the proposed new features using two large-scale real-world query logs and show that the mean reciprocal rank and the success rate can be improved by up to 9% over state-of-the-art QAC models.

References

[1]
E. Agichtein, C. Castillo, D. Donato, A. Gionis, G. Mishne, Finding high-quality content in social media, 2008.
[2]
Z. Bar-Yossef, N. Kraus, Context-sensitive query auto-completion, 2011.
[3]
C.J. Burges, K.M. Svore, P.N. Bennett, A. Pastusiak, Q. Wu, Learning to rank using an ensemble of lambda-gradient models, J. Mach. Learn. Res., 14 (2011) 25-35.
[4]
F. Cai, S. Liang, M. de Rijke, Personalized document re-ranking based on bayesian probabilistic matrix factorization, 2014.
[5]
F. Cai, S. Liang, M. de Rijke, Time-sensitive personalized query auto-completion, 2014.
[6]
H. Cao, D. Jiang, J. Pei, Q. He, Z. Liao, E. Chen, H. Li, Context-aware query suggestion by mining click-through and session data, 2008.
[7]
C. Chatfield, Chapman and Hall, New York, 2004.
[8]
S. Chien, N. Immorlica, Semantic similarity between search engine queries using temporal correlation, 2005.
[9]
A. Chuklin, I. Markov, M. de Rijke, Click Models for Web Search, Morgan & Claypool Publishers, 2015.
[10]
Proc. 2009 workshop on web search click data, ACM, 2009.
[11]
H. Duan, B.-J. P. Hsu, Online spelling correction for query completion, 2011.
[12]
J. Gama, I. ¿liobait¿, A. Bifet, M. Pechenizkiy, A. Bouchachia, A survey on concept drift adaptation, ACM Comput. Surv., 46 (2014) 44:1-44:37.
[13]
N.G. Golbandi, L.K. Katzir, Y.K. Koren, R.L. Lempel, Expediting search trend detection via prediction of query counts, 2013.
[14]
J. Guo, X. Cheng, G. Xu, X. Zhu, Intent-aware query similarity, 2011.
[15]
L. Han, A. L. Kashyap, T. Finin, J. Mayfield, J. Weese, UMBC EBIQUITY-CORE: Semantic textual similarity systems, ACL, 2013.
[16]
K. Hofmann, B. Mitra, F. Radlinski, M. Shokouhi, An eye-tracking study of user interactions with query auto completion, 2014.
[17]
J.-Y. Jiang, Y.-Y. Ke, P.-Y. Chien, P.-J. Cheng, Learning user reformulation behavior for query auto-completion, 2014.
[18]
R. Jones, B. Rey, O. Madani, W. Greiner, Generating query substitutions, 2006.
[19]
A. Kulkarni, J. Teevan, K.M. Svore, S.T. Dumais, Understanding temporal query dynamics, 2011.
[20]
Y. Li, A. Dong, H. Wang, H. Deng, Y. Chang, C. Zhai, A two-dimensional click model for query auto-completion, 2014.
[21]
Z. Liao, D. Jiang, E. Chen, J. Pei, H. Cao, H. Li, Mining concept sequences from large-scale search logs for context-aware query suggestion, ACM Trans. Intell. Syst. Technol., 3 (2011) Article17.
[22]
T.-Y. Liu, Learning to rank for information retrieval, Foundations and Trends in Information Retrieval, 3 (2003) 225-331.
[23]
Y. Liu, R. Song, Y. Chen, J.-Y. Nie, J.-R. Wen, Adaptive query suggestion for difficult queries, 2012.
[24]
H. Ma, H. Yang, I. King, M.R. Lyu, Learning latent semantic relations from clickthrough data for query suggestion, 2008.
[25]
Q. Mei, D. Zhou, K. Church, Query suggestion using hitting time, 2008.
[26]
T. Mikolov, K. Chen, G.S. Corrado, J. Dean, Efficient estimation of word representations in vector space, 2013.
[27]
T. Mikolov, I. Sutskever, K. Chen, G.S. Corrado, J. Dean, Distributed representations of words and phrases and their compositionality, 2013.
[28]
B. Mitra, M. Shokouhi, F. Radlinski, K. Hofmann, On user interactions with query auto-completion, 2014.
[29]
U. Ozertem, O. Chapelle, P. Donmez, E. Velipasaoglu, Learning to suggest: A machine learning framework for ranking query suggestions, 2012.
[30]
G. Pass, A. Chowdhury, C. Torgeson, A picture of search, 2006.
[31]
R.L.T. Santos, C. Macdonald, I. Ounis, Learning to rank query suggestions for adhoc and diversity search, Inf. Retr., 16 (2013) 429-451.
[32]
C. Sengstock, M. Gertz, Conquer: A system for efficient context-aware query suggestions, 2011.
[33]
M. Shokouhi, Learning to personalize query auto-completion, 2013.
[34]
M. Shokouhi, K. Radinsky, Time-sensitive query auto-completion, 2012.
[35]
S. Whiting, J.M. Jose, Recent and robust query auto-completion, 2014.

Cited By

View all
  • (2024)AI-safe Autocompletion with RAG and Relevance CurationProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679078(5562-5563)Online publication date: 21-Oct-2024
  • (2023)Multi-Objective Ranking to Boost Navigational Suggestions in eCommerce AutoCompleteCompanion Proceedings of the ACM Web Conference 202310.1145/3543873.3584649(469-474)Online publication date: 30-Apr-2023
  • (2022)TIPSJournal of Information Science10.1177/016555152096869048:4(524-543)Online publication date: 1-Aug-2022
  • Show More Cited By
  1. Learning from homologous queries and semantically related terms for query auto completion

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Information Processing and Management: an International Journal
    Information Processing and Management: an International Journal  Volume 52, Issue 4
    July 2016
    230 pages

    Publisher

    Pergamon Press, Inc.

    United States

    Publication History

    Published: 01 July 2016

    Author Tags

    1. Learning to rank
    2. Query auto completion
    3. Query suggestion
    4. Semantics

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 24 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)AI-safe Autocompletion with RAG and Relevance CurationProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679078(5562-5563)Online publication date: 21-Oct-2024
    • (2023)Multi-Objective Ranking to Boost Navigational Suggestions in eCommerce AutoCompleteCompanion Proceedings of the ACM Web Conference 202310.1145/3543873.3584649(469-474)Online publication date: 30-Apr-2023
    • (2022)TIPSJournal of Information Science10.1177/016555152096869048:4(524-543)Online publication date: 1-Aug-2022
    • (2022)Applying burst-tries for error-tolerant prefix searchInformation Retrieval10.1007/s10791-022-09416-925:4(481-518)Online publication date: 1-Dec-2022
    • (2021)Neural Instant Search for Music and PodcastProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining10.1145/3447548.3467188(2984-2992)Online publication date: 14-Aug-2021
    • (2020)ReQueProceedings of the 29th ACM International Conference on Information & Knowledge Management10.1145/3340531.3412775(3165-3172)Online publication date: 19-Oct-2020
    • (2019)Joint Neural Collaborative Filtering for Recommender SystemsACM Transactions on Information Systems10.1145/334311737:4(1-30)Online publication date: 14-Aug-2019
    • (2019)Personalized query suggestion diversification in information retrievalFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-018-7283-x14:3Online publication date: 19-Dec-2019
    • (2018)Query-oriented text summarization based on multiobjective evolutionary algorithms and word embeddingsJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-16950634:5(3235-3244)Online publication date: 1-Jan-2018
    • (2018)A Prospect-Guided global query expansion strategy using word embeddingsInformation Processing and Management: an International Journal10.1016/j.ipm.2017.09.00154:1(1-13)Online publication date: 1-Jan-2018
    • Show More Cited By

    View Options

    View options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media