Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3437963.3441833acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Event-Driven Query Expansion

Published: 08 March 2021 Publication History
  • Get Citation Alerts
  • Abstract

    A significant number of event-related queries are issued in Web search. In this paper, we seek to improve retrieval performance by leveraging events and specifically target the classic task of query expansion. We propose a method to expand an event-related query by first detecting the events related to it. Then, we derive the candidates for expansion as terms semantically related to both the query and the events. To identify the candidates, we utilize a novel mechanism to simultaneously embed words and events in the same vector space. We show that our proposed method of leveraging events improves query expansion performance significantly compared with state-of-the-art methods on various newswire TREC datasets.

    Supplementary Material

    MP4 File (Event-Driven_Query_Expansion_WSDM_2021.mp4)
    Presentation video

    References

    [1]
    Gianni Amati and Cornelis Joost Van Rijsbergen. 2002. Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Transactions on Information Systems (TOIS), Vol. 20, 4 (2002), 357--389.
    [2]
    Hiteshwar Kumar Azad and Akshay Deepak. 2019 a. A new approach for query expansion using Wikipedia and WordNet. Information Sciences, Vol. 492 (2019), 147--163.
    [3]
    Hiteshwar Kumar Azad and Akshay Deepak. 2019 b. Query expansion techniques for information retrieval: A survey. Information Processing & Management, Vol. 56, 5 (2019), 1698--1735.
    [4]
    Judit Bar-Ilan, Zheng Zhu, and Mark Levene. 2009. Topic-Specific Analysis of Search Queries. In Proceedings of the 2009 Workshop on Web Search Click Data (Barcelona, Spain) (WSCD '09). Association for Computing Machinery, New York, NY, USA, 35--42. https://doi.org/10.1145/1507509.1507515
    [5]
    Richard H Byrd, Peihuang Lu, Jorge Nocedal, and Ciyou Zhu. 1995. A limited memory algorithm for bound constrained optimization. SIAM Journal on scientific computing, Vol. 16, 5 (1995), 1190--1208.
    [6]
    Ricardo Campos, Gaël Dias, Al'ipio M Jorge, and Adam Jatowt. 2014. Survey of temporal information retrieval and related applications. ACM Computing Surveys (CSUR), Vol. 47, 2 (2014), 1--41.
    [7]
    Claudio Carpineto and Giovanni Romano. 2012. A Survey of Automatic Query Expansion in Information Retrieval. ACM Comput. Surv., Vol. 44, 1 (Jan. 2012).
    [8]
    Alexis Conneau, Guillaume Lample, Marc'Aurelio Ranzato, Ludovic Denoyer, and Hervé Jégou. 2017. Word Translation Without Parallel Data. arXiv preprint arXiv:1710.04087 (2017).
    [9]
    Supratim Das, Arunav Mishra, Klaus Berberich, and Vinay Setty. 2017. Estimating event focus time using neural word embeddings. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. 2039--2042.
    [10]
    Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).
    [11]
    Valerio Di Carlo, Federico Bianchi, and Matteo Palmonari. 2019. Training temporal word embeddings with a compass. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 6326--6334.
    [12]
    Fernando Diaz, Bhaskar Mitra, and Nick Craswell. 2016. Query expansion with locally-trained word embeddings. arXiv preprint arXiv:1605.07891 (2016).
    [13]
    Xiao Ding, Yue Zhang, Ting Liu, and Junwen Duan. 2016. Knowledge-driven event embedding for stock prediction. In Proceedings of coling 2016, the 26th international conference on computational linguistics: Technical papers. 2133--2142.
    [14]
    Evgeniy Gabrilovich, Shaul Markovitch, et al. 2007. Computing semantic relatedness using wikipedia-based explicit semantic analysis. In IJcAI, Vol. 7. 1606--1611.
    [15]
    Seyyedeh Newsha Ghoreishi and Aixin Sun. 2013. Predicting event-relatedness of popular queries. In Proceedings of the 22nd ACM international conference on Information & Knowledge Management. 1193--1196.
    [16]
    William L Hamilton, Jure Leskovec, and Dan Jurafsky. 2016. Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vol. 1. 1489--1501.
    [17]
    Ayyoob Imani, Amir Vakili, Ali Montazer, and Azadeh Shakery. 2019. Deep neural networks for query expansion using word embeddings. In European Conference on Information Retrieval. Springer, 203--210.
    [18]
    Nattiya Kanhabua and Avishek Anand. 2016. Temporal information retrieval. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval. 1235--1238.
    [19]
    Nattiya Kanhabua, Tu Ngoc Nguyen, and Wolfgang Nejdl. 2015. Learning to detect event-related queries for web search. In Proceedings of the 24th International Conference on World Wide Web. 1339--1344.
    [20]
    Nattiya Kanhabua and Kjetil Nørvåg. 2010. Determining time of queries for re-ranking search results. In International Conference on Theory and Practice of Digital Libraries. Springer, 261--272.
    [21]
    Saar Kuzi, Anna Shtok, and Oren Kurland. 2016. Query expansion using word embeddings. In Proceedings of the 25th ACM international on conference on information and knowledge management. 1929--1932.
    [22]
    Ye Ma, Lu Zong, Yikang Yang, and Jionglong Su. 2019. News2vec: News Network Embedding with Subnode Information. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 4845--4854.
    [23]
    Craig Macdonald, Richard McCreadie, Rodrygo LT Santos, and Iadh Ounis. 2012. From puppy to maturity: Experiences in developing Terrier. Proc. of OSIR at SIGIR (2012), 60--63.
    [24]
    Christopher D Manning, Prabhakar Raghavan, and Hinrich Schütze. 2008. Introduction to information retrieval .Cambridge university press.
    [25]
    Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013a. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. 3111--3119.
    [26]
    Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013b. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. 3111--3119.
    [27]
    Jamal Abdul Nasir, Iraklis Varlamis, and Samreen Ishfaq. 2019. A knowledge-based semantic framework for query expansion. Information Processing & Management, Vol. 56, 5 (2019), 1605--1617.
    [28]
    Sérgio Nunes, Cristina Ribeiro, and Gabriel David. 2008. Use of temporal expressions in web search. In European Conference on Information Retrieval. Springer, 580--584.
    [29]
    Ramith Padaki, Zhuyun Dai, and Jamie Callan. 2020. Rethinking Query Expansion for BERT Reranking. In Advances in Information Retrieval: 42nd European Conference on IR Research, ECIR 2020, Lisbon, Portugal, April 14--17, 2020, Proceedings, Part II 42. Springer, 297--304.
    [30]
    Greg Pass, Abdur Chowdhury, and Cayley Torgeson. 2006. A picture of search. In Proceedings of the 1st international conference on Scalable information systems. 1--es.
    [31]
    Kira Radinsky, Fernando Diaz, Susan Dumais, Milad Shokouhi, Anlei Dong, and Yi Chang. 2013. Temporal web dynamics and its application to information retrieval. In Proceedings of the sixth ACM international conference on Web search and data mining. 781--782.
    [32]
    Radim Rehurek and Petr Sojka. 2010. Software framework for topic modelling with large corpora. In In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. Citeseer.
    [33]
    Stephen E Robertson, Steve Walker, Susan Jones, Micheline M Hancock-Beaulieu, Mike Gatford, et al. 1995. Okapi at TREC-3. Nist Special Publication Sp, Vol. 109 (1995), 109.
    [34]
    Guy D Rosin, Eytan Adar, and Kira Radinsky. 2017. Learning Word Relatedness over Time. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 1168--1178.
    [35]
    Guy D Rosin and Kira Radinsky. 2019. Generating Timelines by Modeling Semantic Change. In Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL). 186--195.
    [36]
    Dwaipayan Roy, Debjyoti Paul, Mandar Mitra, and Utpal Garain. 2016. Using word embeddings for automatic query expansion. arXiv preprint arXiv:1606.07608 (2016).
    [37]
    Vinay Setty and Katja Hose. 2018. Event2Vec: Neural embeddings for news events. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 1013--1016.
    [38]
    Milad Shokouhi and Kira Radinsky. 2012. Time-sensitive query auto-completion. In Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval. 601--610.
    [39]
    Ikuya Yamada, Akari Asai, Jin Sakuma, Hiroyuki Shindo, Hideaki Takeda, Yoshiyasu Takefuji, and Yuji Matsumoto. 2020. Wikipedia2Vec: An Efficient Toolkit for Learning and Visualizing the Embeddings of Words and Entities from Wikipedia. arXiv preprint 1812.06280v3 (2020).
    [40]
    Ikuya Yamada, Hiroyuki Shindo, Hideaki Takeda, and Yoshiyasu Takefuji. 2016. Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation. In Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning. Association for Computational Linguistics, 250--259. https://doi.org/10.18653/v1/K16--1025
    [41]
    Zijun Yao, Yifan Sun, Weicong Ding, Nikhil Rao, and Hui Xiong. 2018. Dynamic word embeddings for evolving semantic discovery. In Proceedings of the eleventh acm international conference on web search and data mining. 673--681.
    [42]
    Hamed Zamani and W Bruce Croft. 2016a. Embedding-based query language models. In Proceedings of the 2016 ACM international conference on the theory of information retrieval. 147--156.
    [43]
    Hamed Zamani and W Bruce Croft. 2016b. Estimating embedding vectors for queries. In Proceedings of the 2016 ACM International Conference on the Theory of Information Retrieval. 123--132.
    [44]
    Hamed Zamani and W Bruce Croft. 2017. Relevance-based word embedding. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. 505--514.
    [45]
    Chengxiang Zhai and John Lafferty. 2017. A study of smoothing methods for language models applied to ad hoc information retrieval. In ACM SIGIR Forum, Vol. 51. ACM New York, NY, USA, 268--276.
    [46]
    Xiaojuan Zhang, Shuguang Han, and Wei Lu. 2018. Automatic prediction of news intent for search queries. The Electronic Library (2018).

    Cited By

    View all
    • (2024)Event-Specific Document Ranking Through Multi-stage Query Expansion Using an Event Knowledge GraphAdvances in Information Retrieval10.1007/978-3-031-56060-6_22(333-348)Online publication date: 16-Mar-2024
    • (2023)Enhancing information retrieval with semantic query expansion: a Word2Vec-based approachFourth International Conference on Signal Processing and Computer Science (SPCS 2023)10.1117/12.3012203(57)Online publication date: 21-Dec-2023
    • (2023)Who can verify this? Finding authorities for rumor verification in TwitterInformation Processing and Management: an International Journal10.1016/j.ipm.2023.10336660:4Online publication date: 26-Jul-2023
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WSDM '21: Proceedings of the 14th ACM International Conference on Web Search and Data Mining
    March 2021
    1192 pages
    ISBN:9781450382977
    DOI:10.1145/3437963
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 March 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. query expansion
    2. temporal semantics
    3. word embeddings

    Qualifiers

    • Research-article

    Conference

    WSDM '21

    Acceptance Rates

    Overall Acceptance Rate 498 of 2,863 submissions, 17%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)39
    • Downloads (Last 6 weeks)1

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Event-Specific Document Ranking Through Multi-stage Query Expansion Using an Event Knowledge GraphAdvances in Information Retrieval10.1007/978-3-031-56060-6_22(333-348)Online publication date: 16-Mar-2024
    • (2023)Enhancing information retrieval with semantic query expansion: a Word2Vec-based approachFourth International Conference on Signal Processing and Computer Science (SPCS 2023)10.1117/12.3012203(57)Online publication date: 21-Dec-2023
    • (2023)Who can verify this? Finding authorities for rumor verification in TwitterInformation Processing and Management: an International Journal10.1016/j.ipm.2023.10336660:4Online publication date: 26-Jul-2023
    • (2023)Analysis of Recent Query Expansion Techniques for Information Retrieval SystemsProceedings of the International Conference on Intelligent Computing, Communication and Information Security10.1007/978-981-99-1373-2_29(375-383)Online publication date: 4-Jul-2023
    • (2022)Unsupervised Key Event Detection from Massive Text CorporaProceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3534678.3539395(2535-2544)Online publication date: 14-Aug-2022
    • (2022)Leveraging World Events to Predict E-Commerce Consumer Demand under AnomalyProceedings of the Fifteenth ACM International Conference on Web Search and Data Mining10.1145/3488560.3498452(430-438)Online publication date: 11-Feb-2022

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media