research-article

Query Resolution for Conversational Search with Limited Supervision

Authors:

Nikos Voskarides,

Evangelos Kanoulas,

Maarten de RijkeAuthors Info & Claims

SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

Pages 921 - 930

https://doi.org/10.1145/3397271.3401130

Published: 25 July 2020 Publication History

Abstract

In this work we focus on multi-turn passage retrieval as a crucial component of conversational search. One of the key challenges in multi-turn passage retrieval comes from the fact that the current turn query is often underspecified due to zero anaphora, topic change, or topic return. Context from the conversational history can be used to arrive at a better expression of the current turn query, defined as the task of query resolution. In this paper, we model the query resolution task as a binary term classification problem: for each term appearing in the previous turns of the conversation decide whether to add it to the current turn query or not. We propose QuReTeC (Query Resolution by Term Classification), a neural query resolution model based on bidirectional transformers. We propose a distant supervision method to automatically generate training data by using query-passage relevance labels. Such labels are often readily available in a collection either as human annotations or inferred from user interactions. We show that QuReTeC outperforms state-of-the-art models, and furthermore, that our distant supervision method can be used to substantially reduce the amount of human-curated data required to train QuReTeC. We incorporate QuReTeC in a multi-turn, multi-stage passage retrieval architecture and demonstrate its effectiveness on the TREC CAsT dataset.

Supplementary Material

MP4 File (3397271.3401130.mp4)

Presentation video for the SIGIR 2020 paper: "Query Resolution for Conversational Search with Limited Supervision", by Voskarides, Li, Ren, Kanoulas and de Rijke. We propose QuReTeC, a neural query resolution model for conversational search based on bidirectional transformers.

Download
15.02 MB

References

[1]

Nasreen Abdul-jaleel, James Allan, W Bruce Croft, O Diaz, Leah Larkey, Xiaoyan Li, Mark D Smucker, and Courtney Wade. 2004. UMass at TREC 2004: Novelty and HARD. In TREC. NIST.

[2]

Payal Bajaj, Daniel Campos, Nick Craswell, Li Deng, Jianfeng Gao, Xiaodong Liu, Rangan Majumder, Andrew McNamara, Bhaskar Mitra, Tri Nguyen, Mir Rosenberg, Xia Song, Alina Stoica, Saurabh Tiwary, and Tong Wang. 2018. MS MARCO: A Human Generated MAchine Reading COmprehension Dataset. arXiv preprint arXiv:1611.09268 (2018).

[3]

Nicholas J Belkin. 1980. Anomalous States of Knowledge as A Basis for Information Retrieval. Canadian Journal of Information Science, Vol. 5, 1 (1980), 133--143.

[4]

Nicholas J Belkin, Colleen Cool, Adelheit Stein, and Ulrich Thiel. 1995. Cases, Scripts, and Information-seeking Strategies: On the Design of Interactive Information Retrieval Systems. Expert Systems with Applications, Vol. 9, 3 (1995), 379--395.

[5]

Ben Carterette, Paul Clough, Mark Hall, Evangelos Kanoulas, and Mark Sanderson. 2016. Evaluating Retrieval over Sessions: The TREC Session Track 2011--2014. In SIGIR. ACM, 685--688.

[6]

Ben Carterette, Evangelos Kanoulas, Mark Hall, and Paul Clough. 2014. Overview of the TREC 2014 Session Track. Technical Report. TREC.

[7]

Eunsol Choi, He He, Mohit Iyyer, Mark Yatskar, Wen-tau Yih, Yejin Choi, Percy Liang, and Luke Zettlemoyer. 2018. QuAC: Question Answering in Context. In EMNLP. ACL, 2174--2184.

[8]

Gordon V. Cormack, Charles L A Clarke, and Stefan Buettcher. 2009. Reciprocal Rank Fusion Outperforms Condorcet and Individual Rank Learning Methods. In SIGIR. ACM, 758.

[9]

W. Bruce Croft and R. H. Thompson. 1987. I3R: A New Approach to The Design of Document Retrieval Systems. JASIST, Vol. 38, 6 (1987), 389--404.

[10]

J. Shane Culpepper, Fernando Diaz, and Mark D. Smucker. 2018. Research Frontiers in Information Retrieval: Report from the Third Strategic Workshop on Information Retrieval in Lorne (SWIRL 2018). SIGIR Forum, Vol. 52, 1 (2018), 34--90.

Digital Library

[11]

Jeffrey Dalton, Chenyan Xiong, and Jamie Callan. 2019. CAsT 2019: The Conversational Assistance Track Overview. In TREC 2019. NIST.

[12]

Mostafa Dehghani, Hamed Zamani, Aliaksei Severyn, Jaap Kamps, and W. Bruce Croft. 2017. Neural Ranking Models with Weak Supervision. In SIGIR. ACM, 65--74.

[13]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL-HLT. ACL, 4171--4186.

[14]

Laura Dietz, Manisha Verma, Filip Radlinski, and Nick Craswell. 2017. TREC Complex Answer Retrieval Overview. In TREC. NIST.

[15]

Ahmed Elgohary, Denis Peskov, and Jordan Boyd-Graber. 2019. Can You Unpack That? Learning to Rewrite Questions-in-Context. In EMNLP. ACL, 5920--5926.

[16]

Marzieh Fadaee, Arianna Bisazza, and Christof Monz. 2017. Data Augmentation for Low-Resource Neural Machine Translation. In ACL. ACL, 567--573.

[17]

Jianfeng Gao, Michel Galley, and Lihong Li. 2018. Neural Approaches to Conversational AI. In ACL. ACL, 2--7.

[18]

Dongyi Guan, Hui Yang, and Nazli Goharian. 2012. Effective Structured Query Formulation for Session Search. In TREC. NIST.

[19]

Thorsten Joachims. 2002. Optimizing search Engines using Clickthrough Data. In KDD. ACM Press, 133.

[20]

Mandar Joshi, Omer Levy, Daniel S. Weld, and Luke Zettlemoyer. 2019. BERT for Coreference Resolution: Baselines and Analysis. arXiv preprint arXiv:1908.09091 (2019).

[21]

Vineet Kumar and Sachindra Joshi. 2017. Incomplete Follow-up Question Resolution Using Retrieval Based Sequence to Sequence Learning. In SIGIR. ACM, 705--714.

[22]

Victor Lavrenko and W. Bruce Croft. 2001. Relevance Based Language Models. In SIGIR. ACM, 120--127.

[23]

Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, and Bill Dolan. 2016. A Diversity-Promoting Objective Function for Neural Conversation Models. In NAACL-HLT. ACL, 110--119.

[24]

Yongjie Lin, Yi Chern Tan, and Robert Frank. 2019. Open Sesame: Getting Inside BERT's Linguistic Knowledge. arXiv preprint arXiv:1906.01698 (2019).

[25]

Sean MacAvaney, Andrew Yates, Arman Cohan, and Nazli Goharian. 2019. CEDR: Contextualized Embeddings for Document Ranking. In SIGIR. ACM, 1101--1104.

Digital Library

[26]

Mike Mintz, Steven Bills, Rion Snow, and Dan Jurafsky. 2009. Distant Supervision for Relation Extraction without Labeled Data. In ACL. ACL, 1003--1011.

[27]

Mandar Mitra, Amit Singhal, and Chris Buckley. 1998. Improving Automatic Query Expansion. In SIGIR. ACM, 206--214.

[28]

Tri Nguyen, Mir Rosenberg, Xia Song, Jianfeng Gao, Saurabh Tiwary, Rangan Majumder, and Li Deng. 2016. MS MARCO: A Human Generated MAchine Reading COmprehension Dataset. arXiv preprint arXiv:1611.09268 (2016).

[29]

Rodrigo Nogueira and Kyunghyun Cho. 2017. Task-Oriented Query Reformulation with Reinforcement Learning. In EMNLP. ACL, 574--583.

[30]

Robert N Oddy. 1977. Information Retrieval through Man-machine Dialogue. Journal of Documentation, Vol. 33, 1 (1977), 1--14.

[31]

Kezban Dilek Onal, Ye Zhang, Ismail Sengor Altingovde, Md Mustafizur Rahman, Pinar Karagoz, Alex Braylan, Brandon Dang, Heng-Lu Chang, Henna Kim, Quinten McNamara, Aaron Angert, Edward Banner, Vivek Khetan, Tyler McDonnell, An Thanh Nguyen, Dan Xu, Byron C. Wallace, Maarten de Rijke, and Matthew Lease. 2018. Neural Information Retrieval: At the End of the Early Years. Information Retrieval Journal, Vol. 21, 2-3 (June 2018), 111--182.

Digital Library

[32]

Baolin Peng, Xiujun Li, Jianfeng Gao, Jingjing Liu, and Kam-Fai Wong. 2018. Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning. In ACL. ACL, 2182--2192.

[33]

Yifan Qiao, Chenyan Xiong, Zhenghao Liu, and Zhiyuan Liu. 2019. Understanding the Behaviors of BERT in Ranking. arXiv preprint arXiv:1904.07531 (2019).

[34]

Chen Qu, Liu Yang, Minghui Qiu, Yongfeng Zhang, Cen Chen, W. Bruce Croft, and Mohit Iyyer. 2019. Attentive History Selection for Conversational Question Answering. In CIKM. ACM, 1391--1400.

[35]

Filip Radlinski and Nick Craswell. 2017. A Theoretical Framework for Conversational Search. In CHIIR. ACM, 117--126.

[36]

Dinesh Raghu, Sathish Indurthi, Jitendra Ajmera, and Sachindra Joshi. 2015. A statistical approach for Non-Sentential Utterance Resolution for Interactive QA System. In SIGDIAL. ACL, 335--343.

[37]

Siva Reddy, Danqi Chen, and Christopher D. Manning. 2019. CoQA: A Conversational Question Answering Challenge. TACL, Vol. 7 (2019), 249--266.

[38]

Pengjie Ren, Zhumin Chen, Christof Monz, Jun Ma, and Maarten de Rijke. 2020. Thinking Globally, Acting Locally: Distantly Supervised Global-to-local Knowledge Selection for Background based Conversation. In AAAI.

[39]

Alan Ritter, Sam Clark, Mausam, and Oren Etzioni. 2011. Named Entity Recognition in Tweets: An Experimental Study. In EMNLP. ACL, 1524--1534.

[40]

Abigail See, Peter J. Liu, and Christopher D. Manning. 2017. Get To The Point: Summarization with Pointer-Generator Networks. In ACL. ACL, 1073--1083.

[41]

Marc Sloan, Hui Yang, and Jun Wang. 2015. A Term-based Methodology for Query Reformulation Understanding. Information Retrieval, Vol. 18, 2 (2015), 145--165.

Digital Library

[42]

Christophe Van Gysel, Evangelos Kanoulas, and Maarten de Rijke. 2016. Lexical Query Modeling in Session Search. In ICTIR. ACM, 69--72.

[43]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is All you Need. In NIPS. Curran Associates, Inc., 5998--6008.

[44]

Nikos Voskarides, Edgar Meij, Ridho Reinanda, Abhinav Khaitan, Miles Osborne, Giorgio Stefanoni, Kambadur Prabhanjan, and Maarten de Rijke. 2018. Weakly-supervised Contextualization of Knowledge Graph Facts. In SIGIR. ACM, 765--774.

[45]

Alexandra Vtyurina, Denis Savenkov, Eugene Agichtein, and Charles L. A. Clarke. 2017. Exploring Conversational Search With Humans, Assistants, and Wizards. In CHI. ACM, 2187--2193.

[46]

Lidan Wang, Jimmy Lin, and Donald Metzler. 2011. A Cascade Ranking Model for Efficient Ranked Retrieval. In SIGIR. ACM, 105--114.

[47]

Hui Yang, Dongyi Guan, and Sicong Zhang. 2015. The Query Change Model: Modeling Session Search as a Markov Decision Process. ACM Transactions on Information Systems, Vol. 33, 4 (2015), 20.

Digital Library

[48]

Wei Yang, Haotian Zhang, and Jimmy Lin. 2019. Simple Applications of BERT for Ad Hoc Document Retrieval. arXiv preprint arXiv:1903.10972 (2019).

[49]

Yaosheng Yang, Wenliang Chen, Zhenghua Li, Zhengqiu He, and Min Zhang. 2018. Distantly Supervised NER with Partial Annotation Learning and Reinforcement Learning. In COLING. ACL, 2159--2169.

[50]

Mark Yatskar. 2019. A Qualitative Comparison of CoQA, SQuAD 2.0 and QuAC. In NAACL-HLT. ACL, 2318--2323.

[51]

Chengxiang Zhai and John Lafferty. 2001. A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval. In SIGIR. ACM, 334--342.

Cited By

Kostric IBalog KHui Yang GWang HHan SHauff CZuccon GZhang Y(2024)A Surprisingly Simple yet Effective Multi-Query Rewriting Method for Conversational Passage RetrievalProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657933(2271-2275)Online publication date: 10-Jul-2024
https://dl.acm.org/doi/10.1145/3626772.3657933
Mo FYi BMao KQu CHuang KNie JChua TNgo CKumar RLauw HKa-Wei Lee R(2024)ConvSDG: Session Data Generation for Conversational SearchCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3651940(1634-1642)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589335.3651940
Zamiri MQiang YNikolaev FZhu DKotov AChua TNgo CKa-Wei Lee RKumar RLauw H(2024)Benchmark and Neural Architecture for Conversational Entity Retrieval from a Knowledge GraphProceedings of the ACM Web Conference 202410.1145/3589334.3645676(1519-1528)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589334.3645676
Show More Cited By

Index Terms

Query Resolution for Conversational Search with Limited Supervision
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Discourse, dialogue and pragmatics
2. Information systems
  1. Information retrieval
    1. Information retrieval query processing
      1. Query reformulation

Recommendations

Few-Shot Generative Conversational Query Rewriting
SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

Conversational query rewriting aims to reformulate a concise conversational query to a fully specified, context-independent query that can be effectively handled by existing information retrieval systems. This paper presents a few-shot generative ...
Contextualizing and Expanding Conversational Queries without Supervision
Most conversational passage retrieval systems try to resolve conversational dependencies by using an intermediate query resolution step. To do so, they synthesize conversational data or assume the availability of large-scale question rewriting datasets. ...
Query Tracking for E-commerce Conversational Search: A Machine Comprehension Perspective
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management

With the development of dialog techniques, conversational search has attracted more and more attention as it enables users to interact with the search engine in a natural and efficient manner. However, comparing with the natural language understanding ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR '20: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

July 2020

2548 pages

ISBN:9781450380164

DOI:10.1145/3397271

General Chairs:
Jimmy Huang
York University, Canada
,
Yi Chang
Jilin University, China
,
Xueqi Cheng
Chinese Academy of Sciences, China
,
Program Chairs:
Jaap Kamps
University of Amsterdam, Netherlands
,
Vanessa Murdock
Amazon, U.S.A.
,
Ji-Rong Wen
Renmin University of China, China
,
Yiqun Liu
Tsinghua University, China

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Nederlandse Organisatie voor Wetenschappelijk Onderzoek

Conference

SIGIR '20

Sponsor:

SIGIR

SIGIR '20: The 43rd International ACM SIGIR conference on research and development in Information Retrieval

July 25 - 30, 2020

Virtual Event, China

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

49
Total Citations
View Citations
511
Total Downloads

Downloads (Last 12 months)53
Downloads (Last 6 weeks)5

Reflects downloads up to 11 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

Kostric IBalog KHui Yang GWang HHan SHauff CZuccon GZhang Y(2024)A Surprisingly Simple yet Effective Multi-Query Rewriting Method for Conversational Passage RetrievalProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3626772.3657933(2271-2275)Online publication date: 10-Jul-2024
https://dl.acm.org/doi/10.1145/3626772.3657933
Mo FYi BMao KQu CHuang KNie JChua TNgo CKumar RLauw HKa-Wei Lee R(2024)ConvSDG: Session Data Generation for Conversational SearchCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3651940(1634-1642)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589335.3651940
Zamiri MQiang YNikolaev FZhu DKotov AChua TNgo CKa-Wei Lee RKumar RLauw H(2024)Benchmark and Neural Architecture for Conversational Entity Retrieval from a Knowledge GraphProceedings of the ACM Web Conference 202410.1145/3589334.3645676(1519-1528)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589334.3645676
Tran HYates AWeikum G(2024)Conversational Search with Tail EntitiesAdvances in Information Retrieval10.1007/978-3-031-56060-6_20(303-317)Online publication date: 16-Mar-2024
https://doi.org/10.1007/978-3-031-56060-6_20
Krasakis AYates AKanoulas E(2023)Contextualizing and Expanding Conversational Queries without SupervisionACM Transactions on Information Systems10.1145/363262242:3(1-30)Online publication date: 17-Nov-2023
https://dl.acm.org/doi/10.1145/3632622
Acharya PFrommholz IHopfgartner FLee MOakes MLalmas MZhang MSantos R(2023)Towards Effective Modeling and Exploitation of Search and User Context in Conversational Information RetrievalProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3616005(5161-5164)Online publication date: 21-Oct-2023
https://dl.acm.org/doi/10.1145/3583780.3616005
Malmi EDong YMallinson JChuklin AAdamek JMirylenka DStahlberg FKrause SKumar SSeveryn ASingh ASun YAkoglu LGunopulos DYan XKumar ROzcan FYe J(2023)Fast Text Generation with Text-Editing ModelsProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599579(5815-5816)Online publication date: 6-Aug-2023
https://dl.acm.org/doi/10.1145/3580305.3599579
Mo FNie JHuang KMao KZhu YLi PLiu YSingh ASun YAkoglu LGunopulos DYan XKumar ROzcan FYe J(2023)Learning to Relate to Previous Turns in Conversational SearchProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599411(1722-1732)Online publication date: 6-Aug-2023
https://dl.acm.org/doi/10.1145/3580305.3599411
Mao KQian HMo FDou ZLiu BCheng XCao Z(2023)Learning Denoised and Interpretable Session Representation for Conversational SearchProceedings of the ACM Web Conference 202310.1145/3543507.3583265(3193-3202)Online publication date: 30-Apr-2023
https://dl.acm.org/doi/10.1145/3543507.3583265
Ju JLin STsai MWang CChen HDuh WHuang HKato MMothe JPoblete B(2023)Improving Conversational Passage Re-ranking with View EnsembleProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3592002(2077-2081)Online publication date: 19-Jul-2023
https://dl.acm.org/doi/10.1145/3539618.3592002
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents