Understanding Dataset Design Choices for Multi-hop Reasoning

Chen, Jifan; Durrett, Greg

Computer Science > Computation and Language

arXiv:1904.12106 (cs)

[Submitted on 27 Apr 2019]

Title:Understanding Dataset Design Choices for Multi-hop Reasoning

Authors:Jifan Chen, Greg Durrett

View PDF

Abstract:Learning multi-hop reasoning has been a key challenge for reading comprehension models, leading to the design of datasets that explicitly focus on it. Ideally, a model should not be able to perform well on a multi-hop question answering task without doing multi-hop reasoning. In this paper, we investigate two recently proposed datasets, WikiHop and HotpotQA. First, we explore sentence-factored models for these tasks; by design, these models cannot do multi-hop reasoning, but they are still able to solve a large number of examples in both datasets. Furthermore, we find spurious correlations in the unmasked version of WikiHop, which make it easy to achieve high performance considering only the questions and answers. Finally, we investigate one key difference between these datasets, namely span-based vs. multiple-choice formulations of the QA task. Multiple-choice versions of both datasets can be easily gamed, and two models we examine only marginally exceed a baseline in this setting. Overall, while these datasets are useful testbeds, high-performing models may not be learning as much multi-hop reasoning as previously thought.

Comments:	NAACL 2019
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1904.12106 [cs.CL]
	(or arXiv:1904.12106v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1904.12106

Submission history

From: Jifan Chen [view email]
[v1] Sat, 27 Apr 2019 04:36:57 UTC (110 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-04

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jifan Chen
Greg Durrett

export BibTeX citation

Computer Science > Computation and Language

Title:Understanding Dataset Design Choices for Multi-hop Reasoning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Understanding Dataset Design Choices for Multi-hop Reasoning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators