Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding

Storks, Shane; Gao, Qiaozi; Zhang, Yichi; Chai, Joyce

Computer Science > Computation and Language

arXiv:2109.04947 (cs)

[Submitted on 10 Sep 2021 (v1), last revised 10 May 2022 (this version, v3)]

Title:Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding

Authors:Shane Storks, Qiaozi Gao, Yichi Zhang, Joyce Chai

View PDF

Abstract:Large-scale, pre-trained language models (LMs) have achieved human-level performance on a breadth of language understanding tasks. However, evaluations only based on end task performance shed little light on machines' true ability in language understanding and reasoning. In this paper, we highlight the importance of evaluating the underlying reasoning process in addition to end performance. Toward this goal, we introduce Tiered Reasoning for Intuitive Physics (TRIP), a novel commonsense reasoning dataset with dense annotations that enable multi-tiered evaluation of machines' reasoning process. Our empirical results show that while large LMs can achieve high end performance, they struggle to support their predictions with valid supporting evidence. The TRIP dataset and our baseline results will motivate verifiable evaluation of commonsense reasoning and facilitate future research toward developing better language understanding and reasoning models.

Comments:	Accepted to Findings of EMNLP 2021
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2109.04947 [cs.CL]
	(or arXiv:2109.04947v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.04947

Submission history

From: Shane Storks [view email]
[v1] Fri, 10 Sep 2021 15:47:22 UTC (1,700 KB)
[v2] Sat, 9 Oct 2021 23:57:34 UTC (1,736 KB)
[v3] Tue, 10 May 2022 17:58:29 UTC (1,736 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Shane Storks
Qiaozi Gao
Yichi Zhang

export BibTeX citation

Computer Science > Computation and Language

Title:Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators