Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements

Liu, Jiacheng; Wang, Wenya; Wang, Dianzhuo; Smith, Noah A.; Choi, Yejin; Hajishirzi, Hannaneh

Computer Science > Computation and Language

arXiv:2305.03695 (cs)

[Submitted on 5 May 2023 (v1), last revised 18 Oct 2023 (this version, v3)]

Title:Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements

Authors:Jiacheng Liu, Wenya Wang, Dianzhuo Wang, Noah A. Smith, Yejin Choi, Hannaneh Hajishirzi

View PDF

Abstract:Despite the much discussed capabilities of today's language models, they are still prone to silly and unexpected commonsense failures. We consider a retrospective verification approach that reflects on the correctness of LM outputs, and introduce Vera, a general-purpose model that estimates the plausibility of declarative statements based on commonsense knowledge. Trained on ~7M commonsense statements created from 19 QA datasets and two large-scale knowledge bases, and with a combination of three training objectives, Vera is a versatile model that effectively separates correct from incorrect statements across diverse commonsense domains. When applied to solving commonsense problems in the verification format, Vera substantially outperforms existing models that can be repurposed for commonsense verification, and it further exhibits generalization capabilities to unseen tasks and provides well-calibrated outputs. We find that Vera excels at filtering LM-generated commonsense knowledge and is useful in detecting erroneous commonsense statements generated by models like ChatGPT in real-world settings.

Comments:	EMNLP 2023 main conference
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2305.03695 [cs.CL]
	(or arXiv:2305.03695v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.03695

Submission history

From: Jiacheng Liu [view email]
[v1] Fri, 5 May 2023 17:15:32 UTC (15,025 KB)
[v2] Tue, 23 May 2023 16:25:26 UTC (14,737 KB)
[v3] Wed, 18 Oct 2023 14:48:51 UTC (8,067 KB)

Computer Science > Computation and Language

Title:Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators