ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models

Bian, Ning; Han, Xianpei; Sun, Le; Lin, Hongyu; Lu, Yaojie; He, Ben; Jiang, Shanshan; Dong, Bin

Computer Science > Computation and Language

arXiv:2303.16421 (cs)

[Submitted on 29 Mar 2023 (v1), last revised 19 Apr 2024 (this version, v3)]

Title:ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models

Authors:Ning Bian, Xianpei Han, Le Sun, Hongyu Lin, Yaojie Lu, Ben He, Shanshan Jiang, Bin Dong

View PDF HTML (experimental)

Abstract:Large language models (LLMs) have made significant progress in NLP. However, their ability to memorize, represent, and leverage commonsense knowledge has been a well-known pain point. In this paper, we specifically focus on ChatGPT, a widely used and easily accessible LLM, and ask the following questions: (1) Can ChatGPT effectively answer commonsense questions? (2) Is ChatGPT aware of the underlying commonsense knowledge for answering a specific question? (3) Is ChatGPT knowledgeable in commonsense? (4) Can ChatGPT effectively leverage commonsense for answering questions? We conduct a series of experiments on 11 datasets to evaluate ChatGPT's commonsense abilities, including answering commonsense questions, identifying necessary knowledge, generating knowledge descriptions, and using knowledge descriptions to answer questions again. Experimental results show that: (1) ChatGPT can achieve good QA accuracies in commonsense tasks, while still struggling with certain domains of datasets. (2) ChatGPT is knowledgeable, and can accurately generate most of the commonsense knowledge using knowledge prompts. (3) Despite its knowledge, ChatGPT is an inexperienced commonsense problem solver, which cannot precisely identify the needed commonsense for answering a specific question. These findings raise the need to explore improved mechanisms for effectively incorporating commonsense into LLMs like ChatGPT, such as better instruction following and commonsense guidance.

Comments:	Accepted by LREC-COLING 2024
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2303.16421 [cs.CL]
	(or arXiv:2303.16421v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2303.16421

Submission history

From: Ning Bian [view email]
[v1] Wed, 29 Mar 2023 03:05:43 UTC (270 KB)
[v2] Tue, 12 Mar 2024 03:14:18 UTC (210 KB)
[v3] Fri, 19 Apr 2024 04:57:37 UTC (210 KB)

Computer Science > Computation and Language

Title:ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators