Reasoning Abilities of Large Language Models: In-Depth Analysis on the Abstraction and Reasoning Corpus

Lee, Seungpil; Sim, Woochang; Shin, Donghyeon; Seo, Wongyu; Park, Jiwon; Lee, Seokki; Hwang, Sanha; Kim, Sejin; Kim, Sundong

Computer Science > Computation and Language

arXiv:2403.11793 (cs)

[Submitted on 18 Mar 2024 (v1), last revised 12 Sep 2024 (this version, v2)]

Title:Reasoning Abilities of Large Language Models: In-Depth Analysis on the Abstraction and Reasoning Corpus

Authors:Seungpil Lee, Woochang Sim, Donghyeon Shin, Wongyu Seo, Jiwon Park, Seokki Lee, Sanha Hwang, Sejin Kim, Sundong Kim

View PDF HTML (experimental)

Abstract:The existing methods for evaluating the inference abilities of Large Language Models (LLMs) have been results-centric, making it difficult to assess the inference process. We introduce a new approach using the Abstraction and Reasoning Corpus (ARC) dataset to evaluate the inference and contextual understanding abilities of large language models in a process-centric manner. ARC demands rigorous logical structures for problem-solving, making it a benchmark that facilitates the comparison of model inference abilities with humans. Experimental results confirm that while large language models possess weak inference abilities, they still lag in terms of logical coherence, compositionality, and productivity. Our experiments highlight the reasoning capabilities of LLMs, proposing development paths for achieving human-level reasoning.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET); Symbolic Computation (cs.SC)
Cite as:	arXiv:2403.11793 [cs.CL]
	(or arXiv:2403.11793v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.11793

Submission history

From: Sundong Kim [view email]
[v1] Mon, 18 Mar 2024 13:50:50 UTC (8,423 KB)
[v2] Thu, 12 Sep 2024 23:08:08 UTC (2,674 KB)

Computer Science > Computation and Language

Title:Reasoning Abilities of Large Language Models: In-Depth Analysis on the Abstraction and Reasoning Corpus

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Reasoning Abilities of Large Language Models: In-Depth Analysis on the Abstraction and Reasoning Corpus

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators