Comparing Visual Reasoning in Humans and AI

Murlidaran, Shravan; Wang, William Yang; Eckstein, Miguel P.

Computer Science > Artificial Intelligence

arXiv:2104.14102 (cs)

[Submitted on 29 Apr 2021]

Title:Comparing Visual Reasoning in Humans and AI

Authors:Shravan Murlidaran, William Yang Wang, Miguel P. Eckstein

View PDF

Abstract:Recent advances in natural language processing and computer vision have led to AI models that interpret simple scenes at human levels. Yet, we do not have a complete understanding of how humans and AI models differ in their interpretation of more complex scenes. We created a dataset of complex scenes that contained human behaviors and social interactions. AI and humans had to describe the scenes with a sentence. We used a quantitative metric of similarity between scene descriptions of the AI/human and ground truth of five other human descriptions of each scene. Results show that the machine/human agreement scene descriptions are much lower than human/human agreement for our complex scenes. Using an experimental manipulation that occludes different spatial regions of the scenes, we assessed how machines and humans vary in utilizing regions of images to understand the scenes. Together, our results are a first step toward understanding how machines fall short of human visual reasoning with complex scenes depicting human behaviors.

Subjects:	Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
Cite as:	arXiv:2104.14102 [cs.AI]
	(or arXiv:2104.14102v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2104.14102

Submission history

From: Shravan Murlidaran [view email]
[v1] Thu, 29 Apr 2021 04:44:13 UTC (1,019 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2021-04

Change to browse by:

cs
cs.CV
q-bio
q-bio.NC

References & Citations

DBLP - CS Bibliography

listing | bibtex

William Yang Wang
Miguel P. Eckstein

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Comparing Visual Reasoning in Humans and AI

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Comparing Visual Reasoning in Humans and AI

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators