Zero-shot Visual Question Answering using Knowledge Graph

Chen, Zhuo; Chen, Jiaoyan; Geng, Yuxia; Pan, Jeff Z.; Yuan, Zonggang; Chen, Huajun

Computer Science > Artificial Intelligence

arXiv:2107.05348 (cs)

[Submitted on 12 Jul 2021 (v1), last revised 18 Oct 2021 (this version, v4)]

Title:Zero-shot Visual Question Answering using Knowledge Graph

Authors:Zhuo Chen, Jiaoyan Chen, Yuxia Geng, Jeff Z. Pan, Zonggang Yuan, Huajun Chen

View PDF

Abstract:Incorporating external knowledge to Visual Question Answering (VQA) has become a vital practical need. Existing methods mostly adopt pipeline approaches with different components for knowledge matching and extraction, feature learning, this http URL, such pipeline approaches suffer when some component does not perform well, which leads to error propagation and poor overall performance. Furthermore, the majority of existing approaches ignore the answer bias issue -- many answers may have never appeared during training (i.e., unseen answers) in real-word application. To bridge these gaps, in this paper, we propose a Zero-shot VQA algorithm using knowledge graphs and a mask-based learning mechanism for better incorporating external knowledge, and present new answer-based Zero-shot VQA splits for the F-VQA dataset. Experiments show that our method can achieve state-of-the-art performance in Zero-shot VQA with unseen answers, meanwhile dramatically augment existing end-to-end models on the normal F-VQA task.

Comments:	accepted at the International Semantic Web Conference '21 (ISWC 2021)
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2107.05348 [cs.AI]
	(or arXiv:2107.05348v4 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2107.05348

Submission history

From: Zhuo Chen [view email]
[v1] Mon, 12 Jul 2021 12:17:18 UTC (6,089 KB)
[v2] Tue, 13 Jul 2021 02:50:38 UTC (6,089 KB)
[v3] Wed, 14 Jul 2021 11:37:13 UTC (6,089 KB)
[v4] Mon, 18 Oct 2021 02:01:02 UTC (6,089 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2021-07

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zhuo Chen
Jiaoyan Chen
Yuxia Geng
Jeff Z. Pan
Huajun Chen

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Zero-shot Visual Question Answering using Knowledge Graph

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Zero-shot Visual Question Answering using Knowledge Graph

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators