A Read-Write Memory Network for Movie Story Understanding

Na, Seil; Lee, Sangho; Kim, Jisung; Kim, Gunhee

Computer Science > Computer Vision and Pattern Recognition

arXiv:1709.09345v1 (cs)

[Submitted on 27 Sep 2017 (this version), latest version 16 Mar 2018 (v4)]

Title:A Read-Write Memory Network for Movie Story Understanding

Authors:Seil Na, Sangho Lee, Jisung Kim, Gunhee Kim

View PDF

Abstract:We propose a novel memory network model named Read-Write Memory Network (RWMN) to perform question and answering tasks for large-scale, multimodal movie story understanding. The key focus of our RWMN model is to design the read network and the write network that consist of multiple convolutional layers, which enable memory read and write operations to have high capacity and flexibility. While existing memory-augmented network models treat each memory slot as an independent block, our use of multi-layered CNNs allows the model to read and write sequential memory cells as chunks, which is more reasonable to represent a sequential story because adjacent memory blocks often have strong correlations. For evaluation, we apply our model to all the six tasks of the MovieQA benchmark, and achieve the best accuracies on several tasks, especially on the visual QA task. Our model shows a potential to better understand not only the content in the story, but also more abstract information, such as relationships between characters and the reasons for their actions.

Comments:	accepted at ICCV'17
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
Cite as:	arXiv:1709.09345 [cs.CV]
	(or arXiv:1709.09345v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1709.09345

Submission history

From: Seil Na [view email]
[v1] Wed, 27 Sep 2017 06:02:57 UTC (1,685 KB)
[v2] Thu, 12 Oct 2017 11:36:34 UTC (1,685 KB)
[v3] Fri, 3 Nov 2017 08:40:50 UTC (1,685 KB)
[v4] Fri, 16 Mar 2018 13:43:15 UTC (1,685 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Read-Write Memory Network for Movie Story Understanding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Read-Write Memory Network for Movie Story Understanding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators