Visual Navigation with Spatial Attention

Mayo, Bar; Hazan, Tamir; Tal, Ayellet

Computer Science > Computer Vision and Pattern Recognition

arXiv:2104.09807 (cs)

[Submitted on 20 Apr 2021]

Title:Visual Navigation with Spatial Attention

Authors:Bar Mayo, Tamir Hazan, Ayellet Tal

View PDF

Abstract:This work focuses on object goal visual navigation, aiming at finding the location of an object from a given class, where in each step the agent is provided with an egocentric RGB image of the scene. We propose to learn the agent's policy using a reinforcement learning algorithm. Our key contribution is a novel attention probability model for visual navigation tasks. This attention encodes semantic information about observed objects, as well as spatial information about their place. This combination of the "what" and the "where" allows the agent to navigate toward the sought-after object effectively. The attention model is shown to improve the agent's policy and to achieve state-of-the-art results on commonly-used datasets.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2104.09807 [cs.CV]
	(or arXiv:2104.09807v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2104.09807

Submission history

From: Bar Mayo [view email]
[v1] Tue, 20 Apr 2021 07:39:52 UTC (15,901 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2021-04

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Tamir Hazan
Ayellet Tal

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Visual Navigation with Spatial Attention

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Visual Navigation with Spatial Attention

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators