Multi-Object Navigation with dynamically learned neural implicit representations

Marza, Pierre; Matignon, Laetitia; Simonin, Olivier; Wolf, Christian

Computer Science > Computer Vision and Pattern Recognition

arXiv:2210.05129 (cs)

[Submitted on 11 Oct 2022 (v1), last revised 27 Sep 2023 (this version, v2)]

Title:Multi-Object Navigation with dynamically learned neural implicit representations

Authors:Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolf

View PDF

Abstract:Understanding and mapping a new environment are core abilities of any autonomously navigating agent. While classical robotics usually estimates maps in a stand-alone manner with SLAM variants, which maintain a topological or metric representation, end-to-end learning of navigation keeps some form of memory in a neural network. Networks are typically imbued with inductive biases, which can range from vectorial representations to birds-eye metric tensors or topological structures. In this work, we propose to structure neural networks with two neural implicit representations, which are learned dynamically during each episode and map the content of the scene: (i) the Semantic Finder predicts the position of a previously seen queried object; (ii) the Occupancy and Exploration Implicit Representation encapsulates information about explored area and obstacles, and is queried with a novel global read mechanism which directly maps from function space to a usable embedding space. Both representations are leveraged by an agent trained with Reinforcement Learning (RL) and learned online during each episode. We evaluate the agent on Multi-Object Navigation and show the high impact of using neural implicit representations as a memory source.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2210.05129 [cs.CV]
	(or arXiv:2210.05129v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2210.05129

Submission history

From: Pierre Marza [view email]
[v1] Tue, 11 Oct 2022 04:06:34 UTC (9,088 KB)
[v2] Wed, 27 Sep 2023 11:17:18 UTC (8,903 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-Object Navigation with dynamically learned neural implicit representations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-Object Navigation with dynamically learned neural implicit representations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators