L2B: Learning to Balance the Safety-Efficiency Trade-off in Interactive Crowd-aware Robot Navigation

Nishimura, Mai; Yonetani, Ryo

Computer Science > Robotics

arXiv:2003.09207 (cs)

[Submitted on 20 Mar 2020 (v1), last revised 7 Oct 2020 (this version, v2)]

Title:L2B: Learning to Balance the Safety-Efficiency Trade-off in Interactive Crowd-aware Robot Navigation

Authors:Mai Nishimura, Ryo Yonetani

View PDF

Abstract:This work presents a deep reinforcement learning framework for interactive navigation in a crowded place. Our proposed approach, Learning to Balance (L2B) framework enables mobile robot agents to steer safely towards their destinations by avoiding collisions with a crowd, while actively clearing a path by asking nearby pedestrians to make room, if necessary, to keep their travel efficient. We observe that the safety and efficiency requirements in crowd-aware navigation have a trade-off in the presence of social dilemmas between the agent and the crowd. On the one hand, intervening in pedestrian paths too much to achieve instant efficiency will result in collapsing a natural crowd flow and may eventually put everyone, including the self, at risk of collisions. On the other hand, keeping in silence to avoid every single collision will lead to the agent's inefficient travel. With this observation, our L2B framework augments the reward function used in learning an interactive navigation policy to penalize frequent active path clearing and passive collision avoidance, which substantially improves the balance of the safety-efficiency trade-off. We evaluate our L2B framework in a challenging crowd simulation and demonstrate its superiority, in terms of both navigation success and collision rate, over a state-of-the-art navigation approach.

Comments:	Accepted at IROS2020. Project site: this https URL
Subjects:	Robotics (cs.RO); Machine Learning (cs.LG)
Cite as:	arXiv:2003.09207 [cs.RO]
	(or arXiv:2003.09207v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2003.09207

Submission history

From: Mai Nishimura [view email]
[v1] Fri, 20 Mar 2020 11:40:29 UTC (376 KB)
[v2] Wed, 7 Oct 2020 18:30:02 UTC (1,308 KB)

Computer Science > Robotics

Title:L2B: Learning to Balance the Safety-Efficiency Trade-off in Interactive Crowd-aware Robot Navigation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:L2B: Learning to Balance the Safety-Efficiency Trade-off in Interactive Crowd-aware Robot Navigation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators