Blending Data-Driven Priors in Dynamic Games

Lidard, Justin; Hu, Haimin; Hancock, Asher; Zhang, Zixu; Contreras, Albert Gimó; Modi, Vikash; DeCastro, Jonathan; Gopinath, Deepak; Rosman, Guy; Leonard, Naomi Ehrich; Santos, María; Fisac, Jaime Fernández

Computer Science > Robotics

arXiv:2402.14174 (cs)

[Submitted on 21 Feb 2024 (v1), last revised 7 Jul 2024 (this version, v3)]

Title:Blending Data-Driven Priors in Dynamic Games

Authors:Justin Lidard, Haimin Hu, Asher Hancock, Zixu Zhang, Albert Gimó Contreras, Vikash Modi, Jonathan DeCastro, Deepak Gopinath, Guy Rosman, Naomi Ehrich Leonard, María Santos, Jaime Fernández Fisac

View PDF HTML (experimental)

Abstract:As intelligent robots like autonomous vehicles become increasingly deployed in the presence of people, the extent to which these systems should leverage model-based game-theoretic planners versus data-driven policies for safe, interaction-aware motion planning remains an open question. Existing dynamic game formulations assume all agents are task-driven and behave optimally. However, in reality, humans tend to deviate from the decisions prescribed by these models, and their behavior is better approximated under a noisy-rational paradigm. In this work, we investigate a principled methodology to blend a data-driven reference policy with an optimization-based game-theoretic policy. We formulate KLGame, an algorithm for solving non-cooperative dynamic game with Kullback-Leibler (KL) regularization with respect to a general, stochastic, and possibly multi-modal reference policy. Our method incorporates, for each decision maker, a tunable parameter that permits modulation between task-driven and data-driven behaviors. We propose an efficient algorithm for computing multi-modal approximate feedback Nash equilibrium strategies of KLGame in real time. Through a series of simulated and real-world autonomous driving scenarios, we demonstrate that KLGame policies can more effectively incorporate guidance from the reference policy and account for noisily-rational human behaviors versus non-regularized baselines. Website with additional information, videos, and code: this https URL.

Comments:	20 pages, 12 figures
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Systems and Control (eess.SY); Optimization and Control (math.OC)
Cite as:	arXiv:2402.14174 [cs.RO]
	(or arXiv:2402.14174v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2402.14174

Submission history

From: Justin Lidard [view email]
[v1] Wed, 21 Feb 2024 23:22:32 UTC (23,875 KB)
[v2] Fri, 23 Feb 2024 22:53:50 UTC (23,875 KB)
[v3] Sun, 7 Jul 2024 02:54:35 UTC (29,753 KB)

Computer Science > Robotics

Title:Blending Data-Driven Priors in Dynamic Games

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Blending Data-Driven Priors in Dynamic Games

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators