Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time.

AllVideos Books News Images Maps Shopping

Scholarly articles for Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time.

scholar.google.com › citations

… linear-quadratic mean-field control/game in continuous …
Wang · Cited by 32

[2008.06845] Global Convergence of Policy Gradient for Linear-Quadratic ...

Aug 16, 2020 · In this paper, we study the policy gradient method for the linear-quadratic mean-field control and game, where we assume each agent has ...

[PDF] Global Convergence of Policy Gradient for Linear-Quadratic Mean ...

proceedings.mlr.press › ...

To present its formal definition, we define Λ1(µ) as the optimal policy in Π given the mean- field µ, and define Λ2(µ, π) as the mean-field state/action.

[PDF] Global Convergence of Policy Gradient for Linear-Quadratic Mean ...

arxiv.org › pdf

Aug 16, 2020 · To present its formal definition, we define Λ1(µ) as the optimal policy in Π given the mean-field state µ, and define Λ2(µ, π) as the mean-field ...

(PDF) Global Convergence of Policy Gradient for Linear-Quadratic Mean ...

www.researchgate.net › ... › Gradient

Aug 16, 2020 · Therefore, it has motivated new research directions for mean-field control (MFC) and mean-field game (MFG). In this paper, we study the policy ...

Global Convergence of Policy Gradient for Linear-Quadratic Mean ...

slideslive.com › global-convergence-of-p...

Jul 19, 2021 · Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time · Speakers · Organizer · About ICML 2021.

[PDF] Global Convergence of Policy Gradient for Linear-Quadratic Mean ...

proceedings.mlr.press › ...

Supplementary Material: Global Convergence of Policy Gradient for. Linear-Quadratic Mean-Field Control/Game in Continuous Time. Weichen Wang. ∗. , Jiequn Han.

[PDF] Linear-Quadratic Mean-Field Reinforcement Learning

www.semanticscholar.org › paper › Line...

This work proves rigorously the convergence of exact and model-free policy gradient methods in a mean-field linear-quadratic setting and provides graphical ...

Convergence of Policy Gradient Methods for Finite-Horizon ...

epubs.siam.org › doi › abs

We propose geometry-aware gradient descents for the mean and covariance of the policy using the Fisher geometry and the Bures–Wasserstein geometry, respectively ...

Toward a Theoretical Foundation of Policy Optimization for Learning ...

www.annualreviews.org › journals › ann...

May 3, 2023 · 148. Wang W, Han J, Yang Z, Wang Z 2021. Global convergence of policy gradient for linear-quadratic mean-field control/game in continuous time.

[PDF] Decentralized Policy Gradient Method for Mean-Field Linear Quadratic ...

realworldml.github.io › files › 13_...

We study discrete-time linear-quadratic MARL under mean-field settings with exchangeable finite n ... Mean field games. Japanese journal of mathematics, 2.