A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms

Amortila, Philip; Precup, Doina; Panangaden, Prakash; Bellemare, Marc G.

Computer Science > Machine Learning

arXiv:2003.12239 (cs)

[Submitted on 27 Mar 2020]

Title:A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms

Authors:Philip Amortila, Doina Precup, Prakash Panangaden, Marc G. Bellemare

View PDF

Abstract:We present a distributional approach to theoretical analyses of reinforcement learning algorithms for constant step-sizes. We demonstrate its effectiveness by presenting simple and unified proofs of convergence for a variety of commonly-used methods. We show that value-based methods such as TD($\lambda$) and $Q$-Learning have update rules which are contractive in the space of distributions of functions, thus establishing their exponentially fast convergence to a stationary distribution. We demonstrate that the stationary distribution obtained by any algorithm whose target is an expected Bellman update has a mean which is equal to the true value function. Furthermore, we establish that the distributions concentrate around their mean as the step-size shrinks. We further analyse the optimistic policy iteration algorithm, for which the contraction property does not hold, and formulate a probabilistic policy improvement property which entails the convergence of the algorithm.

Comments:	AISTATS 2020
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2003.12239 [cs.LG]
	(or arXiv:2003.12239v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2003.12239

Submission history

From: Philip Amortila [view email]
[v1] Fri, 27 Mar 2020 05:13:29 UTC (167 KB)

Computer Science > Machine Learning

Title:A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators