The Asymptotic Convergence-Rate of Q-learning.

AllImages Books Videos Maps News Shopping

Search tools

Scholarly articles for The Asymptotic Convergence-Rate of Q-learning.

scholar.google.com › citations

The asymptotic convergence-rate of Q-learning
Szepesvári · Cited by 208

… non-asymptotic convergence for double q-learning
Zhao · Cited by 6

Learning rates for Q-learning
Even-Dar · Cited by 45

[PDF] The Asymptotic Convergence-Rate of Q-learning - Columbia Blogs

blogs.cuit.columbia.edu › 2019/07

In this paper we show that for discounted MDPs with discount factor, > 1/2 the asymptotic rate of convergence of Q-Iearning.

The Asymptotic Convergence-Rate of Q-learning - NIPS papers

papers.nips.cc › paper › 1383-the-asymp...

In this paper we show that for discounted MDPs with discount factor, > 1/2 the asymptotic rate of convergence of Q-Iearning if R(1 - ,) < 1/2 and O( Jlog log ...

[PDF] The Asymptotic Convergence-Rate of Q-learning - University of Alberta

www.ualberta.ca › NeurIPS97.ps.pdf

> 1=2 the asymptotic rate of convergence of Q-learning is O(1=tR(1 )) if R(1. ) < 1=2 and O( p log logt=t) otherwise provided that the state-action pairs are ...

The asymptotic convergence-rate of Q-learning - ACM Digital Library

dl.acm.org › doi

Dec 1, 1997 · In this paper we show that for discounted MDPs with discount factor γ > 1/2 the asymptotic rate of convergence of Q-learning is O(1/tR(1-γ)) ...

The Asymptotic Convergence-Rate of Q-learning. - ResearchGate

www.researchgate.net › ... › Q-Learning

In this paper we show that for discounted MDPs with discount factor $\gamma>1/2$ the asymptotic rate of convergence of Q-learning is O($1/t^{R(1-\gamma$)}) ...

Asymptotic Convergence and Performance of Multi-Agent Q ...

arxiv.org › cs

Jan 23, 2023 · We show a sufficient condition on the rate of exploration such that the Q-Learning dynamics is guaranteed to converge to a unique equilibrium in ...

The Asymptotic Convergence-Rate of Q-learning | - Columbia Blogs

blogs.cuit.columbia.edu › the_asymptotic...

Jul 11, 2019 · The asymptotic rate of convergence of Q-learning is Ο( 1/tR(1-γ) ), if R(1-γ)<0.5, where R=Pmin/Pmax, P is state-action occupation frequency.

[PDF] Learning Rates for Q-learning

www.jmlr.org › papers › volume5

In this paper we derive convergence rates for Q-learning. We show an interesting relationship between the convergence rate and the learning rate used in Q- ...

[PDF] Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

www.cis.upenn.edu › papers › qlea...

In this paper, we provide such a framework, and use it to derive the first finite-time convergence rates (sample size bounds) for both Q-learning and the ...

[PDF] Asymptotic Convergence and Performance of Multi-Agent Q ...

www.southampton.ac.uk › pdfs

May 29, 2023 · We show a sufficient condi- tion on the rate of exploration such that the Q-Learning dynamics. Is guaranteed to converge to a unique equilibrium ...