Two-Timescale Stochastic Approximation Convergence Rates with Applications to Reinforcement Learning

Dalal, Gal; Szorenyi, Balazs; Thoppe, Gugan; Mannor, Shie

Computer Science > Artificial Intelligence

arXiv:1703.05376v2 (cs)

[Submitted on 15 Mar 2017 (v1), revised 31 May 2017 (this version, v2), latest version 4 Jun 2018 (v5)]

Title:Two-Timescale Stochastic Approximation Convergence Rates with Applications to Reinforcement Learning

Authors:Gal Dalal, Balazs Szorenyi, Gugan Thoppe, Shie Mannor

View PDF

Abstract:Two-timescale Stochastic Approximation (SA) algorithms are widely used in Reinforcement Learning (RL). Their iterates have two parts that are updated with distinct stepsizes. In this work we provide a recipe for analyzing two-timescale SA. Using it, we develop the first convergence rate result for them. From this result we extract key insights on stepsize selection. As an application, we obtain convergence rates for two-timescale RL algorithms such as GTD(0), GTD2, and TDC.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1703.05376 [cs.AI]
	(or arXiv:1703.05376v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1703.05376

Submission history

From: Gal Dalal [view email]
[v1] Wed, 15 Mar 2017 20:23:45 UTC (42 KB)
[v2] Wed, 31 May 2017 16:35:17 UTC (59 KB)
[v3] Thu, 7 Sep 2017 07:12:14 UTC (59 KB)
[v4] Wed, 28 Feb 2018 12:13:00 UTC (381 KB)
[v5] Mon, 4 Jun 2018 18:33:57 UTC (285 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2017-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Gal Dalal
Balázs Szörényi
Gugan Thoppe
Shie Mannor

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Two-Timescale Stochastic Approximation Convergence Rates with Applications to Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Two-Timescale Stochastic Approximation Convergence Rates with Applications to Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators