Posterior Sampling for Large Scale Reinforcement Learning

Theocharous, Georgios; Wen, Zheng; Abbasi-Yadkori, Yasin; Vlassis, Nikos

Computer Science > Machine Learning

arXiv:1711.07979 (cs)

[Submitted on 21 Nov 2017 (v1), last revised 22 Oct 2018 (this version, v3)]

Title:Posterior Sampling for Large Scale Reinforcement Learning

Authors:Georgios Theocharous, Zheng Wen, Yasin Abbasi-Yadkori, Nikos Vlassis

View PDF

Abstract:We propose a practical non-episodic PSRL algorithm that unlike recent state-of-the-art PSRL algorithms uses a deterministic, model-independent episode switching schedule. Our algorithm termed deterministic schedule PSRL (DS-PSRL) is efficient in terms of time, sample, and space complexity. We prove a Bayesian regret bound under mild assumptions. Our result is more generally applicable to multiple parameters and continuous state action problems. We compare our algorithm with state-of-the-art PSRL algorithms on standard discrete and continuous problems from the literature. Finally, we show how the assumptions of our algorithm satisfy a sensible parametrization for a large class of problems in sequential recommendations.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1711.07979 [cs.LG]
	(or arXiv:1711.07979v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1711.07979

Submission history

From: Georgios Theocharous [view email]
[v1] Tue, 21 Nov 2017 00:43:24 UTC (791 KB)
[v2] Wed, 6 Dec 2017 23:55:15 UTC (794 KB)
[v3] Mon, 22 Oct 2018 22:06:00 UTC (229 KB)

Computer Science > Machine Learning

Title:Posterior Sampling for Large Scale Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Posterior Sampling for Large Scale Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators