Selecting the State-Representation in Reinforcement Learning

Maillard, Odalric-Ambrym; Munos, Rémi; Ryabko, Daniil

Computer Science > Machine Learning

arXiv:1302.2552 (cs)

[Submitted on 11 Feb 2013]

Title:Selecting the State-Representation in Reinforcement Learning

Authors:Odalric-Ambrym Maillard, Rémi Munos, Daniil Ryabko

View PDF

Abstract:The problem of selecting the right state-representation in a reinforcement learning problem is considered. Several models (functions mapping past observations to a finite set) of the observations are given, and it is known that for at least one of these models the resulting state dynamics are indeed Markovian. Without knowing neither which of the models is the correct one, nor what are the probabilistic characteristics of the resulting MDP, it is required to obtain as much reward as the optimal policy for the correct model (or for the best of the correct models, if there are several). We propose an algorithm that achieves that, with a regret of order T^{2/3} where T is the horizon time.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1302.2552 [cs.LG]
	(or arXiv:1302.2552v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1302.2552
Journal reference:	NIPS 2011, pp. 2627-2635

Submission history

From: Daniil Ryabko [view email]
[v1] Mon, 11 Feb 2013 17:49:38 UTC (24 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2013-02

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Odalric-Ambrym Maillard
Rémi Munos
Daniil Ryabko

export BibTeX citation

Computer Science > Machine Learning

Title:Selecting the State-Representation in Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Selecting the State-Representation in Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators