New inference strategies for solving Markov Decision Processes using reversible jump MCMC

Hoffman, Matthias; Kueck, Hendrik; de Freitas, Nando; Doucet, Arnaud

Computer Science > Machine Learning

arXiv:1205.2643 (cs)

[Submitted on 9 May 2012]

Title:New inference strategies for solving Markov Decision Processes using reversible jump MCMC

Authors:Matthias Hoffman, Hendrik Kueck, Nando de Freitas, Arnaud Doucet

View PDF

Abstract:In this paper we build on previous work which uses inferences techniques, in particular Markov Chain Monte Carlo (MCMC) methods, to solve parameterized control problems. We propose a number of modifications in order to make this approach more practical in general, higher-dimensional spaces. We first introduce a new target distribution which is able to incorporate more reward information from sampled trajectories. We also show how to break strong correlations between the policy parameters and sampled trajectories in order to sample more freely. Finally, we show how to incorporate these techniques in a principled manner to obtain estimates of the optimal policy.

Comments:	Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)
Subjects:	Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC); Computation (stat.CO); Machine Learning (stat.ML)
Report number:	UAI-P-2009-PG-223-231
Cite as:	arXiv:1205.2643 [cs.LG]
	(or arXiv:1205.2643v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1205.2643

Submission history

From: Matthias Hoffman [view email] [via AUAI proxy]
[v1] Wed, 9 May 2012 15:26:47 UTC (2,894 KB)

Computer Science > Machine Learning

Title:New inference strategies for solving Markov Decision Processes using reversible jump MCMC

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:New inference strategies for solving Markov Decision Processes using reversible jump MCMC

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators