Planning and Acting in Partially Observable Stochastic Domains

Planning and Acting in Partially Observable Stochastic DomainsFebruary 1996

February 1996

1996 Technical Report

Publisher:

Brown University
Department of Computer Science Box 1910 Providence, RI
United States

Published:01 February 1996

Bibliometrics

Abstract

In this paper, we bring techniques from operations research to bear on the problem of choosing optimal actions in partially observable stochastic domains. We begin by introducing the theory of Markov decision processes (MDPs) and partially observable MDPs (POMDPs). We then outline a novel algorithm for solving POMDPs off line and show how, in some cases, a finite-memory controller can be extracted from the solution to a POMDP. We conclude with a discussion of the complexity of finding exact solutions to POMDPs and of some possibilities for finding approximate solutions.

Cited By

Contributors

Leslie Pack Kaelbling
MIT Computer Science & Artificial Intelligence Laboratory
- Publication Years1986 - 2024
- Publication counts147
- Citation count3,110
- Available for Download21
- Downloads (cumulative)6,968
- Downloads (12 months)524
- Downloads (6 weeks)63
- Average Downloads per Article332
- Average Citation per Article21
View Full Profile
Michael Lederman Littman
Brown University
- Publication Years1989 - 2024
- Publication counts190
- Citation count6,656
- Available for Download57
- Downloads (cumulative)155,974
- Downloads (12 months)5,171
- Downloads (6 weeks)687
- Average Downloads per Article2,736
- Average Citation per Article35
View Full Profile
Anthony Rocco Cassandra
Telcordia Technologies, Inc.
- Publication Years1994 - 2007
- Publication counts22
- Citation count1,011
- Available for Download4
- Downloads (cumulative)973
- Downloads (12 months)76
- Downloads (6 weeks)15
- Average Downloads per Article243
- Average Citation per Article46
View Full Profile

Comments

Recommendations

Planning and acting in partially observable stochastic domains

In this paper, we bring techniques from operations research to bear on the problem of choosing optimal actions in partially observable stochastic domains. We begin by introducing the theory of Markov decision processes (mdps) and partially observable ...
Read More
Acting Optimally in Partially Observable Stochastic Domains
Read More
Limiting Discounted-Cost Control of Partially Observable Stochastic Systems

This paper presents two main results on partially observable (PO) stochastic systems. In the first one, we consider a general PO system $$ x_{t+1}= F (x_t, a_t, xi_t), , y_t= G(x_t, eta_t) (t=0,1,ldots) hspace{1in} (*) $$ on Borel spaces, with possibly ...
Read More

Browse Reports

Sections

Cited By