High-Probability Sample Complexities for Policy Evaluation With Linear Function Approximation
Abstract
References
Recommendations
Least Squares Policy Evaluation Algorithms with Linear Function Approximation
We consider policy evaluation algorithms within the context of infinite-horizon dynamic programming problems with discounted cost. We focus on discrete-time dynamic systems with a large number of states, and we discuss two methods, which use simulation, ...
Variance-aware off-policy evaluation with linear function approximation
NIPS '21: Proceedings of the 35th International Conference on Neural Information Processing SystemsWe study the off-policy evaluation (OPE) problem in reinforcement learning with linear function approximation, which aims to estimate the value function of a target policy based on the offline data collected by a behavior policy. We propose to ...
Finite-sample analysis for SARSA with linear function approximation
NIPS'19: Proceedings of the 33rd International Conference on Neural Information Processing SystemsSARSA is an on-policy algorithm to learn a Markov decision process policy in reinforcement learning. We investigate the SARSA algorithm with linear function approximation under the non-i.i.d. data, where a single sample trajectory is available. With a ...
Comments
Information & Contributors
Information
Published In
Publisher
IEEE Press
Publication History
Qualifiers
- Research-article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 0Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0