Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning.

AllBooks News Videos Images Maps Shopping

Online Estimation and Inference for Robust Policy Evaluation in ... - arXiv

Oct 4, 2023 · This paper bridges the gap between robust statistics and statistical inference in reinforcement learning, offering a more versatile and reliable ...

[PDF] Online Estimation and Inference for Robust Policy Evaluation in ... - arXiv

arxiv.org › pdf

Oct 4, 2023 · proposed algorithm for robust policy evaluation in reinforcement learning. Theoretical results on convergence rates, asymptotic normality ...

Online Estimation and Inference for Robust Policy Evaluation in ... - PolyU

www.polyu.edu.hk › ama › events › 202...

Jan 31, 2024 · Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning. Distinguished Lecture / Joint Seminar Series.

Online Bootstrap Inference For Policy Evaluation In Reinforcement Learning

www.tandfonline.com › doi › abs

In this article, we study the use of the online bootstrap method for inference in RL policy evaluation. In particular, we focus on the temporal difference (TD) ...

Online Bootstrap Inference For Policy Evaluation in Reinforcement Learning

www.researchgate.net › publication › 35...

In this paper, we study the use of the online bootstrap method for statistical inference in RL. In particular, we focus on the temporal difference (TD) learning ...

[PDF] Online Bootstrap Inference For Policy Evaluation In Reinforcement Learning

par.nsf.gov › servlets › purl

Jun 29, 2022 · (2018) proposed a statistical inference method for M-estimation problems based on fixed step-size SGD. Chen et al. (2020) derived two kinds of ...

Seminar on Statistics - Online Estimation and Inference for Robust ...

calendar.hkust.edu.hk › events › departm...

Jul 18, 2023 · In this paper, we propose a robust policy evaluation algorithm in reinforcement learning, to feature outlier contamination and heavy-tailed ...

[PDF] Reliable Off-policy Evaluation for Reinforcement Learning

optimization-online.org › 2021/01

In a sequential decision-making problem, off-policy evaluation estimates the expected cumulative reward of a target policy using logged trajectory data ...

Track: Reinforcement Learning - ICML 2024

icml.cc › virtual › session

We study representation learning for Offline Reinforcement Learning (RL), focusing on the important task of Offline Policy Evaluation (OPE).

[PDF] More Robust Doubly Robust Off-policy Evaluation

proceedings.mlr.press › ...

We study the problem of off-policy evaluation. (OPE) in reinforcement learning (RL), where the goal is to estimate the performance of a policy.