Data-Efficient Policy Evaluation Through Behavior Policy Search.

scholar.google.com › citations

Data-efficient policy evaluation through behavior policy …
Hanna · Cited by 47

Data-Efficient Policy Evaluation Through Behavior Policy Search

Jun 12, 2017 · We present a behavior policy search algorithm and empirically demonstrate its effectiveness in lowering the mean squared error of policy ...

[PDF] Data-Efficient Policy Evaluation Through Behavior Policy Search

proceedings.mlr.press › ...

We consider the task of evaluating a policy for a Markov decision process (MDP). The standard unbiased technique for evaluating a policy is to.

[PDF] Data-Efficient Policy Evaluation Through Behavior Policy Search

arxiv.org › pdf

Jun 12, 2017 · We consider the task of evaluating a policy for a Markov decision process (MDP). The standard unbiased technique for evaluating a policy is ...

Data-efficient policy evaluation through behavior policy search

dl.acm.org › doi › abs

Aug 6, 2017 · We present a behavior policy search algorithm and empirically demonstrate its effectiveness in lowering the mean squared error of policy ...

[PDF] Data-efficient Policy Evaluation Through Behavior Policy Search

www.cs.utexas.edu › posters › hann...

Data-efficient Policy Evaluation Through Behavior Policy Search. JOSIAH HANNA ... search with Behavior Policy Gradient (BPG) to Monte Carlo policy evaluation ...

People also search for

Data efficient policy evaluation through behavior policy search arxiv

Data efficient policy evaluation through behavior policy search scott niekum

Data-Efficient Policy Evaluation Through Behavior Policy Search

www.researchgate.net › publication › 31...

We present a behavior policy search algorithm and empirically demonstrate its effectiveness in lowering the mean squared error of policy performance estimates.

[PDF] Data-Efficient Policy Evaluation Through Behavior Policy Search

www.semanticscholar.org › paper

A novel policy evaluation sub-problem is proposed, behavior policy search: searching for a behavior policy that reduces mean squared error, and it is shown ...

Data-Efficient Policy Evaluation Through Behavior Policy Search

www.research.ed.ac.uk › publications › d...

Aug 11, 2017 · Dive into the research topics of 'Data-Efficient Policy Evaluation Through Behavior Policy Search'. Together they form a unique fingerprint.

Data-Efficient Policy Evaluation Through Behavior Policy Search

www.cs.utexas.edu › ~pstone › Papers

We present a behavior policy search algorithm and empirically demonstrate its effectiveness in lowering the mean squared error of policy performance estimates.

[PDF] Data-efficient Policy Evaluation through Behavior Policy Search

pdfs.semanticscholar.org › ...

Aug 8, 2017 · Data-efficient Policy Evaluation through Behavior Policy Search. 1 ... Data-efficient Policy Evaluation through Behavior Policy Search. 20.

Scholarly articles for Data-Efficient Policy Evaluation Through Behavior Policy Search.

Data-Efficient Policy Evaluation Through Behavior Policy Search

[PDF] Data-Efficient Policy Evaluation Through Behavior Policy Search

[PDF] Data-Efficient Policy Evaluation Through Behavior Policy Search

Data-efficient policy evaluation through behavior policy search

[PDF] Data-efficient Policy Evaluation Through Behavior Policy Search

Data-Efficient Policy Evaluation Through Behavior Policy Search

[PDF] Data-Efficient Policy Evaluation Through Behavior Policy Search

Data-Efficient Policy Evaluation Through Behavior Policy Search

Data-Efficient Policy Evaluation Through Behavior Policy Search

[PDF] Data-efficient Policy Evaluation through Behavior Policy Search