CS Ph.D. student at Cornell / interested in (offline) reinforcement learning and off-policy evaluation / scope-rl, awesome-offline-rl
Pinned Loading
-
hakuhodo-technologies/scope-rl
hakuhodo-technologies/scope-rl PublicSCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
-
hanjuku-kaso/awesome-offline-rl
hanjuku-kaso/awesome-offline-rl PublicAn index of algorithms for offline reinforcement learning (offline-rl)
-
wsdm2022-cascade-dr
wsdm2022-cascade-dr Public(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"
-
st-tech/zr-obp
st-tech/zr-obp PublicOpen Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.