Protecting against evaluation overfitting in empirical reinforcement learning

Whiteson, S.; Tanner, B.; Taylor, M.E.; Stone, P.

doi:https://doi.org/10.1109/ADPRL.2011.5967363

item 1 out of 1

Author: S. Whiteson
B. Tanner
M.E. Taylor
P. Stone
Year: 2011
Title: Protecting against evaluation overfitting in empirical reinforcement learning
Event: 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2011), Paris, France
Book/source title: Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2011)
Pages (from-to): 120-127
Publisher: IEEE
ISBN: 9781424498871
Document type: Conference contribution
Faculty: Faculty of Science (FNWI)
Institute: Informatics Institute (IVI)
Abstract: Empirical evaluations play an important role in machine learning. However, the usefulness of any evaluation depends on the empirical methodology employed. Designing good empirical methodologies is difficult in part because agents can overfit test evaluations and thereby obtain misleadingly high scores. We argue that reinforcement learning is particularly vulnerable to environment overfitting and propose as a remedy generalized methodologies, in which evaluations are based on multiple environments sampled from a distribution. In addition, we consider how to summarize performance when scores from different environments may not have commensurate values. Finally, we present proof-of-concept results demonstrating how these methodologies can validate an intuitively useful range-adaptive tile coding method.
URL: go to publisher's site
Language: English
Persistent Identifier: https://hdl.handle.net/11245/1.345492

Disclaimer/Complaints regulations

If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library, or send a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible.