Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning

Moerland, Thomas M.; Deichler, Anna; Baldi, Simone; Broekens, Joost; Jonker, Catholijn M.

Computer Science > Artificial Intelligence

arXiv:2005.07404 (cs)

[Submitted on 15 May 2020]

Title:Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning

Authors:Thomas M. Moerland, Anna Deichler, Simone Baldi, Joost Broekens, Catholijn M. Jonker

View PDF

Abstract:Planning and reinforcement learning are two key approaches to sequential decision making. Multi-step approximate real-time dynamic programming, a recently successful algorithm class of which AlphaZero [Silver et al., 2018] is an example, combines both by nesting planning within a learning loop. However, the combination of planning and learning introduces a new question: how should we balance time spend on planning, learning and acting? The importance of this trade-off has not been explicitly studied before. We show that it is actually of key importance, with computational results indicating that we should neither plan too long nor too short. Conceptually, we identify a new spectrum of planning-learning algorithms which ranges from exhaustive search (long planning) to model-free RL (no planning), with optimal performance achieved midway.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2005.07404 [cs.AI]
	(or arXiv:2005.07404v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2005.07404

Submission history

From: Thomas Moerland [view email]
[v1] Fri, 15 May 2020 08:20:08 UTC (2,385 KB)

Computer Science > Artificial Intelligence

Title:Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators