Will it Blend? Composing Value Functions in Reinforcement Learning

van Niekerk, Benjamin; James, Steven; Earle, Adam; Rosman, Benjamin

Computer Science > Machine Learning

arXiv:1807.04439 (cs)

[Submitted on 12 Jul 2018]

Title:Will it Blend? Composing Value Functions in Reinforcement Learning

Authors:Benjamin van Niekerk, Steven James, Adam Earle, Benjamin Rosman

View PDF

Abstract:An important property for lifelong-learning agents is the ability to combine existing skills to solve unseen tasks. In general, however, it is unclear how to compose skills in a principled way. We provide a "recipe" for optimal value function composition in entropy-regularised reinforcement learning (RL) and then extend this to the standard RL setting. Composition is demonstrated in a video game environment, where an agent with an existing library of policies is able to solve new tasks without the need for further learning.

Comments:	The 2nd Lifelong Learning: A Reinforcement Learning Approach (LLARLA) Workshop, Stockholm, Sweden, FAIM 2018
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1807.04439 [cs.LG]
	(or arXiv:1807.04439v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1807.04439

Submission history

From: Steven James [view email]
[v1] Thu, 12 Jul 2018 06:43:12 UTC (2,793 KB)

Computer Science > Machine Learning

Title:Will it Blend? Composing Value Functions in Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Will it Blend? Composing Value Functions in Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators