Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks

Han, Dongqi; Doya, Kenji; Tani, Jun

Computer Science > Machine Learning

arXiv:1901.10113 (cs)

[Submitted on 29 Jan 2019 (v1), last revised 26 Nov 2019 (this version, v6)]

Title:Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks

Authors:Dongqi Han, Kenji Doya, Jun Tani

View PDF

Abstract:Recurrent neural networks (RNNs) for reinforcement learning (RL) have shown distinct advantages, e.g., solving memory-dependent tasks and meta-learning. However, little effort has been spent on improving RNN architectures and on understanding the underlying neural mechanisms for performance gain. In this paper, we propose a novel, multiple-timescale, stochastic RNN for RL. Empirical results show that the network can autonomously learn to abstract sub-goals and can self-develop an action hierarchy using internal dynamics in a challenging continuous control task. Furthermore, we show that the self-developed compositionality of the network enhances faster re-learning when adapting to a new task that is a re-composition of previously learned sub-goals, than when starting from scratch. We also found that improved performance can be achieved when neural activities are subject to stochastic rather than deterministic dynamics.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1901.10113 [cs.LG]
	(or arXiv:1901.10113v6 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1901.10113

Submission history

From: Dongqi Han [view email]
[v1] Tue, 29 Jan 2019 05:34:47 UTC (6,486 KB)
[v2] Tue, 12 Feb 2019 07:18:16 UTC (6,486 KB)
[v3] Tue, 5 Mar 2019 13:54:51 UTC (4,750 KB)
[v4] Fri, 3 May 2019 09:58:53 UTC (3,301 KB)
[v5] Thu, 23 May 2019 15:32:04 UTC (8,209 KB)
[v6] Tue, 26 Nov 2019 08:31:31 UTC (8,154 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-01

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Dongqi Han
Kenji Doya
Jun Tani

export BibTeX citation

Computer Science > Machine Learning

Title:Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Self-organization of action hierarchy and compositionality by reinforcement learning with recurrent neural networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators