Clustered Reinforcement Learning

Ma, Xiao; Zhao, Shen-Yi; Li, Wu-Jun

Computer Science > Machine Learning

arXiv:1906.02457 (cs)

[Submitted on 6 Jun 2019]

Title:Clustered Reinforcement Learning

Authors:Xiao Ma, Shen-Yi Zhao, Wu-Jun Li

View PDF

Abstract:Exploration strategy design is one of the challenging problems in reinforcement learning~(RL), especially when the environment contains a large state space or sparse rewards. During exploration, the agent tries to discover novel areas or high reward~(quality) areas. In most existing methods, the novelty and quality in the neighboring area of the current state are not well utilized to guide the exploration of the agent. To tackle this problem, we propose a novel RL framework, called \underline{c}lustered \underline{r}einforcement \underline{l}earning~(CRL), for efficient exploration in RL. CRL adopts clustering to divide the collected states into several clusters, based on which a bonus reward reflecting both novelty and quality in the neighboring area~(cluster) of the current state is given to the agent. Experiments on a continuous control task and several \emph{Atari 2600} games show that CRL can outperform other state-of-the-art methods to achieve the best performance in most cases.

Comments:	16pages, 3 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1906.02457 [cs.LG]
	(or arXiv:1906.02457v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1906.02457

Submission history

From: Xiao Ma [view email]
[v1] Thu, 6 Jun 2019 07:35:02 UTC (310 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-06

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xiao Ma
Shen-Yi Zhao
Wu-Jun Li

export BibTeX citation

Computer Science > Machine Learning

Title:Clustered Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Clustered Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators