Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clustering

Karimpanal, Thommen George; Wilhelm, Erik

doi:10.1016/j.neucom.2017.04.074

Computer Science > Artificial Intelligence

arXiv:1705.06342 (cs)

[Submitted on 17 May 2017]

Title:Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clustering

Authors:Thommen George Karimpanal, Erik Wilhelm

View PDF

Abstract:In this work, we present a methodology that enables an agent to make efficient use of its exploratory actions by autonomously identifying possible objectives in its environment and learning them in parallel. The identification of objectives is achieved using an online and unsupervised adaptive clustering algorithm. The identified objectives are learned (at least partially) in parallel using Q-learning. Using a simulated agent and environment, it is shown that the converged or partially converged value function weights resulting from off-policy learning can be used to accumulate knowledge about multiple objectives without any additional exploration. We claim that the proposed approach could be useful in scenarios where the objectives are initially unknown or in real world scenarios where exploration is typically a time and energy intensive process. The implications and possible extensions of this work are also briefly discussed.

Comments:	Accepted in Neurocomputing: Special Issue on Multiobjective Reinforcement Learning: Theory and Applications, 24 pages, 6 figures
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1705.06342 [cs.AI]
	(or arXiv:1705.06342v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1705.06342
Journal reference:	Neurocomputing 263, 39-47, 2017
Related DOI:	https://doi.org/10.1016/j.neucom.2017.04.074

Submission history

From: Thommen Karimpanal George [view email]
[v1] Wed, 17 May 2017 20:55:15 UTC (601 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2017-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Thommen George Karimpanal
Erik Wilhelm

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clustering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clustering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators