World Discovery Models

Azar, Mohammad Gheshlaghi; Piot, Bilal; Pires, Bernardo Avila; Grill, Jean-Bastien; Altché, Florent; Munos, Rémi

Computer Science > Artificial Intelligence

arXiv:1902.07685 (cs)

[Submitted on 20 Feb 2019 (v1), last revised 1 Mar 2019 (this version, v3)]

Title:World Discovery Models

Authors:Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo Avila Pires, Jean-Bastien Grill, Florent Altché, Rémi Munos

View PDF

Abstract:As humans we are driven by a strong desire for seeking novelty in our world. Also upon observing a novel pattern we are capable of refining our understanding of the world based on the new information---humans can discover their world. The outstanding ability of the human mind for discovery has led to many breakthroughs in science, art and technology. Here we investigate the possibility of building an agent capable of discovering its world using the modern AI technology. In particular we introduce NDIGO, Neural Differential Information Gain Optimisation, a self-supervised discovery model that aims at seeking new information to construct a global view of its world from partial and noisy observations. Our experiments on some controlled 2-D navigation tasks show that NDIGO outperforms state-of-the-art information-seeking methods in terms of the quality of the learned representation. The improvement in performance is particularly significant in the presence of white or structured noise where other information-seeking methods follow the noise instead of discovering their world.

Subjects:	Artificial Intelligence (cs.AI); Applications (stat.AP); Machine Learning (stat.ML)
Cite as:	arXiv:1902.07685 [cs.AI]
	(or arXiv:1902.07685v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1902.07685

Submission history

From: Mohammad Gheshlaghi Azar [view email]
[v1] Wed, 20 Feb 2019 18:07:18 UTC (4,600 KB) (withdrawn)
[v2] Thu, 21 Feb 2019 15:21:34 UTC (4,594 KB)
[v3] Fri, 1 Mar 2019 20:25:58 UTC (4,597 KB)

Computer Science > Artificial Intelligence

Title:World Discovery Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:World Discovery Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators