Playing Atari with Deep Reinforcement Learning

Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; Graves, Alex; Antonoglou, Ioannis; Wierstra, Daan; Riedmiller, Martin

Computer Science > Machine Learning

arXiv:1312.5602 (cs)

[Submitted on 19 Dec 2013]

Title:Playing Atari with Deep Reinforcement Learning

Authors:Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller

View PDF

Abstract:We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no adjustment of the architecture or learning algorithm. We find that it outperforms all previous approaches on six of the games and surpasses a human expert on three of them.

Comments:	NIPS Deep Learning Workshop 2013
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1312.5602 [cs.LG]
	(or arXiv:1312.5602v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1312.5602

Submission history

From: Volodymyr Mnih [view email]
[v1] Thu, 19 Dec 2013 16:00:08 UTC (221 KB)

Computer Science > Machine Learning

Title:Playing Atari with Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

24 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Playing Atari with Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

24 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators