default search action

combined dblp search
author search
venue search
publication search

ask others

Arthur Guez

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02035
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02035
Khimya Khetarpal, Zhaohan Daniel Guo, Bernardo Ávila Pires, Yunhao Tang, Clare Lyle, Mark Rowland, Nicolas Heess, Diana Borsa, Arthur Guez, Will Dabney:
A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning. CoRR abs/2406.02035 (2024)
2023
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10587
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10587
Veronica Chelu, Tom Zahavy, Arthur Guez, Doina Precup, Sebastian Flennerhag:
Optimism and Adaptivity in Policy Optimization. CoRR abs/2306.10587 (2023)
2022
[c21]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0004PMHPKG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0004PMHPKG22
Jongmin Lee, Cosmin Paduraru, Daniel J. Mankowitz, Nicolas Heess, Doina Precup, Kee-Eung Kim, Arthur Guez:
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation. ICLR 2022
[c20]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DanihelkaGSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DanihelkaGSS22
Ivo Danihelka, Arthur Guez, Julian Schrittwieser, David Silver:
Policy improvement by planning with Gumbel. ICLR 2022
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/GoyalFBWKBGMHKV22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GoyalFBWKBGMHKV22
Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adrià Puigdomènech Badia, Arthur Guez, Mehdi Mirza, Peter Conway Humphreys, Ksenia Konyushkova, Michal Valko, Simon Osindero, Timothy P. Lillicrap, Nicolas Heess, Charles Blundell:
Retrieval-Augmented Reinforcement Learning. ICML 2022: 7740-7765
[c18]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HumphreysGTSWL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HumphreysGTSWL22
Peter Conway Humphreys, Arthur Guez, Olivier Tieleman, Laurent Sifre, Theophane Weber, Timothy P. Lillicrap:
Large-Scale Retrieval for Reinforcement Learning. NeurIPS 2022
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-08417
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-08417
Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adrià Puigdomènech Badia, Arthur Guez, Mehdi Mirza, Ksenia Konyushkova, Michal Valko, Simon Osindero, Timothy P. Lillicrap, Nicolas Heess, Charles Blundell:
Retrieval-Augmented Reinforcement Learning. CoRR abs/2202.08417 (2022)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-08957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-08957
Jongmin Lee, Cosmin Paduraru, Daniel J. Mankowitz, Nicolas Heess, Doina Precup, Kee-Eung Kim, Arthur Guez:
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation. CoRR abs/2204.08957 (2022)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-05314
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-05314
Peter Conway Humphreys, Arthur Guez, Olivier Tieleman, Laurent Sifre, Théophane Weber, Timothy P. Lillicrap:
Large-Scale Retrieval for Reinforcement Learning. CoRR abs/2206.05314 (2022)
2021
[c17]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/HamrickFBGVWABV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HamrickFBGVWABV21
Jessica B. Hamrick, Abram L. Friesen, Feryal M. P. Behbahani, Arthur Guez, Fabio Viola, Sims Witherspoon, Thomas Anthony, Lars Holger Buesing, Petar Velickovic, Theophane Weber:
On the role of planning in model-based deep reinforcement learning. ICLR 2021
[c16]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HesselDVGSSWSH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HesselDVGSSWSH21
Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver, Hado van Hasselt:
Muesli: Combining Improvements in Policy Optimization. ICML 2021: 4214-4226
[c15]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MesnardWVTSHDSH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MesnardWVTSHDSH21
Thomas Mesnard, Theophane Weber, Fabio Viola, Shantanu Thakoor, Alaa Saade, Anna Harutyunyan, Will Dabney, Thomas S. Stepleton, Nicolas Heess, Arthur Guez, Eric Moulines, Marcus Hutter, Lars Buesing, Rémi Munos:
Counterfactual Credit Assignment in Model-Free Reinforcement Learning. ICML 2021: 7654-7664
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-06159
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-06159
Matteo Hessel, Ivo Danihelka, Fabio Viola, Arthur Guez, Simon Schmitt, Laurent Sifre, Theophane Weber, David Silver, Hado van Hasselt:
Muesli: Combining Improvements in Policy Optimization. CoRR abs/2104.06159 (2021)
2020
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/nature/SchrittwieserAH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nature/SchrittwieserAH20
Julian Schrittwieser, Ioannis Antonoglou, Thomas Hubert, Karen Simonyan, Laurent Sifre, Simon Schmitt, Arthur Guez, Edward Lockhart, Demis Hassabis, Thore Graepel, Timothy P. Lillicrap, David Silver:
Mastering Atari, Go, chess and shogi by planning with a learned model. Nat. 588(7839): 604-609 (2020)
[c14]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/GuezVWBKPSH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GuezVWBKPSH20
Arthur Guez, Fabio Viola, Theophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver, Nicolas Heess:
Value-driven Hindsight Modelling. NeurIPS 2020
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-08329
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-08329
Arthur Guez, Fabio Viola, Théophane Weber, Lars Buesing, Steven Kapturowski, Doina Precup, David Silver, Nicolas Heess:
Value-driven Hindsight Modelling. CoRR abs/2002.08329 (2020)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-05524
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-05524
Mehdi Mirza, Andrew Jaegle, Jonathan J. Hunt, Arthur Guez, Saran Tunyasuvunakool, Alistair Muldal, Théophane Weber, Péter Karkus, Sébastien Racanière, Lars Buesing, Timothy P. Lillicrap, Nicolas Heess:
Physically Embedded Planning Problems: New Challenges for Reinforcement Learning. CoRR abs/2009.05524 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-01298
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-01298
Péter Karkus, Mehdi Mirza, Arthur Guez, Andrew Jaegle, Timothy P. Lillicrap, Lars Buesing, Nicolas Heess, Theophane Weber:
Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban. CoRR abs/2010.01298 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-04021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-04021
Jessica B. Hamrick, Abram L. Friesen, Feryal M. P. Behbahani, Arthur Guez, Fabio Viola, Sims Witherspoon, Thomas Anthony, Lars Buesing, Petar Velickovic, Théophane Weber:
On the role of planning in model-based deep reinforcement learning. CoRR abs/2011.04021 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-09464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-09464
Thomas Mesnard, Théophane Weber, Fabio Viola, Shantanu Thakoor, Alaa Saade, Anna Harutyunyan, Will Dabney, Tom Stepleton, Nicolas Heess, Arthur Guez, Marcus Hutter, Lars Buesing, Rémi Munos:
Counterfactual Credit Assignment in Model-Free Reinforcement Learning. CoRR abs/2011.09464 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/BuesingWZHRGL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BuesingWZHRGL19
Lars Buesing, Theophane Weber, Yori Zwols, Nicolas Heess, Sébastien Racanière, Arthur Guez, Jean-Baptiste Lespiau:
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search. ICLR (Poster) 2019
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/GuezMGKRWRSOEWS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GuezMGKRWRSOEWS19
Arthur Guez, Mehdi Mirza, Karol Gregor, Rishabh Kabra, Sébastien Racanière, Theophane Weber, David Raposo, Adam Santoro, Laurent Orseau, Tom Eccles, Greg Wayne, David Silver, Timothy P. Lillicrap:
An Investigation of Model-Free Planning. ICML 2019: 2464-2473
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-03559
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-03559
Arthur Guez, Mehdi Mirza, Karol Gregor, Rishabh Kabra, Sébastien Racanière, Théophane Weber, David Raposo, Adam Santoro, Laurent Orseau, Tom Eccles, Greg Wayne, David Silver, Timothy P. Lillicrap:
An investigation of model-free planning. CoRR abs/1901.03559 (2019)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-00528
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-00528
Shruti Mishra, Abbas Abdolmaleki, Arthur Guez, Piotr Trochim, Doina Precup:
Augmenting learning using symmetry in a biologically-inspired domain. CoRR abs/1910.00528 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-08265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-08265
Julian Schrittwieser, Ioannis Antonoglou, Thomas Hubert, Karen Simonyan, Laurent Sifre, Simon Schmitt, Arthur Guez, Edward Lockhart, Demis Hassabis, Thore Graepel, Timothy P. Lillicrap, David Silver:
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model. CoRR abs/1911.08265 (2019)
2018
[c11]
- view
  - electronic edition @ mindmodeling.org (archived)
  - details & citations
- export record
  dblp key:
  - conf/cogsci/KruscheSGS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cogsci/KruscheSGS18
Moritz Krusche, Eric Schulz, Arthur Guez, Maarten Speekenbrink:
Adaptive planning in human search. CogSci 2018
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/GuezWASVWMS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GuezWASVWMS18
Arthur Guez, Theophane Weber, Ioannis Antonoglou, Karen Simonyan, Oriol Vinyals, Daan Wierstra, Rémi Munos, David Silver:
Learning to Search with MCTSnets. ICML 2018: 1817-1826
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-04697
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-04697
Arthur Guez, Théophane Weber, Ioannis Antonoglou, Karen Simonyan, Oriol Vinyals, Daan Wierstra, Rémi Munos, David Silver:
Learning to Search with MCTSnets. CoRR abs/1802.04697 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-06272
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-06272
Lars Buesing, Theophane Weber, Yori Zwols, Sébastien Racanière, Arthur Guez, Jean-Baptiste Lespiau, Nicolas Heess:
Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search. CoRR abs/1811.06272 (2018)
2017
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/nature/SilverSSAHGHBLB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nature/SilverSSAHGHBLB17
David Silver, Julian Schrittwieser, Karen Simonyan, Ioannis Antonoglou, Aja Huang, Arthur Guez, Thomas Hubert, Lucas Baker, Matthew Lai, Adrian Bolton, Yutian Chen, Timothy P. Lillicrap, Fan Hui, Laurent Sifre, George van den Driessche, Thore Graepel, Demis Hassabis:
Mastering the game of Go without human knowledge. Nat. 550(7676): 354-359 (2017)
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SilverHHSGHDRRB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SilverHHSGHDRRB17
David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David P. Reichert, Neil C. Rabinowitz, André Barreto, Thomas Degris:
The Predictron: End-To-End Learning and Planning. ICML 2017: 3191-3199
[c8]
- view
- export record
  dblp key:
  - conf/nips/RacaniereWRBGRB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RacaniereWRBGRB17
Sébastien Racanière, Theophane Weber, David P. Reichert, Lars Buesing, Arthur Guez, Danilo Jimenez Rezende, Adrià Puigdomènech Badia, Oriol Vinyals, Nicolas Heess, Yujia Li, Razvan Pascanu, Peter W. Battaglia, Demis Hassabis, David Silver, Daan Wierstra:
Imagination-Augmented Agents for Deep Reinforcement Learning. NIPS 2017: 5690-5701
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/WeberRRBGRBVHLP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WeberRRBGRBVHLP17
Theophane Weber, Sébastien Racanière, David P. Reichert, Lars Buesing, Arthur Guez, Danilo Jimenez Rezende, Adrià Puigdomènech Badia, Oriol Vinyals, Nicolas Heess, Yujia Li, Razvan Pascanu, Peter W. Battaglia, David Silver, Daan Wierstra:
Imagination-Augmented Agents for Deep Reinforcement Learning. CoRR abs/1707.06203 (2017)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1712-01815
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-01815
David Silver, Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Matthew Lai, Arthur Guez, Marc Lanctot, Laurent Sifre, Dharshan Kumaran, Thore Graepel, Timothy P. Lillicrap, Karen Simonyan, Demis Hassabis:
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm. CoRR abs/1712.01815 (2017)
2016
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/nature/SilverHMGSDSAPL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nature/SilverHMGSDSAPL16
David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Vedavyas Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy P. Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel, Demis Hassabis:
Mastering the game of Go with deep neural networks and tree search. Nat. 529(7587): 484-489 (2016)
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/BellemareOGTM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/BellemareOGTM16
Marc G. Bellemare, Georg Ostrovski, Arthur Guez, Philip S. Thomas, Rémi Munos:
Increasing the Action Gap: New Operators for Reinforcement Learning. AAAI 2016: 1476-1483
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HasseltGS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HasseltGS16
Hado van Hasselt, Arthur Guez, David Silver:
Deep Reinforcement Learning with Double Q-Learning. AAAI 2016: 2094-2100
[c5]
- view
- export record
  dblp key:
  - conf/nips/HasseltGHMS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HasseltGHMS16
Hado van Hasselt, Arthur Guez, Matteo Hessel, Volodymyr Mnih, David Silver:
Learning values across many orders of magnitude. NIPS 2016: 4287-4295
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HasseltGHS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HasseltGHS16
Hado van Hasselt, Arthur Guez, Matteo Hessel, David Silver:
Learning functions across many orders of magnitudes. CoRR abs/1602.07714 (2016)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SilverHHSGHDRRB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SilverHHSGHDRRB16
David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David P. Reichert, Neil C. Rabinowitz, André Barreto, Thomas Degris:
The Predictron: End-To-End Learning and Planning. CoRR abs/1612.08810 (2016)
2015
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HasseltGS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HasseltGS15
Hado van Hasselt, Arthur Guez, David Silver:
Deep Reinforcement Learning with Double Q-learning. CoRR abs/1509.06461 (2015)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/BellemareOGTM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BellemareOGTM15
Marc G. Bellemare, Georg Ostrovski, Arthur Guez, Philip S. Thomas, Rémi Munos:
Increasing the Action Gap: New Operators for Reinforcement Learning. CoRR abs/1512.04860 (2015)
2014
[c4]
- view
- export record
  dblp key:
  - conf/nips/GuezHSD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GuezHSD14
Arthur Guez, Nicolas Heess, David Silver, Peter Dayan:
Bayes-Adaptive Simulation-based Search with Value Function Approximation. NIPS 2014: 451-459
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/GuezSD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/GuezSD14
Arthur Guez, David Silver, Peter Dayan:
Better Optimism By Bayes: Adaptive Planning with Rich Models. CoRR abs/1402.1958 (2014)
2013
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jair/GuezSD13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/GuezSD13
Arthur Guez, David Silver, Peter Dayan:
Scalable and Efficient Bayes-Adaptive Reinforcement Learning Based on Monte-Carlo Tree Search. J. Artif. Intell. Res. 48: 841-883 (2013)
2012
[c3]
- view
- export record
  dblp key:
  - conf/nips/GuezSD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GuezSD12
Arthur Guez, David Silver, Peter Dayan:
Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search. NIPS 2012: 1034-1042
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1205-3109
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1205-3109
Arthur Guez, David Silver, Peter Dayan:
Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search. CoRR abs/1205.3109 (2012)
2010
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/GuezP10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/GuezP10
Arthur Guez, Joelle Pineau:
Multi-tasking SLAM. ICRA 2010: 377-384

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ijns/PineauGVPA09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijns/PineauGVPA09
Joelle Pineau, Arthur Guez, Robert D. Vincent, Gabriella Panuccio, Massimo Avoli:
Treating Epilepsy via Adaptive Neurostimulation: a Reinforcement Learning Approach. Int. J. Neural Syst. 19(4): 227-240 (2009)
2008
[c1]
- view
  - electronic edition @ aaai.org (archived)
  - details & citations
- export record
  dblp key:
  - conf/aaai/GuezVAP08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/GuezVAP08
Arthur Guez, Robert D. Vincent, Massimo Avoli, Joelle Pineau:
Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning. AAAI 2008: 1671-1678

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.