default search action

combined dblp search
author search
venue search
publication search

ask others

Diana Borsa

Diana L. Borsa

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02035
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02035
Khimya Khetarpal, Zhaohan Daniel Guo, Bernardo Ávila Pires, Yunhao Tang, Clare Lyle, Mark Rowland, Nicolas Heess, Diana Borsa, Arthur Guez, Will Dabney:
A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning. CoRR abs/2406.02035 (2024)
2023
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ChandakTGTMDB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChandakTGTMDB23
Yash Chandak, Shantanu Thakoor, Zhaohan Daniel Guo, Yunhao Tang, Rémi Munos, Will Dabney, Diana L. Borsa:
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition. ICML 2023: 4009-4034
[c12]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/MoskovitzHTBS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MoskovitzHTBS23
Ted Moskovitz, Samo Hromadka, Ahmed Touati, Diana Borsa, Maneesh Sahani:
A State Representation for Diminishing Rewards. NeurIPS 2023
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-00654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-00654
Yash Chandak, Shantanu Thakoor, Zhaohan Daniel Guo, Yunhao Tang, Rémi Munos, Will Dabney, Diana L. Borsa:
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition. CoRR abs/2305.00654 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-03710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-03710
Ted Moskovitz, Samo Hromadka, Ahmed Touati, Diana Borsa, Maneesh Sahani:
A State Representation for Diminishing Rewards. CoRR abs/2309.03710 (2023)
2022
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/PislarSOBS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/PislarSOBS22
Miruna Pislar, David Szepesvari, Georg Ostrovski, Diana L. Borsa, Tom Schaul:
When should agents explore? ICLR 2022
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/FilosVMFBFBS0O22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FilosVMFBFBS0O22
Angelos Filos, Eszter Vértes, Zita Marinho, Gregory Farquhar, Diana Borsa, Abram L. Friesen, Feryal M. P. Behbahani, Tom Schaul, André Barreto, Simon Osindero:
Model-Value Inconsistency as a Signal for Epistemic Uncertainty. ICML 2022: 6474-6498
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ThakoorRBDM022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ThakoorRBDM022
Shantanu Thakoor, Mark Rowland, Diana Borsa, Will Dabney, Rémi Munos, André Barreto:
Generalised Policy Improvement with Geometric Policy Composition. ICML 2022: 21272-21307
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-09699
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-09699
Veronica Chelu, Diana Borsa, Doina Precup, Hado van Hasselt:
Selective Credit Assignment. CoRR abs/2202.09699 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-08736
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-08736
Shantanu Thakoor, Mark Rowland, Diana Borsa, Will Dabney, Rémi Munos, André Barreto:
Generalised Policy Improvement with Geometric Policy Composition. CoRR abs/2206.08736 (2022)
2021
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HasseltMHSBB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HasseltMHSBB21
Hado van Hasselt, Sephora Madjiheurem, Matteo Hessel, David Silver, André Barreto, Diana Borsa:
Expected Eligibility Traces. AAAI 2021: 9997-10005
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-05347
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-05347
Tom Schaul, Georg Ostrovski, Iurii Kemaev, Diana Borsa:
Return-based Scaling: Yet Another Normalisation Trick for Deep RL. CoRR abs/2105.05347 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-13105
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-13105
André Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan J. Hunt, Shibl Mourad, David Silver, Doina Precup:
The Option Keyboard: Combining Skills in Reinforcement Learning. CoRR abs/2106.13105 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-11811
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-11811
Miruna Pislar, David Szepesvari, Georg Ostrovski, Diana Borsa, Tom Schaul:
When should agents explore? CoRR abs/2108.11811 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-04153
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-04153
Angelos Filos, Eszter Vértes, Zita Marinho, Gregory Farquhar, Diana Borsa, Abram L. Friesen, Feryal M. P. Behbahani, Tom Schaul, André Barreto, Simon Osindero:
Model-Value Inconsistency as a Signal for Epistemic Uncertainty. CoRR abs/2112.04153 (2021)
2020
[b1]
- view
- export record
  dblp key:
  - phd/ethos/Borsa20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/ethos/Borsa20
Diana Borsa:
Reinforcement learning in persistent environments: representation learning and transfer. University College London, UK, 2020
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/pnas/BarretoHBSP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pnas/BarretoHBSP20
André Barreto, Shaobo Hou, Diana Borsa, David Silver, Doina Precup:
Fast reinforcement learning with generalized policy updates. Proc. Natl. Acad. Sci. USA 117(48): 30079-30087 (2020)
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/RowlandHHBSMD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/RowlandHHBSMD20
Mark Rowland, Anna Harutyunyan, Hado van Hasselt, Diana Borsa, Tom Schaul, Rémi Munos, Will Dabney:
Conditional Importance Sampling for Off-Policy Learning. AISTATS 2020: 45-55
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-01839
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-01839
Hado van Hasselt, Sephora Madjiheurem, Matteo Hessel, David Silver, André Barreto, Diana Borsa:
Expected Eligibility Traces. CoRR abs/2007.01839 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-02255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-02255
Sebastian Flennerhag, Jane X. Wang, Pablo Sprechmann, Francesco Visin, Alexandre Galashov, Steven Kapturowski, Diana L. Borsa, Nicolas Heess, André Barreto, Razvan Pascanu:
Temporal Difference Uncertainties as a Signal for Exploration. CoRR abs/2010.02255 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/HarutyunyanDBHM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/HarutyunyanDBHM19
Anna Harutyunyan, Will Dabney, Diana Borsa, Nicolas Heess, Rémi Munos, Doina Precup:
The Termination Critic. AISTATS 2019: 2231-2240
[c5]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/BorsaHPLHMP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/BorsaHPLHMP19
Diana Borsa, Nicolas Heess, Bilal Piot, Siqi Liu, Leonard Hasenclever, Rémi Munos, Olivier Pietquin:
Observational Learning by Reinforcement Learning. AAMAS 2019: 1117-1124
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/BorsaBQMHMSS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BorsaBQMHMSS19
Diana Borsa, André Barreto, John Quan, Daniel J. Mankowitz, Hado van Hasselt, Rémi Munos, David Silver, Tom Schaul:
Universal Successor Features Approximators. ICLR (Poster) 2019
[c3]
- view
- export record
  dblp key:
  - conf/nips/BarretoBHCAHTHM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BarretoBHCAHTHM19
André Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan J. Hunt, Shibl Mourad, David Silver, Doina Precup:
The Option Keyboard: Combining Skills in Reinforcement Learning. NeurIPS 2019: 13031-13041
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-10964
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-10964
André Barreto, Diana Borsa, John Quan, Tom Schaul, David Silver, Matteo Hessel, Daniel J. Mankowitz, Augustin Zídek, Rémi Munos:
Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement. CoRR abs/1901.10964 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-09996
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-09996
Anna Harutyunyan, Will Dabney, Diana Borsa, Nicolas Heess, Rémi Munos, Doina Precup:
The Termination Critic. CoRR abs/1902.09996 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-11455
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-11455
Tom Schaul, Diana Borsa, Joseph Modayil, Razvan Pascanu:
Ray Interference: a Source of Plateaus in Deep Reinforcement Learning. CoRR abs/1904.11455 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-03687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-03687
Hado van Hasselt, John Quan, Matteo Hessel, Zhongwen Xu, Diana Borsa, André Barreto:
General non-linear Bellman equations. CoRR abs/1907.03687 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-07479
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-07479
Mark Rowland, Anna Harutyunyan, Hado van Hasselt, Diana Borsa, Tom Schaul, Rémi Munos, Will Dabney:
Conditional Importance Sampling for Off-Policy Learning. CoRR abs/1910.07479 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-06910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-06910
Tom Schaul, Diana Borsa, David Ding, David Szepesvari, Georg Ostrovski, Will Dabney, Simon Osindero:
Adapting Behaviour for Learning Progress. CoRR abs/1912.06910 (2019)
2018
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/BarretoBQSSHMZM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BarretoBQSSHMZM18
André Barreto, Diana Borsa, John Quan, Tom Schaul, David Silver, Matteo Hessel, Daniel J. Mankowitz, Augustin Zídek, Rémi Munos:
Transfer in Deep Reinforcement Learning Using Successor Features and Generalised Policy Improvement. ICML 2018: 510-519
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1812-07626
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-07626
Diana Borsa, André Barreto, John Quan, Daniel J. Mankowitz, Rémi Munos, Hado van Hasselt, David Silver, Tom Schaul:
Universal Successor Features Approximators. CoRR abs/1812.07626 (2018)
2017
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/BorsaPMP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BorsaPMP17
Diana Borsa, Bilal Piot, Rémi Munos, Olivier Pietquin:
Observational Learning by Reinforcement Learning. CoRR abs/1706.06617 (2017)
2016
[c1]
- view
  - electronic edition @ auai.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/uai/GauntBB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/GauntBB16
Alex Gaunt, Diana Borsa, Yoram Bachrach:
Training Neural Nets to Aggregate Crowdsourced Responses. UAI 2016
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/BorsaGS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BorsaGS16
Diana Borsa, Thore Graepel, John Shawe-Taylor:
Learning Shared Representations in Multi-task Reinforcement Learning. CoRR abs/1603.02041 (2016)
2015
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/BorsaGG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BorsaGG15
Diana Borsa, Thore Graepel, Andrew D. Gordon:
The Wreath Process: A totally generative model of geometric shape based on nested symmetries. CoRR abs/1506.03041 (2015)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.