default search action

combined dblp search
author search
venue search
publication search

ask others

David Krueger 0001

David Scott Krueger

> Home > Persons

Person information

affiliation: University of Cambridge, UK
affiliation (former): University of Montréal, MILA, Canada

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j2]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/Siddiqui0LD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/Siddiqui0LD24
Shoaib Ahmed Siddiqui, David Krueger, Yann LeCun, Stéphane Deny:
Blockwise Self-Supervised Learning at Scale. Trans. Mach. Learn. Res. 2024 (2024)
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/fat/ChanEKWHBBRKKHA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fat/ChanEKWHBBRKKHA24
Alan Chan, Carson Ezell, Max Kaufmann, Kevin Wei, Lewis Hammond, Herbie Bradley, Emma Bluemke, Nitarshan Rajkumar, David Krueger, Noam Kolt, Lennart Heim, Markus Anderljung:
Visibility into AI Agents. FAccT 2024: 958-973
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/fat/CasperESKCBHWSH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fat/CasperESKCBHWSH24
Stephen Casper, Carson Ezell, Charlotte Siegmann, Noam Kolt, Taylor Lynn Curtis, Benjamin Bucknall, Andreas A. Haupt, Kevin Wei, Jérémy Scheurer, Marius Hobbhahn, Lee Sharkey, Satyapriya Krishna, Marvin Von Hagen, Silas Alberti, Alan Chan, Qinyi Sun, Michael Gerovitch, David Bau, Max Tegmark, David Krueger, Dylan Hadfield-Menell:
Black-Box Access is Insufficient for Rigorous AI Audits. FAccT 2024: 2254-2272
[c21]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/CosteAK024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/CosteAK024
Thomas Coste, Usman Anwar, Robert Kirk, David Krueger:
Reward Model Ensembles Help Mitigate Overoptimization. ICLR 2024
[c20]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/JainKLDTRGK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/JainKLDTRGK24
Samyak Jain, Robert Kirk, Ekdeep Singh Lubana, Robert P. Dick, Hidenori Tanaka, Tim Rocktäschel, Edward Grefenstette, David Scott Krueger:
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks. ICLR 2024
[c19]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/KrasheninnikovK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KrasheninnikovK24
Dmitrii Krasheninnikov, Egor Krasheninnikov, Bruno Kacper Mlodozeniec, Tegan Maharaj, David Krueger:
Implicit meta-learning may lead language models to trust more reliable sources. ICML 2024
[i48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-13138
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-13138
Alan Chan, Carson Ezell, Max Kaufmann, Kevin Wei, Lewis Hammond, Herbie Bradley, Emma Bluemke, Nitarshan Rajkumar, David Krueger, Noam Kolt, Lennart Heim, Markus Anderljung:
Visibility into AI Agents. CoRR abs/2401.13138 (2024)
[i47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-14446
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-14446
Stephen Casper, Carson Ezell, Charlotte Siegmann, Noam Kolt, Taylor Lynn Curtis, Benjamin Bucknall, Andreas Alexander Haupt, Kevin Wei, Jérémy Scheurer, Marius Hobbhahn, Lee Sharkey, Satyapriya Krishna, Marvin Von Hagen, Silas Alberti, Alan Chan, Qinyi Sun, Michael Gerovitch, David Bau, Max Tegmark, David Krueger, Dylan Hadfield-Menell:
Black-Box Access is Insufficient for Rigorous AI Audits. CoRR abs/2401.14446 (2024)
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-01946
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-01946
James Urquhart Allingham, Bruno Kacper Mlodozeniec, Shreyas Padhy, Javier Antorán, David Krueger, Richard E. Turner, Eric T. Nalisnick, José Miguel Hernández-Lobato:
A Generative Model of Symmetry Transformations. CoRR abs/2403.01946 (2024)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-10462
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-10462
Joshua Clymer, Nick Gabrieli, David Krueger, Thomas Larsen:
Safety Cases: How to Justify the Safety of Advanced AI Systems. CoRR abs/2403.10462 (2024)
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09932
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09932
Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, José Hernández-Orallo, Lewis Hammond, Eric J. Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi, Alan Chan, Markus Anderljung, Lilian Edwards, Yoshua Bengio, Danqi Chen, Samuel Albanie, Tegan Maharaj, Jakob N. Foerster, Florian Tramèr, He He, Atoosa Kasirzadeh, Yejin Choi, David Krueger:
Foundational Challenges in Assuring Alignment and Safety of Large Language Models. CoRR abs/2404.09932 (2024)
[i43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-19550
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-19550
Ryan Greenblatt, Fabien Roger, Dmitrii Krasheninnikov, David Krueger:
Stress-Testing Capability Elicitation With Password-Locked Models. CoRR abs/2405.19550 (2024)
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-12137
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-12137
Alan Chan, Noam Kolt, Peter Wills, Usman Anwar, Christian Schröder de Witt, Nitarshan Rajkumar, Lewis Hammond, David Krueger, Lennart Heim, Markus Anderljung:
IDs for AI Systems. CoRR abs/2406.12137 (2024)
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-15371
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-15371
Akash R. Wasil, Joshua Clymer, David Krueger, Emily Dardaman, Simeon Campos, Evan R. Murphy:
Affirmative safety: An approach to risk management for high-risk AI. CoRR abs/2406.15371 (2024)
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-15753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-15753
Lukas Fluri, Leon Lang, Alessandro Abate, Patrick Forré, David Krueger, Joar Skalse:
The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret. CoRR abs/2406.15753 (2024)
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-16286
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-16286
Shoaib Ahmed Siddiqui, Xin Dong, Greg Heinrich, Thomas M. Breuel, Jan Kautz, David Krueger, Pavlo Molchanov:
A deeper look at depth pruning of LLMs. CoRR abs/2407.16286 (2024)
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-13221
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-13221
Neel Alex, Shoaib Ahmed Siddiqui, Amartya Sanyal, David Krueger:
Protecting against simultaneous data poisoning attacks. CoRR abs/2408.13221 (2024)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-05800
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-05800
Jakub Vrábel, Ori Shem-Ur, Yaron Oz, David Krueger:
Input Space Mode Connectivity in Deep Neural Networks. CoRR abs/2409.05800 (2024)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-03055
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-03055
Shoaib Ahmed Siddiqui, Radhika Gaonkar, Boris Köpf, David Krueger, Andrew Paverd, Ahmed Salem, Shruti Tople, Lukas Wutschitz, Menglin Xia, Santiago Zanella Béguelin:
Permissive Information-Flow Analysis for Large Language Models. CoRR abs/2410.03055 (2024)
2023
[j1]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/CasperDSGSRFKLF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/CasperDSGSRFKLF23
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Tong Wang, Samuel Marks, Charbel-Raphaël Ségerie, Micah Carroll, Andi Peng, Phillip J. K. Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Biyik, Anca D. Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell:
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback. Trans. Mach. Learn. Res. 2023 (2023)
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/eaamo/CarrollCAK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eaamo/CarrollCAK23
Micah Carroll, Alan Chan, Henry Ashton, David Krueger:
Characterizing Manipulation from AI Systems. EAAMO 2023: 6:1-6:13
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/fat/ChanSMPRKLHDCLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fat/ChanSMPRKLHDCLM23
Alan Chan, Rebecca Salganik, Alva Markelius, Chris Pang, Nitarshan Rajkumar, Dmitrii Krasheninnikov, Lauro Langosco, Zhonghao He, Yawen Duan, Micah Carroll, Michelle Lin, Alex Mayhew, Katherine M. Collins, Maryam Molamohammadi, John Burden, Wanru Zhao, Shalaleh Rismani, Konstantinos Voudouris, Umang Bhatt, Adrian Weller, David Krueger, Tegan Maharaj:
Harms from Increasingly Agentic Algorithmic Systems. FAccT 2023: 651-666
[c16]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/CaballeroGRK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/CaballeroGRK23
Ethan Caballero, Kshitij Gupta, Irina Rish, David Krueger:
Broken Neural Scaling Laws. ICLR 2023
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/SiddiquiRMKH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SiddiquiRMKH23
Shoaib Ahmed Siddiqui, Nitarshan Rajkumar, Tegan Maharaj, David Krueger, Sara Hooker:
Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics. ICLR 2023
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LubanaBDKT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LubanaBDKT23
Ekdeep Singh Lubana, Eric J. Bigelow, Robert P. Dick, David Scott Krueger, Hidenori Tanaka:
Mechanistic Mode Connectivity. ICML 2023: 22965-23004
[c13]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ChungAK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChungAK23
Stephen Chung, Ivan Anokhin, David Krueger:
Thinker: Learning to Plan and Act. NeurIPS 2023
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/unireps/WuLMKK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/unireps/WuLMKK23
Cindy Wu, Ekdeep Singh Lubana, Bruno Kacper Mlodozeniec, Robert Kirk, David Krueger:
What Mechanisms Does Knowledge Distillation Distill? UniReps 2023: 60-75
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-03652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-03652
Lev McKinney, Yawen Duan, David Krueger, Adam Gleave:
On The Fragility of Learned Reward Functions. CoRR abs/2301.03652 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-01647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-01647
Shoaib Ahmed Siddiqui, David Krueger, Yann LeCun, Stéphane Deny:
Blockwise Self-Supervised Learning at Scale. CoRR abs/2302.01647 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-10329
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-10329
Alan Chan, Rebecca Salganik, Alva Markelius, Chris Pang, Nitarshan Rajkumar, Dmitrii Krasheninnikov, Lauro Langosco, Zhonghao He, Yawen Duan, Micah Carroll, Michelle Lin, Alex Mayhew, Katherine M. Collins, Maryam Molamohammadi, John Burden, Wanru Zhao, Shalaleh Rismani, Konstantinos Voudouris, Umang Bhatt, Adrian Weller, David Krueger, Tegan Maharaj:
Harms from Increasingly Agentic Algorithmic Systems. CoRR abs/2302.10329 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-06173
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-06173
Xander Davies, Lauro Langosco, David Krueger:
Unifying Grokking and Double Descent. CoRR abs/2303.06173 (2023)
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-09387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-09387
Micah Carroll, Alan Chan, Henry Ashton, David Krueger:
Characterizing Manipulation from AI Systems. CoRR abs/2303.09387 (2023)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-09358
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-09358
Shoaib Ahmed Siddiqui, David Krueger, Thomas M. Breuel:
Investigating the Nature of 3D Generalization in Deep Neural Networks. CoRR abs/2304.09358 (2023)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-14993
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-14993
Stephen Chung, Ivan Anokhin, David Krueger:
Thinker: Learning to Plan and Act. CoRR abs/2307.14993 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-15217
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-15217
Stephen Casper, Xander Davies, Claudia Shi, Thomas Krendl Gilbert, Jérémy Scheurer, Javier Rando, Rachel Freedman, Tomasz Korbak, David Lindner, Pedro Freire, Tony Tong Wang, Samuel Marks, Charbel-Raphaël Ségerie, Micah Carroll, Andi Peng, Phillip J. K. Christoffersen, Mehul Damani, Stewart Slocum, Usman Anwar, Anand Siththaranjan, Max Nadeau, Eric J. Michaud, Jacob Pfau, Dmitrii Krasheninnikov, Xin Chen, Lauro Langosco, Peter Hase, Erdem Biyik, Anca D. Dragan, David Krueger, Dorsa Sadigh, Dylan Hadfield-Menell:
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback. CoRR abs/2307.15217 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02743
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02743
Thomas Coste, Usman Anwar, Robert Kirk, David Krueger:
Reward Model Ensembles Help Mitigate Overoptimization. CoRR abs/2310.02743 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-15047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-15047
Dmitrii Krasheninnikov, Egor Krasheninnikov, Bruno Mlodozeniec, David Krueger:
Meta- (out-of-context) learning in neural networks. CoRR abs/2310.15047 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-17688
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-17688
Yoshua Bengio, Geoffrey E. Hinton, Andrew Yao, Dawn Song, Pieter Abbeel, Yuval Noah Harari, Ya-Qin Zhang, Lan Xue, Shai Shalev-Shwartz, Gillian K. Hadfield, Jeff Clune, Tegan Maharaj, Frank Hutter, Atilim Günes Baydin, Sheila A. McIlraith, Qiqi Gao, Ashwin Acharya, David Krueger, Anca D. Dragan, Philip H. S. Torr, Stuart Russell, Daniel Kahneman, Jan Brauner, Sören Mindermann:
Managing AI Risks in an Era of Rapid Progress. CoRR abs/2310.17688 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-12786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-12786
Samyak Jain, Robert Kirk, Ekdeep Singh Lubana, Robert P. Dick, Hidenori Tanaka, Edward Grefenstette, Tim Rocktäschel, David Scott Krueger:
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks. CoRR abs/2311.12786 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-14751
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-14751
Alan Chan, Ben Bucknall, Herbie Bradley, David Krueger:
Hazards from Increasingly Accessible Fine-Tuning of Downloadable Foundation Models. CoRR abs/2312.14751 (2023)
2022
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LangoscoKSPK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LangoscoKSPK22
Lauro Langosco di Langosco, Jack Koch, Lee D. Sharkey, Jacob Pfau, David Krueger:
Goal Misgeneralization in Deep Reinforcement Learning. ICML 2022: 12004-12019
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/SkalseHKK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SkalseHKK22
Joar Skalse, Nikolaus H. R. Howe, Dmitrii Krasheninnikov, David Krueger:
Defining and Characterizing Reward Gaming. NeurIPS 2022
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-10015
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-10015
Shoaib Ahmed Siddiqui, Nitarshan Rajkumar, Tegan Maharaj, David Krueger, Sara Hooker:
Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics. CoRR abs/2209.10015 (2022)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-13085
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-13085
Joar Skalse, Nikolaus H. R. Howe, Dmitrii Krasheninnikov, David Krueger:
Defining and Characterizing Reward Hacking. CoRR abs/2209.13085 (2022)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-03150
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-03150
Adam Ibrahim, Charles Guille-Escuret, Ioannis Mitliagkas, Irina Rish, David Krueger, Pouya Bashivan:
Towards Out-of-Distribution Adversarial Robustness. CoRR abs/2210.03150 (2022)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-14891
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-14891
Ethan Caballero, Kshitij Gupta, Irina Rish, David Krueger:
Broken Neural Scaling Laws. CoRR abs/2210.14891 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-08422
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-08422
Ekdeep Singh Lubana, Eric J. Bigelow, Robert P. Dick, David Scott Krueger, Hidenori Tanaka:
Mechanistic Mode Connectivity. CoRR abs/2211.08422 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-14827
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-14827
Alan Clark, Shoaib Ahmed Siddiqui, Robert Kirk, Usman Anwar, Stephen Chung, David Krueger:
Domain Generalization for Robust Model-Based Offline Reinforcement Learning. CoRR abs/2211.14827 (2022)
2021
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/KruegerCJ0BZPC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KruegerCJ0BZPC21
David Krueger, Ethan Caballero, Jörn-Henrik Jacobsen, Amy Zhang, Jonathan Binas, Dinghuai Zhang, Rémi Le Priol, Aaron C. Courville:
Out-of-Distribution Generalization via Risk Extrapolation (REx). ICML 2021: 5815-5826
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-07773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-07773
Shahar Avin, Haydn Belfield, Miles Brundage, Gretchen Krueger, Jasmine Wang, Adrian Weller, Markus Anderljung, Igor Krawczuk, David Krueger, Jonathan Lebensold, Tegan Maharaj, Noa Zilberman:
Filling gaps in trustworthy development of AI. CoRR abs/2112.07773 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-13734
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-13734
Enoch Tetteh, Joseph D. Viviano, Yoshua Bengio, David Krueger, Joseph Paul Cohen:
Multi-Domain Balanced Sampling Improves Out-of-Distribution Generalization of Chest X-ray Pathology Prediction Models. CoRR abs/2112.13734 (2021)
2020
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-00688
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-00688
David Krueger, Ethan Caballero, Jörn-Henrik Jacobsen, Amy Zhang, Jonathan Binas, Rémi Le Priol, Aaron C. Courville:
Out-of-Distribution Generalization via Risk Extrapolation (REx). CoRR abs/2003.00688 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2004-07213
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-07213
Miles Brundage, Shahar Avin, Jasmine Wang, Haydn Belfield, Gretchen Krueger, Gillian K. Hadfield, Heidy Khlaaf, Jingying Yang, Helen Toner, Ruth Fong, Tegan Maharaj, Pang Wei Koh, Sara Hooker, Jade Leung, Andrew Trask, Emma Bluemke, Jonathan Lebensold, Cullen O'Keefe, Mark Koren, Théo Ryffel, J. B. Rubinovitz, Tamay Besiroglu, Federica Carugati, Jack Clark, Peter Eckersley, Sarah de Haas, Maritza Johnson, Ben Laurie, Alex Ingerman, Igor Krawczuk, Amanda Askell, Rosario Cammarota, Andrew Lohn, David Krueger, Charlotte Stix, Peter Henderson, Logan Graham, Carina Prunkl, Bianca Martin, Elizabeth Seger, Noa Zilberman, Seán Ó hÉigeartaigh, Frens Kroeger, Girish Sastry, Rebecca Kagan, Adrian Weller, Brian Tse, Elizabeth Barnes, Allan Dafoe, Paul Scharre, Ariel Herbert-Voss, Martijn Rasser, Shagun Sodhani, Carrick Flynn, Thomas Krendl Gilbert, Lisa Dyer, Saif Khan, Yoshua Bengio, Markus Anderljung:
Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims. CoRR abs/2004.07213 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-04948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-04948
Andrew Critch, David Krueger:
AI Research Considerations for Human Existential Safety (ARCHES). CoRR abs/2006.04948 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-09153
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-09153
David Krueger, Tegan Maharaj, Jan Leike:
Hidden Incentives for Auto-Induced Distributional Shift. CoRR abs/2009.09153 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-06709
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-06709
David Krueger, Jan Leike, Owain Evans, John Salvatier:
Active Reinforcement Learning: Observing Rewards at a Cost. CoRR abs/2011.06709 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2018
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/HuangKLC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HuangKLC18
Chin-Wei Huang, David Krueger, Alexandre Lacoste, Aaron C. Courville:
Neural Autoregressive Flows. ICML 2018: 2083-2092
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1801-10308
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-10308
Joel Ruben Antony Moniz, David Krueger:
Nested LSTMs. CoRR abs/1801.10308 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00779
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00779
Chin-Wei Huang, David Krueger, Alexandre Lacoste, Aaron C. Courville:
Neural Autoregressive Flows. CoRR abs/1804.00779 (2018)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-07528
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-07528
Alexandre Lacoste, Boris N. Oreshkin, Wonchang Chung, Thomas Boquet, Negar Rostamzadeh, David Krueger:
Uncertainty in Multitask Transfer Learning. CoRR abs/1806.07528 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-07871
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-07871
Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg:
Scalable agent alignment via reward modeling: a research direction. CoRR abs/1811.07871 (2018)
2017
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/acml/MonizK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acml/MonizK17
Joel Ruben Antony Moniz, David Krueger:
Nested LSTMs. ACML 2017: 530-544
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/KruegerBJAKMBFC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KruegerBJAKMBFC17
David Krueger, Nicolas Ballas, Stanislaw Jastrzebski, Devansh Arpit, Maxinder S. Kanwal, Tegan Maharaj, Emmanuel Bengio, Asja Fischer, Aaron C. Courville:
Deep Nets Don't Learn via Memorization. ICLR (Workshop) 2017
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/KruegerMKPBKGBC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KruegerMKPBKGBC17
David Krueger, Tegan Maharaj, János Kramár, Mohammad Pezeshki, Nicolas Ballas, Nan Rosemary Ke, Anirudh Goyal, Yoshua Bengio, Aaron C. Courville, Christopher J. Pal:
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations. ICLR (Poster) 2017
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ArpitJBKBKMFCBL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ArpitJBKBKMFCBL17
Devansh Arpit, Stanislaw Jastrzebski, Nicolas Ballas, David Krueger, Emmanuel Bengio, Maxinder S. Kanwal, Tegan Maharaj, Asja Fischer, Aaron C. Courville, Yoshua Bengio, Simon Lacoste-Julien:
A Closer Look at Memorization in Deep Networks. ICML 2017: 233-242
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/ArpitJBKBKMFCBL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ArpitJBKBKMFCBL17
Devansh Arpit, Stanislaw Jastrzebski, Nicolas Ballas, David Krueger, Emmanuel Bengio, Maxinder S. Kanwal, Tegan Maharaj, Asja Fischer, Aaron C. Courville, Yoshua Bengio, Simon Lacoste-Julien:
A Closer Look at Memorization in Deep Networks. CoRR abs/1706.05394 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1710-04759
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-04759
David Krueger, Chin-Wei Huang, Riashat Islam, Ryan Turner, Alexandre Lacoste, Aaron C. Courville:
Bayesian Hypernetworks. CoRR abs/1710.04759 (2017)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-05016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-05016
Alexandre Lacoste, Thomas Boquet, Negar Rostamzadeh, Boris N. Oreshkin, Wonchang Chung, David Krueger:
Deep Prior. CoRR abs/1712.05016 (2017)
2016
[c3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/KruegerM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KruegerM15
David Krueger, Roland Memisevic:
Regularizing RNNs by Stabilizing Activations. ICLR 2016
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/KruegerMKPBKGBL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KruegerMKPBKGBL16
David Krueger, Tegan Maharaj, János Kramár, Mohammad Pezeshki, Nicolas Ballas, Nan Rosemary Ke, Anirudh Goyal, Yoshua Bengio, Hugo Larochelle, Aaron C. Courville, Chris Pal:
Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations. CoRR abs/1606.01305 (2016)
2015
[c2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/DinhKB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DinhKB14
Laurent Dinh, David Krueger, Yoshua Bengio:
NICE: Non-linear Independent Components Estimation. ICLR (Workshop) 2015
[c1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/MemisevicKK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/MemisevicKK14
Roland Memisevic, Kishore Reddy Konda, David Krueger:
Zero-bias autoencoders and the benefits of co-adapting features. ICLR (Poster) 2015
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/BachmanKP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BachmanKP15
Philip Bachman, David Krueger, Doina Precup:
Testing Visual Attention in Dynamic Environments. CoRR abs/1510.08949 (2015)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.