


default search action
Joe Benton
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [i10]Mrinank Sharma, Meg Tong, Jesse Mu, Jerry Wei, Jorrit Kruthoff, Scott Goodfriend, Euan Ong, Alwin Peng, Raj Agarwal, Cem Anil, Amanda Askell, Nathan Bailey, Joe Benton, Emma Bluemke, Samuel R. Bowman, Eric Christiansen, Hoagy Cunningham, Andy Dau, Anjali Gopal, Rob Gilson, Logan Graham, Logan Howard, Nimit Kalra, Taesung Lee, Kevin Lin, Peter Lofgren, Francesco Mosconi, Clare O'Hara, Catherine Olsson, Linda Petrini, Samir Rajani, Nikhil Saxena, Alex Silverstein, Tanya Singh, Theodore R. Sumers, Leonard Tang, Kevin K. Troy, Constantin Weisser, Ruiqi Zhong, Giulio Zhou, Jan Leike, Jared Kaplan, Ethan Perez:
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming. CoRR abs/2501.18837 (2025) - 2024
- [j2]Joe Benton, George Deligiannidis, Arnaud Doucet:
Error Bounds for Flow Matching Methods. Trans. Mach. Learn. Res. 2024 (2024) - [c3]Joe Benton, Valentin De Bortoli, Arnaud Doucet, George Deligiannidis:
Nearly d-Linear Convergence Bounds for Diffusion Models via Stochastic Localization. ICLR 2024 - [c2]Cem Anil, Esin Durmus, Nina Panickssery, Mrinank Sharma, Joe Benton, Sandipan Kundu, Joshua Batson, Meg Tong, Jesse Mu, Daniel Ford, Francesco Mosconi, Rajashree Agrawal, Rylan Schaeffer, Naomi Bashkansky, Samuel Svenningsen, Mike Lambert, Ansh Radhakrishnan, Carson Denison, Evan Hubinger, Yuntao Bai, Trenton Bricken, Timothy Maxwell, Nicholas Schiefer, James Sully, Alex Tamkin, Tamera Lanham, Karina Nguyen, Tomek Korbak, Jared Kaplan, Deep Ganguli, Samuel R. Bowman, Ethan Perez, Roger B. Grosse, David Kristjanson Duvenaud:
Many-shot Jailbreaking. NeurIPS 2024 - [i9]Rylan Schaeffer, Dan Valentine, Luke Bailey, James Chua, Cristóbal Eyzaguirre, Zane Durante, Joe Benton, Brando Miranda, Henry Sleight, John Hughes, Rajashree Agrawal, Mrinank Sharma, Scott Emmons, Sanmi Koyejo, Ethan Perez:
When Do Universal Image Jailbreaks Transfer Between Vision-Language Models? CoRR abs/2407.15211 (2024) - [i8]Joe Benton, Misha Wagner, Eric Christiansen, Cem Anil, Ethan Perez, Jai Srivastav, Esin Durmus, Deep Ganguli, Shauna Kravec, Buck Shlegeris, Jared Kaplan, Holden Karnofsky, Evan Hubinger, Roger Grosse, Samuel R. Bowman, David Duvenaud:
Sabotage Evaluations for Frontier Models. CoRR abs/2410.21514 (2024) - 2023
- [j1]Kamélia Daudel, Joe Benton, Yuyang Shi, Arnaud Doucet:
Alpha-divergence Variational Inference Meets Importance Weighted Auto-Encoders: Methodology and Asymptotics. J. Mach. Learn. Res. 24: 243:1-243:83 (2023) - [i7]Joe Benton, George Deligiannidis
, Arnaud Doucet:
Error Bounds for Flow Matching Methods. CoRR abs/2305.16860 (2023) - [i6]Joe Benton, Valentin De Bortoli, Arnaud Doucet, George Deligiannidis
:
Linear Convergence Bounds for Diffusion Models via Stochastic Localization. CoRR abs/2308.03686 (2023) - [i5]Mingyang Deng, Lucas Tao, Joe Benton:
Measuring Feature Sparsity in Language Models. CoRR abs/2310.07837 (2023) - 2022
- [c1]Andrew Campbell, Joe Benton, Valentin De Bortoli, Thomas Rainforth, George Deligiannidis, Arnaud Doucet:
A Continuous Time Framework for Discrete Denoising Models. NeurIPS 2022 - [i4]Andrew Campbell, Joe Benton, Valentin De Bortoli, Tom Rainforth, George Deligiannidis
, Arnaud Doucet:
A Continuous Time Framework for Discrete Denoising Models. CoRR abs/2205.14987 (2022) - [i3]Adam Scherlis, Kshitij Sachan, Adam S. Jermyn, Joe Benton, Buck Shlegeris:
Polysemanticity and Capacity in Neural Networks. CoRR abs/2210.01892 (2022) - [i2]Kamélia Daudel, Joe Benton, Yuyang Shi, Arnaud Doucet:
Alpha-divergence Variational Inference Meets Importance Weighted Auto-Encoders: Methodology and Asymptotics. CoRR abs/2210.06226 (2022) - [i1]Joe Benton, Yuyang Shi, Valentin De Bortoli, George Deligiannidis
, Arnaud Doucet:
From Denoising Diffusions to Denoising Markov Models. CoRR abs/2211.03595 (2022)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-02-28 23:08 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint