Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 240 results for author: Russell, S

.
  1. arXiv:2407.19277  [pdf, other

    physics.bio-ph cond-mat.soft cond-mat.stat-mech

    Predicting the Progression of Cancerous Tumors in Mice: A Machine and Deep Learning Intuition

    Authors: Amit K Chattopadhyay, Aimee Pascaline N Unkundiye, Gillian Pearce, Steven Russell

    Abstract: The study explores Artificial Intelligence (AI) powered modeling to predict the evolution of cancer tumor cells in mice under different forms of treatment. The AI models are analyzed against varying ambient and systemic parameters, e.g. drug dosage, volume of the cancer cell mass, and time taken to destroy the cancer cell mass. The data required for the analysis have been synthetically extracted f… ▽ More

    Submitted 31 July, 2024; v1 submitted 27 July, 2024; originally announced July 2024.

    Comments: 7 figures, 24 pages

    Journal ref: Annals of Biostatistics and Biometric Applications 2024

  2. arXiv:2407.03459  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Quantum decoherence by magnetic fluctuations in a candidate axion insulator

    Authors: Ruben Saatjian, Kohtaro Yamakawa, Ryan S. Russell, James G. Analytis, John W. Harter

    Abstract: In magnetic topological insulators, spontaneous time-reversal symmetry breaking by intrinsic magnetic order can open an energy gap in the topological surface spectrum. In the resulting state, exotic properties like axion electrodynamics, the quantum anomalous Hall effect, and other topological magnetoelectric responses are expected to emerge. A detailed understanding of the magnetic order and its… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  3. arXiv:2406.19501  [pdf, other

    cs.CL cs.LG

    Monitoring Latent World States in Language Models with Propositional Probes

    Authors: Jiahai Feng, Stuart Russell, Jacob Steinhardt

    Abstract: Language models are susceptible to bias, sycophancy, backdoors, and other tendencies that lead to unfaithful responses to the input context. Interpreting internal states of language models could help monitor and correct unfaithful behavior. We hypothesize that language models represent their input contexts in a latent world model, and seek to extract this latent world state from the activations. W… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2406.10026  [pdf

    physics.optics

    Retiming dynamics of harmonically modelocked laser solitons in a self-driven optomechanical lattice

    Authors: Xiaocong Wang, Benhai Wang, Wenbin He, Xintong Zhang, Qi Huang, Zhiyuan Huang, Xin Jiang, Philip St. J. Russell, Meng Pang

    Abstract: Harmonic mode-locking, realized actively or passively, is an effective technique for increasing the repetition rate of lasers, with important applications in optical sampling, laser micro-machining and frequency metrology. It is critically important to understand how a harmonically mode-locked pulse train responds to external perturbations and noise, so as to make sure that it is stable and resist… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  5. arXiv:2406.00877  [pdf, other

    cs.LG cs.AI

    Evidence of Learned Look-Ahead in a Chess-Playing Neural Network

    Authors: Erik Jenner, Shreyas Kapur, Vasil Georgiev, Cameron Allen, Scott Emmons, Stuart Russell

    Abstract: Do neural networks learn to implement algorithms such as look-ahead or search "in the wild"? Or do they rely purely on collections of simple heuristics? We present evidence of learned look-ahead in the policy network of Leela Chess Zero, the currently strongest neural chess engine. We find that Leela internally represents future optimal moves and that these representations are crucial for its fina… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Project page: https://leela-interp.github.io/

  6. arXiv:2405.20519  [pdf, other

    cs.AI

    Diffusion On Syntax Trees For Program Synthesis

    Authors: Shreyas Kapur, Erik Jenner, Stuart Russell

    Abstract: Large language models generate code one token at a time. Their autoregressive generation process lacks the feedback of observing the program's output. Training LLMs to suggest edits directly can be challenging due to the scarcity of rich edit data. To address these problems, we propose neural diffusion models that operate on syntax trees of any context-free grammar. Similar to image diffusion mode… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: https://tree-diffusion.github.io

  7. arXiv:2405.17713  [pdf, other

    cs.AI cs.LG

    AI Alignment with Changing and Influenceable Reward Functions

    Authors: Micah Carroll, Davis Foote, Anand Siththaranjan, Stuart Russell, Anca Dragan

    Abstract: Existing AI alignment approaches assume that preferences are static, which is unrealistic: our preferences change, and may even be influenced by our interactions with AI systems themselves. To clarify the consequences of incorrectly assuming static preferences, we introduce Dynamic Reward Markov Decision Processes (DR-MDPs), which explicitly model preference changes and the AI's influence on them.… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2024

  8. arXiv:2405.06624  [pdf, other

    cs.AI

    Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

    Authors: David "davidad" Dalrymple, Joar Skalse, Yoshua Bengio, Stuart Russell, Max Tegmark, Sanjit Seshia, Steve Omohundro, Christian Szegedy, Ben Goldhaber, Nora Ammann, Alessandro Abate, Joe Halpern, Clark Barrett, Ding Zhao, Tan Zhi-Xuan, Jeannette Wing, Joshua Tenenbaum

    Abstract: Ensuring that AI systems reliably and robustly avoid harmful or dangerous behaviours is a crucial challenge, especially for AI systems with a high degree of autonomy and general intelligence, or systems used in safety-critical contexts. In this paper, we will introduce and define a family of approaches to AI safety, which we will refer to as guaranteed safe (GS) AI. The core feature of these appro… ▽ More

    Submitted 8 July, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  9. arXiv:2405.04669  [pdf, other

    cs.LG cs.CL

    Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics

    Authors: Hanlin Zhu, Baihe Huang, Shaolun Zhang, Michael Jordan, Jiantao Jiao, Yuandong Tian, Stuart Russell

    Abstract: Auto-regressive large language models (LLMs) show impressive capacities to solve many complex reasoning tasks while struggling with some simple logical reasoning tasks such as inverse search: when trained on ''A is B'', LLM fails to directly conclude ''B is A'' during inference, which is known as the ''reversal curse'' (Berglund et al., 2023). In this paper, we theoretically analyze the reversal c… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 40 pages, 15 figures

  10. arXiv:2404.16182  [pdf

    physics.optics physics.app-ph

    Optomagnetic forces on YIG/YFeO3 microspheres levitated in chiral hollow-core photonic crystal fibre

    Authors: Soumya Chakraborty, Gordon K. L. Wong, Ferdi Oda, Vanessa Wachter, Silvia Viola Kusminskiy, Tadahiro Yokosawa, Sabine Hübner, Benjamin Apeleo Zubiri, Erdmann Spiecker, Monica Distaso, Philip St. J. Russell, Nicolas Y. Joly

    Abstract: We explore a magnetooptomechanical system consisting of a single magnetic microparticle optically levitated within the core of a helically twisted single-ring hollow-core photonic crystal fibre. We use newly-developed magnetic particles that have a core of antiferromagnetic yttrium-ortho-ferrite (YFeO3) and a shell of ferrimagnetic YIG (Y3Fe5O12) approximately 50 nm thick. Using a 632.8 nm probe b… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  11. arXiv:2404.12536  [pdf

    astro-ph.EP astro-ph.IM

    Asteroid (101955) Bennu in the Laboratory: Properties of the Sample Collected by OSIRIS-REx

    Authors: Dante S. Lauretta, Harold C. Connolly, Jr., Joseph E. Aebersold, Conel M. O. D. Alexander, Ronald-L. Ballouz, Jessica J. Barnes, Helena C. Bates, Carina A. Bennett, Laurinne Blanche, Erika H. Blumenfeld, Simon J. Clemett, George D. Cody, Daniella N. DellaGiustina, Jason P. Dworkin, Scott A. Eckley, Dionysis I. Foustoukos, Ian A. Franchi, Daniel P. Glavin, Richard C. Greenwood, Pierre Haenecour, Victoria E. Hamilton, Dolores H. Hill, Takahiro Hiroi, Kana Ishimaru, Fred Jourdan , et al. (28 additional authors not shown)

    Abstract: On 24 September 2023, the NASA OSIRIS-REx mission dropped a capsule to Earth containing approximately 120 g of pristine carbonaceous regolith from Bennu. We describe the delivery and initial allocation of this asteroid sample and introduce its bulk physical, chemical, and mineralogical properties from early analyses. The regolith is very dark overall, with higher-reflectance inclusions and particl… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 73 pages, 22 figures

  12. arXiv:2404.10271  [pdf, other

    cs.LG cs.AI cs.CL cs.CY cs.GT

    Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback

    Authors: Vincent Conitzer, Rachel Freedman, Jobst Heitzig, Wesley H. Holliday, Bob M. Jacobs, Nathan Lambert, Milan Mossé, Eric Pacuit, Stuart Russell, Hailey Schoelkopf, Emanuel Tewolde, William S. Zwicker

    Abstract: Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, such as helping to commit crimes or producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level prin… ▽ More

    Submitted 4 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: 15 pages, 4 figures

    MSC Class: 68T01; 68T50; 91B14; 91B12 ACM Class: I.2.0; I.2.7; K.4.2; I.2.m; J.4

  13. arXiv:2403.19107  [pdf

    cs.CV cs.LG

    Synthetic Medical Imaging Generation with Generative Adversarial Networks For Plain Radiographs

    Authors: John R. McNulty, Lee Kho, Alexandria L. Case, Charlie Fornaca, Drew Johnston, David Slater, Joshua M. Abzug, Sybil A. Russell

    Abstract: In medical imaging, access to data is commonly limited due to patient privacy restrictions and the issue that it can be difficult to acquire enough data in the case of rare diseases.[1] The purpose of this investigation was to develop a reusable open-source synthetic image generation pipeline, the GAN Image Synthesis Tool (GIST), that is easy to use as well as easy to deploy. The pipeline helps to… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Report number: Public Release Case Number 22-3965

  14. arXiv:2403.08392  [pdf

    q-bio.TO

    Nonwoven Reinforced Photocurable Poly(glycerol sebacate)-Based Hydrogels

    Authors: Michael Phillips, Giuseppe Tronci, Christopher M. Pask, Stephen J. Russell

    Abstract: Implantable hydrogels should ideally possess mechanical properties matched to the surrounding tissues to enable adequate mechanical function while regeneration occurs. This can be challenging, especially when degradable systems with high water content and hydrolysable chemical bonds are required in anatomical sites under constant mechanical stimulation, e.g. a foot ulcer cavity. In these circumsta… ▽ More

    Submitted 17 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: 26 pages, 12 figures, 3 tables. Accepted in Polymers

  15. arXiv:2403.06003  [pdf, other

    cs.RO cs.AI cs.LG

    A Generalized Acquisition Function for Preference-based Reward Learning

    Authors: Evan Ellis, Gaurav R. Ghosal, Stuart J. Russell, Anca Dragan, Erdem Bıyık

    Abstract: Preference-based reward learning is a popular technique for teaching robots and autonomous systems how a human user wants them to perform a task. Previous works have shown that actively synthesizing preference queries to maximize information gain about the reward function parameters improves data efficiency. The information gain criterion focuses on precisely identifying all parameters of the rewa… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  16. arXiv:2402.17747  [pdf, other

    cs.LG cs.AI stat.ML

    When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback

    Authors: Leon Lang, Davis Foote, Stuart Russell, Anca Dragan, Erik Jenner, Scott Emmons

    Abstract: Past analyses of reinforcement learning from human feedback (RLHF) assume that the human evaluators fully observe the environment. What happens when human feedback is based only on partial observations? We formally define two failure cases: deceptive inflation and overjustification. Modeling the human as Boltzmann-rational w.r.t. a belief over trajectories, we prove conditions under which RLHF is… ▽ More

    Submitted 8 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  17. arXiv:2402.08062  [pdf, ps, other

    cs.LG cs.AI

    Avoiding Catastrophe in Continuous Spaces by Asking for Help

    Authors: Benjamin Plaut, Hanlin Zhu, Stuart Russell

    Abstract: Most reinforcement learning algorithms with formal regret guarantees assume all mistakes are reversible and essentially rely on trying all possible behaviors. This approach leads to poor outcomes when some mistakes are irreparable or even catastrophic. We propose a variant of the contextual bandit problem where the goal is to minimize the chance of catastrophe. Specifically, we assume that the pay… ▽ More

    Submitted 26 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  18. arXiv:2401.09155  [pdf

    physics.optics

    Frequency conversion of vortex states by chiral forward Brillouin scattering in twisted photonic crystal fibre

    Authors: Xinglin Zeng, Philip St. J. Russell, Birgit Stiller

    Abstract: Optical vortex states-higher optical modes with helical phase progression and carrying orbital angular momentum-have been explored to increase the flexibility and capacity of optical fibres employed for example in mode-division-multiplexing, optical trapping and multimode imaging. A common requirement in such systems is high fidelity transfer of signals between different frequency bands and modes,… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  19. arXiv:2312.12747  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    ALMANACS: A Simulatability Benchmark for Language Model Explainability

    Authors: Edmund Mills, Shiye Su, Stuart Russell, Scott Emmons

    Abstract: How do we measure the efficacy of language model explainability methods? While many explainability methods have been developed, they are typically evaluated on bespoke tasks, preventing an apples-to-apples comparison. To help fill this gap, we present ALMANACS, a language model explainability benchmark. ALMANACS scores explainability methods on simulatability, i.e., how well the explanations impro… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: Code is available at https://github.com/edmundmills/ALMANACS}{https://github.com/edmundmills/ALMANACS

  20. arXiv:2312.08369  [pdf, other

    stat.ML cs.AI cs.LG

    The Effective Horizon Explains Deep RL Performance in Stochastic Environments

    Authors: Cassidy Laidlaw, Banghua Zhu, Stuart Russell, Anca Dragan

    Abstract: Reinforcement learning (RL) theory has largely focused on proving minimax sample complexity bounds. These require strategic exploration algorithms that use relatively limited function classes for representing the policy or value function. Our goal is to explain why deep RL algorithms often perform well in practice, despite using random exploration and much more expressive function classes like neu… ▽ More

    Submitted 12 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Journal ref: ICLR 2024 (Spotlight)

  21. arXiv:2311.15862  [pdf, other

    physics.geo-ph

    Evidence for a kilometre-scale seismically slow layer atop the core-mantle boundary from normal modes

    Authors: Stuart Russell, Jessica C. E. Irving, Lisanne Jagt, Sanne Cottaar

    Abstract: Geodynamic modelling and seismic studies have highlighted the possibility that a thin layer of low seismic velocities, potentially molten, may sit atop the core-mantle boundary but has thus far eluded detection. In this study we employ normal modes, an independent data type to body waves, to assess the visibility of a seismically slow layer atop the core-mantle boundary to normal mode centre frequ… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  22. arXiv:2311.01011  [pdf, other

    cs.LG cs.CR

    Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game

    Authors: Sam Toyer, Olivia Watkins, Ethan Adrian Mendes, Justin Svegliato, Luke Bailey, Tiffany Wang, Isaac Ong, Karim Elmaaroufi, Pieter Abbeel, Trevor Darrell, Alan Ritter, Stuart Russell

    Abstract: While Large Language Models (LLMs) are increasingly being used in real-world applications, they remain vulnerable to prompt injection attacks: malicious third party prompts that subvert the intent of the system designer. To help researchers study this problem, we present a dataset of over 126,000 prompt injection attacks and 46,000 prompt-based "defenses" against prompt injection, all created by p… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  23. arXiv:2310.17688  [pdf, other

    cs.CY cs.AI cs.CL cs.LG

    Managing extreme AI risks amid rapid progress

    Authors: Yoshua Bengio, Geoffrey Hinton, Andrew Yao, Dawn Song, Pieter Abbeel, Trevor Darrell, Yuval Noah Harari, Ya-Qin Zhang, Lan Xue, Shai Shalev-Shwartz, Gillian Hadfield, Jeff Clune, Tegan Maharaj, Frank Hutter, Atılım Güneş Baydin, Sheila McIlraith, Qiqi Gao, Ashwin Acharya, David Krueger, Anca Dragan, Philip Torr, Stuart Russell, Daniel Kahneman, Jan Brauner, Sören Mindermann

    Abstract: Artificial Intelligence (AI) is progressing rapidly, and companies are shifting their focus to developing generalist AI systems that can autonomously act and pursue goals. Increases in capabilities and autonomy may soon massively amplify AI's impact, with risks that include large-scale social harms, malicious uses, and an irreversible loss of human control over autonomous AI systems. Although rese… ▽ More

    Submitted 22 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Published in Science: https://www.science.org/doi/10.1126/science.adn0117

  24. arXiv:2310.15288  [pdf, other

    cs.AI cs.LG

    Active teacher selection for reinforcement learning from human feedback

    Authors: Rachel Freedman, Justin Svegliato, Kyle Wray, Stuart Russell

    Abstract: Reinforcement learning from human feedback (RLHF) enables machine learning systems to learn objectives from human feedback. A core limitation of these systems is their assumption that all feedback comes from a single human teacher, despite querying a range of distinct teachers. We propose the Hidden Utility Bandit (HUB) framework to model differences in teacher rationality, expertise, and costline… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  25. arXiv:2310.01706  [pdf, other

    cs.LG

    On Representation Complexity of Model-based and Model-free Reinforcement Learning

    Authors: Hanlin Zhu, Baihe Huang, Stuart Russell

    Abstract: We study the representation complexity of model-based and model-free reinforcement learning (RL) in the context of circuit complexity. We prove theoretically that there exists a broad class of MDPs such that their underlying transition and reward functions can be represented by constant depth circuits with polynomial size, while the optimal $Q$-function suffers an exponential circuit complexity in… ▽ More

    Submitted 10 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 23 pages, 9 figures, to be published in ICLR 2024

  26. arXiv:2309.00236  [pdf, other

    cs.LG cs.CL cs.CR

    Image Hijacks: Adversarial Images can Control Generative Models at Runtime

    Authors: Luke Bailey, Euan Ong, Stuart Russell, Scott Emmons

    Abstract: Are foundation models secure against malicious actors? In this work, we focus on the image input to a vision-language model (VLM). We discover image hijacks, adversarial images that control the behaviour of VLMs at inference time, and introduce the general Behaviour Matching algorithm for training image hijacks. From this, we derive the Prompt Matching method, allowing us to train hijacks matching… ▽ More

    Submitted 22 April, 2024; v1 submitted 31 August, 2023; originally announced September 2023.

    Comments: Project page at https://image-hijacks.github.io

  27. arXiv:2307.14745  [pdf, other

    cs.MA

    Using Multi-Agent MicroServices (MAMS) for Agent Based Modelling

    Authors: Martynas Jagutis, Sean Russell, Rem Collier

    Abstract: This paper demonstrates the use of the Multi-Agent MicroServices (MAMS) architectural style through a case study based around the development of a prototype traffic simulation in which agents model a population of individuals who travel from home to work and vice versa by car.

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 4 page demo paper accepted at EMAS. Paper has been extended from this version and submitted for publication in the formal proceedings

  28. arXiv:2306.09309  [pdf, other

    cs.AI cs.MA

    Who Needs to Know? Minimal Knowledge for Optimal Coordination

    Authors: Niklas Lauffer, Ameesh Shah, Micah Carroll, Michael Dennis, Stuart Russell

    Abstract: To optimally coordinate with others in cooperative games, it is often crucial to have information about one's collaborators: successful driving requires understanding which side of the road to drive on. However, not every feature of collaborators is strategically relevant: the fine-grained acceleration of drivers may be ignored while maintaining optimal coordination. We show that there is a well-d… ▽ More

    Submitted 13 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: To be published at ICML 2023

    ACM Class: I.2.6; I.2.11

  29. arXiv:2306.06924  [pdf, other

    cs.AI cs.CR cs.CY cs.LG

    TASRA: a Taxonomy and Analysis of Societal-Scale Risks from AI

    Authors: Andrew Critch, Stuart Russell

    Abstract: While several recent works have identified societal-scale and extinction-level risks to humanity arising from artificial intelligence, few have attempted an {\em exhaustive taxonomy} of such risks. Many exhaustive taxonomies are possible, and some are useful -- particularly if they reveal new risks or practical approaches to safety. This paper explores a taxonomy based on accountability: whose act… ▽ More

    Submitted 14 June, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    MSC Class: 68T01 ACM Class: I.2.0

  30. arXiv:2305.11220  [pdf, other

    quant-ph

    Protecting quantum modes in optical fibres

    Authors: M. A. T. Butt, P. Roth, G. K. L. Wong, M. H. Frosz, L. L. Sanchez-Soto, E. A. Anashkina, A. V. Andrianov, P. Banzer, P. S. J. Russell, G. Leuchs

    Abstract: Polarization-preserving fibers maintain the two polarization states of an orthogonal basis. Quantum communication, however, requires sending at least two nonorthogonal states and these cannot both be preserved. We present a new scheme that allows for using polarization encoding in a fiber not only in the discrete, but also in the continuous-variable regime. For the example of a helically twisted p… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 7 pages, 4 figures, accepted in Phys. Rev. Applied

  31. Coronal Heating as Determined by the Solar Flare Frequency Distribution Obtained by Aggregating Case Studies

    Authors: James Paul Mason, Alexandra Werth, Colin G. West, Allison A. Youngblood, Donald L. Woodraska, Courtney Peck, Kevin Lacjak, Florian G. Frick, Moutamen Gabir, Reema A. Alsinan, Thomas Jacobsen, Mohammad Alrubaie, Kayla M. Chizmar, Benjamin P. Lau, Lizbeth Montoya Dominguez, David Price, Dylan R. Butler, Connor J. Biron, Nikita Feoktistov, Kai Dewey, N. E. Loomis, Michal Bodzianowski, Connor Kuybus, Henry Dietrick, Aubrey M. Wolfe , et al. (977 additional authors not shown)

    Abstract: Flare frequency distributions represent a key approach to addressing one of the largest problems in solar and stellar physics: determining the mechanism that counter-intuitively heats coronae to temperatures that are orders of magnitude hotter than the corresponding photospheres. It is widely accepted that the magnetic field is responsible for the heating, but there are two competing mechanisms th… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 1,002 authors, 14 pages, 4 figures, 3 tables, published by The Astrophysical Journal on 2023-05-09, volume 948, page 71

  32. arXiv:2304.09853  [pdf, other

    cs.LG stat.ML

    Bridging RL Theory and Practice with the Effective Horizon

    Authors: Cassidy Laidlaw, Stuart Russell, Anca Dragan

    Abstract: Deep reinforcement learning (RL) works impressively in some environments and fails catastrophically in others. Ideally, RL theory should be able to provide an understanding of why this is, i.e. bounds predictive of practical performance. Unfortunately, current theory does not quite have this ability. We compare standard deep RL algorithms to prior sample complexity bounds by introducing a new data… ▽ More

    Submitted 11 January, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

    Journal ref: NeurIPS 2023 (Oral)

  33. arXiv:2303.00894  [pdf, other

    cs.LG cs.AI

    Active Reward Learning from Multiple Teachers

    Authors: Peter Barnett, Rachel Freedman, Justin Svegliato, Stuart Russell

    Abstract: Reward learning algorithms utilize human feedback to infer a reward function, which is then used to train an AI system. This human feedback is often a preference comparison, in which the human teacher compares several samples of AI behavior and chooses which they believe best accomplishes the objective. While reward learning typically assumes that all feedback comes from a single teacher, in pract… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  34. arXiv:2302.12564  [pdf

    cond-mat.mtrl-sci physics.optics

    Valleytronics in bulk MoS$_2$ by optical control of parity and time symmetries

    Authors: Igor Tyulnev, Álvaro Jiménez-Galán, Julita Poborska, Lenard Vamos, Rui F. Silva, Philip St. J. Russell, Francesco Tani, Olga Smirnova, Misha Ivanov, Jens Biegert

    Abstract: The valley degree of freedom of electrons in materials promises routes toward energy-efficient information storage with enticing prospects towards quantum information processing. Current challenges in utilizing valley polarization are symmetry conditions that require monolayer structures or specific material engineering, non-resonant optical control to avoid energy dissipation, and the ability to… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: 4 figures

  35. arXiv:2301.04694  [pdf

    astro-ph.IM astro-ph.EP physics.geo-ph

    Science Priorities for the Extraction of the Solid MSR Samples from their Sample Tubes

    Authors: N. Dauphas, S. S. Russell, D. Beaty, F. Thiessen, J. Barnes, L. Bonal, J. Bridges, T. Bristow, J. Eiler, L. Ferriere, T. Fornaro, J. Gattacceca, B. Hoffman, E. J. Javaux, T. Kleine, H. Y. McSween, M. Prasad, L. Rampe, M. Schmidt, B. Schoene, K. L. Siebach, J. Stern, N. Tosca

    Abstract: Preservation of the chemical and structural integrity of samples that will be brought back from Mars is paramount to achieving the scientific objectives of MSR. Given our knowledge of the nature of the samples retrieved at Jezero by Perseverance, at least two options need to be tested for opening the sample tubes: (1) One or two radial cuts at the end of the tube to slide the sample out. (2) Two r… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: 8 pages, 3 figures, 1 table, report NASA-ESA Mars Rock Team Report

  36. arXiv:2211.11972  [pdf, other

    cs.LG cs.AI

    imitation: Clean Imitation Learning Implementations

    Authors: Adam Gleave, Mohammad Taufeeque, Juan Rocamonde, Erik Jenner, Steven H. Wang, Sam Toyer, Maximilian Ernestus, Nora Belrose, Scott Emmons, Stuart Russell

    Abstract: imitation provides open-source implementations of imitation and reward learning algorithms in PyTorch. We include three inverse reinforcement learning (IRL) algorithms, three imitation learning algorithms and a preference comparison algorithm. The implementations have been benchmarked against previous results, and automated tests cover 98% of the code. Moreover, the algorithms are implemented in a… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  37. arXiv:2211.00716  [pdf, ps, other

    cs.LG cs.AI math.OC math.ST stat.ML

    Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian

    Authors: Paria Rashidinejad, Hanlin Zhu, Kunhe Yang, Stuart Russell, Jiantao Jiao

    Abstract: Offline reinforcement learning (RL), which refers to decision-making from a previously-collected dataset of interactions, has received significant attention over the past years. Much effort has focused on improving offline RL practicality by addressing the prevalent issue of partial data coverage through various forms of conservative policy learning. While the majority of algorithms do not have fi… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 49 pages, 1 figure

  38. arXiv:2211.00241  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Adversarial Policies Beat Superhuman Go AIs

    Authors: Tony T. Wang, Adam Gleave, Tom Tseng, Kellin Pelrine, Nora Belrose, Joseph Miller, Michael D. Dennis, Yawen Duan, Viktor Pogrebniak, Sergey Levine, Stuart Russell

    Abstract: We attack the state-of-the-art Go-playing AI system KataGo by training adversarial policies against it, achieving a >97% win rate against KataGo running at superhuman settings. Our adversaries do not win by playing Go well. Instead, they trick KataGo into making serious blunders. Our attack transfers zero-shot to other superhuman Go-playing AIs, and is comprehensible to the extent that human exper… ▽ More

    Submitted 13 July, 2023; v1 submitted 31 October, 2022; originally announced November 2022.

    Comments: Accepted to ICML 2023, see paper for changelog

    ACM Class: I.2.6

  39. arXiv:2208.07976  [pdf

    astro-ph.EP astro-ph.SR

    Presolar stardust in asteroid Ryugu

    Authors: Jens Barosch, Larry R. Nittler, Jianhua Wang, Conel M. O'D. Alexander, Bradley T. De Gregorio, Cécile Engrand, Yoko Kebukawa, Kazuhide Nagashima, Rhonda M. Stroud, Hikaru Yabuta, Yoshinari Abe, Jérôme Aléon, Sachiko Amari, Yuri Amelin, Ken-ichi Bajo, Laure Bejach, Martin Bizzarro, Lydie Bonal, Audrey Bouvier, Richard W. Carlson, Marc Chaussidon, Byeon-Gak Choi, George D. Cody, Emmanuel Dartois, Nicolas Dauphas , et al. (99 additional authors not shown)

    Abstract: We have conducted a NanoSIMS-based search for presolar material in samples recently returned from C-type asteroid Ryugu as part of JAXA's Hayabusa2 mission. We report the detection of all major presolar grain types with O- and C-anomalous isotopic compositions typically identified in carbonaceous chondrite meteorites: 1 silicate, 1 oxide, 1 O-anomalous supernova grain of ambiguous phase, 38 SiC, a… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: 12 pages, 3 figures, 2 tables. Published in ApJL

    Journal ref: 2022, The Astrophysical Journal Letters, 935, L3 (12pp)

  40. arXiv:2208.07006  [pdf, ps, other

    cs.GT cs.LO cs.MA

    Cooperative and uncooperative institution designs: Surprises and problems in open-source game theory

    Authors: Andrew Critch, Michael Dennis, Stuart Russell

    Abstract: It is increasingly possible for real-world agents, such as software-based agents or human institutions, to view the internal programming of other such agents that they interact with. For instance, a company can read the bylaws of another company, or one software system can read the source code of another. Game-theoretic equilibria between the designers of such agents are called \emph{program equil… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: 41 pages

    MSC Class: 93A14; 93A16; 91-08; 91A11; 91A35; 91A68; 91A44; 91B06; 91B41; 91B52 ACM Class: F.3.1; F.4.1; I.2.3; J.4

  41. Temporal self-compression and self-frequency shift of sub-microjoule pulses at 8 MHz repetition rate

    Authors: Francesco Tani, Jacob Lampen, Martin Butryn, Michael H. Frosz, Jie Jiang, Martin Fermann, Philip St. J. Russell

    Abstract: We combine soliton dynamics in gas-filled hollow-core photonic crystal fibers with a state-of-the-art fiber laser to realize a turn-key system producing few-fs pulses at 8 MHz repetition rate at pump energies as low as 220 nJ. Furthermore, by exploiting the soliton self-frequency shift in a second hydrogen-filled hollow-core fiber, we efficiently generate pulses as short as 22 fs, continuously tun… ▽ More

    Submitted 23 July, 2022; originally announced July 2022.

  42. arXiv:2207.03470  [pdf, other

    cs.GT cs.AI cs.LG cs.MA

    For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria

    Authors: Scott Emmons, Caspar Oesterheld, Andrew Critch, Vincent Conitzer, Stuart Russell

    Abstract: Although it has been known since the 1970s that a globally optimal strategy profile in a common-payoff game is a Nash equilibrium, global optimality is a strict requirement that limits the result's applicability. In this work, we show that any locally optimal symmetric strategy profile is also a (global) Nash equilibrium. Furthermore, we show that this result is robust to perturbations to the comm… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  43. arXiv:2205.15705  [pdf

    physics.optics nlin.PS physics.app-ph

    High-quality 8-fold self-compression of ultrashort near-UV pulses in Ar-filled ultrathin-walled photonic crystal fiber

    Authors: Jie Luan, Philip St. J. Russell, David Novoa

    Abstract: We demonstrate generation of 7.6 fs near-UV pulses centered at 400 nm via 8-fold soliton-effect self-compression in an Ar-filled hollow-core kagomé-style photonic crystal fiber with ultrathin core walls. Analytical calculations of the effective compression length and soliton order permit adjustment of the experimental parameters, and numerical modelling of the nonlinear pulse dynamics in the fiber… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: 7 pages, 5 figures

  44. arXiv:2205.08229  [pdf, other

    physics.geo-ph

    A Re-examination of Ellipticity Corrections for Seismic Phases

    Authors: Stuart Russell, John F. Rudge, Jessica C. E. Irving, Sanne Cottaar

    Abstract: The Earth's ellipticity of figure has an effect on the travel times of seismic waves over teleseismic distances. Tables of ellipticity corrections and coefficients have been used by seismologists for several decades, however due to the increasing variety and complexity of seismic phases in use, current tables of ellipticity coefficients are now outmoded and incomplete. We present a Python package,… ▽ More

    Submitted 11 August, 2022; v1 submitted 17 May, 2022; originally announced May 2022.

    Comments: Main paper of 11 pages, 4 figures and 1 table plus a supplement of 12 pages and 1 table

  45. arXiv:2205.07886  [pdf, other

    cs.LG cs.AI

    An Empirical Investigation of Representation Learning for Imitation

    Authors: Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah

    Abstract: Imitation learning often needs a large demonstration set in order to handle the full range of situations that an agent might find itself in during deployment. However, collecting expert demonstrations can be expensive. Recent work in vision, reinforcement learning, and NLP has shown that auxiliary representation learning objectives can reduce the need for large amounts of expensive, task-specific… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: Accepted to NeurIPS2021 Datasets and Benchmarks Track

  46. arXiv:2204.11971  [pdf

    physics.optics

    Optical vortex Brillouin laser

    Authors: Xinglin Zeng, Philip St. J. Russell, Yang Chen, Zheqi Wang, Gordon K. L. Wong, Paul Roth, Michael H. Frosz, Birgit Stiller

    Abstract: Optical vortices, which have been extensively studied over the last decades, offer an additional degree of freedom useful in many applications, such as optical tweezers and quantum control. Stimulated Brillouin scattering, providing a narrow linewidth and a strong nonlinear response, has been used to realise quasi-continuous wave (CW) lasers. Here, we report stable oscillation of optical vortices… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  47. arXiv:2204.11966  [pdf, other

    cs.LG cs.IR

    Estimating and Penalizing Induced Preference Shifts in Recommender Systems

    Authors: Micah Carroll, Anca Dragan, Stuart Russell, Dylan Hadfield-Menell

    Abstract: The content that a recommender system (RS) shows to users influences them. Therefore, when choosing a recommender to deploy, one is implicitly also choosing to induce specific internal states in users. Even more, systems trained via long-horizon optimization will have direct incentives to manipulate users: in this work, we focus on the incentive to shift user preferences so they are easier to sati… ▽ More

    Submitted 14 July, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted to ICML 2022 (Spotlight)

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:2686-2708, 2022

  48. arXiv:2203.12053  [pdf, other

    eess.AS cs.SD

    Upmixing via style transfer: a variational autoencoder for disentangling spatial images and musical content

    Authors: Haici Yang, Sanna Wager, Spencer Russell, Mike Luo, Minje Kim, Wontak Kim

    Abstract: In the stereo-to-multichannel upmixing problem for music, one of the main tasks is to set the directionality of the instrument sources in the multichannel rendering results. In this paper, we propose a modified variational autoencoder model that learns a latent space to describe the spatial images in multichannel music. We seek to disentangle the spatial images and music content, so the learned la… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

  49. arXiv:2203.07475  [pdf, other

    cs.LG cs.AI stat.ML

    Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

    Authors: Joar Skalse, Matthew Farrugia-Roberts, Stuart Russell, Alessandro Abate, Adam Gleave

    Abstract: It is often very challenging to manually design reward functions for complex, real-world tasks. To solve this, one can instead use reward learning to infer a reward function from data. However, there are often multiple reward functions that fit the data equally well, even in the infinite-data limit. This means that the reward function is only partially identifiable. In this work, we formally chara… ▽ More

    Submitted 7 June, 2023; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: ICML 2023. 9 pages main paper, 26 pages total, 3 figures

    ACM Class: I.2.6

  50. arXiv:2203.03680  [pdf

    physics.optics physics.app-ph

    Nonreciprocal vortex isolator by stimulated Brillouin scattering in chiral photonic crystal fibre

    Authors: Xinglin Zeng, Philip St. J. Russell, Christian Wolff, Michael H. Frosz, Gordon K. L. Wong, Birgit Stiller

    Abstract: Optical non-reciprocity, which breaks the symmetry between forward and backward propagating optical waves, has become vital in photonic systems and enables many key devices, such as optical isolators, circulators and optical routers. Most conventional optical isolators involve magneto-optic materials, but devices based on optical nonlinearities, optomechanically induced transparency and stimulated… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.