Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 316 results for author: Chan, A

.
  1. arXiv:2407.14981  [pdf, other

    cs.CY

    Open Problems in Technical AI Governance

    Authors: Anka Reuel, Ben Bucknall, Stephen Casper, Tim Fist, Lisa Soder, Onni Aarne, Lewis Hammond, Lujain Ibrahim, Alan Chan, Peter Wills, Markus Anderljung, Ben Garfinkel, Lennart Heim, Andrew Trask, Gabriel Mukobi, Rylan Schaeffer, Mauricio Baker, Sara Hooker, Irene Solaiman, Alexandra Sasha Luccioni, Nitarshan Rajkumar, Nicolas Moës, Jeffrey Ladish, Neel Guha, Jessica Newman , et al. (6 additional authors not shown)

    Abstract: AI progress is creating a growing range of risks and opportunities, but it is often unclear how they should be navigated. In many cases, the barriers and uncertainties faced are at least partly technical. Technical AI governance, referring to technical analysis and tools for supporting the effective governance of AI, seeks to address such challenges. It can help to (a) identify areas where interve… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

    Comments: Ben Bucknall and Anka Reuel contributed equally and share the first author position

  2. arXiv:2407.04062  [pdf, other

    physics.optics

    Single-mode emission by phase-delayed coupling between nano-lasers

    Authors: T. V. Raziman, Anna Fischer, Riccardo Nori, Anthony Chan, Wai Kit Ng, Dhruv Saxena, Ortwin Hess, Korneel Molkens, Ivo Tanghe, Pieter Geiregat, Dries Van Thourhout, Mauricio Barahona, Riccardo Sapienza

    Abstract: Near-field coupling between nanolasers enables collective high-power lasing but leads to complex spectral reshaping and multimode operation, limiting the emission brightness, spatial coherence and temporal stability. Many lasing architectures have been proposed to circumvent this limitation, based on symmetries, topology, or interference. We show that a much simpler and robust method exploiting ph… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  3. arXiv:2406.15621  [pdf, other

    physics.ins-det

    On the 96-well plate coverglass tilt and curvature suppression in 96-camera imaging system

    Authors: Antony C Chan

    Abstract: The 96-eyes instrument is capable of computational extended depth of focus (eDOF) of up to +/- 30 micrometer in the phase channel, and conventional depth of field (DOF) of +/- 5 micrometer in the fluorescence channel. However, it requires minimal plate-to-plate cover glass depth variation to function. Plate depths are measured using a third-party plate scanner (Opera Phenix) grouped by plate types… ▽ More

    Submitted 13 March, 2024; originally announced June 2024.

  4. arXiv:2406.12137  [pdf, other

    cs.AI

    IDs for AI Systems

    Authors: Alan Chan, Noam Kolt, Peter Wills, Usman Anwar, Christian Schroeder de Witt, Nitarshan Rajkumar, Lewis Hammond, David Krueger, Lennart Heim, Markus Anderljung

    Abstract: AI systems are increasingly pervasive, yet information needed to decide whether and how to engage with them may not exist or be accessible. A user may not be able to verify whether a system has certain safety certifications. An investigator may not know whom to investigate when a system causes an incident. It may not be clear whom to contact to shut down a malfunctioning system. Across a number of… ▽ More

    Submitted 18 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Work-in-progress

  5. arXiv:2406.09630  [pdf, other

    cs.CV cs.LG

    Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition

    Authors: Mehreen Saeed, Adrian Chan, Anupam Mijar, Joseph Moukarzel, Georges Habchi, Carlos Younes, Amin Elias, Chau-Wai Wong, Akram Khater

    Abstract: We present the Manuscripts of Handwritten Arabic~(Muharaf) dataset, which is a machine learning dataset consisting of more than 1,600 historic handwritten page images transcribed by experts in archival Arabic. Each document image is accompanied by spatial polygonal coordinates of its text lines as well as basic page elements. This dataset was compiled to advance the state of the art in handwritten… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  6. arXiv:2406.09409  [pdf, other

    cs.CV eess.IV

    CodedEvents: Optimal Point-Spread-Function Engineering for 3D-Tracking with Event Cameras

    Authors: Sachin Shah, Matthew Albert Chan, Haoming Cai, Jingxi Chen, Sakshum Kulshrestha, Chahat Deep Singh, Yiannis Aloimonos, Christopher Metzler

    Abstract: Point-spread-function (PSF) engineering is a well-established computational imaging technique that uses phase masks and other optical elements to embed extra information (e.g., depth) into the images captured by conventional CMOS image sensors. To date, however, PSF-engineering has not been applied to neuromorphic event cameras; a powerful new image sensing technology that responds to changes in t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  7. arXiv:2406.08414  [pdf, other

    cs.LG

    Discovering Preference Optimization Algorithms with and for Large Language Models

    Authors: Chris Lu, Samuel Holt, Claudio Fanconi, Alex J. Chan, Jakob Foerster, Mihaela van der Schaar, Robert Tjarko Lange

    Abstract: Offline preference optimization is a key method for enhancing and controlling the quality of Large Language Model (LLM) outputs. Typically, preference optimization is approached as an offline supervised learning task using manually-crafted convex loss functions. While these methods are based on theoretical insights, they are inherently constrained by human creativity, so the large search space of… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  8. arXiv:2405.19943  [pdf, other

    cs.CV

    Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting

    Authors: Qi Zhang, Yunfei Gong, Daijie Chen, Antoni B. Chan, Hui Huang

    Abstract: Recent deep learning-based multi-view people detection (MVD) methods have shown promising results on existing datasets. However, current methods are mainly trained and evaluated on small, single scenes with a limited number of multi-view frames and fixed camera views. As a result, these methods may not be practical for detecting people in larger, more complex scenes with severe occlusions and came… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: AAAI 2024

  9. arXiv:2405.08886  [pdf, other

    cs.LG stat.ML

    The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks

    Authors: Ziquan Liu, Yufei Cui, Yan Yan, Yi Xu, Xiangyang Ji, Xue Liu, Antoni B. Chan

    Abstract: In safety-critical applications such as medical imaging and autonomous driving, where decisions have profound implications for patient health and road safety, it is imperative to maintain both high adversarial robustness to protect against potential adversarial attacks and reliable uncertainty quantification in decision-making. With extensive research focused on enhancing adversarial robustness th… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: ICML2024

  10. arXiv:2405.01644  [pdf

    eess.IV cs.CV physics.med-ph

    A Classification-Based Adaptive Segmentation Pipeline: Feasibility Study Using Polycystic Liver Disease and Metastases from Colorectal Cancer CT Images

    Authors: Peilong Wang, Timothy L. Kline, Andy D. Missert, Cole J. Cook, Matthew R. Callstrom, Alex Chan, Robert P. Hartman, Zachary S. Kelm, Panagiotis Korfiatis

    Abstract: Automated segmentation tools often encounter accuracy and adaptability issues when applied to images of different pathology. The purpose of this study is to explore the feasibility of building a workflow to efficiently route images to specifically trained segmentation models. By implementing a deep learning classifier to automatically classify the images and route them to appropriate segmentation… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: J Digit Imaging. Inform. med. (2024)

  11. arXiv:2405.01641  [pdf, other

    cond-mat.stat-mech hep-th nlin.CD quant-ph

    Spectral form factor in chaotic, localized, and integrable open quantum many-body systems

    Authors: Jiachen Li, Stephen Yan, Tomaž Prosen, Amos Chan

    Abstract: We numerically study the spectral statistics of open quantum many-body systems (OQMBS) as signatures of quantum chaos (or the lack thereof), using the dissipative spectral form factor (DSFF), a generalization of the spectral form factor to complex spectra. We show that the DSFF of chaotic OQMBS generically displays the $\textit{quadratic}$ ramp-plateau behaviour of the Ginibre ensemble from random… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  12. arXiv:2404.16654  [pdf, ps, other

    quant-ph math.CO

    A generalization of quantum pair state transfer

    Authors: Sooyeong Kim, Hermie Monterde, Bahman Ahmadi, Ada Chan, Stephen Kirkland, Sarah Plosker

    Abstract: An $s$-pair state in a graph is a quantum state of the form $\mathbf{e}_u+s\mathbf{e}_v$, where $u$ and $v$ are vertices in the graph and $s$ is a non-zero complex number. If $s=-1$ (resp., $s=1$), then such a state is called a pair state (resp. plus state). In this paper, we develop the theory of perfect $s$-pair state transfer in continuous quantum walks, where the Hamiltonian is taken to be the… ▽ More

    Submitted 28 July, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    MSC Class: 05C50; 81P45; 05C76; 15A18; 81Q10

  13. arXiv:2404.11895  [pdf, other

    cs.CV

    FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models

    Authors: Wei Wu, Qingnan Fan, Shuai Qin, Hong Gu, Ruoyu Zhao, Antoni B. Chan

    Abstract: Precise image editing with text-to-image models has attracted increasing interest due to their remarkable generative capabilities and user-friendly nature. However, such attempts face the pivotal challenge of misalignment between the intended precise editing target regions and the broader area impacted by the guidance in practice. Despite excellent methods leveraging attention mechanisms that have… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  14. arXiv:2404.10057  [pdf, other

    cond-mat.stat-mech hep-th math-ph quant-ph

    Universal distributions of overlaps from unitary dynamics in generic quantum many-body systems

    Authors: Alexios Christopoulos, Amos Chan, Andrea De Luca

    Abstract: We study the preparation of a quantum state using a circuit of depth $t$ from a factorized state of $N$ sites. We argue that in the appropriate scaling limit of large $t$ and $N$, the overlap between states evolved under generic many-body chaotic dynamics belongs to a family of universal distribution that generalizes the celebrated Porter-Thomas distribution. This is a consequence of a mapping in… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 15 pages, 6 figures

  15. arXiv:2404.09932  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    Foundational Challenges in Assuring Alignment and Safety of Large Language Models

    Authors: Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, Jose Hernandez-Orallo, Lewis Hammond, Eric Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi , et al. (13 additional authors not shown)

    Abstract: This work identifies 18 foundational challenges in assuring the alignment and safety of large language models (LLMs). These challenges are organized into three different categories: scientific understanding of LLMs, development and deployment methods, and sociotechnical challenges. Based on the identified challenges, we pose $200+$ concrete research questions.

    Submitted 15 April, 2024; originally announced April 2024.

  16. arXiv:2404.09504  [pdf, other

    cs.CV

    Learning Tracking Representations from Single Point Annotations

    Authors: Qiangqiang Wu, Antoni B. Chan

    Abstract: Existing deep trackers are typically trained with largescale video frames with annotated bounding boxes. However, these bounding boxes are expensive and time-consuming to annotate, in particular for large scale datasets. In this paper, we propose to learn tracking representations from single point annotations (i.e., 4.5x faster to annotate than the traditional bounding box) in a weakly supervised… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accept to CVPR2024-L3DIVU

  17. arXiv:2403.16935  [pdf, other

    quant-ph

    Measuring Spectral Form Factor in Many-Body Chaotic and Localized Phases of Quantum Processors

    Authors: Hang Dong, Pengfei Zhang, Ceren B. Dag, Yu Gao, Ning Wang, Jinfeng Deng, Xu Zhang, Jiachen Chen, Shibo Xu, Ke Wang, Yaozu Wu, Chuanyu Zhang, Feitong Jin, Xuhao Zhu, Aosai Zhang, Yiren Zou, Ziqi Tan, Zhengyi Cui, Zitian Zhu, Fanhao Shen, Tingting Li, Jiarun Zhong, Zehang Bao, Hekang Li, Zhen Wang , et al. (6 additional authors not shown)

    Abstract: The spectral form factor (SFF) captures universal spectral fluctuations as signatures of quantum chaos, and has been instrumental in advancing multiple frontiers of physics including the studies of black holes and quantum many-body systems. However, the measurement of SFF in many-body systems is challenging due to the difficulty in resolving level spacings that become exponentially small with incr… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 12 pages, 9 figures

  18. arXiv:2403.15218  [pdf, other

    cs.CV cs.AI cs.LG

    Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations

    Authors: Pranav Kulkarni, Adway Kanhere, Dharmam Savani, Andrew Chan, Devina Chatterjee, Paul H. Yi, Vishwa S. Parekh

    Abstract: Curating annotations for medical image segmentation is a labor-intensive and time-consuming task that requires domain expertise, resulting in "narrowly" focused deep learning (DL) models with limited translational utility. Recently, foundation models like the Segment Anything Model (SAM) have revolutionized semantic segmentation with exceptional zero-shot generalizability across various domains, i… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  19. arXiv:2403.12046  [pdf, other

    cs.CV

    GPT-4V(ision) Unsuitable for Clinical Care and Education: A Clinician-Evaluated Assessment

    Authors: Senthujan Senkaiahliyan, Augustin Toma, Jun Ma, An-Wen Chan, Andrew Ha, Kevin R. An, Hrishikesh Suresh, Barry Rubin, Bo Wang

    Abstract: OpenAI's large multimodal model, GPT-4V(ision), was recently developed for general image interpretation. However, less is known about its capabilities with medical image interpretation and diagnosis. Board-certified physicians and senior residents assessed GPT-4V's proficiency across a range of medical conditions using imaging modalities such as CT scans, MRIs, ECGs, and clinical photographs. Alth… ▽ More

    Submitted 14 November, 2023; originally announced March 2024.

  20. arXiv:2403.10236  [pdf, other

    cs.CV

    A Fixed-Point Approach to Unified Prompt-Based Counting

    Authors: Wei Lin, Antoni B. Chan

    Abstract: Existing class-agnostic counting models typically rely on a single type of prompt, e.g., box annotations. This paper aims to establish a comprehensive prompt-based counting framework capable of generating density maps for concerned objects indicated by various prompt types, such as box, point, and text. To achieve this goal, we begin by converting prompts from different modalities into prompt mask… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted by AAAI 2024

  21. arXiv:2403.03949  [pdf, other

    cs.RO cs.AI cs.LG

    Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach for Robust Manipulation

    Authors: Marcel Torne, Anthony Simeonov, Zechu Li, April Chan, Tao Chen, Abhishek Gupta, Pulkit Agrawal

    Abstract: Imitation learning methods need significant human supervision to learn policies robust to changes in object poses, physical disturbances, and visual distractors. Reinforcement learning, on the other hand, can explore the environment autonomously to learn robust behaviors but may require impractical amounts of unsafe real-world data collection. To learn performant, robust policies without the burde… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: Project page: https://real-to-sim-to-real.github.io/RialTo/

  22. arXiv:2402.17514  [pdf, other

    cs.CV

    Robust Unsupervised Crowd Counting and Localization with Adaptive Resolution SAM

    Authors: Jia Wan, Qiangqiang Wu, Wei Lin, Antoni B. Chan

    Abstract: The existing crowd counting models require extensive training data, which is time-consuming to annotate. To tackle this issue, we propose a simple yet effective crowd counting method by utilizing the Segment-Everything-Everywhere Model (SEEM), an adaptation of the Segmentation Anything Model (SAM), to generate pseudo-labels for training crowd counting models. However, our initial investigation rev… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  23. arXiv:2402.16939  [pdf, other

    quant-ph cond-mat.stat-mech

    Projected state ensemble of a generic model of many-body quantum chaos

    Authors: Amos Chan, Andrea De Luca

    Abstract: The projected ensemble is based on the study of the quantum state of a subsystem $A$ conditioned on projective measurements in its complement. Recent studies have observed that a more refined measure of the thermalization of a chaotic quantum system can be defined on the basis of convergence of the projected ensemble to a quantum state design, i.e. a system thermalizes when it becomes indistinguis… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 23 pages, 7 figures. Submitted for the special issue of Journal of Physics A: Mathematical and Theoretical on Quantum-Circuit Models for Many-Body Physics Out of Equilibrium

  24. arXiv:2402.14261  [pdf, other

    cs.SE cs.AI

    Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming

    Authors: Anisha Agarwal, Aaron Chan, Shubham Chandel, Jinu Jang, Shaun Miller, Roshanak Zilouchian Moghaddam, Yevhen Mohylevskyy, Neel Sundaresan, Michele Tufano

    Abstract: The integration of Large Language Models (LLMs) into Development Environments (IDEs) has become a focal point in modern software development. LLMs such as OpenAI GPT-3.5/4 and Code Llama offer the potential to significantly augment developer productivity by serving as intelligent, chat-driven programming assistants. However, utilizing LLMs out of the box is unlikely to be optimal for any given sce… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  25. arXiv:2402.11590  [pdf, other

    cs.HC

    Designing interactive data visualizations representing recovery progress for patients after stroke

    Authors: Alicia Ouskine, Adrian D. C. Chan, Fateme Rajabiyazdi

    Abstract: Stroke is one of the leading causes of disability worldwide. The efficacy of recovery is determined by a variety of factors, including patient adherence to rehabilitation programs. One way to increase patient adherence to their rehabilitation program is to show patients their progress that is visualized in a simple and intuitive way. We begin to gather preliminary information on Functional Capacit… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: 2 pages

  26. arXiv:2402.05713  [pdf, other

    cs.LG cs.AI cs.CV

    Hidden in Plain Sight: Undetectable Adversarial Bias Attacks on Vulnerable Patient Populations

    Authors: Pranav Kulkarni, Andrew Chan, Nithya Navarathna, Skylar Chan, Paul H. Yi, Vishwa S. Parekh

    Abstract: The proliferation of artificial intelligence (AI) in radiology has shed light on the risk of deep learning (DL) models exacerbating clinical biases towards vulnerable patient populations. While prior literature has focused on quantifying biases exhibited by trained DL models, demographically targeted adversarial bias attacks on DL models and its implication in the clinical environment remains an u… ▽ More

    Submitted 7 April, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 29 pages, 4 figures

  27. ReviewFlow: Intelligent Scaffolding to Support Academic Peer Reviewing

    Authors: Lu Sun, Aaron Chan, Yun Seo Chang, Steven P. Dow

    Abstract: Peer review is a cornerstone of science. Research communities conduct peer reviews to assess contributions and to improve the overall quality of science work. Every year, new community members are recruited as peer reviewers for the first time. How could technology help novices adhere to their community's practices and standards for peer reviewing? To better understand peer review practices and ch… ▽ More

    Submitted 26 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 19 pages, accepted at the 29th ACM Conference on Intelligent User Interfaces (IUI 2024)

  28. arXiv:2402.03478  [pdf, other

    cs.LG cs.CV

    Hyper-Diffusion: Estimating Epistemic and Aleatoric Uncertainty with a Single Model

    Authors: Matthew A. Chan, Maria J. Molina, Christopher A. Metzler

    Abstract: Estimating and disentangling epistemic uncertainty (uncertainty that can be reduced with more training data) and aleatoric uncertainty (uncertainty that is inherent to the task at hand) is critically important when applying machine learning (ML) to high-stakes applications such as medical imaging and weather forecasting. Conditional diffusion models' breakthrough ability to accurately and efficien… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 10 pages, 7 figures

  29. arXiv:2402.00782  [pdf, other

    cs.LG

    Dense Reward for Free in Reinforcement Learning from Human Feedback

    Authors: Alex J. Chan, Hao Sun, Samuel Holt, Mihaela van der Schaar

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has been credited as the key advance that has allowed Large Language Models (LLMs) to effectively follow instructions and produce useful assistance. Classically, this involves generating completions from the LLM in response to a query before using a separate reward model to assign a score to the full completion. As an auto-regressive process, the L… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  30. arXiv:2401.15528  [pdf, other

    cond-mat.mes-hall

    Dirac mass induced by optical gain and loss

    Authors: Letian Yu, Haoran Xue, Ruixiang Guo, Eng Aik Chan, Yun Yong Terh, Cesare Soci, Baile Zhang, Y. D. Chong

    Abstract: Mass is commonly regarded as an intrinsic property of matter, but modern physics reveals particle masses to have complex origins, such as the Higgs mechanism in high-energy physics. In crystal lattices such as graphene, relativistic Dirac particles can exist as low-energy quasiparticles with masses imparted by lattice symmetry-breaking perturbations. These mass-generating mechanisms all assume Her… ▽ More

    Submitted 14 April, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

  31. arXiv:2401.14446  [pdf, other

    cs.CY cs.AI cs.CR

    Black-Box Access is Insufficient for Rigorous AI Audits

    Authors: Stephen Casper, Carson Ezell, Charlotte Siegmann, Noam Kolt, Taylor Lynn Curtis, Benjamin Bucknall, Andreas Haupt, Kevin Wei, Jérémy Scheurer, Marius Hobbhahn, Lee Sharkey, Satyapriya Krishna, Marvin Von Hagen, Silas Alberti, Alan Chan, Qinyi Sun, Michael Gerovitch, David Bau, Max Tegmark, David Krueger, Dylan Hadfield-Menell

    Abstract: External audits of AI systems are increasingly recognized as a key mechanism for AI governance. The effectiveness of an audit, however, depends on the degree of access granted to auditors. Recent audits of state-of-the-art AI systems have primarily relied on black-box access, in which auditors can only query the system and observe its outputs. However, white-box access to the system's inner workin… ▽ More

    Submitted 29 May, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: FAccT 2024

    Journal ref: The 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24), June 3-6, 2024, Rio de Janeiro, Brazil

  32. arXiv:2401.13138  [pdf, other

    cs.CY cs.AI

    Visibility into AI Agents

    Authors: Alan Chan, Carson Ezell, Max Kaufmann, Kevin Wei, Lewis Hammond, Herbie Bradley, Emma Bluemke, Nitarshan Rajkumar, David Krueger, Noam Kolt, Lennart Heim, Markus Anderljung

    Abstract: Increased delegation of commercial, scientific, governmental, and personal activities to AI agents -- systems capable of pursuing complex goals with limited supervision -- may exacerbate existing societal risks and introduce new risks. Understanding and mitigating these risks involves critically evaluating existing governance structures, revising and adapting these structures where needed, and ens… ▽ More

    Submitted 17 May, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted to ACM Conference on Fairness, Accountability, and Transparency (ACM FAccT 2024)

  33. Error Propagation Analysis for Multithreaded Programs: An Empirical Approach

    Authors: Stefan Winter, Abraham Chan, Habib Saissi, Karthik Pattabiraman, Neeraj Suri

    Abstract: Fault injection is a technique to measure the robustness of a program to errors by introducing faults into the program under test. Following a fault injection experiment, Error Propagation Analysis (EPA) is deployed to understand how errors affect a program's execution. EPA typically compares the traces of a fault-free (golden) run with those from a faulty run of the program. While this suffices f… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: Extended version of conference paper, originally published in the proceedings of ICST'17 (see: https://ieeexplore.ieee.org/document/7927974)

  34. arXiv:2312.14751  [pdf, other

    cs.LG cs.CY

    Hazards from Increasingly Accessible Fine-Tuning of Downloadable Foundation Models

    Authors: Alan Chan, Ben Bucknall, Herbie Bradley, David Krueger

    Abstract: Public release of the weights of pretrained foundation models, otherwise known as downloadable access \citep{solaiman_gradient_2023}, enables fine-tuning without the prohibitive expense of pretraining. Our work argues that increasingly accessible fine-tuning of downloadable models may increase hazards. First, we highlight research to improve the accessibility of fine-tuning. We split our discussio… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: Accepted as a spotlight workshop paper at the Socially Responsible Language Modelling Research (SoLaR) workshop, held at NeurIPS 2023

  35. arXiv:2312.02401  [pdf, other

    stat.ML cs.LG cs.SI

    Harmonizing Global Voices: Culturally-Aware Models for Enhanced Content Moderation

    Authors: Alex J. Chan, José Luis Redondo García, Fabrizio Silvestri, Colm O'Donnel, Konstantina Palla

    Abstract: Content moderation at scale faces the challenge of considering local cultural distinctions when assessing content. While global policies aim to maintain decision-making consistency and prevent arbitrary rule enforcement, they often overlook regional variations in interpreting natural language as expressed in content. In this study, we are looking into how moderation systems can tackle this issue b… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 12 pages, 8 Figures. Supplementary material

  36. arXiv:2311.14110  [pdf, other

    cs.LG cs.AI

    When is Off-Policy Evaluation Useful? A Data-Centric Perspective

    Authors: Hao Sun, Alex J. Chan, Nabeel Seedat, Alihan Hüyük, Mihaela van der Schaar

    Abstract: Evaluating the value of a hypothetical target policy with only a logged dataset is important but challenging. On the one hand, it brings opportunities for safe policy improvement under high-stakes scenarios like clinical guidelines. On the other hand, such opportunities raise a need for precise off-policy evaluation (OPE). While previous work on OPE focused on improving the algorithm in value esti… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Off-Policy Evaluation, Data-Centric AI, Data-Centric Reinforcement Learning, Reinforcement Learning

  37. arXiv:2311.10441  [pdf

    physics.optics

    Retrieving positions of closely packed sub-wavelength nanoparticles from their diffraction patterns

    Authors: Benquan Wang, Ruyi An, Eng Aik Chan, Giorgio Adamo, Jin-Kyu So, Yewen Li, Zexiang Shen, Bo An, Nikolay I. Zheludev

    Abstract: Distinguishing two objects or point sources located closer than the Rayleigh distance is impossible in conventional microscopy. Understandably, the task becomes increasingly harder with a growing number of particles placed in close proximity. It has been recently demonstrated that subwavelength nanoparticles in closely packed clusters can be counted by AI-enabled analysis of the diffraction patter… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 6 pages, 3 figures

  38. arXiv:2311.09227  [pdf, other

    cs.CY cs.AI cs.SE

    Open-Sourcing Highly Capable Foundation Models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives

    Authors: Elizabeth Seger, Noemi Dreksler, Richard Moulange, Emily Dardaman, Jonas Schuett, K. Wei, Christoph Winter, Mackenzie Arnold, Seán Ó hÉigeartaigh, Anton Korinek, Markus Anderljung, Ben Bucknall, Alan Chan, Eoghan Stafford, Leonie Koessler, Aviv Ovadya, Ben Garfinkel, Emma Bluemke, Michael Aird, Patrick Levermore, Julian Hazell, Abhishek Gupta

    Abstract: Recent decisions by leading AI labs to either open-source their models or to restrict access to their models has sparked debate about whether, and how, increasingly capable AI models should be shared. Open-sourcing in AI typically refers to making model architecture and weights freely and publicly accessible for anyone to modify, study, build on, and use. This offers advantages such as enabling ex… ▽ More

    Submitted 29 September, 2023; originally announced November 2023.

    Comments: Official release at https://www.governance.ai/research-paper/open-sourcing-highly-capable-foundation-models

  39. arXiv:2311.07426  [pdf, other

    cs.LG cs.CV cs.HC

    Optimising Human-AI Collaboration by Learning Convincing Explanations

    Authors: Alex J. Chan, Alihan Huyuk, Mihaela van der Schaar

    Abstract: Machine learning models are being increasingly deployed to take, or assist in taking, complicated and high-impact decisions, from quasi-autonomous vehicles to clinical decision support systems. This poses challenges, particularly when models have hard-to-detect failure modes and are able to take actions without oversight. In order to handle this challenge, we propose a method for a collaborative s… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  40. arXiv:2311.02805  [pdf, other

    cs.CL

    Tailoring Self-Rationalizers with Multi-Reward Distillation

    Authors: Sahana Ramnath, Brihi Joshi, Skyler Hallinan, Ximing Lu, Liunian Harold Li, Aaron Chan, Jack Hessel, Yejin Choi, Xiang Ren

    Abstract: Large language models (LMs) are capable of generating free-text rationales to aid question answering. However, prior work 1) suggests that useful self-rationalization is emergent only at significant scales (e.g., 175B parameter GPT-3); and 2) focuses largely on downstream performance, ignoring the semantics of the rationales themselves, e.g., are they faithful, true, and helpful for humans? In thi… ▽ More

    Submitted 22 May, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Journal ref: The Twelfth International Conference on Learning Representations, 2024

  41. arXiv:2310.19967  [pdf

    cs.LG

    Early detection of inflammatory arthritis to improve referrals using multimodal machine learning from blood testing, semi-structured and unstructured patient records

    Authors: Bing Wang, Weizi Li, Anthony Bradlow, Antoni T. Y. Chan, Eghosa Bazuaye

    Abstract: Early detection of inflammatory arthritis (IA) is critical to efficient and accurate hospital referral triage for timely treatment and preventing the deterioration of the IA disease course, especially under limited healthcare resources. The manual assessment process is the most common approach in practice for the early detection of IA, but it is extremely labor-intensive and inefficient. A large a… ▽ More

    Submitted 3 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted in The 57th Hawaii International Conference on System Sciences, 3-6 Jan 2024, Hawaii

  42. arXiv:2310.14455  [pdf

    cs.CY cs.AI

    An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI

    Authors: Ross Gruetzemacher, Alan Chan, Kevin Frazier, Christy Manning, Štěpán Los, James Fox, José Hernández-Orallo, John Burden, Matija Franklin, Clíodhna Ní Ghuidhir, Mark Bailey, Daniel Eth, Toby Pilditch, Kyle Kilian

    Abstract: Given rapid progress toward advanced AI and risks from frontier AI systems (advanced AI systems pushing the boundaries of the AI capabilities frontier), the creation and implementation of AI governance and regulatory schemes deserves prioritization and substantial investment. However, the status quo is untenable and, frankly, dangerous. A regulatory gap has permitted AI labs to conduct research, d… ▽ More

    Submitted 6 November, 2023; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: 50 pages, 2 figures; updated w/ a few minor revisions based on feedback from SoLaR Workshop reviewers (on 5 page version)

  43. arXiv:2310.08901  [pdf, other

    cs.MA cs.AI cs.CL

    Welfare Diplomacy: Benchmarking Language Model Cooperation

    Authors: Gabriel Mukobi, Hannah Erlebach, Niklas Lauffer, Lewis Hammond, Alan Chan, Jesse Clifton

    Abstract: The growing capabilities and increasingly widespread deployment of AI systems necessitate robust benchmarks for measuring their cooperative capabilities. Unfortunately, most multi-agent benchmarks are either zero-sum or purely cooperative, providing limited opportunities for such measurements. We introduce a general-sum variant of the zero-sum board game Diplomacy -- called Welfare Diplomacy -- in… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  44. arXiv:2310.07989  [pdf, other

    cond-mat.str-el

    Unconventional Magnetic Oscillations in Kagome Mott Insulators

    Authors: Guoxin Zheng, Yuan Zhu, Kuan-Wen Chen, Byungmin Kang, Dechen Zhang, Kaila Jenkins, Aaron Chan, Zhenyuan Zeng, Aini Xu, Oscar A. Valenzuela, Joanna Blawat, John Singleton, Patrick A. Lee, Shiliang Li, Lu Li

    Abstract: We apply a strong magnetic field to a kagome Mott insulator with antiferromagnetic interactions which does not show magnetic ordering down to low temperatures. We observe a plateau at magnetization 1/9 Bohr magneton per magnetic ion (Cu). Furthermore, in the vicinity of this plateau we observe sets of strong oscillations in the magnetic torque, reminiscent of quantum oscillations in metals. Such o… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 5 pages and 3 figures in the main text, 10 additional figures in the supplement

  45. arXiv:2310.06574  [pdf, other

    cs.LG stat.AP stat.ML

    XAI for Early Crop Classification

    Authors: Ayshah Chan, Maja Schneider, Marco Körner

    Abstract: We propose an approach for early crop classification through identifying important timesteps with eXplainable AI (XAI) methods. Our approach consists of training a baseline crop classification model to carry out layer-wise relevance propagation (LRP) so that the salient time step can be identified. We chose a selected number of such important time indices to create the bounding region of the short… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  46. arXiv:2310.04743  [pdf, other

    cs.CL

    Resprompt: Residual Connection Prompting Advances Multi-Step Reasoning in Large Language Models

    Authors: Song Jiang, Zahra Shakeri, Aaron Chan, Maziar Sanjabi, Hamed Firooz, Yinglong Xia, Bugra Akyildiz, Yizhou Sun, Jinchao Li, Qifan Wang, Asli Celikyilmaz

    Abstract: Chain-of-thought (CoT) prompting, which offers step-by-step problem-solving rationales, has impressively unlocked the reasoning potential of large language models (LLMs). Yet, the standard CoT is less effective in problems demanding multiple reasoning steps. This limitation arises from the complex reasoning process in multi-step problems: later stages often depend on the results of several steps e… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: 29 pages

  47. arXiv:2309.15840  [pdf, other

    cs.CL cs.AI cs.LG

    How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

    Authors: Lorenzo Pacchiardi, Alex J. Chan, Sören Mindermann, Ilan Moscovitz, Alexa Y. Pan, Yarin Gal, Owain Evans, Jan Brauner

    Abstract: Large language models (LLMs) can "lie", which we define as outputting false statements despite "knowing" the truth in a demonstrable sense. LLMs might "lie", for example, when instructed to output misinformation. Here, we develop a simple lie detector that requires neither access to the LLM's activations (black-box) nor ground-truth knowledge of the fact in question. The detector works by asking a… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  48. arXiv:2309.12325  [pdf

    cs.CY cs.AI cs.CV cs.LG

    FUTURE-AI: International consensus guideline for trustworthy and deployable artificial intelligence in healthcare

    Authors: Karim Lekadir, Aasa Feragen, Abdul Joseph Fofanah, Alejandro F Frangi, Alena Buyx, Anais Emelie, Andrea Lara, Antonio R Porras, An-Wen Chan, Arcadi Navarro, Ben Glocker, Benard O Botwe, Bishesh Khanal, Brigit Beger, Carol C Wu, Celia Cintas, Curtis P Langlotz, Daniel Rueckert, Deogratias Mzurikwao, Dimitrios I Fotiadis, Doszhan Zhussupov, Enzo Ferrante, Erik Meijering, Eva Weicken, Fabio A González , et al. (95 additional authors not shown)

    Abstract: Despite major advances in artificial intelligence (AI) for medicine and healthcare, the deployment and adoption of AI technologies remain limited in real-world clinical practice. In recent years, concerns have been raised about the technical, clinical, ethical and legal risks associated with medical AI. To increase real world adoption, it is essential that medical AI tools are trusted and accepted… ▽ More

    Submitted 8 July, 2024; v1 submitted 11 August, 2023; originally announced September 2023.

    ACM Class: I.2.0; I.4.0; I.5.0

  49. arXiv:2309.05171  [pdf, other

    math.CO

    Bounds on Kemeny's constant of a graph and the Nordhaus-Gaddum problem

    Authors: Sooyeong Kim, Neal Madras, Ada Chan, Mark Kempton, Stephen Kirkland, Adam Knudson

    Abstract: We study Nordhaus-Gaddum problems for Kemeny's constant $\mathcal{K}(G)$ of a connected graph $G$. We prove bounds on $\min\{\mathcal{K}(G),\mathcal{K}(\overline{G})\}$ and the product $\mathcal{K}(G)\mathcal{K}(\overline{G})$ for various families of graphs. In particular, we show that if the maximum degree of a graph $G$ on $n$ vertices is $n-O(1)$ or $n-Ω(n)$, then… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    MSC Class: 05C09; 60J10; 05C81; 05C50; 05A19

  50. arXiv:2308.16179  [pdf, other

    quant-ph cond-mat.stat-mech hep-th nlin.CD

    Out-of-time-order correlator, many-body quantum chaos, light-like generators, and singular values

    Authors: Ke Huang, Xiao Li, David A. Huse, Amos Chan

    Abstract: We study out-of-time-order correlators (OTOCs) of local operators in spatial-temporal invariant or random quantum circuits using light-like generators (LLG) -- many-body operators that exist in and act along the light-like directions. We demonstrate that the OTOC can be approximated by the leading singular value of the LLG, which, for the case of generic many-body chaotic circuits, is increasingly… ▽ More

    Submitted 9 October, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: 6 + 15 pages, 3 + 11 figures. Comments are welcome. Updated on 2023-10-10