Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 104 results for author: Xu, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00075  [pdf, other

    cs.AI cs.CL cs.CR cs.LG

    Logicbreaks: A Framework for Understanding Subversion of Rule-based Inference

    Authors: Anton Xue, Avishree Khare, Rajeev Alur, Surbhi Goel, Eric Wong

    Abstract: We study how to subvert language models from following the rules. We model rule-following as inference in propositional Horn logic, a mathematical system in which rules have the form "if $P$ and $Q$, then $R$" for some propositions $P$, $Q$, and $R$. We prove that although transformers can faithfully abide by such rules, maliciously crafted prompts can nevertheless mislead even theoretically const… ▽ More

    Submitted 21 June, 2024; originally announced July 2024.

  2. arXiv:2406.18848  [pdf, other

    cs.LG

    Temporally Multi-Scale Sparse Self-Attention for Physical Activity Data Imputation

    Authors: Hui Wei, Maxwell A. Xu, Colin Samplawski, James M. Rehg, Santosh Kumar, Benjamin M. Marlin

    Abstract: Wearable sensors enable health researchers to continuously collect data pertaining to the physiological state of individuals in real-world settings. However, such data can be subject to extensive missingness due to a complex combination of factors. In this work, we study the problem of imputation of missing step count data, one of the most ubiquitous forms of wearable sensor data. We construct a n… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted by Conference on Health, Inference, and Learning (CHIL) 2024

  3. arXiv:2406.07890  [pdf, other

    eess.AS cs.CL cs.LG

    Exploring Speech Foundation Models for Speaker Diarization in Child-Adult Dyadic Interactions

    Authors: Anfeng Xu, Kevin Huang, Tiantian Feng, Lue Shen, Helen Tager-Flusberg, Shrikanth Narayanan

    Abstract: Speech foundation models, trained on vast datasets, have opened unique opportunities in addressing challenging low-resource speech understanding, such as child speech. In this work, we explore the capabilities of speech foundation models on child-adult speaker diarization. We show that exemplary foundation models can achieve 39.5% and 62.3% relative reductions in Diarization Error Rate and Speaker… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Interspeech 2024

  4. arXiv:2406.02864  [pdf, other

    cs.CL cs.AI

    NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models

    Authors: Ancheng Xu, Minghuan Tan, Lei Wang, Min Yang, Ruifeng Xu

    Abstract: Numeral systems and units of measurement are two conjoined topics in activities of human beings and have mutual effects with the languages expressing them. Currently, the evaluation of Large Language Models (LLMs) often involves mathematical reasoning, yet little attention is given to how minor changes in numbers or units can drastically alter the complexity of problems and the performance of LLMs… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Findings of ACL 2024

  5. arXiv:2405.20685  [pdf, other

    cs.LG cs.CV

    Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space

    Authors: Yukai Zhang, Ao Xu, Zihao Li, Tieru Wu

    Abstract: In the realm of Artificial Intelligence (AI), the importance of Explainable Artificial Intelligence (XAI) is increasingly recognized, particularly as AI models become more integral to our lives. One notable single-instance XAI approach is counterfactual explanation, which aids users in comprehending a model's decisions and offers guidance on altering these decisions. Specifically in the context of… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  6. arXiv:2405.20664  [pdf, other

    cs.LG

    Weak Robust Compatibility Between Learning Algorithms and Counterfactual Explanation Generation Algorithms

    Authors: Ao Xu, Tieru Wu

    Abstract: Counterfactual explanation generation is a powerful method for Explainable Artificial Intelligence. It can help users understand why machine learning models make specific decisions, and how to change those decisions. Evaluating the robustness of counterfactual explanation algorithms is therefore crucial. Previous literature has widely studied the robustness based on the perturbation of input insta… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  7. arXiv:2405.05789  [pdf, other

    cs.CR math.NA

    High-Performance Privacy-Preserving Matrix Completion for Trajectory Recovery

    Authors: Jiahao Guo, An-Bao Xu

    Abstract: Matrix completion has important applications in trajectory recovery and mobile social networks. However, sending raw data containing personal, sensitive information to cloud computing nodes may lead to privacy exposure issue.The privacy-preserving matrix completion is a useful approach to perform matrix completion while preserving privacy. In this paper, we propose a high-performance method for pr… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 7 pages, 10 figures

  8. arXiv:2403.19225  [pdf, other

    cs.CV

    Efficient and Effective Weakly-Supervised Action Segmentation via Action-Transition-Aware Boundary Alignment

    Authors: Angchi Xu, Wei-Shi Zheng

    Abstract: Weakly-supervised action segmentation is a task of learning to partition a long video into several action segments, where training videos are only accompanied by transcripts (ordered list of actions). Most of existing methods need to infer pseudo segmentation for training by serial alignment between all frames and the transcript, which is time-consuming and hard to be parallelized while training.… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  9. arXiv:2402.16877  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Large Language Model Augmented Exercise Retrieval for Personalized Language Learning

    Authors: Austin Xu, Will Monroe, Klinton Bicknell

    Abstract: We study the problem of zero-shot exercise retrieval in the context of online language learning, to give learners the ability to explicitly request personalized exercises via natural language. Using real-world data collected from language learners, we observe that vector similarity approaches poorly capture the relationship between exercise content and the language that learners use to express wha… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Presented at Learning Analytics and Knowledge 2024. 11 pages, 4 figures, 5 tables

  10. arXiv:2402.06635  [pdf, other

    q-fin.ST cs.CE cs.LG

    Large (and Deep) Factor Models

    Authors: Bryan Kelly, Boris Kuznetsov, Semyon Malamud, Teng Andrea Xu

    Abstract: We open up the black box behind Deep Learning for portfolio optimization and prove that a sufficiently wide and arbitrarily deep neural network (DNN) trained to maximize the Sharpe ratio of the Stochastic Discount Factor (SDF) is equivalent to a large factor model (LFM): A linear factor pricing model that uses many non-linear characteristics. The nature of these characteristics depends on the arch… ▽ More

    Submitted 20 January, 2024; originally announced February 2024.

  11. arXiv:2312.04752  [pdf, other

    cs.LG physics.geo-ph

    A Test-Time Learning Approach to Reparameterize the Geophysical Inverse Problem with a Convolutional Neural Network

    Authors: Anran Xu, Lindsey J. Heagy

    Abstract: Regularization is critical in solving the ill-posed geo-physical inversion problems. Explicit regularization is often used, but there are opportunities to explore the implicit regularization effect inherently from a Neural Network structure. Researchers in Computer Vision (CV) have discovered that the Convolutional Neural Network (CNN) architecture inherently enforces a regularization that is adva… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  12. arXiv:2311.18042  [pdf, other

    quant-ph cs.PL

    Compilation for Surface Code Quantum Computers

    Authors: Abtin Molavi, Amanda Xu, Swamit Tannu, Aws Albarghouthi

    Abstract: Practical applications of quantum computing depend on fault-tolerant devices with error correction. Today, the most promising approach is a class of error-correcting codes called surface codes. In this paper, we study the problem of compiling quantum circuits for quantum computers implementing surface codes. The problem involves (1) mapping circuit qubits to the device qubits and (2) routing execu… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  13. arXiv:2311.11019  [pdf, other

    cs.CV cs.LG cs.MM

    Hyperbolic Space with Hierarchical Margin Boosts Fine-Grained Learning from Coarse Labels

    Authors: Shu-Lin Xu, Yifan Sun, Faen Zhang, Anqi Xu, Xiu-Shen Wei, Yi Yang

    Abstract: Learning fine-grained embeddings from coarse labels is a challenging task due to limited label granularity supervision, i.e., lacking the detailed distinctions required for fine-grained tasks. The task becomes even more demanding when attempting few-shot fine-grained recognition, which holds practical significance in various applications. To address these challenges, we propose a novel method that… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS 2023

  14. arXiv:2311.02253  [pdf, other

    cs.LG cs.AI

    Comparative Knowledge Distillation

    Authors: Alex Wilf, Alex Tianyi Xu, Paul Pu Liang, Alexander Obolenskiy, Daniel Fried, Louis-Philippe Morency

    Abstract: In the era of large scale pretrained models, Knowledge Distillation (KD) serves an important role in transferring the wisdom of computationally heavy teacher models to lightweight, efficient student models while preserving performance. Traditional KD paradigms, however, assume readily available access to teacher models for frequent inference -- a notion increasingly at odds with the realities of c… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2310.13011

  15. arXiv:2311.00519  [pdf, other

    cs.LG

    REBAR: Retrieval-Based Reconstruction for Time-series Contrastive Learning

    Authors: Maxwell A. Xu, Alexander Moreno, Hui Wei, Benjamin M. Marlin, James M. Rehg

    Abstract: The success of self-supervised contrastive learning hinges on identifying positive data pairs, such that when they are pushed together in embedding space, the space encodes useful information for subsequent downstream tasks. Constructing positive pairs is non-trivial as the pairing must be similar enough to reflect a shared semantic meaning, but different enough to capture within-class variation.… ▽ More

    Submitted 16 March, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: ICLR 2024 | Code available at: https://github.com/maxxu05/rebar

    Journal ref: The Eleventh International Conference on Learning Representations (2024)

  16. arXiv:2310.01867  [pdf, other

    eess.AS cs.SD

    Audio-visual child-adult speaker classification in dyadic interactions

    Authors: Anfeng Xu, Kevin Huang, Tiantian Feng, Helen Tager-Flusberg, Shrikanth Narayanan

    Abstract: Interactions involving children span a wide range of important domains from learning to clinical diagnostic and therapeutic contexts. Automated analyses of such interactions are motivated by the need to seek accurate insights and offer scale and robustness across diverse and wide-ranging conditions. Identifying the speech segments belonging to the child is a critical step in such modeling. Convent… ▽ More

    Submitted 9 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: In review for ICASSP 2024, 5 pages

  17. arXiv:2309.04626  [pdf, other

    stat.ML cs.AI cs.IT cs.LG

    Perceptual adjustment queries and an inverted measurement paradigm for low-rank metric learning

    Authors: Austin Xu, Andrew D. McRae, Jingyan Wang, Mark A. Davenport, Ashwin Pananjady

    Abstract: We introduce a new type of query mechanism for collecting human feedback, called the perceptual adjustment query ( PAQ). Being both informative and cognitively lightweight, the PAQ adopts an inverted measurement scheme, and combines advantages from both cardinal and ordinal queries. We showcase the PAQ in the metric learning problem, where we collect PAQ measurements to learn an unknown Mahalanobi… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 42 pages, 6 figures

  18. arXiv:2308.14052  [pdf, other

    cs.CV

    MM-AU:Towards Multimodal Understanding of Advertisement Videos

    Authors: Digbalay Bose, Rajat Hebbar, Tiantian Feng, Krishna Somandepalli, Anfeng Xu, Shrikanth Narayanan

    Abstract: Advertisement videos (ads) play an integral part in the domain of Internet e-commerce as they amplify the reach of particular products to a broad audience or can serve as a medium to raise awareness about specific issues through concise narrative structures. The narrative structures of advertisements involve several elements like reasoning about the broad content (topic and the underlying message)… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: Accepted to ACM Multimedia 2023

  19. arXiv:2307.06848  [pdf, other

    cs.NI cs.LG math.NA

    Tensor Completion via Leverage Sampling and Tensor QR Decomposition for Network Latency Estimation

    Authors: Jun Lei, Ji-Qian Zhao, Jing-Qi Wang, An-Bao Xu

    Abstract: In this paper, we consider the network latency estimation, which has been an important metric for network performance. However, a large scale of network latency estimation requires a lot of computing time. Therefore, we propose a new method that is much faster and maintains high accuracy. The data structure of network nodes can form a matrix, and the tensor model can be formed by introducing the t… ▽ More

    Submitted 27 June, 2023; originally announced July 2023.

    Comments: 20 pages, 7 figures

  20. arXiv:2307.05902  [pdf, other

    cs.LG cs.AI

    Stability Guarantees for Feature Attributions with Multiplicative Smoothing

    Authors: Anton Xue, Rajeev Alur, Eric Wong

    Abstract: Explanation methods for machine learning models tend not to provide any formal guarantees and may not reflect the underlying decision-making process. In this work, we analyze stability as a property for reliable feature attribution methods. We prove that relaxed variants of stability are guaranteed if the model is sufficiently Lipschitz with respect to the masking of features. We develop a smoothi… ▽ More

    Submitted 26 October, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

  21. arXiv:2306.14251  [pdf, other

    cs.RO

    Optimal and Stable Multi-Layer Object Rearrangement on a Tabletop

    Authors: Andy Xu, Kai Gao, Si Wei Feng, Jingjin Yu

    Abstract: Object rearrangement is a fundamental sub-task in accomplishing a great many physical tasks. As such, effectively executing rearrangement is an important skill for intelligent robots to master. In this study, we conduct the first algorithmic study on optimally solving the problem of Multi-layer Object Rearrangement on a Tabletop (MORT), in which one object may be relocated at a time, and an object… ▽ More

    Submitted 30 June, 2023; v1 submitted 25 June, 2023; originally announced June 2023.

    Comments: Accepted by 2023 IROS - IEEE/RSJ International Conference on Intelligent Robots

  22. arXiv:2305.19215  [pdf, other

    stat.ML cs.LG

    dotears: Scalable, consistent DAG estimation using observational and interventional data

    Authors: Albert Xue, Jingyou Rao, Sriram Sankararaman, Harold Pimentel

    Abstract: New biological assays like Perturb-seq link highly parallel CRISPR interventions to a high-dimensional transcriptomic readout, providing insight into gene regulatory networks. Causal gene regulatory networks can be represented by directed acyclic graph (DAGs), but learning DAGs from observational data is complicated by lack of identifiability and a combinatorial solution space. Score-based structu… ▽ More

    Submitted 20 February, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

  23. arXiv:2305.15591  [pdf, other

    cs.LG

    Lightweight Learner for Shared Knowledge Lifelong Learning

    Authors: Yunhao Ge, Yuecheng Li, Di Wu, Ao Xu, Adam M. Jones, Amanda Sofie Rios, Iordanis Fostiropoulos, Shixian Wen, Po-Hsuan Huang, Zachary William Murdock, Gozde Sahin, Shuo Ni, Kiran Lekkala, Sumedh Anand Sontakke, Laurent Itti

    Abstract: In Lifelong Learning (LL), agents continually learn as they encounter new conditions and tasks. Most current LL is limited to a single agent that learns tasks sequentially. Dedicated LL machinery is then deployed to mitigate the forgetting of old tasks as new tasks are learned. This is inherently slow. We propose a new Shared Knowledge Lifelong Learning (SKILL) challenge, which deploys a decentral… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Transactions on Machine Learning Research (TMLR) paper

  24. arXiv:2305.14802  [pdf, other

    cs.CL

    Estimating Large Language Model Capabilities without Labeled Test Data

    Authors: Harvey Yiyun Fu, Qinyuan Ye, Albert Xu, Xiang Ren, Robin Jia

    Abstract: Large Language Models (LLMs) have the impressive ability to perform in-context learning (ICL) from only a few examples, but the success of ICL varies widely from task to task. Thus, it is important to quickly determine whether ICL is applicable to a new task, but directly evaluating ICL accuracy can be expensive in situations where test data is expensive to annotate -- the exact situations where I… ▽ More

    Submitted 26 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted to EMNLP 2023 Findings. Camera-ready version. Code: https://github.com/harvey-fin/icl-estimate

  25. arXiv:2305.14117  [pdf, other

    eess.AS cs.LG

    Understanding Spoken Language Development of Children with ASD Using Pre-trained Speech Embeddings

    Authors: Anfeng Xu, Rajat Hebbar, Rimita Lahiri, Tiantian Feng, Lindsay Butler, Lue Shen, Helen Tager-Flusberg, Shrikanth Narayanan

    Abstract: Speech processing techniques are useful for analyzing speech and language development in children with Autism Spectrum Disorder (ASD), who are often varied and delayed in acquiring these skills. Early identification and intervention are crucial, but traditional assessment methodologies such as caregiver reports are not adequate for the requisite behavioral phenotyping. Natural Language Sample (NLS… ▽ More

    Submitted 31 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to Interspeech 2023, 5 pages

  26. arXiv:2304.11795  [pdf, ps, other

    math.CO cs.DM

    Fractional eternal domination: securely distributing resources across a network

    Authors: Fnu Devvrit, Aaron Krim-Yee, Nithish Kumar, Gary MacGillivray, Ben Seamone, Virgélot Virgile, AnQi Xu

    Abstract: This paper initiates the study of fractional eternal domination in graphs, a natural relaxation of the well-studied eternal domination problem. We study the connections to flows and linear programming in order to obtain results on the complexity of determining the fractional eternal domination number of a graph $G$, which we denote $γ_{\,\textit{f}}^{\infty}(G)$. We study the behaviour of… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: 32 pages, including appendix

    MSC Class: 05C57; 05C69; 05C72; 05C21; 90C05; 91A24; 49N75

  27. arXiv:2303.09158  [pdf, other

    cs.CV

    Facial Affect Recognition based on Transformer Encoder and Audiovisual Fusion for the ABAW5 Challenge

    Authors: Ziyang Zhang, Liuwei An, Zishun Cui, Ao xu, Tengteng Dong, Yueqi Jiang, Jingyi Shi, Xin Liu, Xiao Sun, Meng Wang

    Abstract: In this paper, we present our solutions for the 5th Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW), which includes four sub-challenges of Valence-Arousal (VA) Estimation, Expression (Expr) Classification, Action Unit (AU) Detection and Emotional Reaction Intensity (ERI) Estimation. The 5th ABAW competition focuses on facial affect recognition utilizing different modalit… ▽ More

    Submitted 20 March, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

  28. arXiv:2302.03264  [pdf, other

    cs.CV cs.LG

    Delving Deep into Simplicity Bias for Long-Tailed Image Recognition

    Authors: Xiu-Shen Wei, Xuhao Sun, Yang Shen, Anqi Xu, Peng Wang, Faen Zhang

    Abstract: Simplicity Bias (SB) is a phenomenon that deep neural networks tend to rely favorably on simpler predictive patterns but ignore some complex features when applied to supervised discriminative tasks. In this work, we investigate SB in long-tailed image recognition and find the tail classes suffer more severely from SB, which harms the generalization performance of such underrepresented classes. We… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

  29. arXiv:2301.11414  [pdf, other

    cs.LG math.OC

    A Simple Algorithm For Scaling Up Kernel Methods

    Authors: Teng Andrea Xu, Bryan Kelly, Semyon Malamud

    Abstract: The recent discovery of the equivalence between infinitely wide neural networks (NNs) in the lazy training regime and Neural Tangent Kernels (NTKs) (Jacot et al., 2018) has revived interest in kernel methods. However, conventional wisdom suggests kernel methods are unsuitable for large samples due to their computational complexity and memory requirements. We introduce a novel random feature regres… ▽ More

    Submitted 30 January, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

  30. Autonomous Material Composite Morphing Wing

    Authors: Daniel Morton, Artemis Xu, Alberto Matute, Robert F. Shepherd

    Abstract: Aeronautics research has continually sought to achieve the adaptability and morphing performance of avian wings, but in practice, wings of all scales continue to use the same hinged control-surface embodiment. Recent research into compliant and bio-inspired mechanisms for morphing wings and control surfaces has indicated promising results, though often these are mechanically complex, or limited in… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  31. arXiv:2301.00074  [pdf, other

    cs.CC cs.AI cs.DS cs.SC

    Matrix Multiplication: Verifying Strong Uniquely Solvable Puzzles

    Authors: Matthew Anderson, Zongliang Ji, Anthony Yang Xu

    Abstract: Cohn and Umans proposed a framework for developing fast matrix multiplication algorithms based on the embedding computation in certain groups algebras. In subsequent work with Kleinberg and Szegedy, they connected this to the search for combinatorial objects called strong uniquely solvable puzzles (strong USPs). We begin a systematic computer-aided search for these objects. We develop and implemen… ▽ More

    Submitted 30 December, 2022; originally announced January 2023.

    Comments: 35 pages, 7 figures, full version of SAT 2020 extended abstract

    ACM Class: F.2.1; I.2.8; G.4; I.3.2

  32. arXiv:2301.00032  [pdf, other

    cs.LG cs.AI math.OC math.ST

    Bayesian Learning for Dynamic Inference

    Authors: Aolin Xu, Peng Guan

    Abstract: The traditional statistical inference is static, in the sense that the estimate of the quantity of interest does not affect the future evolution of the quantity. In some sequential estimation problems however, the future values of the quantity to be estimated depend on the estimate of its current value. This type of estimation problems has been formulated as the dynamic inference problem. In this… ▽ More

    Submitted 30 December, 2022; originally announced January 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2111.14746

  33. arXiv:2212.12645  [pdf, other

    cs.CV cs.LG

    HandsOff: Labeled Dataset Generation With No Additional Human Annotations

    Authors: Austin Xu, Mariya I. Vasileva, Achal Dave, Arjun Seshadri

    Abstract: Recent work leverages the expressive power of generative adversarial networks (GANs) to generate labeled synthetic datasets. These dataset generation methods often require new annotations of synthetic images, which forces practitioners to seek out annotators, curate a set of synthetic images, and ensure the quality of generated labels. We introduce the HandsOff framework, a technique capable of pr… ▽ More

    Submitted 30 March, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

    Comments: 22 pages, 20 figures. CVPR 2023

  34. arXiv:2212.07514  [pdf, other

    cs.LG cs.AI

    PulseImpute: A Novel Benchmark Task for Pulsative Physiological Signal Imputation

    Authors: Maxwell A. Xu, Alexander Moreno, Supriya Nagesh, V. Burak Aydemir, David W. Wetter, Santosh Kumar, James M. Rehg

    Abstract: The promise of Mobile Health (mHealth) is the ability to use wearable sensors to monitor participant physiology at high frequencies during daily life to enable temporally-precise health interventions. However, a major challenge is frequent missing data. Despite a rich imputation literature, existing techniques are ineffective for the pulsative signals which comprise many mHealth applications, and… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2022 | Code available at: https://github.com/rehg-lab/pulseimpute | Data available at: https://doi.org/10.5281/zenodo.7129964

    Journal ref: Advances in Neural Information Processing Systems 35 (2022) 26874-26888

  35. arXiv:2211.15718  [pdf, other

    cs.CL

    Contrastive Novelty-Augmented Learning: Anticipating Outliers with Large Language Models

    Authors: Albert Xu, Xiang Ren, Robin Jia

    Abstract: In many task settings, text classification models are likely to encounter examples from novel classes on which they cannot predict correctly. Selective prediction, in which models abstain on low-confidence examples, provides a possible solution, but existing models are often overly confident on unseen classes. To remedy this overconfidence, we introduce Contrastive Novelty-Augmented Learning (CoNA… ▽ More

    Submitted 26 May, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: ACL 2023

  36. arXiv:2211.09691  [pdf, other

    cs.PL quant-ph

    Synthesizing Quantum-Circuit Optimizers

    Authors: Amanda Xu, Abtin Molavi, Lauren Pick, Swamit Tannu, Aws Albarghouthi

    Abstract: Near-term quantum computers are expected to work in an environment where each operation is noisy, with no error correction. Therefore, quantum-circuit optimizers are applied to minimize the number of noisy operations. Today, physicists are constantly experimenting with novel devices and architectures. For every new physical substrate and for every modification of a quantum computer, we need to mod… ▽ More

    Submitted 10 May, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: Full version of PLDI 2023 paper

  37. arXiv:2211.04020  [pdf, other

    q-bio.QM cs.LG q-bio.GN q-bio.TO

    Generating counterfactual explanations of tumor spatial proteomes to discover effective strategies for enhancing immune infiltration

    Authors: Zitong Jerry Wang, Alexander M. Xu, Aman Bhargava, Matt W. Thomson

    Abstract: The tumor microenvironment (TME) significantly impacts cancer prognosis due to its immune composition. While therapies for altering the immune composition, including immunotherapies, have shown exciting results for treating hematological cancers, they are less effective for immunologically-cold, solid tumors. Spatial omics technologies capture the spatial organization of the TME with unprecedented… ▽ More

    Submitted 13 October, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

  38. arXiv:2211.03160  [pdf, ps, other

    physics.flu-dyn cs.CE physics.comp-ph

    Multi-GPU thermal lattice Boltzmann simulations using OpenACC and MPI

    Authors: Ao Xu, Bo-Tao Li

    Abstract: We assess the performance of the hybrid Open Accelerator (OpenACC) and Message Passing Interface (MPI) approach for multi-graphics processing units (GPUs) accelerated thermal lattice Boltzmann (LB) simulation. The OpenACC accelerates computation on a single GPU, and the MPI synchronizes the information between multiple GPUs. With a single GPU, the two-dimension (2D) simulation achieved 1.93 billio… ▽ More

    Submitted 17 November, 2022; v1 submitted 6 November, 2022; originally announced November 2022.

    Comments: 31 pages, 12 figures

    Journal ref: International Journal of Heat and Mass Transfer 2022, 201, 123649

  39. arXiv:2210.11173  [pdf, other

    cs.LG

    Mathematical Justification of Hard Negative Mining via Isometric Approximation Theorem

    Authors: Albert Xu, Jhih-Yi Hsieh, Bhaskar Vundurthy, Eliana Cohen, Howie Choset, Lu Li

    Abstract: In deep metric learning, the Triplet Loss has emerged as a popular method to learn many computer vision and natural language processing tasks such as facial recognition, object detection, and visual-semantic embeddings. One issue that plagues the Triplet Loss is network collapse, an undesirable phenomenon where the network projects the embeddings of all data onto a single point. Researchers predom… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 9 pages, 6 figures, submitted to AAAI 2023

  40. arXiv:2210.00637  [pdf, other

    cs.LG cs.AI

    Benign Autoencoders

    Authors: Semyon Malamud, Teng Andrea Xu, Antoine Didisheim

    Abstract: Recent progress in Generative Artificial Intelligence (AI) relies on efficient data representations, often featuring encoder-decoder architectures. We formalize the mathematical problem of finding the optimal encoder-decoder pair and characterize its solution, which we name the "benign autoencoder" (BAE). We prove that BAE projects data onto a manifold whose dimension is the optimal compressibilit… ▽ More

    Submitted 28 August, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: This paper replaces and subsumes arXiv:2110.08884

  41. arXiv:2208.13679  [pdf, other

    cs.AR quant-ph

    Qubit Mapping and Routing via MaxSAT

    Authors: Abtin Molavi, Amanda Xu, Martin Diges, Lauren Pick, Swamit Tannu, Aws Albarghouthi

    Abstract: Near-term quantum computers will operate in a noisy environment, without error correction. A critical problem for near-term quantum computing is laying out a logical circuit onto a physical device with limited connectivity between qubits. This is known as the qubit mapping and routing (QMR) problem, an intractable combinatorial problem. It is important to solve QMR as optimally as possible to redu… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

  42. arXiv:2206.06469  [pdf

    cs.LG stat.ML

    Invariant Structure Learning for Better Generalization and Causal Explainability

    Authors: Yunhao Ge, Sercan Ö. Arik, Jinsung Yoon, Ao Xu, Laurent Itti, Tomas Pfister

    Abstract: Learning the causal structure behind data is invaluable for improving generalization and obtaining high-quality explanations. We propose a novel framework, Invariant Structure Learning (ISL), that is designed to improve causal structure discovery by utilizing generalization as an indication. ISL splits the data into different environments, and learns a structure that is invariant to the target acr… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: 16 pages (including Appendix), 4 figures

  43. arXiv:2206.03482  [pdf, other

    cs.LG math.OC

    Chordal Sparsity for SDP-based Neural Network Verification

    Authors: Anton Xue, Lars Lindemann, Rajeev Alur

    Abstract: Neural networks are central to many emerging technologies, but verifying their correctness remains a major challenge. It is known that network outputs can be sensitive and fragile to even small input perturbations, thereby increasing the risk of unpredictable and undesirable behavior. Fast and accurate verification of neural networks is therefore critical to their widespread adoption, and in recen… ▽ More

    Submitted 8 January, 2024; v1 submitted 7 June, 2022; originally announced June 2022.

  44. arXiv:2205.10940  [pdf, other

    cs.RO cs.LG

    Toward smart composites: small-scale, untethered prediction and control for soft sensor/actuator systems

    Authors: Sarah Aguasvivas Manzano, Vani Sundaram, Artemis Xu, Khoi Ly, Mark Rentschler, Robert Shepherd, Nikolaus Correll

    Abstract: We present formulation and open-source tools to achieve in-material model predictive control of sensor/actuator systems using learned forward kinematics and on-device computation. Microcontroller units (MCUs) that compute the prediction and control task while colocated with the sensors and actuators enable in-material untethered behaviors. In this approach, small parameter size neural network mode… ▽ More

    Submitted 22 August, 2022; v1 submitted 22 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at the Journal of Composite Materials. Special Issue: Multifunctional Composites for Autonomic, Adaptive and Self-Sustaining Systems

  45. arXiv:2205.09665  [pdf, other

    cs.CL

    Automated Crossword Solving

    Authors: Eric Wallace, Nicholas Tomlin, Albert Xu, Kevin Yang, Eshaan Pathak, Matthew Ginsberg, Dan Klein

    Abstract: We present the Berkeley Crossword Solver, a state-of-the-art approach for automatically solving crossword puzzles. Our system works by generating answer candidates for each crossword clue using neural question answering models and then combines loopy belief propagation with local search to find full puzzle solutions. Compared to existing approaches, our system improves exact puzzle accuracy from 7… ▽ More

    Submitted 3 July, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: ACL 2022

  46. arXiv:2204.09381  [pdf, other

    cs.SD cs.CL eess.AS

    Exploration strategies for articulatory synthesis of complex syllable onsets

    Authors: Daniel R. van Niekerk, Anqi Xu, Branislav Gerazov, Paul K. Krug, Peter Birkholz, Yi Xu

    Abstract: High-quality articulatory speech synthesis has many potential applications in speech science and technology. However, developing appropriate mappings from linguistic specification to articulatory gestures is difficult and time consuming. In this paper we construct an optimisation-based framework as a first step towards learning these mappings without manual intervention. We demonstrate the product… ▽ More

    Submitted 30 June, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: Accepted at Interspeech 2022

  47. arXiv:2204.00846  [pdf, other

    cs.LG

    Chordal Sparsity for Lipschitz Constant Estimation of Deep Neural Networks

    Authors: Anton Xue, Lars Lindemann, Alexander Robey, Hamed Hassani, George J. Pappas, Rajeev Alur

    Abstract: Lipschitz constants of neural networks allow for guarantees of robustness in image classification, safety in controller design, and generalizability beyond the training data. As calculating Lipschitz constants is NP-hard, techniques for estimating Lipschitz constants must navigate the trade-off between scalability and accuracy. In this work, we significantly push the scalability frontier of a semi… ▽ More

    Submitted 8 January, 2024; v1 submitted 2 April, 2022; originally announced April 2022.

  48. arXiv:2203.15709  [pdf, other

    cs.CV

    OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction

    Authors: Lixin Yang, Kailin Li, Xinyu Zhan, Fei Wu, Anran Xu, Liu Liu, Cewu Lu

    Abstract: Learning how humans manipulate objects requires machines to acquire knowledge from two perspectives: one for understanding object affordances and the other for learning human's interactions based on the affordances. Even though these two knowledge bases are crucial, we find that current databases lack a comprehensive awareness of them. In this work, we propose a multi-modal and rich-annotated know… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR 2022

  49. arXiv:2203.10144  [pdf, other

    cs.CV

    Closing the Generalization Gap of Cross-silo Federated Medical Image Segmentation

    Authors: An Xu, Wenqi Li, Pengfei Guo, Dong Yang, Holger Roth, Ali Hatamizadeh, Can Zhao, Daguang Xu, Heng Huang, Ziyue Xu

    Abstract: Cross-silo federated learning (FL) has attracted much attention in medical imaging analysis with deep learning in recent years as it can resolve the critical issues of insufficient data, data privacy, and training efficiency. However, there can be a generalization gap between the model trained from FL and the one from centralized training. This important issue comes from the non-iid data distribut… ▽ More

    Submitted 23 February, 2023; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: CVPR 2022

  50. arXiv:2203.06338  [pdf, other

    eess.IV cs.CV

    Auto-FedRL: Federated Hyperparameter Optimization for Multi-institutional Medical Image Segmentation

    Authors: Pengfei Guo, Dong Yang, Ali Hatamizadeh, An Xu, Ziyue Xu, Wenqi Li, Can Zhao, Daguang Xu, Stephanie Harmon, Evrim Turkbey, Baris Turkbey, Bradford Wood, Francesca Patella, Elvira Stellato, Gianpaolo Carrafiello, Vishal M. Patel, Holger R. Roth

    Abstract: Federated learning (FL) is a distributed machine learning technique that enables collaborative model training while avoiding explicit data sharing. The inherent privacy-preserving property of FL algorithms makes them especially attractive to the medical field. However, in case of heterogeneous client data distributions, standard FL methods are unstable and require intensive hyperparameter tuning t… ▽ More

    Submitted 31 August, 2022; v1 submitted 11 March, 2022; originally announced March 2022.