Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 214 results for author: Cohen, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.10614  [pdf, other

    cs.CE cs.AI cs.CL q-fin.CP

    Modeling News Interactions and Influence for Financial Market Prediction

    Authors: Mengyu Wang, Shay B. Cohen, Tiejun Ma

    Abstract: The diffusion of financial news into market prices is a complex process, making it challenging to evaluate the connections between news events and market movements. This paper introduces FININ (Financial Interconnected News Influence Network), a novel market prediction model that captures not only the links between news and prices but also the interactions among news items themselves. FININ effect… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Accepted by EMNLP 2024

  2. arXiv:2410.10336  [pdf, other

    cs.AI cs.CL cs.LG cs.SC

    CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning

    Authors: Joshua Ong Jun Leang, Aryo Pradipta Gema, Shay B. Cohen

    Abstract: Mathematical reasoning remains a significant challenge for large language models (LLMs), despite progress in prompting techniques such as Chain-of-Thought (CoT). We present Chain of Mathematically Annotated Thought (CoMAT), which enhances reasoning through two stages: Symbolic Conversion (converting natural language queries into symbolic form) and Reasoning Execution (deriving answers from symboli… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 8 pages, 12 figures

  3. arXiv:2410.08811  [pdf, other

    cs.CR cs.AI cs.CL

    PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning

    Authors: Tingchen Fu, Mrinank Sharma, Philip Torr, Shay B. Cohen, David Krueger, Fazl Barez

    Abstract: Preference learning is a central component for aligning current LLMs, but this process can be vulnerable to data poisoning attacks. To address this concern, we introduce PoisonBench, a benchmark for evaluating large language models' susceptibility to data poisoning during preference learning. Data poisoning attacks can manipulate large language model responses to include hidden malicious content o… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: Tingchen Fu and Fazl Barez are core research contributors

  4. arXiv:2409.19431  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Generalization Error of the Tilted Empirical Risk

    Authors: Gholamali Aminian, Amir R. Asadi, Tian Li, Ahmad Beirami, Gesine Reinert, Samuel N. Cohen

    Abstract: The generalization error (risk) of a supervised statistical learning algorithm quantifies its prediction ability on previously unseen data. Inspired by exponential tilting, Li et al. (2021) proposed the tilted empirical risk as a non-linear risk metric for machine learning applications such as classification and regression problems. In this work, we examine the generalization error of the tilted e… ▽ More

    Submitted 17 October, 2024; v1 submitted 28 September, 2024; originally announced September 2024.

    Comments: New results are added

  5. arXiv:2409.08045  [pdf, other

    cs.CR cs.AI

    Unleashing Worms and Extracting Data: Escalating the Outcome of Attacks against RAG-based Inference in Scale and Severity Using Jailbreaking

    Authors: Stav Cohen, Ron Bitton, Ben Nassi

    Abstract: In this paper, we show that with the ability to jailbreak a GenAI model, attackers can escalate the outcome of attacks against RAG-based GenAI-powered applications in severity and scale. In the first part of the paper, we show that attackers can escalate RAG membership inference attacks and RAG entity extraction attacks to RAG documents extraction attacks, forcing a more severe outcome compared to… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: for Github, see https://github.com/StavC/UnleashingWorms-ExtractingData

  6. arXiv:2408.11081  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    What can Large Language Models Capture about Code Functional Equivalence?

    Authors: Nickil Maveli, Antonio Vergari, Shay B. Cohen

    Abstract: Code-LLMs, LLMs pre-trained on large code corpora, have shown great progress in learning rich representations of the structure and syntax of code, successfully using it to generate or classify code fragments. At the same time, understanding if they are able to do so because they capture code semantics, and how well, is still an open question. In this paper, we tackle this problem by introducing Se… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 37 pages

  7. arXiv:2408.10014  [pdf, other

    cs.DS

    Improved Distance (Sensitivity) Oracles with Subquadratic Space

    Authors: Davide Bilò, Shiri Chechik, Keerti Choudhary, Sarel Cohen, Tobias Friedrich, Martin Schirneck

    Abstract: A distance oracle (DO) with stretch $(α, β)$ for a graph $G$ is a data structure that, when queried with vertices $s$ and $t$, returns a value $\widehat{d}(s,t)$ such that $d(s,t) \le \widehat{d}(s,t) \le α\cdot d(s,t) + β$. An $f$-edge fault-tolerant distance sensitivity oracle ($f$-DSO) additionally receives a set $F$ of up to $f$ edges and estimates the $s$-$t$-distance in $G{-}F$. Our first co… ▽ More

    Submitted 20 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

    Comments: An extended abstract of this work appeared at FOCS 2024

  8. arXiv:2408.05061  [pdf, other

    cs.CR cs.AI

    A Jailbroken GenAI Model Can Cause Substantial Harm: GenAI-powered Applications are Vulnerable to PromptWares

    Authors: Stav Cohen, Ron Bitton, Ben Nassi

    Abstract: In this paper we argue that a jailbroken GenAI model can cause substantial harm to GenAI-powered applications and facilitate PromptWare, a new type of attack that flips the GenAI model's behavior from serving an application to attacking it. PromptWare exploits user inputs to jailbreak a GenAI model to force/perform malicious activity within the context of a GenAI-powered application. First, we int… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: Website, see https://sites.google.com/view/promptware

  9. arXiv:2408.03866  [pdf

    cs.DB cs.AI cs.LO

    Mapping the Provenance Ontology to Basic Formal Ontology

    Authors: Tim Prudhomme, Giacomo De Colle, Austin Liebers, Alec Sculley, Peihong, Xie, Sydney Cohen, John Beverley

    Abstract: The Provenance Ontology (PROV-O) is a World Wide Web Consortium (W3C) recommended ontology used to structure data about provenance across a wide variety of domains. Basic Formal Ontology (BFO) is a top-level ontology ISO/IEC standard used to structure a wide variety of ontologies, such as the OBO Foundry ontologies and the Common Core Ontologies (CCO). To enhance interoperability between these two… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 28 pages, 10 figures

  10. arXiv:2407.14436  [pdf, other

    cs.GT

    Integrated Resource Allocation and Strategy Synthesis in Safety Games on Graphs with Deception

    Authors: Abhishek N. Kulkarni, Matthew S. Cohen, Charles A. Kamhoua, Jie Fu

    Abstract: Deception plays a crucial role in strategic interactions with incomplete information. Motivated by security applications, we study a class of two-player turn-based deterministic games with one-sided incomplete information, in which player 1 (P1) aims to prevent player 2 (P2) from reaching a set of target states. In addition to actions, P1 can place two kinds of deception resources: "traps" and "fa… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: 37 pages, 7 figures

  11. arXiv:2407.07543  [pdf, other

    cs.DS

    A New Approach for Approximating Directed Rooted Networks

    Authors: Sarel Cohen, Lior Kamma, Aikaterini Niklanovits

    Abstract: We consider the k-outconnected directed Steiner tree problem (k-DST). Given a directed edge-weighted graph $G=(V,E,w)$, where $V=\{r\}\cup S \cup T$, and an integer $k$, the goal is to find a minimum cost subgraph of $G$ in which there are $k$ edge-disjoint $rt$-paths for every terminal $t\in T$. The problem is know to be NP-hard. Furthermore, the question on whether a polynomial time, subpolynomi… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  12. arXiv:2407.03277  [pdf, other

    cs.CL

    Evaluating Automatic Metrics with Incremental Machine Translation Systems

    Authors: Guojun Wu, Shay B. Cohen, Rico Sennrich

    Abstract: We introduce a dataset comprising commercial machine translations, gathered weekly over six years across 12 translation directions. Since human A/B testing is commonly used, we assume commercial systems improve over time, which enables us to evaluate machine translation (MT) metrics based on their preference for more recent translations. Our study not only confirms several prior findings, such as… ▽ More

    Submitted 3 October, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

  13. arXiv:2405.20838  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    einspace: Searching for Neural Architectures from Fundamental Operations

    Authors: Linus Ericsson, Miguel Espinosa, Chenhongyi Yang, Antreas Antoniou, Amos Storkey, Shay B. Cohen, Steven McDonagh, Elliot J. Crowley

    Abstract: Neural architecture search (NAS) finds high performing networks for a given task. Yet the results of NAS are fairly prosaic; they did not e.g. create a shift from convolutional structures to transformers. This is not least because the search spaces in NAS often aren't diverse enough to include such transformations a priori. Instead, for NAS to provide greater potential for fundamental design shift… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Project page at https://linusericsson.github.io/einspace/

  14. arXiv:2405.09719  [pdf, other

    cs.CL cs.AI cs.LG

    Spectral Editing of Activations for Large Language Model Alignment

    Authors: Yifu Qiu, Zheng Zhao, Yftah Ziser, Anna Korhonen, Edoardo M. Ponti, Shay B. Cohen

    Abstract: Large language models (LLMs) often exhibit undesirable behaviours, such as generating untruthful or biased content. Editing their internal representations has been shown to be effective in mitigating such behaviours on top of the existing alignment methods. We propose a novel inference-time editing method, namely spectral editing of activations (SEA), to project the input representations into dire… ▽ More

    Submitted 25 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  15. arXiv:2404.16123  [pdf, other

    cs.CV cs.AI cs.CL

    FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication

    Authors: Eric Slyman, Stefan Lee, Scott Cohen, Kushal Kafle

    Abstract: Recent dataset deduplication techniques have demonstrated that content-aware dataset pruning can dramatically reduce the cost of training Vision-Language Pretrained (VLP) models without significant performance losses compared to training on the original dataset. These results have been based on pruning commonly used image-caption datasets collected from the web -- datasets that are known to harbor… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Conference paper at CVPR 2024. 6 pages, 8 figures. Project Page: https://ericslyman.com/fairdedup/

    ACM Class: I.4.10; I.2.7; E.0

  16. arXiv:2404.14715  [pdf, other

    cs.CV cs.CL

    FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction

    Authors: Hang Hua, Jing Shi, Kushal Kafle, Simon Jenni, Daoan Zhang, John Collomosse, Scott Cohen, Jiebo Luo

    Abstract: Recent progress in large-scale pre-training has led to the development of advanced vision-language models (VLMs) with remarkable proficiency in comprehending and generating multimodal content. Despite the impressive ability to perform complex reasoning for VLMs, current models often struggle to effectively and precisely capture the compositional information on both the image and text sides. To add… ▽ More

    Submitted 19 July, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: ECCV 2024

  17. arXiv:2403.13312  [pdf, other

    cs.CL

    LeanReasoner: Boosting Complex Logical Reasoning with Lean

    Authors: Dongwei Jiang, Marcio Fonseca, Shay B. Cohen

    Abstract: Large language models (LLMs) often struggle with complex logical reasoning due to logical inconsistencies and the inherent difficulty of such reasoning. We use Lean, a theorem proving framework, to address these challenges. By formalizing logical reasoning problems into theorems within Lean, we can solve them by proving or disproving the corresponding theorems. This method reduces the risk of logi… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2024 main conference

  18. arXiv:2403.10701  [pdf, other

    cs.CV

    IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation

    Authors: Yizhi Song, Zhifei Zhang, Zhe Lin, Scott Cohen, Brian Price, Jianming Zhang, Soo Ye Kim, He Zhang, Wei Xiong, Daniel Aliaga

    Abstract: Generative object compositing emerges as a promising new avenue for compositional image editing. However, the requirement of object identity preservation poses a significant challenge, limiting practical usage of most existing methods. In response, this paper introduces IMPRINT, a novel diffusion-based generative model trained with a two-stage learning framework that decouples learning of identity… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  19. arXiv:2403.08828  [pdf, other

    cs.HC cs.AI cs.RO

    People Attribute Purpose to Autonomous Vehicles When Explaining Their Behavior

    Authors: Balint Gyevnar, Stephanie Droop, Tadeg Quillien, Shay B. Cohen, Neil R. Bramley, Christopher G. Lucas, Stefano V. Albrecht

    Abstract: Cognitive science can help us understand which explanations people might expect, and in which format they frame these explanations, whether causal, counterfactual, or teleological (i.e., purpose-oriented). Understanding the relevance of these concepts is crucial for building good explainable AI (XAI) which offers recourse and actionability. Focusing on autonomous driving, a complex decision-making… ▽ More

    Submitted 30 April, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  20. arXiv:2403.02817  [pdf, other

    cs.CR

    Here Comes The AI Worm: Unleashing Zero-click Worms that Target GenAI-Powered Applications

    Authors: Stav Cohen, Ron Bitton, Ben Nassi

    Abstract: In the past year, numerous companies have incorporated Generative AI (GenAI) capabilities into new and existing applications, forming interconnected Generative AI (GenAI) ecosystems consisting of semi/fully autonomous agents powered by GenAI services. While ongoing research highlighted risks associated with the GenAI layer of agents (e.g., dialog poisoning, membership inference, prompt leaking, ja… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Website: https://sites.google.com/view/compromptmized

  21. arXiv:2402.17783  [pdf, other

    eess.SP cs.AI cs.LG

    BagStacking: An Integrated Ensemble Learning Approach for Freezing of Gait Detection in Parkinson's Disease

    Authors: Seffi Cohen, Lior Rokach

    Abstract: This paper introduces BagStacking, a novel ensemble learning method designed to enhance the detection of Freezing of Gait (FOG) in Parkinson's Disease (PD) by using a lower-back sensor to track acceleration. Building on the principles of bagging and stacking, BagStacking aims to achieve the variance reduction benefit of bagging's bootstrap sampling while also learning sophisticated blending throug… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  22. arXiv:2402.15055  [pdf, other

    cs.CL cs.AI cs.LG

    Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions

    Authors: Clement Neo, Shay B. Cohen, Fazl Barez

    Abstract: In this paper, we investigate the interplay between attention heads and specialized "next-token" neurons in the Multilayer Perceptron that predict specific tokens. By prompting an LLM like GPT-4 to explain these model internals, we can elucidate attention mechanisms that activate certain next-token neurons. Our analysis identifies attention heads that recognize contexts relevant to predicting a pa… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 15 pages, 11 figures

  23. arXiv:2402.10643  [pdf, other

    cs.CL cs.AI

    `Keep it Together': Enforcing Cohesion in Extractive Summaries by Simulating Human Memory

    Authors: Ronald Cardenas, Matthias Galle, Shay B. Cohen

    Abstract: Extractive summaries are usually presented as lists of sentences with no expected cohesion between them. In this paper, we aim to enforce cohesion whilst controlling for informativeness and redundancy in summaries, in cases where the input exhibits high redundancy. The pipeline controls for redundancy in long inputs as it is consumed, and balances informativeness and cohesion during sentence selec… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  24. arXiv:2402.07025  [pdf, other

    stat.ML cs.IT cs.LG

    Generalization Error of Graph Neural Networks in the Mean-field Regime

    Authors: Gholamali Aminian, Yixuan He, Gesine Reinert, Łukasz Szpruch, Samuel N. Cohen

    Abstract: This work provides a theoretical framework for assessing the generalization error of graph neural networks in the over-parameterized regime, where the number of parameters surpasses the quantity of data points. We explore two widely utilized types of graph neural networks: graph convolutional neural networks and message passing graph neural networks. Prior to this study, existing bounds on the gen… ▽ More

    Submitted 1 July, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: Accepted in ICML 2024

  25. arXiv:2402.05534  [pdf, other

    cs.SI cs.DS

    Robust Parameter Fitting to Realistic Network Models via Iterative Stochastic Approximation

    Authors: Thomas Bläsius, Sarel Cohen, Philipp Fischbeck, Tobias Friedrich, Martin S. Krejca

    Abstract: Random graph models are widely used to understand network properties and graph algorithms. Key to such analyses are the different parameters of each model, which affect various network features, such as its size, clustering, or degree distribution. The exact effect of the parameters on these features is not well understood, mainly because we lack tools to thoroughly investigate this relation. More… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  26. arXiv:2401.10415  [pdf, other

    cs.CL cs.AI

    Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals?

    Authors: Marcio Fonseca, Shay B. Cohen

    Abstract: In this work, we investigate the controllability of large language models (LLMs) on scientific summarization tasks. We identify key stylistic and content coverage factors that characterize different types of summaries such as paper reviews, abstracts, and lay summaries. By controlling stylistic features, we find that non-fine-tuned LLMs outperform humans in the MuP review generation task, both in… ▽ More

    Submitted 27 June, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: ACL 2024 camera ready

  27. arXiv:2401.01814  [pdf, other

    cs.AI

    Large Language Models Relearn Removed Concepts

    Authors: Michelle Lo, Shay B. Cohen, Fazl Barez

    Abstract: Advances in model editing through neuron pruning hold promise for removing undesirable concepts from large language models. However, it remains unclear whether models have the capacity to reacquire pruned concepts after editing. To investigate this, we evaluate concept relearning in models by tracking concept saliency and similarity in pruned neurons during retraining. Our findings reveal that mod… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  28. arXiv:2312.11476  [pdf

    physics.geo-ph cs.LG

    The geometry of flow: Advancing predictions of river geometry with multi-model machine learning

    Authors: Shuyu Y Chang, Zahra Ghahremani, Laura Manuel, Mohammad Erfani, Chaopeng Shen, Sagy Cohen, Kimberly Van Meter, Jennifer L Pierce, Ehab A Meselhe, Erfan Goharian

    Abstract: Hydraulic geometry parameters describing river hydrogeomorphic is important for flood forecasting. Although well-established, power-law hydraulic geometry curves have been widely used to understand riverine systems and mapping flooding inundation worldwide for the past 70 years, we have become increasingly aware of the limitations of these approaches. In the present study, we have moved beyond the… ▽ More

    Submitted 27 November, 2023; originally announced December 2023.

    Comments: 30 pages, 10 figures

  29. arXiv:2312.03480  [pdf, other

    cs.CL

    AMR Parsing is Far from Solved: GrAPES, the Granular AMR Parsing Evaluation Suite

    Authors: Jonas Groschwitz, Shay B. Cohen, Lucia Donatelli, Meaghan Fowlie

    Abstract: We present the Granular AMR Parsing Evaluation Suite (GrAPES), a challenge set for Abstract Meaning Representation (AMR) parsing with accompanying evaluation metrics. AMR parsers now obtain high scores on the standard AMR evaluation metric Smatch, close to or even above reported inter-annotator agreement. But that does not mean that AMR parsing is solved; in fact, human evaluation in previous work… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Accepted at EMNLP 2023. For the associated GitHub repository, see https://github.com/jgroschwitz/GrAPES

    ACM Class: J.5

  30. arXiv:2312.02156  [pdf, other

    cs.CV cs.AI

    Latent Feature-Guided Diffusion Models for Shadow Removal

    Authors: Kangfu Mei, Luis Figueroa, Zhe Lin, Zhihong Ding, Scott Cohen, Vishal M. Patel

    Abstract: Recovering textures under shadows has remained a challenging problem due to the difficulty of inferring shadow-free scenes from shadow images. In this paper, we propose the use of diffusion models as they offer a promising approach to gradually refine the details of shadow regions during the diffusion process. Our method improves this process by conditioning on a learned latent feature space that… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: project page see https://kfmei.page/shadow-diffusion/index.html

  31. arXiv:2311.12901  [pdf, other

    q-bio.QM cs.LG

    From Microbes to Methane: AI-Based Predictive Modeling of Feed Additive Efficacy in Dairy Cows

    Authors: Yaniv Altshuler, Tzruya Calvao Chebach, Shalom Cohen

    Abstract: In an era of increasing pressure to achieve sustainable agriculture, the optimization of livestock feed for enhancing yield and minimizing environmental impact is a paramount objective. This study presents a pioneering approach towards this goal, using rumen microbiome data to predict the efficacy of feed additives in dairy cattle. We collected an extensive dataset that includes methane emission… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 51 pages, 24 figures, 11 tables, 93 references

  32. arXiv:2311.09467  [pdf, other

    cs.CL cs.AI

    Think While You Write: Hypothesis Verification Promotes Faithful Knowledge-to-Text Generation

    Authors: Yifu Qiu, Varun Embar, Shay B. Cohen, Benjamin Han

    Abstract: Knowledge-to-text generators often struggle to faithfully generate descriptions for the input facts: they may produce hallucinations that contradict the input, or describe facts not present in the input. To reduce hallucinations, we propose a decoding-only method, TWEAK (Think While Effectively Articulating Knowledge), which can be integrated with any generator without retraining. TWEAK treats the… ▽ More

    Submitted 3 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: NAACL 2024 (Findings)

  33. arXiv:2311.08704  [pdf, other

    cs.CL cs.AI

    Can Large Language Models Follow Concept Annotation Guidelines? A Case Study on Scientific and Financial Domains

    Authors: Marcio Fonseca, Shay B. Cohen

    Abstract: Although large language models (LLMs) exhibit remarkable capacity to leverage in-context demonstrations, it is still unclear to what extent they can learn new concepts or facts from ground-truth labels. To address this question, we examine the capacity of instruction-tuned LLMs to follow in-context concept guidelines for sentence labeling tasks. We design guidelines that present different types of… ▽ More

    Submitted 26 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: ACL 2024 camera ready

  34. arXiv:2311.08398  [pdf, other

    cs.CL cs.AI

    Are Large Language Models Temporally Grounded?

    Authors: Yifu Qiu, Zheng Zhao, Yftah Ziser, Anna Korhonen, Edoardo M. Ponti, Shay B. Cohen

    Abstract: Are Large language models (LLMs) temporally grounded? Since LLMs cannot perceive and interact with the environment, it is impossible to answer this question directly. Instead, we provide LLMs with textual narratives and probe them with respect to their common-sense knowledge of the structure and duration of events, their ability to order events along a timeline, and self-consistency within their t… ▽ More

    Submitted 16 November, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

  35. arXiv:2310.15513  [pdf, other

    cs.CL

    A Joint Matrix Factorization Analysis of Multilingual Representations

    Authors: Zheng Zhao, Yftah Ziser, Bonnie Webber, Shay B. Cohen

    Abstract: We present an analysis tool based on joint matrix factorization for comparing latent representations of multilingual and monolingual models. An alternative to probing, this tool allows us to analyze multiple sets of representations in a joint manner. Using this tool, we study to what extent and how morphosyntactic features are reflected in the representations learned by multilingual pre-trained mo… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted to Findings of EMNLP 2023

  36. arXiv:2308.12910  [pdf, other

    cs.CV

    SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data

    Authors: Ziyan Yang, Kushal Kafle, Zhe Lin, Scott Cohen, Zhihong Ding, Vicente Ordonez

    Abstract: We propose Subject-Conditional Relation Detection SCoRD, where conditioned on an input subject, the goal is to predict all its relations to other objects in a scene along with their locations. Based on the Open Images dataset, we propose a challenging OIv6-SCoRD benchmark such that the training and testing splits have a distribution shift in terms of the occurrence statistics of $\langle$subject,… ▽ More

    Submitted 4 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: WACV 2024

  37. arXiv:2308.02927  [pdf, other

    cs.DC

    Subquadratic Multivalued Asynchronous Byzantine Agreement WHP

    Authors: Shir Cohen, Idit Keidar

    Abstract: There have been several reductions from multivalued consensus to binary consensus over the past 20 years. To the best of our knowledge, none of them solved it for Byzantine asynchronous settings. In this paper, we close this gap. Moreover, we do so in subquadratic communication, using newly developed subquadratic binary Byzantine Agreement techniques.

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2002.06545

  38. arXiv:2308.01999  [pdf, other

    quant-ph cs.PF cs.SE

    cuQuantum SDK: A High-Performance Library for Accelerating Quantum Science

    Authors: Harun Bayraktar, Ali Charara, David Clark, Saul Cohen, Timothy Costa, Yao-Lung L. Fang, Yang Gao, Jack Guan, John Gunnels, Azzam Haidar, Andreas Hehn, Markus Hohnerbach, Matthew Jones, Tom Lubowe, Dmitry Lyakh, Shinya Morino, Paul Springer, Sam Stanwyck, Igor Terentyev, Satya Varadhan, Jonathan Wong, Takuma Yamaguchi

    Abstract: We present the NVIDIA cuQuantum SDK, a state-of-the-art library of composable primitives for GPU-accelerated quantum circuit simulations. As the size of quantum devices continues to increase, making their classical simulation progressively more difficult, the availability of fast and scalable quantum circuit simulators becomes vital for quantum algorithm developers, as well as quantum hardware eng… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: paper accepted at QCE 2023, journal reference will be updated whenever available

    MSC Class: 68Q12; 68Q09; 81P68;

  39. arXiv:2307.11677  [pdf, ps, other

    cs.DS

    Improved Approximate Distance Oracles: Bypassing the Thorup-Zwick Bound in Dense Graphs

    Authors: Davide Bilò, Shiri Chechik, Keerti Choudhary, Sarel Cohen, Tobias Friedrich, Martin Schirneck

    Abstract: Despite extensive research on distance oracles, there are still large gaps between the best constructions for spanners and distance oracles. Notably, there exist sparse spanners with a multiplicative stretch of $1+\varepsilon$ plus some additive stretch. A fundamental open problem is whether such a bound is achievable for distance oracles as well. Specifically, can we construct a distance oracle w… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  40. arXiv:2307.10810  [pdf, other

    cs.LG cs.AI

    On Combining Expert Demonstrations in Imitation Learning via Optimal Transport

    Authors: Ilana Sebag, Samuel Cohen, Marc Peter Deisenroth

    Abstract: Imitation learning (IL) seeks to teach agents specific tasks through expert demonstrations. One of the key approaches to IL is to define a distance between agent and expert and to find an agent policy that minimizes that distance. Optimal transport methods have been widely used in imitation learning as they provide ways to measure meaningful distances between agent and expert trajectories. However… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Journal ref: NeurIPS Workshop on Optimal Transport and Machine Learning, 2021

  41. arXiv:2306.15041  [pdf

    q-bio.QM cs.DB

    A Comparison of Neuroelectrophysiology Databases

    Authors: Priyanka Subash, Alex Gray, Misque Boswell, Samantha L. Cohen, Rachael Garner, Sana Salehi, Calvary Fisher, Samuel Hobel, Satrajit Ghosh, Yaroslav Halchenko, Benjamin Dichter, Russell A. Poldrack, Chris Markiewicz, Dora Hermes, Arnaud Delorme, Scott Makeig, Brendan Behan, Alana Sparks, Stephen R Arnott, Zhengjia Wang, John Magnotti, Michael S. Beauchamp, Nader Pouratian, Arthur W. Toga, Dominique Duncan

    Abstract: As data sharing has become more prevalent, three pillars - archives, standards, and analysis tools - have emerged as critical components in facilitating effective data sharing and collaboration. This paper compares four freely available intracranial neuroelectrophysiology data repositories: Data Archive for the BRAIN Initiative (DABI), Distributed Archives for Neurophysiology Data Integration (DAN… ▽ More

    Submitted 30 August, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 22 pages, 6 figures, 5 tables

  42. arXiv:2306.11623  [pdf, ps, other

    stat.ML cs.LG math.ST

    Mean-field Analysis of Generalization Errors

    Authors: Gholamali Aminian, Samuel N. Cohen, Łukasz Szpruch

    Abstract: We propose a novel framework for exploring weak and $L_2$ generalization errors of algorithms through the lens of differential calculus on the space of probability measures. Specifically, we consider the KL-regularized empirical risk minimization problem and establish generic conditions under which the generalization error convergence rate, when training on a sample of size $n$, is… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 49 pages

    MSC Class: 62B10; 60F99; 49N80; 46N30

  43. arXiv:2305.19911  [pdf, other

    cs.LG cs.CL

    Neuron to Graph: Interpreting Language Model Neurons at Scale

    Authors: Alex Foote, Neel Nanda, Esben Kran, Ioannis Konstas, Shay Cohen, Fazl Barez

    Abstract: Advances in Large Language Models (LLMs) have led to remarkable capabilities, yet their inner mechanisms remain largely unknown. To understand these models, we need to unravel the functions of individual neurons and their contribution to the network. This paper introduces a novel automated approach designed to scale interpretability techniques across a vast array of neurons within LLMs, to make th… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  44. arXiv:2305.19734  [pdf, other

    cs.AI cs.CL cs.DB

    Knowledge Base Question Answering for Space Debris Queries

    Authors: Paul Darm, Antonio Valerio Miceli-Barone, Shay B. Cohen, Annalisa Riccardi

    Abstract: Space agencies execute complex satellite operations that need to be supported by the technical knowledge contained in their extensive information systems. Knowledge bases (KB) are an effective way of storing and accessing such information at scale. In this work we present a system, developed for the European Space Agency (ESA), that can answer complex natural language queries, to support engineers… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: 7 pages, ACL 2023 industry track

    ACM Class: I.2.7

  45. arXiv:2305.16947  [pdf, other

    cs.CL

    Sentence-Incremental Neural Coreference Resolution

    Authors: Matt Grenander, Shay B. Cohen, Mark Steedman

    Abstract: We propose a sentence-incremental neural coreference resolution system which incrementally builds clusters after marking mention boundaries in a shift-reduce method. The system is aimed at bridging two recent approaches at coreference resolution: (1) state-of-the-art non-incremental models that incur quadratic complexity in document length with high computational cost, and (2) memory network-based… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2022

  46. arXiv:2305.15507  [pdf, other

    cs.CL cs.AI

    The Larger They Are, the Harder They Fail: Language Models do not Recognize Identifier Swaps in Python

    Authors: Antonio Valerio Miceli-Barone, Fazl Barez, Ioannis Konstas, Shay B. Cohen

    Abstract: Large Language Models (LLMs) have successfully been applied to code generation tasks, raising the question of how well these models understand programming. Typical programming languages have invariances and equivariances in their semantics that human programmers intuitively understand and exploit, such as the (near) invariance to the renaming of identifiers. We show that LLMs not only fail to prop… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 17 pages, 5 figure, ACL 2023

  47. arXiv:2305.13632  [pdf, other

    cs.CL cs.AI cs.LG

    Detecting and Mitigating Hallucinations in Multilingual Summarisation

    Authors: Yifu Qiu, Yftah Ziser, Anna Korhonen, Edoardo M. Ponti, Shay B. Cohen

    Abstract: Hallucinations pose a significant challenge to the reliability of neural models for abstractive summarisation. While automatically generated summaries may be fluent, they often lack faithfulness to the original document. This issue becomes even more pronounced in low-resource settings, such as cross-lingual transfer. With the existing faithful metrics focusing on English, even measuring the extent… ▽ More

    Submitted 26 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  48. Approximate Distance Sensitivity Oracles in Subquadratic Space

    Authors: Davide Bilò, Shiri Chechik, Keerti Choudhary, Sarel Cohen, Tobias Friedrich, Simon Krogmann, Martin Schirneck

    Abstract: An $f$-edge fault-tolerant distance sensitive oracle ($f$-DSO) with stretch $σ\ge 1$ is a data structure that preprocesses a given undirected, unweighted graph $G$ with $n$ vertices and $m$ edges, and a positive integer $f$. When queried with a pair of vertices $s, t$ and a set $F$ of at most $f$ edges, it returns a $σ$-approximation of the $s$-$t$-distance in $G-F$. We study $f$-DSOs that take… ▽ More

    Submitted 4 June, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: The is the arXiv version of the eponymous paper that appeared first at STOC 2023 and then was extended to a journal version, published in TheoretiCS

    Journal ref: TheoretiCS, Volume 3 (June 5, 2024) theoretics:11689

  49. arXiv:2305.08828  [pdf, other

    cs.CL

    PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in India

    Authors: Ashok Urlana, Pinzhen Chen, Zheng Zhao, Shay B. Cohen, Manish Shrivastava, Barry Haddow

    Abstract: This paper introduces PMIndiaSum, a multilingual and massively parallel summarization corpus focused on languages in India. Our corpus provides a training and testing ground for four language families, 14 languages, and the largest to date with 196 language pairs. We detail our construction workflow including data acquisition, processing, and quality assurance. Furthermore, we publish benchmarks f… ▽ More

    Submitted 19 October, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: Findings of EMNLP 2023

    ACM Class: I.2.7

  50. arXiv:2305.07494  [pdf, other

    cs.GT cs.AI cs.DS

    Temporal Network Creation Games

    Authors: Davide Bilò, Sarel Cohen, Tobias Friedrich, Hans Gawendowicz, Nicolas Klodt, Pascal Lenzner, George Skretas

    Abstract: Most networks are not static objects, but instead they change over time. This observation has sparked rigorous research on temporal graphs within the last years. In temporal graphs, we have a fixed set of nodes and the connections between them are only available at certain time steps. This gives rise to a plethora of algorithmic problems on such graphs, most prominently the problem of finding temp… ▽ More

    Submitted 21 May, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: To appear at the 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023), full version