Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 595 results for author: De, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01722  [pdf

    cs.SE

    ToffA-DSPL: an approach of trade-off analysis for designing dynamic software product lines

    Authors: Michelle Larissa Luciano Carvalho, Paulo Cesar Masiero, Ismayle de Sousa Santos, Eduardo Santana de Almeida

    Abstract: Software engineers have adopted the Dynamic Software Product Lines (DSPL) engineering practices to develop Dynamically Adaptable Software (DAS). DAS is seen as a DSPL application and must cope with a large number of configurations of features, Non-functional Requirements (NFRs), and contexts. However, the accurate representation of the impact of features over NFRs and contexts for the identificati… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2407.00463  [pdf, other

    cs.LG cs.AI cs.CL cs.HC eess.AS

    Open-Source Conversational AI with SpeechBrain 1.0

    Authors: Mirco Ravanelli, Titouan Parcollet, Adel Moumen, Sylvain de Langen, Cem Subakan, Peter Plantinga, Yingzhi Wang, Pooneh Mousavi, Luca Della Libera, Artem Ploujnikov, Francesco Paissan, Davide Borra, Salah Zaiem, Zeyu Zhao, Shucong Zhang, Georgios Karakasidis, Sung-Lin Yeh, Aku Rouhe, Rudolf Braun, Florian Mai, Juan Zuluaga-Gomez, Seyed Mahed Mousavi, Andreas Nautsch, Xuechen Liu, Sangeet Sagar , et al. (5 additional authors not shown)

    Abstract: SpeechBrain is an open-source Conversational AI toolkit based on PyTorch, focused particularly on speech processing tasks such as speech recognition, speech enhancement, speaker recognition, text-to-speech, and much more. It promotes transparency and replicability by releasing both the pre-trained models and the complete "recipes" of code and algorithms required for training them. This paper prese… ▽ More

    Submitted 2 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: Submitted to JMLR (Machine Learning Open Source Software)

  3. arXiv:2407.00273  [pdf

    cs.SE

    Please do not go: understanding turnover of software engineers from different perspectives

    Authors: Michelle Larissa Luciano Carvalho, Paulo da Silva Cruz, Eduardo Santana de Almeida, Paulo Anselmo da Mota Silveira Neto, Rafael Prikladnicki

    Abstract: Turnover consists of moving into and out of professional employees in the company in a given period. Such a phenomenon significantly impacts the software industry since it generates knowledge loss, delays in the schedule, and increased costs in the final project. Despite the efforts made by researchers and professionals to minimize the turnover, more studies are needed to understand the motivation… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  4. arXiv:2406.19552  [pdf, other

    cs.CL cs.AI cs.LG

    Rethinking harmless refusals when fine-tuning foundation models

    Authors: Florin Pop, Judd Rosenblatt, Diogo Schwerz de Lucena, Michael Vaiana

    Abstract: In this paper, we investigate the degree to which fine-tuning in Large Language Models (LLMs) effectively mitigates versus merely conceals undesirable behavior. Through the lens of semi-realistic role-playing exercises designed to elicit such behaviors, we explore the response dynamics of LLMs post fine-tuning interventions. Our methodology involves prompting models for Chain-of-Thought (CoT) reas… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: ICLR 2024 AGI Workshop Poster

  5. arXiv:2406.16781  [pdf, other

    cs.CY

    A Carrying Capacity Calculator for Pedestrians Using OpenStreetMap Data: Application to Urban Tourism and Public Spaces

    Authors: Duarte Sampaio de Almeida, Rodrigo Simões, Fernando Brito e Abreu, Adriano Lopes, Inês Boavida-Portugal

    Abstract: Determining the carrying capacity of urban tourism destinations and public spaces is essential for sustainable management. This paper presents an online tool that calculates pedestrian carrying capacities for user-defined areas based on OpenStreetMap (OSM) data. The tool considers physical, real, and effective carrying capacities by incorporating parameters such as area per pedestrian, rotation fa… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    ACM Class: J.2; K.4

  6. arXiv:2406.13677  [pdf, other

    cs.CL cs.CY

    Leveraging Large Language Models to Measure Gender Bias in Gendered Languages

    Authors: Erik Derner, Sara Sansalvador de la Fuente, Yoan Gutiérrez, Paloma Moreda, Nuria Oliver

    Abstract: Gender bias in text corpora used in various natural language processing (NLP) contexts, such as for training large language models (LLMs), can lead to the perpetuation and amplification of societal inequalities. This is particularly pronounced in gendered languages like Spanish or French, where grammatical structures inherently encode gender, making the bias analysis more challenging. Existing met… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  7. arXiv:2406.12137  [pdf, other

    cs.AI

    IDs for AI Systems

    Authors: Alan Chan, Noam Kolt, Peter Wills, Usman Anwar, Christian Schroeder de Witt, Nitarshan Rajkumar, Lewis Hammond, David Krueger, Lennart Heim, Markus Anderljung

    Abstract: AI systems are increasingly pervasive, yet information needed to decide whether and how to engage with them may not exist or be accessible. A user may not be able to verify whether a system satisfies certain safety standards. An investigator may not know whom to investigate when a system causes an incident. A platform may find it difficult to penalize repeated negative interactions with the same s… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under review

  8. arXiv:2406.07387  [pdf, ps, other

    cs.IT cs.LG eess.SP

    Machine Learning-Based Channel Prediction for RIS-assisted MIMO Systems With Channel Aging

    Authors: Nipuni Ginige, Arthur Sousa de Sena, Nurul Huda Mahmood, Nandana Rajatheva, Matti Latva-aho

    Abstract: Reconfigurable intelligent surfaces (RISs) have emerged as a promising technology to enhance the performance of sixth-generation (6G) and beyond communication systems. The passive nature of RISs and their large number of reflecting elements pose challenges to the channel estimation process. The associated complexity further escalates when the channel coefficients are fast-varying as in scenarios w… ▽ More

    Submitted 9 May, 2024; originally announced June 2024.

  9. arXiv:2406.06208  [pdf, other

    cs.SD eess.AS

    Quantifying the effect of speech pathology on automatic and human speaker verification

    Authors: Bence Mark Halpern, Thomas Tienkamp, Wen-Chin Huang, Lester Phillip Violeta, Teja Rebernik, Sebastiaan de Visscher, Max Witjes, Martijn Wieling, Defne Abur, Tomoki Toda

    Abstract: This study investigates how surgical intervention for speech pathology (specifically, as a result of oral cancer surgery) impacts the performance of an automatic speaker verification (ASV) system. Using two recently collected Dutch datasets with parallel pre and post-surgery audio from the same speaker, NKI-OC-VC and SPOKE, we assess the extent to which speech pathology influences ASV performance,… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures, 2 tables. Accepted to Interspeech 2024

    ACM Class: I.2.7

  10. arXiv:2406.03452  [pdf, other

    cs.CL

    Using Synchronic Definitions and Semantic Relations to Classify Semantic Change Types

    Authors: Pierluigi Cassotti, Stefano De Pascale, Nina Tahmasebi

    Abstract: There is abundant evidence of the fact that the way words change their meaning can be classified in different types of change, highlighting the relationship between the old and new meanings (among which generalization, specialization and co-hyponymy transfer). In this paper, we present a way of detecting these types of change by constructing a model that leverages information both from synchronic… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  11. arXiv:2406.02765  [pdf, other

    cs.NE cs.LG

    Discovering Dynamic Symbolic Policies with Genetic Programming

    Authors: Sigur de Vries, Sander Keemink, Marcel van Gerven

    Abstract: Artificial intelligence (AI) techniques are increasingly being applied to solve control problems. However, control systems developed in AI are often black-box methods, in that it is not clear how and why they generate their outputs. A lack of transparency can be problematic for control tasks in particular, because it complicates the identification of biases or errors, which in turn negatively infl… ▽ More

    Submitted 21 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 22 pages including references and appendix, 4 figures, 1 algorithm, 5 tables

  12. arXiv:2406.02619  [pdf, other

    cs.CR cs.LG

    Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits

    Authors: Andis Draguns, Andrew Gritsevskiy, Sumeet Ramesh Motwani, Charlie Rogers-Smith, Jeffrey Ladish, Christian Schroeder de Witt

    Abstract: The rapid proliferation of open-source language models significantly increases the risks of downstream backdoor attacks. These backdoors can introduce dangerous behaviours during model deployment and can evade detection by conventional cybersecurity monitoring systems. In this paper, we introduce a novel class of backdoors in autoregressive transformer models, that, in contrast to prior art, are u… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages, 5 figures

  13. Guarding isometric subgraphs and Cops and Robber in planar graphs

    Authors: Sebastián González Hermosillo de la Maza, Bojan Mohar

    Abstract: In the game of Cops and Robbers, one of the most useful results is that an isometric path in a graph can be guarded by one cop. In this paper, we introduce the concept of wide shadow in a subgraph, and use it to characterize all 1-guardable graphs. As an application, we show that 3 cops can capture a robber in any planar graph with the added restriction that at most two cops can move simultaneousl… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    MSC Class: 05C57; 05C10

    Journal ref: Europ. J. Combin. 119 (2024) 103809

  14. arXiv:2405.19540  [pdf, other

    cs.IT cs.CR

    Computing Low-Entropy Couplings for Large-Support Distributions

    Authors: Samuel Sokota, Dylan Sam, Christian Schroeder de Witt, Spencer Compton, Jakob Foerster, J. Zico Kolter

    Abstract: Minimum-entropy coupling (MEC) -- the process of finding a joint distribution with minimum entropy for given marginals -- has applications in areas such as causality and steganography. However, existing algorithms are either computationally intractable for large-support distributions or limited to specific distribution types and sensitive to hyperparameter choices. This work addresses these limita… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  15. arXiv:2405.12195  [pdf, other

    cs.SE

    Developers' Perceptions on the Impact of ChatGPT in Software Development: A Survey

    Authors: Thiago S. Vaillant, Felipe Deveza de Almeida, Paulo Anselmo M. S. Neto, Cuiyun Gao, Jan Bosch, Eduardo Santana de Almeida

    Abstract: As Large Language Models (LLMs), including ChatGPT and analogous systems, continue to advance, their robust natural language processing capabilities and diverse applications have garnered considerable attention. Nonetheless, despite the increasing acknowledgment of the convergence of Artificial Intelligence (AI) and Software Engineering (SE), there is a lack of studies involving the impact of this… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 31 pages, 9 figures

    ACM Class: D.2.0

  16. arXiv:2405.10004  [pdf, other

    eess.IV cs.CV cs.LG

    ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset

    Authors: Johannes Rückert, Louise Bloch, Raphael Brüngel, Ahmad Idrissi-Yaghir, Henning Schäfer, Cynthia S. Schmidt, Sven Koitka, Obioma Pelka, Asma Ben Abacha, Alba G. Seco de Herrera, Henning Müller, Peter A. Horn, Felix Nensa, Christoph M. Friedrich

    Abstract: Automated medical image analysis systems often require large amounts of training data with high quality labels, which are difficult and time consuming to generate. This paper introduces Radiology Object in COntext version 2 (ROCOv2), a multimodal dataset consisting of radiological images and associated medical concepts and captions extracted from the PMC Open Access subset. It is an updated versio… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: Accepted for Scientific Data

  17. arXiv:2405.08137  [pdf, ps, other

    physics.comp-ph cond-mat.mtrl-sci cs.LG physics.chem-ph

    LATTE: an atomic environment descriptor based on Cartesian tensor contractions

    Authors: Franco Pellegrini, Stefano de Gironcoli, Emine Küçükbenli

    Abstract: We propose a new descriptor for local atomic environments, to be used in combination with machine learning models for the construction of interatomic potentials. The Local Atomic Tensors Trainable Expansion (LATTE) allows for the efficient construction of a variable number of many-body terms with learnable parameters, resulting in a descriptor that is efficient, expressive, and can be scaled to su… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 7 pages, 1 figure

  18. arXiv:2405.06802  [pdf

    cs.CL cs.AI

    Summarizing Radiology Reports Findings into Impressions

    Authors: Raul Salles de Padua, Imran Qureshi

    Abstract: Patient hand-off and triage are two fundamental problems in health care. Often doctors must painstakingly summarize complex findings to efficiently communicate with specialists and quickly make decisions on which patients have the most urgent cases. In pursuit of these challenges, we present (1) a model with state-of-art radiology report summarization performance using (2) a novel method for augme… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 figures

  19. arXiv:2405.06685  [pdf, other

    cs.CL

    Multigenre AI-powered Story Composition

    Authors: Edirlei Soares de Lima, Margot M. E. Neggers, Antonio L. Furtado

    Abstract: This paper shows how to construct genre patterns, whose purpose is to guide interactive story composition in a way that enforces thematic consistency. To start the discussion we argue, based on previous seminal works, for the existence of five fundamental genres, namely comedy, romance - in the sense of epic plots, flourishing since the twelfth century -, tragedy, satire, and mystery. To construct… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  20. arXiv:2405.05031  [pdf, other

    cs.CV

    Mitigating Bias Using Model-Agnostic Data Attribution

    Authors: Sander De Coninck, Wei-Cheng Wang, Sam Leroux, Pieter Simoens

    Abstract: Mitigating bias in machine learning models is a critical endeavor for ensuring fairness and equity. In this paper, we propose a novel approach to address bias by leveraging pixel image attributions to identify and regularize regions of images containing significant information about bias attributes. Our method utilizes a model-agnostic approach to extract pixel attributions by employing a convolut… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted to the 2024 IEEE CVPR Workshop on Fair, Data-efficient, and Trusted Computer Vision

  21. arXiv:2405.00794  [pdf, other

    cs.CV

    Coherent 3D Portrait Video Reconstruction via Triplane Fusion

    Authors: Shengze Wang, Xueting Li, Chao Liu, Matthew Chan, Michael Stengel, Josef Spjut, Henry Fuchs, Shalini De Mello, Koki Nagano

    Abstract: Recent breakthroughs in single-image 3D portrait reconstruction have enabled telepresence systems to stream 3D portrait videos from a single camera in real-time, potentially democratizing telepresence. However, per-frame 3D reconstruction exhibits temporal inconsistency and forgets the user's appearance. On the other hand, self-reenactment methods can render coherent 3D portraits by driving a pers… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  22. arXiv:2404.17047  [pdf, other

    cs.LG

    Near to Mid-term Risks and Opportunities of Open-Source Generative AI

    Authors: Francisco Eiras, Aleksandar Petrov, Bertie Vidgen, Christian Schroeder de Witt, Fabio Pizzati, Katherine Elkins, Supratik Mukhopadhyay, Adel Bibi, Botos Csaba, Fabro Steibel, Fazl Barez, Genevieve Smith, Gianluca Guadagni, Jon Chun, Jordi Cabot, Joseph Marvin Imperial, Juan A. Nolazco-Flores, Lori Landay, Matthew Jackson, Paul Röttger, Philip H. S. Torr, Trevor Darrell, Yong Suk Lee, Jakob Foerster

    Abstract: In the next few years, applications of Generative AI are expected to revolutionize a number of different areas, ranging from science & medicine to education. The potential for these seismic changes has triggered a lively debate about potential risks and resulted in calls for tighter regulation, in particular from some of the major tech companies who are leading in AI development. This regulation i… ▽ More

    Submitted 24 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted to ICML'24 as a position paper

  23. arXiv:2404.16722  [pdf, other

    cs.CC

    Clique Is Hard on Average for Sherali-Adams with Bounded Coefficients

    Authors: Susanna F. de Rezende, Aaron Potechin, Kilian Risse

    Abstract: We prove that Sherali-Adams with polynomially bounded coefficients requires proofs of size $n^{Ω(d)}$ to rule out the existence of an $n^{Θ(1)}$-clique in Erdős-Rényi random graphs whose maximum clique is of size $d\leq 2\log n$. This lower bound is tight up to the multiplicative constant in the exponent. We obtain this result by introducing a technique inspired by pseudo-calibration which may be… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: This is the full-length version of a paper with the title "Clique Is Hard on Average for Unary Sherali-Adams" that appeared in the Proceedings of the 64th Annual IEEE Symposium on Foundations of Computer Science (FOCS 2023)

    ACM Class: F.2.2; F.1.3; I.2.3; F.4.1

  24. arXiv:2404.16653  [pdf, other

    cs.CL cs.AI

    Análise de ambiguidade linguística em modelos de linguagem de grande escala (LLMs)

    Authors: Lavínia de Carvalho Moraes, Irene Cristina Silvério, Rafael Alexandre Sousa Marques, Bianca de Castro Anaia, Dandara Freitas de Paula, Maria Carolina Schincariol de Faria, Iury Cleveston, Alana de Santana Correia, Raquel Meister Ko Freitag

    Abstract: Linguistic ambiguity continues to represent a significant challenge for natural language processing (NLP) systems, notwithstanding the advancements in architectures such as Transformers and BERT. Inspired by the recent success of instructional models like ChatGPT and Gemini (In 2023, the artificial intelligence was called Bard.), this study aims to analyze and discuss linguistic ambiguity within t… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: in Portuguese language, 16 páginas, 5 páginas de apêndice e 4 imagens

  25. arXiv:2404.16244  [pdf, other

    cs.CY

    The Ethics of Advanced AI Assistants

    Authors: Iason Gabriel, Arianna Manzini, Geoff Keeling, Lisa Anne Hendricks, Verena Rieser, Hasan Iqbal, Nenad Tomašev, Ira Ktena, Zachary Kenton, Mikel Rodriguez, Seliem El-Sayed, Sasha Brown, Canfer Akbulut, Andrew Trask, Edward Hughes, A. Stevie Bergman, Renee Shelby, Nahema Marchal, Conor Griffin, Juan Mateos-Garcia, Laura Weidinger, Winnie Street, Benjamin Lange, Alex Ingerman, Alison Lentz , et al. (32 additional authors not shown)

    Abstract: This paper focuses on the opportunities and the ethical and societal risks posed by advanced AI assistants. We define advanced AI assistants as artificial agents with natural language interfaces, whose function is to plan and execute sequences of actions on behalf of a user, across one or more domains, in line with the user's expectations. The paper starts by considering the technology itself, pro… ▽ More

    Submitted 28 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  26. arXiv:2404.11727  [pdf

    cs.CV

    Deep Learning for Video-Based Assessment of Endotracheal Intubation Skills

    Authors: Jean-Paul Ainam, Erim Yanik, Rahul Rahul, Taylor Kunkes, Lora Cavuoto, Brian Clemency, Kaori Tanaka, Matthew Hackett, Jack Norfleet, Suvranu De

    Abstract: Endotracheal intubation (ETI) is an emergency procedure performed in civilian and combat casualty care settings to establish an airway. Objective and automated assessment of ETI skills is essential for the training and certification of healthcare providers. However, the current approach is based on manual feedback by an expert, which is subjective, time- and resource-intensive, and is prone to poo… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  27. arXiv:2404.10889  [pdf

    cs.AI

    Cognitive-Motor Integration in Assessing Bimanual Motor Skills

    Authors: Erim Yanik, Xavier Intes, Suvranu De

    Abstract: Accurate assessment of bimanual motor skills is essential across various professions, yet, traditional methods often rely on subjective assessments or focus solely on motor actions, overlooking the integral role of cognitive processes. This study introduces a novel approach by leveraging deep neural networks (DNNs) to analyze and integrate both cognitive decision-making and motor execution. We tes… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 12 pages, 3 figures, 2 tables

  28. arXiv:2404.09666  [pdf, other

    eess.IV cs.CV q-bio.QM

    Deformable MRI Sequence Registration for AI-based Prostate Cancer Diagnosis

    Authors: Alessa Hering, Sarah de Boer, Anindo Saha, Jasper J. Twilt, Mattias P. Heinrich, Derya Yakar, Maarten de Rooij, Henkjan Huisman, Joeran S. Bosma

    Abstract: The PI-CAI (Prostate Imaging: Cancer AI) challenge led to expert-level diagnostic algorithms for clinically significant prostate cancer detection. The algorithms receive biparametric MRI scans as input, which consist of T2-weighted and diffusion-weighted scans. These scans can be misaligned due to multiple factors in the scanning process. Image registration can alleviate this issue by predicting t… ▽ More

    Submitted 28 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  29. arXiv:2404.09206  [pdf, other

    cs.CL

    DKE-Research at SemEval-2024 Task 2: Incorporating Data Augmentation with Generative Models and Biomedical Knowledge to Enhance Inference Robustness

    Authors: Yuqi Wang, Zeqiang Wang, Wei Wang, Qi Chen, Kaizhu Huang, Anh Nguyen, Suparna De

    Abstract: Safe and reliable natural language inference is critical for extracting insights from clinical trial reports but poses challenges due to biases in large pre-trained language models. This paper presents a novel data augmentation technique to improve model robustness for biomedical natural language inference in clinical trials. By generating synthetic examples through semantic perturbations and doma… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  30. arXiv:2404.08488  [pdf

    cs.CL

    Thematic Analysis with Large Language Models: does it work with languages other than English? A targeted test in Italian

    Authors: Stefano De Paoli

    Abstract: This paper proposes a test to perform Thematic Analysis (TA) with Large Language Model (LLM) on data which is in a different language than English. While there has been initial promising work on using pre-trained LLMs for TA on data in English, we lack any tests on whether these models can reasonably perform the same analysis with good quality in other language. In this paper a test will be propos… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  31. arXiv:2404.07839  [pdf, other

    cs.LG cs.AI cs.CL

    RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

    Authors: Aleksandar Botev, Soham De, Samuel L Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti , et al. (37 additional authors not shown)

    Abstract: We introduce RecurrentGemma, an open language model which uses Google's novel Griffin architecture. Griffin combines linear recurrences with local attention to achieve excellent performance on language. It has a fixed-sized state, which reduces memory use and enables efficient inference on long sequences. We provide a pre-trained model with 2B non-embedding parameters, and an instruction tuned var… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  32. arXiv:2404.07099  [pdf, other

    cs.LG cs.AI

    Rethinking Out-of-Distribution Detection for Reinforcement Learning: Advancing Methods for Evaluation and Detection

    Authors: Linas Nasvytis, Kai Sandbrink, Jakob Foerster, Tim Franzmeyer, Christian Schroeder de Witt

    Abstract: While reinforcement learning (RL) algorithms have been successfully applied across numerous sequential decision-making problems, their generalization to unforeseen testing environments remains a significant concern. In this paper, we study the problem of out-of-distribution (OOD) detection in RL, which focuses on identifying situations at test time that RL agents have not encountered in their trai… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted as a full paper to the 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024)

  33. arXiv:2404.06976  [pdf, other

    cs.IR

    Quati: A Brazilian Portuguese Information Retrieval Dataset from Native Speakers

    Authors: Mirelle Bueno, Eduardo Seiti de Oliveira, Rodrigo Nogueira, Roberto A. Lotufo, Jayr Alencar Pereira

    Abstract: Despite Portuguese being one of the most spoken languages in the world, there is a lack of high-quality information retrieval datasets in that language. We present Quati, a dataset specifically designed for the Brazilian Portuguese language. It comprises a collection of queries formulated by native speakers and a curated set of documents sourced from a selection of high-quality Brazilian Portugues… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 22 pages

  34. arXiv:2404.02659  [pdf, other

    cs.CV cs.NE

    A Satellite Band Selection Framework for Amazon Forest Deforestation Detection Task

    Authors: Eduardo Neto, Fabio A. Faria, Amanda A. S. de Oliveira, Álvaro L. Fazenda

    Abstract: The conservation of tropical forests is a topic of significant social and ecological relevance due to their crucial role in the global ecosystem. Unfortunately, deforestation and degradation impact millions of hectares annually, necessitating government or private initiatives for effective forest monitoring. This study introduces a novel framework that employs the Univariate Marginal Distribution… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 9 pages, 4 figures, paper accepted for presentation at GECCO 2024

  35. arXiv:2403.19935  [pdf, other

    cs.CV

    CP HDR: A feature point detection and description library for LDR and HDR images

    Authors: Artur Santos Nascimento, Valter Guilherme Silva de Souza, Daniel Oliveira Dantas, Beatriz Trinchão Andrade

    Abstract: In computer vision, characteristics refer to image regions with unique properties, such as corners, edges, textures, or areas with high contrast. These regions can be represented through feature points (FPs). FP detection and description are fundamental steps to many computer vision tasks. Most FP detection and description methods use low dynamic range (LDR) images, sufficient for most application… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    MSC Class: 68T45 ACM Class: I.4.0

  36. Loss Regularizing Robotic Terrain Classification

    Authors: Shakti Deo Kumar, Sudhanshu Tripathi, Krishna Ujjwal, Sarvada Sakshi Jha, Suddhasil De

    Abstract: Locomotion mechanics of legged robots are suitable when pacing through difficult terrains. Recognising terrains for such robots are important to fully yoke the versatility of their movements. Consequently, robotic terrain classification becomes significant to classify terrains in real time with high accuracy. The conventional classifiers suffer from overfitting problem, low accuracy problem, high… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Preliminary draft of the work published in IEEE conference 2023

  37. arXiv:2403.09547  [pdf

    cs.SE cs.LG

    How do Machine Learning Projects use Continuous Integration Practices? An Empirical Study on GitHub Actions

    Authors: João Helis Bernardo, Daniel Alencar da Costa, Sérgio Queiroz de Medeiros, Uirá Kulesza

    Abstract: Continuous Integration (CI) is a well-established practice in traditional software development, but its nuances in the domain of Machine Learning (ML) projects remain relatively unexplored. Given the distinctive nature of ML development, understanding how CI practices are adopted in this context is crucial for tailoring effective approaches. In this study, we conduct a comprehensive analysis of 18… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 10 pages, Mining Software Repositories, MSR 2024

  38. arXiv:2403.08295  [pdf, other

    cs.CL cs.AI

    Gemma: Open Models Based on Gemini Research and Technology

    Authors: Gemma Team, Thomas Mesnard, Cassidy Hardin, Robert Dadashi, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Léonard Hussenot, Pier Giuseppe Sessa, Aakanksha Chowdhery, Adam Roberts, Aditya Barua, Alex Botev, Alex Castro-Ros, Ambrose Slone, Amélie Héliou, Andrea Tacchetti, Anna Bulanova, Antonia Paterson, Beth Tsai, Bobak Shahriari , et al. (83 additional authors not shown)

    Abstract: This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models. Gemma models demonstrate strong performance across academic benchmarks for language understanding, reasoning, and safety. We release two sizes of models (2 billion and 7 billion parameters), and provide both pretrained and fine-tuned checkpoints. Ge… ▽ More

    Submitted 16 April, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  39. A Machine learning and Empirical Bayesian Approach for Predictive Buying in B2B E-commerce

    Authors: Tuhin Subhra De, Pranjal Singh, Alok Patel

    Abstract: In the context of developing nations like India, traditional business to business (B2B) commerce heavily relies on the establishment of robust relationships, trust, and credit arrangements between buyers and sellers. Consequently, ecommerce enterprises frequently. Established in 2016 with a vision to revolutionize trade in India through technology, Udaan is the countrys largest business to busines… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Published at the 8th International Conference on Machine Learning and Soft Computing (ICMLSC 2024), Singapore

  40. arXiv:2403.07507  [pdf, other

    astro-ph.EP cs.LG

    Reconstructions of Jupiter's magnetic field using physics informed neural networks

    Authors: Philip W. Livermore, Leyuan Wu, Longwei Chen, Sjoerd A. L. de Ridder

    Abstract: Magnetic sounding using data collected from the Juno mission can be used to provide constraints on Jupiter's interior. However, inwards continuation of reconstructions assuming zero electrical conductivity and a representation in spherical harmonics are limited by the enhancement of noise at small scales. Here we describe new reconstructions of Jupiter's internal magnetic field based on physics-in… ▽ More

    Submitted 3 May, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  41. arXiv:2403.02330  [pdf, other

    cs.CV

    RegionGPT: Towards Region Understanding Vision Language Model

    Authors: Qiushan Guo, Shalini De Mello, Hongxu Yin, Wonmin Byeon, Ka Chun Cheung, Yizhou Yu, Ping Luo, Sifei Liu

    Abstract: Vision language models (VLMs) have experienced rapid advancements through the integration of large language models (LLMs) with image-text pairs, yet they struggle with detailed regional visual understanding due to limited spatial awareness of the vision encoder, and the use of coarse-grained training data that lacks detailed, region-specific captions. To address this, we introduce RegionGPT (short… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  42. arXiv:2402.19427  [pdf, other

    cs.LG cs.CL

    Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

    Authors: Soham De, Samuel L. Smith, Anushan Fernando, Aleksandar Botev, George Cristian-Muraru, Albert Gu, Ruba Haroun, Leonard Berrada, Yutian Chen, Srivatsan Srinivasan, Guillaume Desjardins, Arnaud Doucet, David Budden, Yee Whye Teh, Razvan Pascanu, Nando De Freitas, Caglar Gulcehre

    Abstract: Recurrent neural networks (RNNs) have fast inference and scale efficiently on long sequences, but they are difficult to train and hard to scale. We propose Hawk, an RNN with gated linear recurrences, and Griffin, a hybrid model that mixes gated linear recurrences with local attention. Hawk exceeds the reported performance of Mamba on downstream tasks, while Griffin matches the performance of Llama… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 25 pages, 11 figures

  43. arXiv:2402.12110  [pdf, other

    cs.CG cs.DS

    The Complexity of Geodesic Spanners using Steiner Points

    Authors: Sarita de Berg, Tim Ophelders, Irene Parada, Frank Staals, Jules Wulms

    Abstract: A geometric $t$-spanner $\mathcal{G}$ on a set $S$ of $n$ point sites in a metric space $P$ is a subgraph of the complete graph on $S$ such that for every pair of sites $p,q$ the distance in $\mathcal{G}$ is a most $t$ times the distance $d(p,q)$ in $P$. We call a connection between two sites in the spanner a link. In some settings, such as when $P$ is a simple polygon with $m$ vertices and a link… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 25 pages, 11 figures

  44. arXiv:2402.12028  [pdf, other

    cs.CG

    Exact solutions to the Weighted Region Problem

    Authors: Sarita de Berg, Guillermo Esteban, Rodrigo I. Silveira, Frank Staals

    Abstract: In this paper, we consider the Weighted Region Problem. In the Weighted Region Problem, the length of a path is defined as the sum of the weights of the subpaths within each region, where the weight of a subpath is its Euclidean length multiplied by a weight $ α\geq 0 $ depending on the region. We study a restricted version of the problem of determining shortest paths through a single weighted rec… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  45. arXiv:2402.07510  [pdf, other

    cs.AI cs.CR

    Secret Collusion Among Generative AI Agents

    Authors: Sumeet Ramesh Motwani, Mikhail Baranchuk, Martin Strohmeier, Vijay Bolina, Philip H. S. Torr, Lewis Hammond, Christian Schroeder de Witt

    Abstract: Recent capability increases in large language models (LLMs) open up applications in which teams of communicating generative AI agents solve joint tasks. This poses privacy and security challenges concerning the unauthorised sharing of information, or other unwanted forms of agent coordination. Modern steganographic techniques could render such dynamics hard to detect. In this paper, we comprehensi… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  46. arXiv:2402.05106  [pdf, other

    cs.CV cs.AI cs.CL

    Image captioning for Brazilian Portuguese using GRIT model

    Authors: Rafael Silva de Alencar, William Alberto Cruz Castañeda, Marcellus Amadeus

    Abstract: This work presents the early development of a model of image captioning for the Brazilian Portuguese language. We used the GRIT (Grid - and Region-based Image captioning Transformer) model to accomplish this work. GRIT is a Transformer-only neural architecture that effectively utilizes two visual features to generate better captions. The GRIT method emerged as a proposal to be a more efficient way… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2207.09666 by other authors

  47. arXiv:2402.01088  [pdf, other

    cs.GT cs.MA

    The Danger Of Arrogance: Welfare Equilibra As A Solution To Stackelberg Self-Play In Non-Coincidental Games

    Authors: Jake Levi, Chris Lu, Timon Willi, Christian Schroeder de Witt, Jakob Foerster

    Abstract: The increasing prevalence of multi-agent learning systems in society necessitates understanding how to learn effective and safe policies in general-sum multi-agent environments against a variety of opponents, including self-play. General-sum learning is difficult because of non-stationary opponents and misaligned incentives. Our first main contribution is to show that many recent approaches to gen… ▽ More

    Submitted 27 March, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 31 pages, 23 figures

  48. arXiv:2402.00591  [pdf, other

    cs.AI

    Sandra -- A Neuro-Symbolic Reasoner Based On Descriptions And Situations

    Authors: Nicolas Lazzari, Stefano De Giorgis, Aldo Gangemi, Valentina Presutti

    Abstract: This paper presents sandra, a neuro-symbolic reasoner combining vectorial representations with deductive reasoning. Sandra builds a vector space constrained by an ontology and performs reasoning over it. The geometric nature of the reasoner allows its combination with neural networks, bridging the gap with symbolic knowledge representations. Sandra is based on the Description and Situation (DnS) o… ▽ More

    Submitted 25 March, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  49. arXiv:2401.17821  [pdf, other

    cs.CV cs.HC

    Do Object Detection Localization Errors Affect Human Performance and Trust?

    Authors: Sven de Witte, Ombretta Strafforello, Jan van Gemert

    Abstract: Bounding boxes are often used to communicate automatic object detection results to humans, aiding humans in a multitude of tasks. We investigate the relationship between bounding box localization errors and human task performance. We use observer performance studies on a visual multi-object counting task to measure both human trust and performance with different levels of bounding box accuracy. Th… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  50. arXiv:2401.15455  [pdf, other

    cs.CV cs.LG

    New Foggy Object Detecting Model

    Authors: Rahul Banavathu, Modem Veda Sree, Bollina Kavya Sri, Suddhasil De

    Abstract: Object detection in reduced visibility has become a prominent research area. The existing techniques are not accurate enough in recognizing objects under such circumstances. This paper introduces a new foggy object detection method through a two-staged architecture of region identification from input images and detecting objects in such regions. The paper confirms notable improvements of the propo… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.