Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 115 results for author: Silva, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19482  [pdf, other

    cs.CL

    xTower: A Multilingual LLM for Explaining and Correcting Translation Errors

    Authors: Marcos Treviso, Nuno M. Guerreiro, Sweta Agrawal, Ricardo Rei, José Pombal, Tania Vaz, Helena Wu, Beatriz Silva, Daan van Stigt, André F. T. Martins

    Abstract: While machine translation (MT) systems are achieving increasingly strong performance on benchmarks, they often produce translations with errors and anomalies. Understanding these errors can potentially help improve the translation quality and user experience. This paper introduces xTower, an open large language model (LLM) built on top of TowerBase designed to provide free-text explanations for tr… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.17915  [pdf, other

    cs.CV cs.AI

    Semi-supervised classification of dental conditions in panoramic radiographs using large language model and instance segmentation: A real-world dataset evaluation

    Authors: Bernardo Silva, Jefferson Fontinele, Carolina Letícia Zilli Vieira, João Manuel R. S. Tavares, Patricia Ramos Cury, Luciano Oliveira

    Abstract: Dental panoramic radiographs offer vast diagnostic opportunities, but training supervised deep learning networks for automatic analysis of those radiology images is hampered by a shortage of labeled data. Here, a different perspective on this problem is introduced. A semi-supervised learning framework is proposed to classify thirteen dental conditions on panoramic radiographs, with a particular em… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 43 pages, 12 figures, 9 tables

  3. arXiv:2406.16241  [pdf, other

    cs.LG stat.ME

    Position: Benchmarking is Limited in Reinforcement Learning Research

    Authors: Scott M. Jordan, Adam White, Bruno Castro da Silva, Martha White, Philip S. Thomas

    Abstract: Novel reinforcement learning algorithms, or improvements on existing ones, are commonly justified by evaluating their performance on benchmark environments and are compared to an ever-changing set of standard algorithms. However, despite numerous calls for improvements, experimental practices continue to produce misleading or unsupported claims. One reason for the ongoing substandard practices is… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 19 pages, 13 figures, The Forty-first International Conference on Machine Learning (ICML 2024)

  4. arXiv:2404.08555  [pdf, other

    cs.LG cs.AI cs.CL

    RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

    Authors: Shreyas Chaudhari, Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande, Bruno Castro da Silva

    Abstract: State-of-the-art large language models (LLMs) have become indispensable tools for various tasks. However, training LLMs to serve as effective assistants for humans requires careful consideration. A promising approach is reinforcement learning from human feedback (RLHF), which leverages human feedback to update the model in accordance with human preferences and mitigate issues like toxicity and hal… ▽ More

    Submitted 15 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

  5. arXiv:2404.00213  [pdf, other

    cs.CL

    Injecting New Knowledge into Large Language Models via Supervised Fine-Tuning

    Authors: Nick Mecklenburg, Yiyou Lin, Xiaoxiao Li, Daniel Holstein, Leonardo Nunes, Sara Malvar, Bruno Silva, Ranveer Chandra, Vijay Aski, Pavan Kumar Reddy Yannam, Tolga Aktas, Todd Hendry

    Abstract: In recent years, Large Language Models (LLMs) have shown remarkable performance in generating human-like text, proving to be a valuable asset across various applications. However, adapting these models to incorporate new, out-of-domain knowledge remains a challenge, particularly for facts and events that occur after the model's knowledge cutoff date. This paper investigates the effectiveness of Su… ▽ More

    Submitted 2 April, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: 16 pages; 7 figures. updated authors list

  6. Exploring Optical Flow Inclusion into nnU-Net Framework for Surgical Instrument Segmentation

    Authors: Marcos Fernández-Rodríguez, Bruno Silva, Sandro Queirós, Helena R. Torres, Bruno Oliveira, Pedro Morais, Lukas R. Buschle, Jorge Correia-Pinto, Estevão Lima, João L. Vilaça

    Abstract: Surgical instrument segmentation in laparoscopy is essential for computer-assisted surgical systems. Despite the Deep Learning progress in recent years, the dynamic setting of laparoscopic surgery still presents challenges for precise segmentation. The nnU-Net framework excelled in semantic segmentation analyzing single frames without temporal information. The framework's ease of use, including it… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Journal ref: Proceedings Volume 12928, Medical Imaging 2024: Image-Guided Procedures, Robotic Interventions, and Modeling; 1292827 (2024)

  7. arXiv:2403.07201  [pdf

    cs.LG cs.AI stat.AP

    A multi-cohort study on prediction of acute brain dysfunction states using selective state space models

    Authors: Brandon Silva, Miguel Contreras, Sabyasachi Bandyopadhyay, Yuanfang Ren, Ziyuan Guan, Jeremy Balch, Kia Khezeli, Tezcan Ozrazgat Baslanti, Ben Shickel, Azra Bihorac, Parisa Rashidi

    Abstract: Assessing acute brain dysfunction (ABD), including delirium and coma in the intensive care unit (ICU), is a critical challenge due to its prevalence and severe implications for patient outcomes. Current diagnostic methods rely on infrequent clinical observations, which can only determine a patient's ABD status after onset. Our research attempts to solve these problems by harnessing Electronic Heal… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 22 pages, 8 figures, To be published

  8. arXiv:2403.06322  [pdf, other

    cs.CV cs.AI

    Leveraging Computer Vision in the Intensive Care Unit (ICU) for Examining Visitation and Mobility

    Authors: Scott Siegel, Jiaqing Zhang, Sabyasachi Bandyopadhyay, Subhash Nerella, Brandon Silva, Tezcan Baslanti, Azra Bihorac, Parisa Rashidi

    Abstract: Despite the importance of closely monitoring patients in the Intensive Care Unit (ICU), many aspects are still assessed in a limited manner due to the time constraints imposed on healthcare providers. For example, although excessive visitations during rest hours can potentially exacerbate the risk of circadian rhythm disruption and delirium, it is not captured in the ICU. Likewise, while mobility… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  9. arXiv:2403.02043  [pdf, other

    eess.IV cs.CV

    Iterative Occlusion-Aware Light Field Depth Estimation using 4D Geometrical Cues

    Authors: Rui Lourenço, Lucas Thomaz, Eduardo A. B. Silva, Sergio M. M. Faria

    Abstract: Light field cameras and multi-camera arrays have emerged as promising solutions for accurately estimating depth by passively capturing light information. This is possible because the 3D information of a scene is embedded in the 4D light field geometry. Commonly, depth estimation methods extract this information relying on gradient information, heuristic-based optimisation models, or learning-based… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  10. arXiv:2402.18814  [pdf, other

    cs.IT

    New topological subsystem codes from semi-regular tessellations

    Authors: Eduardo Brandani da Silva, Evandro Mazetto Brizola

    Abstract: In this work, we present new constructions for topological subsystem codes using semi-regular Euclidean and hyperbolic tessellations. They give us new families of codes, and we also provide a new family of codes obtained through an already existing construction, due to Sarvepalli and Brown. We also prove new results that allow us to obtain the parameters of these new codes.

    Submitted 28 February, 2024; originally announced February 2024.

  11. arXiv:2401.08686  [pdf, other

    cs.CV

    Attention Modules Improve Modern Image-Level Anomaly Detection: A DifferNet Case Study

    Authors: André Luiz B. Vieira e Silva, Francisco Simões, Danny Kowerko, Tobias Schlosser, Felipe Battisti, Veronica Teichrieb

    Abstract: Within (semi-)automated visual inspection, learning-based approaches for assessing visual defects, including deep neural networks, enable the processing of otherwise small defect patterns in pixel size on high-resolution imagery. The emergence of these often rarely occurring defect patterns explains the general need for labeled data corpora. To not only alleviate this issue but to furthermore adva… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Accepted to CVPRW 2023: VISION'23 - 1st workshop on Vision-based InduStrial InspectiON (Extended Abstract). arXiv admin note: substantial text overlap with arXiv:2311.02747

  12. arXiv:2401.08406  [pdf, other

    cs.CL cs.LG

    RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture

    Authors: Angels Balaguer, Vinamra Benara, Renato Luiz de Freitas Cunha, Roberto de M. Estevão Filho, Todd Hendry, Daniel Holstein, Jennifer Marsman, Nick Mecklenburg, Sara Malvar, Leonardo O. Nunes, Rafael Padilha, Morris Sharp, Bruno Silva, Swati Sharma, Vijay Aski, Ranveer Chandra

    Abstract: There are two common ways in which developers are incorporating proprietary and domain-specific data when building applications of Large Language Models (LLMs): Retrieval-Augmented Generation (RAG) and Fine-Tuning. RAG augments the prompt with the external data, while fine-Tuning incorporates the additional knowledge into the model itself. However, the pros and cons of both approaches are not well… ▽ More

    Submitted 30 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  13. arXiv:2312.12972  [pdf, other

    cs.LG

    From Past to Future: Rethinking Eligibility Traces

    Authors: Dhawal Gupta, Scott M. Jordan, Shreyas Chaudhari, Bo Liu, Philip S. Thomas, Bruno Castro da Silva

    Abstract: In this paper, we introduce a fresh perspective on the challenges of credit assignment and policy evaluation. First, we delve into the nuances of eligibility traces and explore instances where their updates may result in unexpected credit assignment to preceding states. From this investigation emerges the concept of a novel value function, which we refer to as the \emph{bidirectional value functio… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted in The 38th Annual AAAI Conference on Artificial Intelligence

  14. arXiv:2311.02747  [pdf, other

    cs.CV

    Attention Modules Improve Image-Level Anomaly Detection for Industrial Inspection: A DifferNet Case Study

    Authors: André Luiz Buarque Vieira e Silva, Francisco Simões, Danny Kowerko, Tobias Schlosser, Felipe Battisti, Veronica Teichrieb

    Abstract: Within (semi-)automated visual industrial inspection, learning-based approaches for assessing visual defects, including deep neural networks, enable the processing of otherwise small defect patterns in pixel size on high-resolution imagery. The emergence of these often rarely occurring defect patterns explains the general need for labeled data corpora. To alleviate this issue and advance the curre… ▽ More

    Submitted 7 November, 2023; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: Accepted at WACV 2024

  15. arXiv:2311.02026  [pdf

    cs.AI

    APRICOT-Mamba: Acuity Prediction in Intensive Care Unit (ICU): Development and Validation of a Stability, Transitions, and Life-Sustaining Therapies Prediction Model

    Authors: Miguel Contreras, Brandon Silva, Benjamin Shickel, Tezcan Ozrazgat-Baslanti, Yuanfang Ren, Ziyuan Guan, Jeremy Balch, Jiaqing Zhang, Sabyasachi Bandyopadhyay, Kia Khezeli, Azra Bihorac, Parisa Rashidi

    Abstract: The acuity state of patients in the intensive care unit (ICU) can quickly change from stable to unstable. Early detection of deteriorating conditions can result in providing timely interventions and improved survival rates. In this study, we propose APRICOT-M (Acuity Prediction in Intensive Care Unit-Mamba), a 150k-parameter state space-based neural network to predict acuity state, transitions, an… ▽ More

    Submitted 8 March, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

  16. InsPLAD: A Dataset and Benchmark for Power Line Asset Inspection in UAV Images

    Authors: André Luiz Buarque Vieira e Silva, Heitor de Castro Felix, Franscisco Paulo Magalhães Simões, Veronica Teichrieb, Michel Mozinho dos Santos, Hemir Santiago, Virginia Sgotti, Henrique Lott Neto

    Abstract: Power line maintenance and inspection are essential to avoid power supply interruptions, reducing its high social and financial impacts yearly. Automating power line visual inspections remains a relevant open problem for the industry due to the lack of public real-world datasets of power line components and their various defects to foster new research. This paper introduces InsPLAD, a Power Line A… ▽ More

    Submitted 3 December, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: This is an original manuscript of an article published by Taylor & Francis in the International Journal of Remote Sensing on 29 Nov 2023, available online: https://doi.org/10.1080/01431161.2023.2283900

  17. arXiv:2310.19007  [pdf, other

    cs.LG

    Behavior Alignment via Reward Function Optimization

    Authors: Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno Castro da Silva

    Abstract: Designing reward functions for efficiently guiding reinforcement learning (RL) agents toward specific behaviors is a complex task. This is challenging since it requires the identification of reward structures that are not sparse and that avoid inadvertently inducing undesirable behaviors. Naively modifying the reward structure to offer denser and more frequent feedback can lead to unintended outco… ▽ More

    Submitted 31 October, 2023; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: (Spotlight) Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  18. arXiv:2310.06225  [pdf, other

    cs.AI cs.LG

    GPT-4 as an Agronomist Assistant? Answering Agriculture Exams Using Large Language Models

    Authors: Bruno Silva, Leonardo Nunes, Roberto Estevão, Vijay Aski, Ranveer Chandra

    Abstract: Large language models (LLMs) have demonstrated remarkable capabilities in natural language understanding across various domains, including healthcare and finance. For some tasks, LLMs achieve similar or better performance than trained human beings, therefore it is reasonable to employ human exams (e.g., certification tests) to assess the performance of LLMs. We present a comprehensive evaluation o… ▽ More

    Submitted 12 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  19. arXiv:2310.05951  [pdf, other

    cs.CV cs.AI

    Reducing the False Positive Rate Using Bayesian Inference in Autonomous Driving Perception

    Authors: Gledson Melotti, Johann J. S. Bastos, Bruno L. S. da Silva, Tiago Zanotelli, Cristiano Premebida

    Abstract: Object recognition is a crucial step in perception systems for autonomous and intelligent vehicles, as evidenced by the numerous research works in the topic. In this paper, object recognition is explored by using multisensory and multimodality approaches, with the intention of reducing the false positive rate (FPR). The reduction of the FPR becomes increasingly important in perception systems sinc… ▽ More

    Submitted 22 October, 2023; v1 submitted 9 September, 2023; originally announced October 2023.

    Comments: This paper has been submitted to the journal Pattern Recognition Letters

  20. arXiv:2310.01129  [pdf, other

    cs.CV

    Strength in Diversity: Multi-Branch Representation Learning for Vehicle Re-Identification

    Authors: Eurico Almeida, Bruno Silva, Jorge Batista

    Abstract: This paper presents an efficient and lightweight multi-branch deep architecture to improve vehicle re-identification (V-ReID). While most V-ReID work uses a combination of complex multi-branch architectures to extract robust and diversified embeddings towards re-identification, we advocate that simple and lightweight architectures can be designed to fulfill the Re-ID task without compromising perf… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: Paper accepted in ITSC2023

  21. arXiv:2309.01122  [pdf

    q-bio.QM cs.CE cs.LG

    AI driven B-cell Immunotherapy Design

    Authors: Bruna Moreira da Silva, David B. Ascher, Nicholas Geard, Douglas E. V. Pires

    Abstract: Antibodies, a prominent class of approved biologics, play a crucial role in detecting foreign antigens. The effectiveness of antigen neutralisation and elimination hinges upon the strength, sensitivity, and specificity of the paratope-epitope interaction, which demands resource-intensive experimental techniques for characterisation. In recent years, artificial intelligence and machine learning met… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  22. arXiv:2307.00067  [pdf

    cs.AI cs.CY cs.LG

    Transformers in Healthcare: A Survey

    Authors: Subhash Nerella, Sabyasachi Bandyopadhyay, Jiaqing Zhang, Miguel Contreras, Scott Siegel, Aysegul Bumin, Brandon Silva, Jessica Sena, Benjamin Shickel, Azra Bihorac, Kia Khezeli, Parisa Rashidi

    Abstract: With Artificial Intelligence (AI) increasingly permeating various aspects of society, including healthcare, the adoption of the Transformers neural network architecture is rapidly changing many applications. Transformer is a type of deep learning architecture initially developed to solve general-purpose Natural Language Processing (NLP) tasks and has subsequently been adapted in many fields, inclu… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

  23. arXiv:2306.12962  [pdf, other

    eess.SY cs.LG math.DS physics.comp-ph

    PyKoopman: A Python Package for Data-Driven Approximation of the Koopman Operator

    Authors: Shaowu Pan, Eurika Kaiser, Brian M. de Silva, J. Nathan Kutz, Steven L. Brunton

    Abstract: PyKoopman is a Python package for the data-driven approximation of the Koopman operator associated with a dynamical system. The Koopman operator is a principled linear embedding of nonlinear dynamics and facilitates the prediction, estimation, and control of strongly nonlinear dynamics using linear systems theory. In particular, PyKoopman provides tools for data-driven system identification for un… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: 16 pages

  24. arXiv:2306.10121  [pdf, other

    cs.LG cs.AI

    A Comprehensive Modeling Approach for Crop Yield Forecasts using AI-based Methods and Crop Simulation Models

    Authors: Renato Luiz de Freitas Cunha, Bruno Silva, Priscilla Barreira Avegliano

    Abstract: Numerous solutions for yield estimation are either based on data-driven models, or on crop-simulation models (CSMs). Researchers tend to build data-driven models using nationwide crop information databases provided by agencies such as the USDA. On the opposite side of the spectrum, CSMs require fine data that may be hard to generalize from a handful of fields. In this paper, we propose a comprehen… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  25. arXiv:2305.09838  [pdf, other

    cs.LG cs.AI

    Coagent Networks: Generalized and Scaled

    Authors: James E. Kostas, Scott M. Jordan, Yash Chandak, Georgios Theocharous, Dhawal Gupta, Martha White, Bruno Castro da Silva, Philip S. Thomas

    Abstract: Coagent networks for reinforcement learning (RL) [Thomas and Barto, 2011] provide a powerful and flexible framework for deriving principled learning rules for arbitrary stochastic neural networks. The coagent framework offers an alternative to backpropagation-based deep learning (BDL) that overcomes some of backpropagation's main limitations. For example, coagent networks can compute different par… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  26. arXiv:2305.07511  [pdf, ps, other

    cs.LG cs.AI cs.CY eess.IV

    eXplainable Artificial Intelligence on Medical Images: A Survey

    Authors: Matteus Vargas Simão da Silva, Rodrigo Reis Arrais, Jhessica Victoria Santos da Silva, Felipe Souza Tânios, Mateus Antonio Chinelatto, Natalia Backhaus Pereira, Renata De Paris, Lucas Cesar Ferreira Domingos, Rodrigo Dória Villaça, Vitor Lopes Fabris, Nayara Rossi Brito da Silva, Ana Claudia Akemi Matsuki de Faria, Jose Victor Nogueira Alves da Silva, Fabiana Cristina Queiroz de Oliveira Marucci, Francisco Alves de Souza Neto, Danilo Xavier Silva, Vitor Yukio Kondo, Claudio Filipi Gonçalves dos Santos

    Abstract: Over the last few years, the number of works about deep learning applied to the medical field has increased enormously. The necessity of a rigorous assessment of these models is required to explain these results to all people involved in medical exams. A recent field in the machine learning area is explainable artificial intelligence, also known as XAI, which targets to explain the results of such… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  27. arXiv:2304.09849  [pdf

    cs.SE

    Perceptions of Task Interdependence in Software Development: An Industrial Case Study

    Authors: Mayara Benício de Barros Souza, Fabio Q. B. da Silva, Carolyn Seaman

    Abstract: Context: Task interdependence is a work design factor that expresses the mutual dependency between tasks that compose a whole work. In software development, task interdependencies are created by the technical dependencies between the components of the software system and by how the development tasks are allocated to individuals in a teamwork context. Despite its importance for individual and team… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: 11 pages

    MSC Class: D.2 SOFTWARE ENGINEERING (K.6.3)

  28. arXiv:2304.06992  [pdf, other

    cs.RO

    Collaborative Ground-Aerial Multi-Robot System for Disaster Response Missions with a Low-Cost Drone Add-On for Off-the-Shelf Drones

    Authors: Shalutha Rajapakshe, Dilanka Wickramasinghe, Sahan Gurusinghe, Deepana Ishtaweera, Bhanuka Silva, Peshala Jayasekara, Nick Panitz, Paul Flick, Navinda Kottege

    Abstract: In disaster-stricken environments, it's vital to assess the damage quickly, analyse the stability of the environment, and allocate resources to the most vulnerable areas where victims might be present. These missions are difficult and dangerous to be conducted directly by humans. Using the complementary capabilities of both the ground and aerial robots, we investigate a collaborative approach of a… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  29. arXiv:2303.07305  [pdf

    cs.LG cs.AI

    Transformer Models for Acute Brain Dysfunction Prediction

    Authors: Brandon Silva, Miguel Contreras, Tezcan Ozrazgat Baslanti, Yuanfang Ren, Guan Ziyuan, Kia Khezeli, Azra Bihorac, Parisa Rashidi

    Abstract: Acute brain dysfunctions (ABD), which include coma and delirium, are prevalent in the ICU, especially among older patients. The current approach in manual assessment of ABD by care providers may be sporadic and subjective. Hence, there exists a need for a data-driven robust system automating the assessment and prediction of ABD. In this work, we develop a machine learning system for real-time pred… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: 15 pages, 6 figures, 6 tables

  30. arXiv:2303.00577  [pdf, ps, other

    eess.SP cs.DC cs.NI

    Computing Functions Over-the-Air Using Digital Modulations

    Authors: Saeed Razavikia, Jose Mairton Barros da Silva Jr, Carlo Fischione

    Abstract: Over-the-air computation (AirComp) is a known technique in which wireless devices transmit values by analog amplitude modulation so that a function of these values is computed over the communication channel at a common receiver. The physical reason is the superposition properties of the electromagnetic waves, which naturally return sums of analog values. Consequently, the applications of AirComp a… ▽ More

    Submitted 20 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: submitted version to the IEEE ICC conference

  31. arXiv:2302.03022  [pdf, other

    cs.CV cs.RO eess.IV

    SurgT challenge: Benchmark of Soft-Tissue Trackers for Robotic Surgery

    Authors: Joao Cartucho, Alistair Weld, Samyakh Tukra, Haozheng Xu, Hiroki Matsuzaki, Taiyo Ishikawa, Minjun Kwon, Yong Eun Jang, Kwang-Ju Kim, Gwang Lee, Bizhe Bai, Lueder Kahrs, Lars Boecking, Simeon Allmendinger, Leopold Muller, Yitong Zhang, Yueming Jin, Sophia Bano, Francisco Vasconcelos, Wolfgang Reiter, Jonas Hajek, Bruno Silva, Estevao Lima, Joao L. Vilaca, Sandro Queiros , et al. (1 additional authors not shown)

    Abstract: This paper introduces the ``SurgT: Surgical Tracking" challenge which was organised in conjunction with MICCAI 2022. There were two purposes for the creation of this challenge: (1) the establishment of the first standardised benchmark for the research community to assess soft-tissue trackers; and (2) to encourage the development of unsupervised deep learning methods, given the lack of annotated da… ▽ More

    Submitted 30 August, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  32. arXiv:2301.10330  [pdf, other

    cs.LG cs.AI

    Off-Policy Evaluation for Action-Dependent Non-Stationary Environments

    Authors: Yash Chandak, Shiv Shankar, Nathaniel D. Bastian, Bruno Castro da Silva, Emma Brunskil, Philip S. Thomas

    Abstract: Methods for sequential decision-making are often built upon a foundational assumption that the underlying decision process is stationary. This limits the application of such methods because real-world problems are often subject to changes due to external factors (passive non-stationarity), changes induced by interactions with the system itself (active non-stationarity), or both (hybrid non-station… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: Accepted at Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

  33. Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization

    Authors: Lucas N. Alegre, Ana L. C. Bazzan, Diederik M. Roijers, Ann Nowé, Bruno C. da Silva

    Abstract: Multi-objective reinforcement learning (MORL) algorithms tackle sequential decision problems where agents may have different preferences over (possibly conflicting) reward functions. Such algorithms often learn a set of policies (each optimized for a particular agent preference) that can later be used to solve problems with novel preferences. We introduce a novel algorithm that uses Generalized Po… ▽ More

    Submitted 23 March, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: Accepted to AAMAS 2023

  34. arXiv:2212.13950  [pdf, other

    cs.IT eess.SP

    Mixed Coherent and Non-Coherent Transmission for Multi-CPU Cell-Free Systems

    Authors: Roberto P. Antonioli, Iran M. Braga Jr., Gabor Fodor, Yuri C. B. Silva, Walter C. Freitas Jr

    Abstract: Existing works on cell-free systems consider either coherent or non-coherent downlink data transmission and a network deployment with a single central processing unit (CPU). While it is known that coherent transmission outperforms noncoherent transmission when assuming unlimited fronthaul links, the former requires a perfect timing synchronization, which is practically not viable over a large netw… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: Submitted for possible publication in IEEE conference

  35. arXiv:2211.04152  [pdf, other

    cs.LG eess.SP math.OC

    Federated Learning Using Three-Operator ADMM

    Authors: Shashi Kant, José Mairton B. da Silva Jr., Gabor Fodor, Bo Göransson, Mats Bengtsson, Carlo Fischione

    Abstract: Federated learning (FL) has emerged as an instance of distributed machine learning paradigm that avoids the transmission of data generated on the users' side. Although data are not transmitted, edge devices have to deal with limited communication bandwidths, data heterogeneity, and straggler effects due to the limited computational resources of users' devices. A prominent approach to overcome such… ▽ More

    Submitted 25 March, 2024; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: accepted to IEEE Journal of Selected Topics in Signal Processing, 2022

  36. arXiv:2210.17469  [pdf, ps, other

    cs.LG cs.DC eess.SP

    Blind Asynchronous Over-the-Air Federated Edge Learning

    Authors: Saeed Razavikia, Jaume Anguera Peris, Jose Mairton B. da Silva Jr, Carlo Fischione

    Abstract: Federated Edge Learning (FEEL) is a distributed machine learning technique where each device contributes to training a global inference model by independently performing local computations with their data. More recently, FEEL has been merged with over-the-air computation (OAC), where the global model is calculated over the air by leveraging the superposition of analog signals. However, when implem… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

  37. arXiv:2208.14501  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Model-Based Reinforcement Learning with SINDy

    Authors: Rushiv Arora, Bruno Castro da Silva, Eliot Moss

    Abstract: We draw on the latest advancements in the physics community to propose a novel method for discovering the governing non-linear dynamics of physical systems in reinforcement learning (RL). We establish that this method is capable of discovering the underlying dynamics using significantly fewer trajectories (as little as one rollout with $\leq 30$ time steps) than state of the art model learning alg… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 8 pages, 1 figure, 1 table, 1 algorithm, presented at the Decision Awareness in Reinforcement Learning workshop held at the International Conference on Machine Learning, 22 July 2022, Baltimore MD, USA

  38. arXiv:2208.11848  [pdf, other

    cs.CR cs.LG

    On Differential Privacy for Federated Learning in Wireless Systems with Multiple Base Stations

    Authors: Nima Tavangaran, Mingzhe Chen, Zhaohui Yang, José Mairton B. Da Silva Jr., H. Vincent Poor

    Abstract: In this work, we consider a federated learning model in a wireless system with multiple base stations and inter-cell interference. We apply a differential private scheme to transmit information from users to their corresponding base station during the learning phase. We show the convergence behavior of the learning process by deriving an upper bound on its optimality gap. Furthermore, we define an… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  39. arXiv:2208.11744  [pdf, other

    cs.LG cs.AI cs.CY

    Enforcing Delayed-Impact Fairness Guarantees

    Authors: Aline Weber, Blossom Metevier, Yuriy Brun, Philip S. Thomas, Bruno Castro da Silva

    Abstract: Recent research has shown that seemingly fair machine learning models, when used to inform decisions that have an impact on peoples' lives or well-being (e.g., applications involving education, employment, and lending), can inadvertently increase social inequality in the long term. This is because prior fairness-aware algorithms only consider static fairness constraints, such as equal opportunity… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: 24 pages, 5 figures

  40. Learning constitutive models from microstructural simulations via a non-intrusive reduced basis method: Extension to geometrical parameterizations

    Authors: Theron Guo, Francesco A. B. Silva, Ondřej Rokoš, Karen Veroy

    Abstract: Understanding structure-property relations is essential to optimally design materials for specific applications. Two-scale simulations are often employed to analyze the effect of the microstructure on a component's macroscopic properties. However, they are typically computationally expensive and infeasible in multi-query contexts such as optimization and material design. To make such analyses amen… ▽ More

    Submitted 24 October, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Journal ref: Computer Methods in Applied Mechanics and Engineering 401 (2022): 115636

  41. arXiv:2206.11326  [pdf, other

    cs.LG cs.AI

    Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer

    Authors: Lucas N. Alegre, Ana L. C. Bazzan, Bruno C. da Silva

    Abstract: In many real-world applications, reinforcement learning (RL) agents might have to solve multiple tasks, each one typically modeled via a reward function. If reward functions are expressed linearly, and the agent has previously learned a set of policies for different tasks, successor features (SFs) can be exploited to combine such policies and identify reasonable solutions for new problems. However… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: Proceedings of the 39th International Conference on Machine Learning (ICML'22)

  42. arXiv:2205.05032  [pdf, other

    cs.DB cs.DL q-bio.PE

    Brazilian COVID-19 data streaming

    Authors: Nívea B. da Silva, Luis Iván O. Valencia, Fábio M. H. S. Filho, Andressa C. S. Ferreira, Felipe A. C. Pereira, Guilherme L. de Oliveira, Paloma F. Oliveira, Moreno S. Rodrigues, Pablo I. P. Ramos, Juliane F. Oliveira

    Abstract: We collected individualized (unidentifiable) and aggregated openly available data from various sources related to suspected/confirmed SARS-CoV-2 infections, vaccinations, non-pharmaceutical government interventions, human mobility, and levels of population inequality in Brazil. In addition, a data structure allowing real-time data collection, curation, integration, and extract-transform-load proce… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: 12 pages, 6 figures, 2 tables

  43. arXiv:2203.15856  [pdf, other

    cs.CV

    OdontoAI: A human-in-the-loop labeled data set and an online platform to boost research on dental panoramic radiographs

    Authors: Bernardo Silva, Laís Pinheiro, Brenda Sobrinho, Fernanda Lima, Bruna Sobrinho, Kalyf Abdalla, Matheus Pithon, Patrícia Cury, Luciano Oliveira

    Abstract: Deep learning has remarkably advanced in the last few years, supported by large labeled data sets. These data sets are precious yet scarce because of the time-consuming labeling procedures, discouraging researchers from producing them. This scarcity is especially true in dentistry, where deep learning applications are still in an embryonic stage. Motivated by this background, we address in this st… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 45 pages, 11 figures, journal preprint

  44. arXiv:2112.13819  [pdf, other

    cs.RO

    Trajectory Planning for Hybrid Unmanned Aerial Underwater Vehicles with Smooth Media Transition

    Authors: Pedro Miranda Pinheiro, Armando Alves Neto, Ricardo Bedin Grando, Cesar Bastos da Silva, Vivian Misaki Aoki, Dayana Cardoso, Alexandre Campos Horn, Paulo Lilles Jorge Drews-Jr

    Abstract: In the last decade, a great effort has been employed in the study of Hybrid Unmanned Aerial Underwater Vehicles, robots that can easily fly and dive into the water with different levels of mechanical adaptation. However, most of this literature is concentrated on physical design, practical issues of construction, and, more recently, low-level control strategies. Little has been done in the context… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: Accepted to the Journal of Intelligent & Robotic Systems

  45. arXiv:2112.13687  [pdf

    cs.LG

    Predição de Incidência de Lesão por Pressão em Pacientes de UTI usando Aprendizado de Máquina

    Authors: Henrique P. Silva, Arthur D. Reys, Daniel S. Severo, Dominique H. Ruther, Flávio A. O. B. Silva, Maria C. S. S. Guimarães, Roberto Z. A. Pinto, Saulo D. S. Pedro, Túlio P. Navarro, Danilo Silva

    Abstract: Pressure ulcers have high prevalence in ICU patients but are preventable if identified in initial stages. In practice, the Braden scale is used to classify high-risk patients. This paper investigates the use of machine learning in electronic health records data for this task, by using data available in MIMIC-III v1.4. Two main contributions are made: a new approach for evaluating models that consi… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Comments: 3 pages, 1 figure, in Portuguese, accepted at XVIII Congresso Brasileiro de Informática em Saúde (CBIS 2021)

  46. arXiv:2111.08481  [pdf, other

    eess.SY cs.LG physics.flu-dyn

    PySINDy: A comprehensive Python package for robust sparse system identification

    Authors: Alan A. Kaptanoglu, Brian M. de Silva, Urban Fasel, Kadierdan Kaheman, Andy J. Goldschmidt, Jared L. Callaham, Charles B. Delahunt, Zachary G. Nicolaou, Kathleen Champion, Jean-Christophe Loiseau, J. Nathan Kutz, Steven L. Brunton

    Abstract: Automated data-driven modeling, the process of directly discovering the governing equations of a system from data, is increasingly being used across the scientific community. PySINDy is a Python package that provides tools for applying the sparse identification of nonlinear dynamics (SINDy) approach to data-driven model discovery. In this major update to PySINDy, we implement several advanced feat… ▽ More

    Submitted 25 January, 2022; v1 submitted 12 November, 2021; originally announced November 2021.

  47. arXiv:2110.15013  [pdf, other

    math.DS cs.LG math-ph physics.comp-ph stat.ML

    Deeptime: a Python library for machine learning dynamical models from time series data

    Authors: Moritz Hoffmann, Martin Scherer, Tim Hempel, Andreas Mardt, Brian de Silva, Brooke E. Husic, Stefan Klus, Hao Wu, Nathan Kutz, Steven L. Brunton, Frank Noé

    Abstract: Generation and analysis of time-series data is relevant to many quantitative fields ranging from economics to fluid mechanics. In the physical sciences, structures such as metastable and coherent sets, slow relaxation processes, collective variables dominant transition pathways or manifolds and channels of probability flow can be of great importance for understanding and characterizing the kinetic… ▽ More

    Submitted 11 December, 2021; v1 submitted 28 October, 2021; originally announced October 2021.

    Journal ref: Machine Learning: Science and Technology, Volume 3, Number 1, 2021

  48. arXiv:2108.07743  [pdf, other

    cs.LG

    Incremental cluster validity index-guided online learning for performance and robustness to presentation order

    Authors: Leonardo Enzo Brito da Silva, Nagasharath Rayapati, Donald C. Wunsch II

    Abstract: In streaming data applications incoming samples are processed and discarded, therefore, intelligent decision-making is crucial for the performance of lifelong learning systems. In addition, the order in which samples arrive may heavily affect the performance of online (and offline) incremental learners. The recently introduced incremental cluster validity indices (iCVIs) provide valuable aid in ad… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

  49. arXiv:2107.00187  [pdf, other

    cs.DC cs.AI

    Context-aware Execution Migration Tool for Data Science Jupyter Notebooks on Hybrid Clouds

    Authors: Renato L. F. Cunha, Lucas V. Real, Renan Souza, Bruno Silva, Marco A. S. Netto

    Abstract: Interactive computing notebooks, such as Jupyter notebooks, have become a popular tool for developing and improving data-driven models. Such notebooks tend to be executed either in the user's own machine or in a cloud environment, having drawbacks and benefits in both approaches. This paper presents a solution developed as a Jupyter extension that automatically selects which cells, as well as in w… ▽ More

    Submitted 30 June, 2021; originally announced July 2021.

    Comments: 10 pages

  50. arXiv:2106.11447  [pdf, other

    eess.IV cs.CV cs.LG

    Encoder-Decoder Architectures for Clinically Relevant Coronary Artery Segmentation

    Authors: João Lourenço Silva, Miguel Nobre Menezes, Tiago Rodrigues, Beatriz Silva, Fausto J. Pinto, Arlindo L. Oliveira

    Abstract: Coronary X-ray angiography is a crucial clinical procedure for the diagnosis and treatment of coronary artery disease, which accounts for roughly 16% of global deaths every year. However, the images acquired in these procedures have low resolution and poor contrast, making lesion detection and assessment challenging. Accurate coronary artery segmentation not only helps mitigate these problems, but… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.