Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 141 results for author: Oliveira, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17915  [pdf, other

    cs.CV cs.AI

    Semi-supervised classification of dental conditions in panoramic radiographs using large language model and instance segmentation: A real-world dataset evaluation

    Authors: Bernardo Silva, Jefferson Fontinele, Carolina Letícia Zilli Vieira, João Manuel R. S. Tavares, Patricia Ramos Cury, Luciano Oliveira

    Abstract: Dental panoramic radiographs offer vast diagnostic opportunities, but training supervised deep learning networks for automatic analysis of those radiology images is hampered by a shortage of labeled data. Here, a different perspective on this problem is introduced. A semi-supervised learning framework is proposed to classify thirteen dental conditions on panoramic radiographs, with a particular em… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 43 pages, 12 figures, 9 tables

  2. arXiv:2406.17526  [pdf, other

    cs.CL cs.IR

    LumberChunker: Long-Form Narrative Document Segmentation

    Authors: André V. Duarte, João Marques, Miguel Graça, Miguel Freire, Lei Li, Arlindo L. Oliveira

    Abstract: Modern NLP tasks increasingly rely on dense retrieval methods to access up-to-date and relevant contextual information. We are motivated by the premise that retrieval benefits from segments that can vary in size such that a content's semantic independence is better captured. We propose LumberChunker, a method leveraging an LLM to dynamically segment documents, which iteratively prompts the LLM to… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    ACM Class: I.2

  3. arXiv:2406.06700  [pdf, other

    cs.LG cs.AI

    Forget Sharpness: Perturbed Forgetting of Model Biases Within SAM Dynamics

    Authors: Ankit Vani, Frederick Tung, Gabriel L. Oliveira, Hossein Sharifi-Noghabi

    Abstract: Despite attaining high empirical generalization, the sharpness of models trained with sharpness-aware minimization (SAM) do not always correlate with generalization error. Instead of viewing SAM as minimizing sharpness to improve generalization, our paper considers a new perspective based on SAM's training dynamics. We propose that perturbations in SAM perform perturbed forgetting, where they disc… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Published as a conference paper at ICML 2024. 9 pages main, 15 pages total including references and appendix

  4. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  5. arXiv:2404.01446  [pdf, other

    cs.CV cs.AI

    Finding Regions of Interest in Whole Slide Images Using Multiple Instance Learning

    Authors: Martim Afonso, Praphulla M. S. Bhawsar, Monjoy Saha, Jonas S. Almeida, Arlindo L. Oliveira

    Abstract: Whole Slide Images (WSI), obtained by high-resolution digital scanning of microscope slides at multiple scales, are the cornerstone of modern Digital Pathology. However, they represent a particular challenge to AI-based/AI-mediated analysis because pathology labeling is typically done at slide-level, instead of tile-level. It is not just that medical diagnostics is recorded at the specimen level,… ▽ More

    Submitted 11 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  6. arXiv:2403.17816  [pdf, other

    cs.CL

    Graph Language Model (GLM): A new graph-based approach to detect social instabilities

    Authors: Wallyson Lemes de Oliveira, Vahid Shamsaddini, Ali Ghofrani, Rahul Singh Inda, Jithendra Sai Veeramaneni, Étienne Voutaz

    Abstract: This scientific report presents a novel methodology for the early prediction of important political events using News datasets. The methodology leverages natural language processing, graph theory, clique analysis, and semantic relationships to uncover hidden predictive signals within the data. Initially, we designed a preliminary version of the method and tested it on a few events. This analysis r… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  7. arXiv:2402.10889  [pdf, other

    cs.CR

    Evaluation of EAP Usage for Authenticating Eduroam Users in 5G Networks

    Authors: Leonardo Azalim de Oliveira, Edelberto Franco Silva

    Abstract: The fifth generation of the telecommunication networks (5G) established the service-oriented paradigm on the mobile networks. In this new context, the 5G Core component has become extremely flexible so, in addition to serving mobile networks, it can also be used to connect devices from the so-called non-3GPP networks, which contains technologies such as WiFi. The implementation of this connectivit… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    ACM Class: C.2.0

  8. arXiv:2402.09910  [pdf, other

    cs.CL cs.LG

    DE-COP: Detecting Copyrighted Content in Language Models Training Data

    Authors: André V. Duarte, Xuandong Zhao, Arlindo L. Oliveira, Lei Li

    Abstract: How can we detect if copyrighted content was used in the training process of a language model, considering that the training data is typically undisclosed? We are motivated by the premise that a language model is likely to identify verbatim excerpts from its training text. We propose DE-COP, a method to determine whether a piece of copyrighted content was included in training. DE-COP's core approa… ▽ More

    Submitted 25 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    ACM Class: I.2

  9. arXiv:2311.16121  [pdf, other

    cs.CV cs.GR

    Real-Time Neural Materials using Block-Compressed Features

    Authors: Clément Weinreich, Louis de Oliveira, Antoine Houdard, Georges Nader

    Abstract: Neural materials typically consist of a collection of neural features along with a decoder network. The main challenge in integrating such models in real-time rendering pipelines lies in the large size required to store their features in GPU memory and the complexity of evaluating the network efficiently. We present a neural material model whose features and decoder are specifically designed to be… ▽ More

    Submitted 17 February, 2024; v1 submitted 26 October, 2023; originally announced November 2023.

    Comments: Eurographics 2024

  10. arXiv:2311.08547  [pdf, other

    cs.AI

    DeepThought: An Architecture for Autonomous Self-motivated Systems

    Authors: Arlindo L. Oliveira, Tiago Domingos, Mário Figueiredo, Pedro U. Lima

    Abstract: The ability of large language models (LLMs) to engage in credible dialogues with humans, taking into account the training data and the context of the conversation, has raised discussions about their ability to exhibit intrinsic motivations, agency, or even some degree of consciousness. We argue that the internal architecture of LLMs and their finite and volatile state cannot support any of these p… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    ACM Class: I.2

  11. arXiv:2311.02891  [pdf, other

    cs.LG

    AdaFlood: Adaptive Flood Regularization

    Authors: Wonho Bae, Yi Ren, Mohamad Osama Ahmed, Frederick Tung, Danica J. Sutherland, Gabriel L. Oliveira

    Abstract: Although neural networks are conventionally optimized towards zero training loss, it has been recently learned that targeting a non-zero training loss threshold, referred to as a flood level, often enables better test time generalization. Current approaches, however, apply the same constant flood level to all training samples, which inherently assumes all the samples have the same difficulty. We p… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  12. arXiv:2311.00211  [pdf, ps, other

    cs.SE

    Anachronic Tertiary Studies in Software Engineering: An Exploratory Quaternary Study

    Authors: Valdemar Vicente Graciano Neto, Célia Laís Rodrigues, Fernando Kenji Kamei, Juliano Lopes de Oliveira, Eliomar Araújo de Lima, Mohamad Kassab, Roberto Oliveira

    Abstract: Systematic literature reviews tentativelydescribe the state of the art in a given research area. However, the continuous publication of new primary and secondary studies following the release of a tertiary study can make the communication of results not integrally representative in regards to the advances achieved by that time. Consequently, using such a study as a reference within specific bodies… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: 8 pages, not peer-reviewed yet

  13. arXiv:2310.15328  [pdf

    eess.IV cs.CV cs.LG

    DeepVox and SAVE-CT: a contrast- and dose-independent 3D deep learning approach for thoracic aorta segmentation and aneurysm prediction using computed tomography scans

    Authors: Matheus del-Valle, Lariza Laura de Oliveira, Henrique Cursino Vieira, Henrique Min Ho Lee, Lucas Lembrança Pinheiro, Maria Fernanda Portugal, Newton Shydeo Brandão Miyoshi, Nelson Wolosker

    Abstract: Thoracic aortic aneurysm (TAA) is a fatal disease which potentially leads to dissection or rupture through progressive enlargement of the aorta. It is usually asymptomatic and screening recommendation are limited. The gold-standard evaluation is performed by computed tomography angiography (CTA) and radiologists time-consuming assessment. Scans for other indications could help on this screening, h… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 23 pages, 4 figures, 7 tables

    ACM Class: I.2; I.4

  14. arXiv:2310.10575  [pdf, other

    cs.CV q-bio.NC

    Matching the Neuronal Representations of V1 is Necessary to Improve Robustness in CNNs with V1-like Front-ends

    Authors: Ruxandra Barbulescu, Tiago Marques, Arlindo L. Oliveira

    Abstract: While some convolutional neural networks (CNNs) have achieved great success in object recognition, they struggle to identify objects in images corrupted with different types of common noise patterns. Recently, it was shown that simulating computations in early visual areas at the front of CNNs leads to improvements in robustness to image corruptions. Here, we further explore this result and show t… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  15. arXiv:2307.10018  [pdf, other

    cs.RO cs.AI

    RobôCIn Small Size League Extended Team Description Paper for RoboCup 2023

    Authors: Aline Lima de Oliveira, Cauê Addae da Silva Gomes, Cecília Virginia Santos da Silva, Charles Matheus de Sousa Alves, Danilo Andrade Martins de Souza, Driele Pires Ferreira Araújo Xavier, Edgleyson Pereira da Silva, Felipe Bezerra Martins, Lucas Henrique Cavalcanti Santos, Lucas Dias Maciel, Matheus Paixão Gumercindo dos Santos, Matheus Lafayette Vasconcelos, Matheus Vinícius Teotonio do Nascimento Andrade, João Guilherme Oliveira Carvalho de Melo, João Pedro Souza Pereira de Moura, José Ronald da Silva, José Victor Silva Cruz, Pedro Henrique Santana de Morais, Pedro Paulo Salman de Oliveira, Riei Joaquim Matos Rodrigues, Roberto Costa Fernandes, Ryan Vinicius Santos Morais, Tamara Mayara Ramos Teobaldo, Washington Igor dos Santos Silva, Edna Natividade Silva Barros

    Abstract: RobôCIn has participated in RoboCup Small Size League since 2019, won its first world title in 2022 (Division B), and is currently a three-times Latin-American champion. This paper presents our improvements to defend the Small Size League (SSL) division B title in RoboCup 2023 in Bordeaux, France. This paper aims to share some of the academic research that our team developed over the past year. Ou… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  16. arXiv:2307.02300  [pdf, other

    cs.LG cs.IR

    Improving Address Matching using Siamese Transformer Networks

    Authors: André V. Duarte, Arlindo L. Oliveira

    Abstract: Matching addresses is a critical task for companies and post offices involved in the processing and delivery of packages. The ramifications of incorrectly delivering a package to the wrong recipient are numerous, ranging from harm to the company's reputation to economic and environmental costs. This research introduces a deep learning-based model designed to increase the efficiency of address matc… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: To be published in the 22nd EPIA Conference on Artificial Intelligence, EPIA 2023, Faial Island - Azores, Portugal, 5-8 September 2023, Proceedings

    ACM Class: I.2

  17. Vehicle Occurrence-based Parking Space Detection

    Authors: Paulo R. Lisboa de Almeida, Jeovane Honório Alves, Luiz S. Oliveira, Andre Gustavo Hochuli, João V. Fröhlich, Rodrigo A. Krauel

    Abstract: Smart-parking solutions use sensors, cameras, and data analysis to improve parking efficiency and reduce traffic congestion. Computer vision-based methods have been used extensively in recent years to tackle the problem of parking lot management, but most of the works assume that the parking spots are manually labeled, impacting the cost and feasibility of deployment. To fill this gap, this work p… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted for presentation at the 2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC 2023)

  18. arXiv:2303.05401  [pdf, other

    cs.CL cs.LG cs.SI

    Early Warning Signals of Social Instabilities in Twitter Data

    Authors: Vahid Shamsaddini, Henry Kirveslahti, Raphael Reinauer, Wallyson Lemes de Oliveira, Matteo Caorsi, Etienne Voutaz

    Abstract: The goal of this project is to create and study novel techniques to identify early warning signals for socially disruptive events, like riots, wars, or revolutions using only publicly available data on social media. Such techniques need to be robust enough to work on real-time data: to achieve this goal we propose a topological approach together with more standard BERT models. Indeed, topology-bas… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 22 pages

    MSC Class: 68

  19. arXiv:2301.12023  [pdf, other

    cs.LG

    Meta Temporal Point Processes

    Authors: Wonho Bae, Mohamed Osama Ahmed, Frederick Tung, Gabriel L. Oliveira

    Abstract: A temporal point process (TPP) is a stochastic process where its realization is a sequence of discrete events in time. Recent work in TPPs model the process using a neural network in a supervised learning framework, where a training set is a collection of all the sequences. In this work, we propose to train TPPs in a meta learning framework, where each sequence is treated as a different task, via… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: Accepted to ICLR2023

  20. arXiv:2301.10608  [pdf, other

    cs.CV cs.LG

    Connecting metrics for shape-texture knowledge in computer vision

    Authors: Tiago Oliveira, Tiago Marques, Arlindo L. Oliveira

    Abstract: Modern artificial neural networks, including convolutional neural networks and vision transformers, have mastered several computer vision tasks, including object recognition. However, there are many significant differences between the behavior and robustness of these systems and of the human visual system. Deep neural networks remain brittle and susceptible to many changes in the image that do not… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: 7 pages, 3 figures

  21. arXiv:2212.08568  [pdf, other

    cs.CV cs.LG

    Biomedical image analysis competitions: The state of current participation practice

    Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

    Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More

    Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  22. arXiv:2210.11327  [pdf, other

    cs.LG stat.ML

    Improving Data Quality with Training Dynamics of Gradient Boosting Decision Trees

    Authors: Moacir Antonelli Ponti, Lucas de Angelis Oliveira, Mathias Esteban, Valentina Garcia, Juan Martín Román, Luis Argerich

    Abstract: Real world datasets contain incorrectly labeled instances that hamper the performance of the model and, in particular, the ability to generalize out of distribution. Also, each example might have different contribution towards learning. This motivates studies to better understanding of the role of data instances with respect to their contribution in good metrics in models. In this paper we propose… ▽ More

    Submitted 22 February, 2024; v1 submitted 20 October, 2022; originally announced October 2022.

  23. arXiv:2209.10901  [pdf, other

    cs.LG

    Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning

    Authors: Manuel Goulão, Arlindo L. Oliveira

    Abstract: The Vision Transformer architecture has shown to be competitive in the computer vision (CV) space where it has dethroned convolution-based networks in several benchmarks. Nevertheless, convolutional neural networks (CNN) remain the preferential architecture for the representation module in reinforcement learning. In this work, we study pretraining a Vision Transformer using several state-of-the-ar… ▽ More

    Submitted 18 July, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

  24. arXiv:2209.07928  [pdf, other

    cs.AI cs.CL eess.SY

    The BLue Amazon Brain (BLAB): A Modular Architecture of Services about the Brazilian Maritime Territory

    Authors: Paulo Pirozelli, Ais B. R. Castro, Ana Luiza C. de Oliveira, André S. Oliveira, Flávio N. Cação, Igor C. Silveira, João G. M. Campos, Laura C. Motheo, Leticia F. Figueiredo, Lucas F. A. O. Pellicer, Marcelo A. José, Marcos M. José, Pedro de M. Ligabue, Ricardo S. Grava, Rodrigo M. Tavares, Vinícius B. Matos, Yan V. Sym, Anna H. R. Costa, Anarosa A. F. Brandão, Denis D. Mauá, Fabio G. Cozman, Sarajane M. Peres

    Abstract: We describe the first steps in the development of an artificial agent focused on the Brazilian maritime territory, a large region within the South Atlantic also known as the Blue Amazon. The "BLue Amazon Brain" (BLAB) integrates a number of services aimed at disseminating information about this region and its importance, functioning as a tool for environmental awareness. The main service provided… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Journal ref: AI: Modeling Oceans and Climate Change (IJCAI-ECAI), 2022

  25. arXiv:2208.14375  [pdf

    eess.IV cs.CV cs.LG cs.NE

    Automated recognition of the pericardium contour on processed CT images using genetic algorithms

    Authors: E. O. Rodrigues, L. O. Rodrigues, L. S. N. Oliveira, A. Conci, P. Liatsis

    Abstract: This work proposes the use of Genetic Algorithms (GA) in tracing and recognizing the pericardium contour of the human heart using Computed Tomography (CT) images. We assume that each slice of the pericardium can be modelled by an ellipse, the parameters of which need to be optimally determined. An optimal ellipse would be one that closely follows the pericardium contour and, consequently, separate… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Journal ref: Computers in Biology and Medicine, Volume 87, 2017, Pages 38-45, ISSN 0010-4825

  26. arXiv:2206.08537  [pdf, ps, other

    cs.CV cs.LG

    Large-Margin Representation Learning for Texture Classification

    Authors: Jonathan de Matos, Luiz Eduardo Soares de Oliveira, Alceu de Souza Britto Junior, Alessandro Lameiras Koerich

    Abstract: This paper presents a novel approach combining convolutional layers (CLs) and large-margin metric learning for training supervised models on small datasets for texture classification. The core of such an approach is a loss function that computes the distances between instances of interest and support vectors. The objective is to update the weights of CLs iteratively to learn a representation with… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 7 pages

  27. Fast & Furious: Modelling Malware Detection as Evolving Data Streams

    Authors: Fabrício Ceschin, Marcus Botacin, Heitor Murilo Gomes, Felipe Pinagé, Luiz S. Oliveira, André Grégio

    Abstract: Malware is a major threat to computer systems and imposes many challenges to cyber security. Targeted threats, such as ransomware, cause millions of dollars in losses every year. The constant increase of malware infections has been motivating popular antiviruses (AVs) to develop dedicated detection strategies, which include meticulously crafted machine learning (ML) pipelines. However, malware dev… ▽ More

    Submitted 15 August, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  28. arXiv:2205.05032  [pdf, other

    cs.DB cs.DL q-bio.PE

    Brazilian COVID-19 data streaming

    Authors: Nívea B. da Silva, Luis Iván O. Valencia, Fábio M. H. S. Filho, Andressa C. S. Ferreira, Felipe A. C. Pereira, Guilherme L. de Oliveira, Paloma F. Oliveira, Moreno S. Rodrigues, Pablo I. P. Ramos, Juliane F. Oliveira

    Abstract: We collected individualized (unidentifiable) and aggregated openly available data from various sources related to suspected/confirmed SARS-CoV-2 infections, vaccinations, non-pharmaceutical government interventions, human mobility, and levels of population inequality in Brazil. In addition, a data structure allowing real-time data collection, curation, integration, and extract-transform-load proce… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: 12 pages, 6 figures, 2 tables

  29. arXiv:2203.15856  [pdf, other

    cs.CV

    OdontoAI: A human-in-the-loop labeled data set and an online platform to boost research on dental panoramic radiographs

    Authors: Bernardo Silva, Laís Pinheiro, Brenda Sobrinho, Fernanda Lima, Bruna Sobrinho, Kalyf Abdalla, Matheus Pithon, Patrícia Cury, Luciano Oliveira

    Abstract: Deep learning has remarkably advanced in the last few years, supported by large labeled data sets. These data sets are precious yet scarce because of the time-consuming labeling procedures, discouraging researchers from producing them. This scarcity is especially true in dentistry, where deep learning applications are still in an embryonic stage. Motivated by this background, we address in this st… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: 45 pages, 11 figures, journal preprint

  30. arXiv:2202.08176  [pdf, other

    cs.LG cs.AI

    Bias and unfairness in machine learning models: a systematic literature review

    Authors: Tiago Palma Pagano, Rafael Bessa Loureiro, Fernanda Vitória Nascimento Lisboa, Gustavo Oliveira Ramos Cruz, Rodrigo Matos Peixoto, Guilherme Aragão de Sousa Guimarães, Lucas Lisboa dos Santos, Maira Matos Araujo, Marco Cruz, Ewerton Lopes Silva de Oliveira, Ingrid Winkler, Erick Giovani Sperandio Nascimento

    Abstract: One of the difficulties of artificial intelligence is to ensure that model decisions are fair and free of bias. In research, datasets, metrics, techniques, and tools are applied to detect and mitigate algorithmic unfairness and bias. This study aims to examine existing knowledge on bias and unfairness in Machine Learning models, identifying mitigation methods, fairness metrics, and supporting tool… ▽ More

    Submitted 3 November, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

  31. arXiv:2201.02874  [pdf, ps, other

    cs.LG cs.AI

    Assessing Policy, Loss and Planning Combinations in Reinforcement Learning using a New Modular Architecture

    Authors: Tiago Gaspar Oliveira, Arlindo L. Oliveira

    Abstract: The model-based reinforcement learning paradigm, which uses planning algorithms and neural network models, has recently achieved unprecedented results in diverse applications, leading to what is now known as deep reinforcement learning. These agents are quite complex and involve multiple components, factors that can create challenges for research. In this work, we propose a new modular software ar… ▽ More

    Submitted 8 January, 2022; originally announced January 2022.

    MSC Class: 49L20 ACM Class: I.2.6; I.2.8

  32. arXiv:2112.12748  [pdf, other

    cs.CV cs.AI cs.LG

    Assessing the Impact of Attention and Self-Attention Mechanisms on the Classification of Skin Lesions

    Authors: Rafael Pedro, Arlindo L. Oliveira

    Abstract: Attention mechanisms have raised significant interest in the research community, since they promise significant improvements in the performance of neural network architectures. However, in any specific problem, we still lack a principled way to choose specific mechanisms and hyper-parameters that lead to guaranteed improvements. More recently, self-attention has been proposed and widely used in tr… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    ACM Class: I.5.4

  33. arXiv:2112.06735  [pdf, other

    cond-mat.stat-mech cs.LG math.AT

    Unsupervised machine learning approaches to the $q$-state Potts model

    Authors: Andrea Tirelli, Danyella O. Carvalho, Lucas A. Oliveira, J. P. Lima, Natanael C. Costa, Raimundo R. dos Santos

    Abstract: In this paper with study phase transitions of the $q$-state Potts model, through a number of unsupervised machine learning techniques, namely Principal Component Analysis (PCA), $k$-means clustering, Uniform Manifold Approximation and Projection (UMAP), and Topological Data Analysis (TDA). Even though in all cases we are able to retrieve the correct critical temperatures $T_c(q)$, for $q = 3, 4$ a… ▽ More

    Submitted 18 March, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Added computation of critical exponents; exposition improved

  34. arXiv:2111.00673  [pdf, other

    cs.IT

    Decoding of Polar Codes Based on Q-Learning-Driven Belief Propagation

    Authors: L. M. Oliveira, R. M. Oliveira, R. C. de Lamare

    Abstract: This paper presents an enhanced belief propagation (BP) decoding algorithm and a reinforcement learning-based BP decoding algorithm for polar codes. The enhanced BP algorithm weighs each Processing Element (PE) input based on their signals and Euclidean distances using a heuristic metric. The proposed reinforcement learning-based BP decoding strategy relies on reweighting the messages and consists… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

    Comments: 15 pages, 5 figures

  35. arXiv:2110.15731  [pdf, other

    cs.CL cs.SD eess.AS

    CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese

    Authors: Arnaldo Candido Junior, Edresson Casanova, Anderson Soares, Frederico Santos de Oliveira, Lucas Oliveira, Ricardo Corso Fernandes Junior, Daniel Peixoto Pinto da Silva, Fernando Gorgulho Fayet, Bruno Baldissera Carlotto, Lucas Rafael Stefanel Gris, Sandra Maria Aluísio

    Abstract: Automatic Speech recognition (ASR) is a complex and challenging task. In recent years, there have been significant advances in the area. In particular, for the Brazilian Portuguese (BP) language, there were about 376 hours public available for ASR task until the second half of 2020. With the release of new datasets in early 2021, this number increased to 574 hours. The existing resources, however,… ▽ More

    Submitted 18 November, 2021; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: This paper is under consideration at Language Resources and Evaluation (LREV)

  36. arXiv:2109.12622  [pdf, other

    cs.CV cs.AI cs.LG

    Using Soft Labels to Model Uncertainty in Medical Image Segmentation

    Authors: João Lourenço Silva, Arlindo L. Oliveira

    Abstract: Medical image segmentation is inherently uncertain. For a given image, there may be multiple plausible segmentation hypotheses, and physicians will often disagree on lesion and organ boundaries. To be suited to real-world application, automatic segmentation systems must be able to capture this uncertainty and variability. Thus far, this has been addressed by building deep learning models that, thr… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

    Comments: 8 pages, 1 figure, 3 tables

  37. arXiv:2108.02840  [pdf, other

    cs.CV

    Attention-based fusion of semantic boundary and non-boundary information to improve semantic segmentation

    Authors: Jefferson Fontinele, Gabriel Lefundes, Luciano Oliveira

    Abstract: This paper introduces a method for image semantic segmentation grounded on a novel fusion scheme, which takes place inside a deep convolutional neural network. The main goal of our proposal is to explore object boundary information to improve the overall segmentation performance. Unlike previous works that combine boundary and segmentation features, or those that use boundary information to regula… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

  38. An Efficient Multi-objective Evolutionary Approach for Solving the Operation of Multi-Reservoir System Scheduling in Hydro-Power Plants

    Authors: C. G. Marcelino, G. M. C. Leite, C. A. D. M Delgado, L. B. de Oliveira, E. F. Wanner, S. Jiménez-Fernández, S. Salcedo-Sanz

    Abstract: This paper tackles the short-term hydro-power unit commitment problem in a multi-reservoir system - a cascade-based operation scenario. For this, we propose a new mathematical modelling in which the goal is to maximize the total energy production of the hydro-power plant in a sub-daily operation, and, simultaneously, to maximize the total water content (volume) of reservoirs. For solving the probl… ▽ More

    Submitted 28 July, 2021; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: Accepted Manuscript version (after peer review, and editor-author communications). https://doi.org/10.1016/j.eswa.2021.115638

    Journal ref: Expert Systems With Applications (2021)

  39. arXiv:2107.06762  [pdf, other

    q-bio.NC cs.LG q-bio.QM

    Modelling Neuronal Behaviour with Time Series Regression: Recurrent Neural Networks on C. Elegans Data

    Authors: Gonçalo Mestre, Ruxandra Barbulescu, Arlindo L. Oliveira, L. Miguel Silveira

    Abstract: Given the inner complexity of the human nervous system, insight into the dynamics of brain activity can be gained from understanding smaller and simpler organisms, such as the nematode C. Elegans. The behavioural and structural biology of these organisms is well-known, making them prime candidates for benchmarking modelling and simulation techniques. In these complex neuronal collections, classica… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

  40. arXiv:2107.05682  [pdf, other

    cs.LG cs.AI math.OC math.ST stat.ML

    Least-Squares Linear Dilation-Erosion Regressor Trained using a Convex-Concave Procedure

    Authors: Angelica Lourenço Oliveira, Marcos Eduardo Valle

    Abstract: This paper presents a hybrid morphological neural network for regression tasks called linear dilation-erosion regressor ($\ell$-DER). An $\ell$-DER is given by a convex combination of the composition of linear and morphological operators. They yield continuous piecewise linear functions and, thus, are universal approximators. Besides introducing the $\ell$-DER model, we formulate their training as… ▽ More

    Submitted 6 September, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: 15 pages

    Journal ref: BRACIS 2022

  41. arXiv:2106.11447  [pdf, other

    eess.IV cs.CV cs.LG

    Encoder-Decoder Architectures for Clinically Relevant Coronary Artery Segmentation

    Authors: João Lourenço Silva, Miguel Nobre Menezes, Tiago Rodrigues, Beatriz Silva, Fausto J. Pinto, Arlindo L. Oliveira

    Abstract: Coronary X-ray angiography is a crucial clinical procedure for the diagnosis and treatment of coronary artery disease, which accounts for roughly 16% of global deaths every year. However, the images acquired in these procedures have low resolution and poor contrast, making lesion detection and assessment challenging. Accurate coronary artery segmentation not only helps mitigate these problems, but… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  42. Heterogeneous Multi-task Learning with Expert Diversity

    Authors: Raquel Aoki, Frederick Tung, Gabriel L. Oliveira

    Abstract: Predicting multiple heterogeneous biological and medical targets is a challenge for traditional deep learning models. In contrast to single-task learning, in which a separate model is trained for each target, multi-task learning (MTL) optimizes a single model to predict multiple related targets simultaneously. To address this challenge, we propose the Multi-gate Mixture-of-Experts with Exclusivity… ▽ More

    Submitted 27 May, 2022; v1 submitted 19 June, 2021; originally announced June 2021.

    Comments: 10 pages, 7 figures, BIOKDD, IEEE/ACM

    Journal ref: IEEE/ACM Transactions on Computational Biology and Bioinformatics (2022)

  43. arXiv:2106.02484  [pdf, other

    cs.CR cs.AI

    NeuraCrypt: Hiding Private Health Data via Random Neural Networks for Public Training

    Authors: Adam Yala, Homa Esfahanizadeh, Rafael G. L. D' Oliveira, Ken R. Duffy, Manya Ghobadi, Tommi S. Jaakkola, Vinod Vaikuntanathan, Regina Barzilay, Muriel Medard

    Abstract: Balancing the needs of data privacy and predictive utility is a central challenge for machine learning in healthcare. In particular, privacy concerns have led to a dearth of public datasets, complicated the construction of multi-hospital cohorts and limited the utilization of external machine learning resources. To remedy this, new methods are required to enable data owners, such as hospitals, to… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  44. arXiv:2105.02752  [pdf, other

    cs.LG cs.NE

    Modeling the geospatial evolution of COVID-19 using spatio-temporal convolutional sequence-to-sequence neural networks

    Authors: Mário Cardoso, André Cavalheiro, Alexandre Borges, Ana F. Duarte, Amílcar Soares, Maria João Pereira, Nuno J. Nunes, Leonardo Azevedo, Arlindo L. Oliveira

    Abstract: Europe was hit hard by the COVID-19 pandemic and Portugal was one of the most affected countries, having suffered three waves in the first twelve months. Approximately between Jan 19th and Feb 5th 2021 Portugal was the country in the world with the largest incidence rate, with 14-days incidence rates per 100,000 inhabitants in excess of 1000. Despite its importance, accurate prediction of the geos… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: 10 pages, 8 figures

    MSC Class: 92-10 ACM Class: I.2.6

  45. arXiv:2104.14847  [pdf, other

    cs.LG stat.ML

    Active WeaSuL: Improving Weak Supervision with Active Learning

    Authors: Samantha Biegel, Rafah El-Khatib, Luiz Otavio Vilas Boas Oliveira, Max Baak, Nanne Aben

    Abstract: The availability of labelled data is one of the main limitations in machine learning. We can alleviate this using weak supervision: a framework that uses expert-defined rules $\boldsymbolλ$ to estimate probabilistic labels $p(y|\boldsymbolλ)$ for the entire data set. These rules, however, are dependent on what experts know about the problem, and hence may be inaccurate or may fail to capture impor… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

    Comments: Accepted to the ICLR 2021 Workshop on Weakly Supervised Learning

  46. arXiv:2103.02969  [pdf, other

    eess.IV cs.CV cs.LG

    Automated Detection of Coronary Artery Stenosis in X-ray Angiography using Deep Neural Networks

    Authors: Dinis L. Rodrigues, Miguel Nobre Menezes, Fausto J. Pinto, Arlindo L. Oliveira

    Abstract: Coronary artery disease leading up to stenosis, the partial or total blocking of coronary arteries, is a severe condition that affects millions of patients each year. Automated identification and classification of stenosis severity from minimally invasive procedures would be of great clinical value, but existing methods do not match the accuracy of experienced cardiologists, due to the complexity… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: 10 pages, 4 Figures

  47. Information Ranking Using Optimum-Path Forest

    Authors: Nathalia Q. Ascenção, Luis C. S. Afonso, Danilo Colombo, Luciano Oliveira, João P. Papa

    Abstract: The task of learning to rank has been widely studied by the machine learning community, mainly due to its use and great importance in information retrieval, data mining, and natural language processing. Therefore, ranking accurately and learning to rank are crucial tasks. Context-Based Information Retrieval systems have been of great importance to reduce the effort of finding relevant data. Such s… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

  48. Exploring the Public Reaction to COVID-19 News on Social Media in Portugal

    Authors: Luciana Oliveira, Arminda Sequeira, Adriana Oliveira, Paulino Silva, Anabela Mesquita

    Abstract: The outburst and proliferation of the COVID-19 pandemic, together with the subsequent social distancing measures, have raised massive challenges in almost all domains of public and private life around the globe. The stay-at-home movement has pushed the news audiences into social networks, which, in turn, has become the most prolific field for receiving and sharing news updates, as well as for publ… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: 10 pages, 5 Figures, 3 tables

    Journal ref: Proceedings of the 8th European Conference on Social Media - ECSM 2021 (pp. 167-176)

  49. arXiv:2102.03889  [pdf, other

    cs.CV

    Machine Learning Methods for Histopathological Image Analysis: A Review

    Authors: Jonathan de Matos, Steve Tsham Mpinda Ataky, Alceu de Souza Britto Jr., Luiz Eduardo Soares de Oliveira, Alessandro Lameiras Koerich

    Abstract: Histopathological images (HIs) are the gold standard for evaluating some types of tumors for cancer diagnosis. The analysis of such images is not only time and resource consuming, but also very challenging even for experienced pathologists, resulting in inter- and intra-observer disagreements. One of the ways of accelerating such an analysis is to use computer-aided diagnosis (CAD) systems. In thi… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

    Comments: 45 pages. arXiv admin note: text overlap with arXiv:1904.07900

  50. Context, input and process as critical elements for successful Emergency Remote Learning

    Authors: Luciana Oliveira, Anabela Mesquita, Arminda Sequeira, Adriana Oliveira, Paulino Silva

    Abstract: In Spring 2020, the world moved from traditional classes to what was coined as ERL (Emergency Remote Teaching, Learning, Instruction), posing real challenges to all actors involved, requiring an immediate, unprecedented, and unplanned devising of mitigation strategies. The impacts of this transition cannot, however, be studied only at the educational level, as it consists of a broader social shift… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

    Comments: 10 pages, 1 figure, 1 table

    MSC Class: K.4.0 ACM Class: K.4.0

    Journal ref: Trends and Applications in Information Systems and Technologies. WorldCIST 2021. Advances in Intelligent Systems and Computing, vol 1367. Springer, Cham