Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 83 results for author: Ribeiro, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13945  [pdf, other

    cs.CL

    FANTAstic SEquences and Where to Find Them: Faithful and Efficient API Call Generation through State-tracked Constrained Decoding and Reranking

    Authors: Zhuoer Wang, Leonardo F. R. Ribeiro, Alexandros Papangelis, Rohan Mukherjee, Tzu-Yen Wang, Xinyan Zhao, Arijit Biswas, James Caverlee, Angeliki Metallinou

    Abstract: API call generation is the cornerstone of large language models' tool-using ability that provides access to the larger world. However, existing supervised and in-context learning approaches suffer from high training costs, poor data efficiency, and generated API calls that can be unfaithful to the API documentation and the user's request. To address these limitations, we propose an output-side opt… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2406.03592  [pdf, other

    cs.CL cs.AI

    Measuring Retrieval Complexity in Question Answering Systems

    Authors: Matteo Gabburo, Nicolaas Paul Jedema, Siddhant Garg, Leonardo F. R. Ribeiro, Alessandro Moschitti

    Abstract: In this paper, we investigate which questions are challenging for retrieval-based Question Answering (QA). We (i) propose retrieval complexity (RC), a novel metric conditioned on the completeness of retrieved documents, which measures the difficulty of answering questions, and (ii) propose an unsupervised pipeline to measure RC given an arbitrary retrieval system. Our proposed pipeline measures RC… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 (findings)

  3. arXiv:2405.12785  [pdf, other

    cs.AI

    Artificial Intelligence Approaches for Predictive Maintenance in the Steel Industry: A Survey

    Authors: Jakub Jakubowski, Natalia Wojak-Strzelecka, Rita P. Ribeiro, Sepideh Pashami, Szymon Bobek, Joao Gama, Grzegorz J Nalepa

    Abstract: Predictive Maintenance (PdM) emerged as one of the pillars of Industry 4.0, and became crucial for enhancing operational efficiency, allowing to minimize downtime, extend lifespan of equipment, and prevent failures. A wide range of PdM tasks can be performed using Artificial Intelligence (AI) methods, which often use data generated from industrial sensors. The steel industry, which is an important… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Preprint submitted to Engineering Applications of Artificial Intelligence

  4. arXiv:2405.05809  [pdf

    cs.LG cs.AI cs.CY

    Aequitas Flow: Streamlining Fair ML Experimentation

    Authors: Sérgio Jesus, Pedro Saleiro, Inês Oliveira e Silva, Beatriz M. Jorge, Rita P. Ribeiro, João Gama, Pedro Bizarro, Rayid Ghani

    Abstract: Aequitas Flow is an open-source framework for end-to-end Fair Machine Learning (ML) experimentation in Python. This package fills the existing integration gaps in other Fair ML packages of complete and accessible experimentation. It provides a pipeline for fairness-aware model training, hyperparameter optimization, and evaluation, enabling rapid and simple experiments and result analysis. Aimed at… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  5. A Multilevel Strategy to Improve People Tracking in a Real-World Scenario

    Authors: Cristiano B. de Oliveira, Joao C. Neves, Rafael O. Ribeiro, David Menotti

    Abstract: The Palácio do Planalto, office of the President of Brazil, was invaded by protesters on January 8, 2023. Surveillance videos taken from inside the building were subsequently released by the Brazilian Supreme Court for public scrutiny. We used segments of such footage to create the UFPR-Planalto801 dataset for people tracking and re-identification in a real-world scenario. This dataset consists of… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted for presentation at the International Conference on Computer Vision Theory and Applications (VISAPP) 2024

    Journal ref: Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, 2024

  6. arXiv:2404.14455  [pdf, other

    cs.LG cs.AI

    A Neuro-Symbolic Explainer for Rare Events: A Case Study on Predictive Maintenance

    Authors: João Gama, Rita P. Ribeiro, Saulo Mastelini, Narjes Davarid, Bruno Veloso

    Abstract: Predictive Maintenance applications are increasingly complex, with interactions between many components. Black box models are popular approaches based on deep learning techniques due to their predictive accuracy. This paper proposes a neural-symbolic architecture that uses an online rule-learning algorithm to explain when the black box model predicts failures. The proposed system solves two proble… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 26 pages

  7. arXiv:2404.01790  [pdf, other

    cs.CV cs.LG

    Super-Resolution Analysis for Landfill Waste Classification

    Authors: Matias Molina, Rita P. Ribeiro, Bruno Veloso, João Gama

    Abstract: Illegal landfills are a critical issue due to their environmental, economic, and public health impacts. This study leverages aerial imagery for environmental crime monitoring. While advances in artificial intelligence and computer vision hold promise, the challenge lies in training models with high-resolution literature datasets and adapting them to open-access low-resolution images. Considering t… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: This article has been accepted by the Symposium on Intelligent Data Analysis (IDA 2024)

  8. arXiv:2404.01701  [pdf, other

    cs.CL

    On the Role of Summary Content Units in Text Summarization Evaluation

    Authors: Marcel Nawrath, Agnieszka Nowak, Tristan Ratz, Danilo C. Walenta, Juri Opitz, Leonardo F. R. Ribeiro, João Sedoc, Daniel Deutsch, Simon Mille, Yixin Liu, Lining Zhang, Sebastian Gehrmann, Saad Mahamood, Miruna Clinciu, Khyathi Chandu, Yufang Hou

    Abstract: At the heart of the Pyramid evaluation method for text summarization lie human written summary content units (SCUs). These SCUs are concise sentences that decompose a summary into small facts. Such SCUs can be used to judge the quality of a candidate summary, possibly partially automated via natural language inference (NLI) systems. Interestingly, with the aim to fully automate the Pyramid evaluat… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 10 Pages, 3 Figures, 3 Tables, camera ready version accepted at NAACL 2024

  9. Logic-based Explanations for Linear Support Vector Classifiers with Reject Option

    Authors: Francisco Mateus Rocha Filho, Thiago Alves Rocha, Reginaldo Pereira Fernandes Ribeiro, Ajalmar Rêgo da Rocha Neto

    Abstract: Support Vector Classifier (SVC) is a well-known Machine Learning (ML) model for linear classification problems. It can be used in conjunction with a reject option strategy to reject instances that are hard to correctly classify and delegate them to a specialist. This further increases the confidence of the model. Given this, obtaining an explanation of the cause of rejection is important to not bl… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 16 pages, submitted to BRACIS 2023 (Brazilian Conference on Intelligent Systems), accepted version published in Intelligent Systems, LNCS, vol 14195

    ACM Class: I.2.4; I.2.6

  10. arXiv:2402.03488  [pdf, other

    cs.LO cs.PL

    Redex -> Coq: towards a theory of decidability of Redex's reduction semantics

    Authors: Mallku Soldevila, Rodrigo Ribeiro, Beta Ziliani

    Abstract: We propose the first steps in the development of a tool to automate the translation of Redex models into a (hopefully) semantically equivalent model in Coq, and to provide tactics to help in the certification of fundamental properties of such models. The work is heavily based on a model of Redex's semantics developed by Klein et al. By means of a simple generalization of the matching problem in Re… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  11. arXiv:2401.18060  [pdf, ps, other

    math.NT cs.DM math.CO

    Rarity of the infinite chains in the tree of numerical semigroups

    Authors: Maria Bras-Amorós, Mariana Rosas Ribeiro

    Abstract: We prove that, for each fixed genus, the portion of semigroups of that genus belonging to infinite chains in the semigroup tree approaches 0 as the genus grows to infinite. This means that most numerical semigroups have a finite number of descendants in the semigroup tree. This problem has been open since 2009.

    Submitted 31 January, 2024; originally announced January 2024.

    MSC Class: 68W30; 06F05; 20M14; 05A99

  12. arXiv:2311.18734  [pdf, other

    math.PR cs.DS physics.chem-ph physics.soc-ph

    Structural results for the Tree Builder Random Walk

    Authors: Janos Engländer, Giulio Iacobelli, Gábor Pete, Rodrigo Ribeiro

    Abstract: We study the Tree Builder Random Walk: a randomly growing tree, built by a walker as she is walking around the tree. Namely, at each time $n$, she adds a leaf to her current vertex with probability $p_n=n^{-γ}$, $γ\in (2/3,1]$, then moves to a uniform random neighbor on the possibly modified tree. We show that the tree process at its growth times, after a random finite number of steps, can be coup… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 34 pages, 4 figures

    MSC Class: 05C81; 05C80; 60F99; 60J05

  13. arXiv:2310.10623  [pdf, other

    cs.CL cs.AI cs.LG

    Generating Summaries with Controllable Readability Levels

    Authors: Leonardo F. R. Ribeiro, Mohit Bansal, Markus Dreyer

    Abstract: Readability refers to how easily a reader can understand a written text. Several factors affect the readability level, such as the complexity of the text, its subject matter, and the reader's background knowledge. Generating summaries based on different readability levels is critical for enabling knowledge consumption by diverse audiences. However, current text generation approaches lack refined c… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted as an EMNLP 2023 main paper

  14. arXiv:2310.09916  [pdf, other

    cs.RO

    Socially reactive navigation models for mobile robots in dynamic environments

    Authors: Ricarte Ribeiro, Plinio Moreno

    Abstract: The objective of this work is to expand upon previous works, considering socially acceptable behaviours within robot navigation and interaction, and allow a robot to closely approach static and dynamic individuals or groups. The space models developed in this dissertation are adaptive, that is, capable of changing over time to accommodate the changing circumstances often existent within a social e… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  15. arXiv:2309.04292  [pdf, other

    cs.CL cs.AI

    Fuzzy Fingerprinting Transformer Language-Models for Emotion Recognition in Conversations

    Authors: Patrícia Pereira, Rui Ribeiro, Helena Moniz, Luisa Coheur, Joao Paulo Carvalho

    Abstract: Fuzzy Fingerprints have been successfully used as an interpretable text classification technique, but, like most other techniques, have been largely surpassed in performance by Large Pre-trained Language Models, such as BERT or RoBERTa. These models deliver state-of-the-art results in several Natural Language Processing tasks, namely Emotion Recognition in Conversations (ERC), but suffer from the… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: FUZZ-IEEE 2023

  16. Enhancing Network Slicing Architectures with Machine Learning, Security, Sustainability and Experimental Networks Integration

    Authors: Joberto S. B. Martins, Tereza C. Carvalho, Rodrigo Moreira, Cristiano Both, Adnei Donatti, João H. Corrêa, José A. Suruagy, Sand L. Corrêa, Antonio J. G. Abelem, Moisés R. N. Ribeiro, Jose-Marcos Nogueira, Luiz C. S. Magalhães, Juliano Wickboldt, Tiago Ferreto, Ricardo Mello, Rafael Pasquini, Marcos Schwarz, Leobino N. Sampaio, Daniel F. Macedo, José F. de Rezende, Kleber V. Cardoso, Flávio O. Silva

    Abstract: Network Slicing (NS) is an essential technique extensively used in 5G networks computing strategies, mobile edge computing, mobile cloud computing, and verticals like the Internet of Vehicles and industrial IoT, among others. NS is foreseen as one of the leading enablers for 6G futuristic and highly demanding applications since it allows the optimization and customization of scarce and disputed re… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 10 pages, 11 figures

    ACM Class: I.2.1; C.2.1; C.2.3

    Journal ref: IEEE ACCESS 2023

  17. Reconstructing Spatiotemporal Data with C-VAEs

    Authors: Tiago F. R. Ribeiro, Fernando Silva, Rogério Luís de C. Costa

    Abstract: The continuous representation of spatiotemporal data commonly relies on using abstract data types, such as \textit{moving regions}, to represent entities whose shape and position continuously change over time. Creating this representation from discrete snapshots of real-world entities requires using interpolation methods to compute in-between data representations and estimate the position and shap… ▽ More

    Submitted 28 August, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Update acknowledgments to include published article information

    Journal ref: Advances in Databases and Information Systems 13985 (2023) 59-73

  18. arXiv:2306.07688  [pdf, other

    cs.RO

    Mobility Strategy of Multi-Limbed Climbing Robots for Asteroid Exploration

    Authors: Warley F. R. Ribeiro, Kentaro Uno, Masazumi Imai, Koki Murase, Barış Can Yalçın, Matteo El Hariry, Miguel A. Olivares-Mendez, Kazuya Yoshida

    Abstract: Mobility on asteroids by multi-limbed climbing robots is expected to achieve our exploration goals in such challenging environments. We propose a mobility strategy to improve the locomotion safety of climbing robots in such harsh environments that picture extremely low gravity and highly uneven terrain. Our method plans the gait by decoupling the base and limbs' movements and adjusting the main bo… ▽ More

    Submitted 22 June, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Paper accepted for presentation at the CLAWAR 2023 (26th International Conference on Climbing and Walking Robots and the Support Technologies for Mobile Machines) (Updated references formatting)

  19. arXiv:2306.05120  [pdf, other

    cs.AI

    Explainable Predictive Maintenance

    Authors: Sepideh Pashami, Slawomir Nowaczyk, Yuantao Fan, Jakub Jakubowski, Nuno Paiva, Narjes Davari, Szymon Bobek, Samaneh Jamshidi, Hamid Sarmadi, Abdallah Alabdallah, Rita P. Ribeiro, Bruno Veloso, Moamar Sayed-Mouchaweh, Lala Rajaoarisoa, Grzegorz J. Nalepa, João Gama

    Abstract: Explainable Artificial Intelligence (XAI) fills the role of a critical interface fostering interactions between sophisticated intelligent systems and diverse individuals, including data scientists, domain experts, end-users, and more. It aids in deciphering the intricate internal mechanisms of ``black box'' Machine Learning (ML), rendering the reasons behind their decisions more understandable. Ho… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: 51 pages, 9 figures

    ACM Class: I.2.1

  20. arXiv:2305.07716  [pdf, other

    cs.RO cs.AI

    Learning to Reason over Scene Graphs: A Case Study of Finetuning GPT-2 into a Robot Language Model for Grounded Task Planning

    Authors: Georgia Chalvatzaki, Ali Younes, Daljeet Nandha, An Le, Leonardo F. R. Ribeiro, Iryna Gurevych

    Abstract: Long-horizon task planning is essential for the development of intelligent assistive and service robots. In this work, we investigate the applicability of a smaller class of large language models (LLMs), specifically GPT-2, in robotic task planning by learning to decompose tasks into subgoal specifications for a planner to execute sequentially. Our method grounds the input of the LLM on the domain… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 21 pages, 6 figures

  21. arXiv:2305.00352  [pdf, other

    cs.CV stat.AP

    Embedding Aggregation for Forensic Facial Comparison

    Authors: Rafael Oliveira Ribeiro, João C. R. Neves, Arnout C. C. Ruifrok, Flavio de Barros Vidal

    Abstract: In forensic facial comparison, questioned-source images are usually captured in uncontrolled environments, with non-uniform lighting, and from non-cooperative subjects. The poor quality of such material usually compromises their value as evidence in legal matters. On the other hand, in forensic casework, multiple images of the person of interest are usually available. In this paper, we propose to… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

    Comments: 13 pages, 8 figures, submitted to Forensic Science International

    ACM Class: I.4; I.5

  22. arXiv:2304.06634  [pdf, other

    cs.CL cs.AI cs.LG cs.SI

    PGTask: Introducing the Task of Profile Generation from Dialogues

    Authors: Rui Ribeiro, Joao P. Carvalho, Luísa Coheur

    Abstract: Recent approaches have attempted to personalize dialogue systems by leveraging profile information into models. However, this knowledge is scarce and difficult to obtain, which makes the extraction/generation of profile information from dialogues a fundamental asset. To surpass this limitation, we introduce the Profile Generation Task (PGTask). We contribute with a new dataset for this problem, co… ▽ More

    Submitted 26 August, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted at SIGDIAL 2023, 4 pages, 2 figures

  23. arXiv:2303.16151  [pdf, other

    q-fin.ST cs.LG econ.EM stat.ML

    Forecasting Large Realized Covariance Matrices: The Benefits of Factor Models and Shrinkage

    Authors: Rafael Alves, Diego S. de Brito, Marcelo C. Medeiros, Ruy M. Ribeiro

    Abstract: We propose a model to forecast large realized covariance matrices of returns, applying it to the constituents of the S\&P 500 daily. To address the curse of dimensionality, we decompose the return covariance matrix using standard firm-level factors (e.g., size, value, and profitability) and use sectoral restrictions in the residual covariance matrix. This restricted model is then estimated using v… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  24. arXiv:2301.07996  [pdf, other

    cs.RO

    RAMP: Reaction-Aware Motion Planning of Multi-Legged Robots for Locomotion in Microgravity

    Authors: Warley F. R. Ribeiro, Kentaro Uno, Masazumi Imai, Koki Murase, Kazuya Yoshida

    Abstract: Robotic mobility in microgravity is necessary to expand human utilization and exploration of outer space. Bio-inspired multi-legged robots are a possible solution for safe and precise locomotion. However, a dynamic motion of a robot in microgravity can lead to failures due to gripper detachment caused by excessive motion reactions. We propose a novel Reaction-Aware Motion Planning (RAMP) to improv… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: Submitted version of paper accepted for presentation at the 2023 IEEE International Conference on Robotics and Automation (ICRA)

  25. arXiv:2211.13358  [pdf, other

    cs.LG

    Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation

    Authors: Sérgio Jesus, José Pombal, Duarte Alves, André Cruz, Pedro Saleiro, Rita P. Ribeiro, João Gama, Pedro Bizarro

    Abstract: Evaluating new techniques on realistic datasets plays a crucial role in the development of ML research and its broader adoption by practitioners. In recent years, there has been a significant increase of publicly available unstructured data resources for computer vision and NLP tasks. However, tabular data -- which is prevalent in many high-stakes domains -- has been lagging behind. To bridge this… ▽ More

    Submitted 28 November, 2022; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted at NeurIPS 2022. https://openreview.net/forum?id=UrAYT2QwOX8

  26. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  27. arXiv:2210.10695  [pdf, other

    cs.IR cs.CL

    Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking

    Authors: Tim Baumgärtner, Leonardo F. R. Ribeiro, Nils Reimers, Iryna Gurevych

    Abstract: Pairing a lexical retriever with a neural re-ranking model has set state-of-the-art performance on large-scale information retrieval datasets. This pipeline covers scenarios like question answering or navigational queries, however, for information-seeking scenarios, users often provide information on whether a document is relevant to their query in form of clicks or explicit feedback. Therefore, i… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022

  28. arXiv:2210.06496  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    SUMBot: Summarizing Context in Open-Domain Dialogue Systems

    Authors: Rui Ribeiro, Luísa Coheur

    Abstract: In this paper, we investigate the problem of including relevant information as context in open-domain dialogue systems. Most models struggle to identify and incorporate important knowledge from dialogues and simply use the entire turns as context, which increases the size of the input fed to the model with unnecessary information. Additionally, due to the input size limitation of a few hundred tok… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 4 pages, 3 figures, accepted at IberSPEECH 2022

  29. Face Super-Resolution Using Stochastic Differential Equations

    Authors: Marcelo dos Santos, Rayson Laroca, Rafael O. Ribeiro, João Neves, Hugo Proença, David Menotti

    Abstract: Diffusion models have proven effective for various applications such as images, audio and graph generation. Other important applications are image super-resolution and the solution of inverse problems. More recently, some works have used stochastic differential equations (SDEs) to generalize diffusion models to continuous time. In this work, we introduce SDEs to generate super-resolution face imag… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

    Comments: Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2022

  30. arXiv:2208.09316  [pdf, other

    cs.CL

    UKP-SQuARE v2: Explainability and Adversarial Attacks for Trustworthy QA

    Authors: Rachneet Sachdeva, Haritz Puerto, Tim Baumgärtner, Sewin Tariverdian, Hao Zhang, Kexin Wang, Hossain Shaikh Saadi, Leonardo F. R. Ribeiro, Iryna Gurevych

    Abstract: Question Answering (QA) systems are increasingly deployed in applications where they support real-world decisions. However, state-of-the-art models rely on deep neural networks, which are difficult to interpret by humans. Inherently interpretable models or post hoc explainability methods can help users to comprehend how a model arrives at its prediction and, if successful, increase their trust in… ▽ More

    Submitted 20 October, 2022; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: Accepted at AACL 2022 as Demo Paper

  31. arXiv:2207.05466  [pdf, other

    cs.LG cs.AI

    A Benchmark dataset for predictive maintenance

    Authors: Bruno Veloso, João Gama, Rita P. Ribeiro, Pedro M. Pereira

    Abstract: The paper describes the MetroPT data set, an outcome of a eXplainable Predictive Maintenance (XPM) project with an urban metro public transportation service in Porto, Portugal. The data was collected in 2022 that aimed to evaluate machine learning methods for online anomaly detection and failure prediction. By capturing several analogic sensor signals (pressure, temperature, current consumption),… ▽ More

    Submitted 18 July, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

  32. arXiv:2206.12294  [pdf

    cs.AI cs.CL

    Learning Rhetorical Structure Theory-based descriptions of observed behaviour

    Authors: Luis Botelho, Luis Nunes, Ricardo Ribeiro, Rui J. Lopes

    Abstract: In a previous paper, we have proposed a set of concepts, axiom schemata and algorithms that can be used by agents to learn to describe their behaviour, goals, capabilities, and environment. The current paper proposes a new set of concepts, axiom schemata and algorithms that allow the agent to learn new descriptions of an observed behaviour (e.g., perplexing actions), of its actor (e.g., undesired… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  33. arXiv:2206.11249  [pdf, other

    cs.CL cs.AI cs.LG

    GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

    Authors: Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di Jin, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter , et al. (52 additional authors not shown)

    Abstract: Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, an… ▽ More

    Submitted 24 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  34. arXiv:2206.09991  [pdf, other

    cs.LG cs.AI stat.ML

    Model Optimization in Imbalanced Regression

    Authors: Aníbal Silva, Rita P. Ribeiro, Nuno Moniz

    Abstract: Imbalanced domain learning aims to produce accurate models in predicting instances that, though underrepresented, are of utmost importance for the domain. Research in this field has been mainly focused on classification tasks. Comparatively, the number of studies carried out in the context of regression tasks is negligible. One of the main reasons for this is the lack of loss functions capable of… ▽ More

    Submitted 15 August, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

  35. arXiv:2204.06508  [pdf, other

    cs.CL cs.AI cs.LG

    FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations

    Authors: Leonardo F. R. Ribeiro, Mengwen Liu, Iryna Gurevych, Markus Dreyer, Mohit Bansal

    Abstract: Despite recent improvements in abstractive summarization, most current approaches generate summaries that are not factually consistent with the source document, severely restricting their trust and usage in real-world applications. Recent works have shown promising improvements in factuality error identification using text or dependency arc entailments; however, they do not consider the entire sem… ▽ More

    Submitted 19 July, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: NAACL 2022 (15 pages)

  36. arXiv:2203.13693  [pdf, other

    cs.CL cs.IR

    UKP-SQUARE: An Online Platform for Question Answering Research

    Authors: Tim Baumgärtner, Kexin Wang, Rachneet Sachdeva, Max Eichler, Gregor Geigle, Clifton Poth, Hannah Sterz, Haritz Puerto, Leonardo F. R. Ribeiro, Jonas Pfeiffer, Nils Reimers, Gözde Gül Şahin, Iryna Gurevych

    Abstract: Recent advances in NLP and information retrieval have given rise to a diverse set of question answering tasks that are of different formats (e.g., extractive, abstractive), require different model architectures (e.g., generative, discriminative), and setups (e.g., with or without retrieval). Despite having a large number of powerful, specialized QA pipelines (which we refer to as Skills) that cons… ▽ More

    Submitted 28 March, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022 Demo Track

  37. arXiv:2202.03040  [pdf, other

    cs.CL

    Towards Learning Through Open-Domain Dialog

    Authors: Eugénio Ribeiro, Ricardo Ribeiro, David Martins de Matos

    Abstract: The development of artificial agents able to learn through dialog without domain restrictions has the potential to allow machines to learn how to perform tasks in a similar manner to humans and change how we relate to them. However, research in this area is practically nonexistent. In this paper, we identify the modifications required for a dialog system to be able to learn from the dialog and pro… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: 6 pages, 1 figure

    ACM Class: H.1.2; H.3.1; I.2.7

  38. Question rewriting? Assessing its importance for conversational question answering

    Authors: Gonçalo Raposo, Rui Ribeiro, Bruno Martins, Luísa Coheur

    Abstract: In conversational question answering, systems must correctly interpret the interconnected interactions and generate knowledgeable answers, which may require the retrieval of relevant information from a background repository. Recent approaches to this problem leverage neural language models, although different alternatives can be considered in terms of modules for (a) representing user questions in… ▽ More

    Submitted 14 April, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

    Comments: Submitted manuscript (not anonymized) accepted to the 44th European Conference on Information Retrieval (ECIR) 2022. This preprint has not undergone peer review (when applicable) or any post-submission improvements or corrections. The Version of Record of this contribution is published in Advances in Information Retrieval, and is available online at https://doi.org/10.1007/978-3-030-99739-7_23

  39. arXiv:2109.03808  [pdf, other

    cs.CL

    Smelting Gold and Silver for Improved Multilingual AMR-to-Text Generation

    Authors: Leonardo F. R. Ribeiro, Jonas Pfeiffer, Yue Zhang, Iryna Gurevych

    Abstract: Recent work on multilingual AMR-to-text generation has exclusively focused on data augmentation strategies that utilize silver AMR. However, this assumes a high quality of generated AMRs, potentially limiting the transferability to the target task. In this paper, we investigate different techniques for automatically generating AMR annotations, where we aim to study which source of information yiel… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted as a conference paper to EMNLP 2021

  40. arXiv:2103.09120  [pdf, other

    cs.CL

    Structural Adapters in Pretrained Language Models for AMR-to-text Generation

    Authors: Leonardo F. R. Ribeiro, Yue Zhang, Iryna Gurevych

    Abstract: Pretrained language models (PLM) have recently advanced graph-to-text generation, where the input graph is linearized into a sequence and fed into the PLM to obtain its representation. However, efficiently encoding the graph structure in PLMs is challenging because such models were pretrained on natural language, and modeling structured data may lead to catastrophic forgetting of distributional kn… ▽ More

    Submitted 8 September, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: Accepted as a long conference paper to EMNLP 2021

  41. arXiv:2102.11146  [pdf, other

    cs.CL

    Domain Adaptation in Dialogue Systems using Transfer and Meta-Learning

    Authors: Rui Ribeiro, Alberto Abad, José Lopes

    Abstract: Current generative-based dialogue systems are data-hungry and fail to adapt to new unseen domains when only a small amount of target data is available. Additionally, in real-world applications, most domains are underrepresented, so there is a need to create a system capable of generalizing to these domains using minimal data. In this paper, we propose a method that adapts to unseen domains by comb… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: 5 pages, 2 figures, accepted at IberSPEECH 2020

  42. arXiv:2101.10892  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Online Body Schema Adaptation through Cost-Sensitive Active Learning

    Authors: Gonçalo Cunha, Pedro Vicente, Alexandre Bernardino, Ricardo Ribeiro, Plínio Moreno

    Abstract: Humanoid robots have complex bodies and kinematic chains with several Degrees-of-Freedom (DoF) which are difficult to model. Learning the parameters of a kinematic model can be achieved by observing the position of the robot links during prospective motions and minimising the prediction errors. This work proposes a movement efficient approach for estimating online the body-schema of a humanoid rob… ▽ More

    Submitted 10 February, 2022; v1 submitted 26 January, 2021; originally announced January 2021.

    Comments: 6 pages, 7 figures

  43. arXiv:2101.06733  [pdf, other

    cs.SE

    Profiling Software Developers with Process Mining and N-Gram Language Models

    Authors: João Caldeira, Fernando Brito e Abreu, Jorge Cardoso, Ricardo Ribeiro, Claudia Werner

    Abstract: Context: Profiling developers is challenging since many factors, such as their skills, experience, development environment and behaviors, may influence a detailed analysis and the delivery of coherent interpretations. Objective: We aim at profiling software developers by mining their software development process. To do so, we performed a controlled experiment where, in the realm of a Python prog… ▽ More

    Submitted 17 January, 2021; originally announced January 2021.

  44. Business-Driven Technical Debt Prioritization: An Industrial Case Study

    Authors: Rodrigo Rebouças de Almeida, Rafael do Nascimento Ribeiro, Christoph Treude, Uirá Kulesza

    Abstract: Incorporating the business perspective into prioritizing technical debt is essential to contribute to decision making in industry. In this paper, we evolve and evaluate a business-driven approach for technical debt prioritization. The approach was evaluated during a five-month industrial case study with business and technical stakeholders' active participation. The results show that the approach c… ▽ More

    Submitted 21 March, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: Accepted for publication at the International Conference on Technical Debt 2021 (https://2021.techdebtconf.org/)

    Journal ref: 2021 IEEE/ACM International Conference on Technical Debt (TechDebt)

  45. arXiv:2008.07356  [pdf, ps, other

    cs.AI eess.SY

    Estimating action plans for smart poultry houses

    Authors: Darlan Felipe Klotz, Richardson Ribeiro, Fabrício Enembreck, Gustavo Denardin, Marco Barbosa, Dalcimar Casanova, Marcelo Teixeira

    Abstract: In poultry farming, the systematic choice, update, and implementation of periodic (t) action plans define the feed conversion rate (FCR[t]), which is an acceptable measure for successful production. Appropriate action plans provide tailored resources for broilers, allowing them to grow within the so-called thermal comfort zone, without wast or lack of resources. Although the implementation of an a… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: To be submitted to the journal Expert Systems with Applications

    MSC Class: 68T35 (Primary) 93C40 (Secondary) ACM Class: C.3; C.5; I.2; J.6

  46. arXiv:2007.08426  [pdf, other

    cs.CL

    Investigating Pretrained Language Models for Graph-to-Text Generation

    Authors: Leonardo F. R. Ribeiro, Martin Schmitt, Hinrich Schütze, Iryna Gurevych

    Abstract: Graph-to-text generation aims to generate fluent texts from graph-based data. In this paper, we investigate two recently proposed pretrained language models (PLMs) and analyze the impact of different task-adaptive pretraining strategies for PLMs in graph-to-text generation. We present a study across three graph domains: meaning representations, Wikipedia knowledge graphs (KGs) and scientific KGs.… ▽ More

    Submitted 27 September, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Accepted as a long paper to NLP4ConvAI, EMNLP2021

  47. arXiv:2006.16035  [pdf

    cs.LG cs.AI cs.FL eess.SY stat.ML

    Concept and the implementation of a tool to convert industry 4.0 environments modeled as FSM to an OpenAI Gym wrapper

    Authors: Kallil M. C. Zielinski, Marcelo Teixeira, Richardson Ribeiro, Dalcimar Casanova

    Abstract: Industry 4.0 systems have a high demand for optimization in their tasks, whether to minimize cost, maximize production, or even synchronize their actuators to finish or speed up the manufacture of a product. Those challenges make industrial environments a suitable scenario to apply all modern reinforcement learning (RL) concepts. The main difficulty, however, is the lack of that industrial environ… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

  48. arXiv:2006.09242  [pdf, other

    cs.CL

    Modeling Graph Structure via Relative Position for Text Generation from Knowledge Graphs

    Authors: Martin Schmitt, Leonardo F. R. Ribeiro, Philipp Dufter, Iryna Gurevych, Hinrich Schütze

    Abstract: We present Graformer, a novel Transformer-based encoder-decoder architecture for graph-to-text generation. With our novel graph self-attention, the encoding of a node relies on all nodes in the input graph - not only direct neighbors - facilitating the detection of global patterns. We represent the relation between two nodes as the length of the shortest path between them. Graformer learns to weig… ▽ More

    Submitted 27 April, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Accepted as a long paper at TextGraphs 2021

  49. arXiv:2005.11787  [pdf, ps, other

    cs.CL

    Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers

    Authors: Anne Lauscher, Olga Majewska, Leonardo F. R. Ribeiro, Iryna Gurevych, Nikolai Rozanov, Goran Glavaš

    Abstract: Following the major success of neural language models (LMs) such as BERT or GPT-2 on a variety of language understanding tasks, recent work focused on injecting (structured) knowledge from external resources into these models. While on the one hand, joint pretraining (i.e., training from scratch, adding objectives based on external knowledge to the primary LM objective) may be prohibitively comput… ▽ More

    Submitted 11 October, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

    Comments: EMNLP 2020 - DeeLIO, ECML 2020 - DECODEML, 5 pages, 4 tables, 3 references

  50. arXiv:2003.03556  [pdf, other

    cs.CL

    Automatic Recognition of the General-Purpose Communicative Functions defined by the ISO 24617-2 Standard for Dialog Act Annotation

    Authors: Eugénio Ribeiro, Ricardo Ribeiro, David Martins de Matos

    Abstract: ISO 24617-2, the standard for dialog act annotation, defines a hierarchically organized set of general-purpose communicative functions. The automatic recognition of these functions, although practically unexplored, is relevant for a dialog system, since they provide cues regarding the intention behind the segments and how they should be interpreted. We explore the recognition of general-purpose co… ▽ More

    Submitted 16 January, 2021; v1 submitted 7 March, 2020; originally announced March 2020.

    Comments: 30 pages, 4 figures, 9 tables

    ACM Class: H.1.2; H.3.1; I.2.7