Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 125 results for author: Freitas, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18626  [pdf, other

    q-bio.QM cs.AI cs.CL

    An LLM-based Knowledge Synthesis and Scientific Reasoning Framework for Biomedical Discovery

    Authors: Oskar Wysocki, Magdalena Wysocka, Danilo Carvalho, Alex Teodor Bogatu, Danilo Miranda Gusicuma, Maxime Delmas, Harriet Unsworth, Andre Freitas

    Abstract: We present BioLunar, developed using the Lunar framework, as a tool for supporting biological analyses, with a particular emphasis on molecular-level evidence enrichment for biomarker discovery in oncology. The platform integrates Large Language Models (LLMs) to facilitate complex scientific reasoning across distributed evidence spaces, enhancing the capability for harmonizing and reasoning over h… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: accepted for ACL 2024 System Demonstration Track

  2. arXiv:2406.17837  [pdf, other

    cs.LG cs.AI

    Transformer Normalisation Layers and the Independence of Semantic Subspaces

    Authors: Stephen Menary, Samuel Kaski, Andre Freitas

    Abstract: Recent works have shown that transformers can solve contextual reasoning tasks by internally executing computational graphs called circuits. Circuits often use attention to logically match information from subspaces of the representation, e.g. using position-in-sequence to identify the previous token. In this work, we consider a semantic subspace to be any independent subspace of the latent repres… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.09898  [pdf, other

    cs.LG

    Positive-Unlabelled Learning for Identifying New Candidate Dietary Restriction-related Genes among Ageing-related Genes

    Authors: Jorge Paz-Ruza, Alex A. Freitas, Amparo Alonso-Betanzos, Bertha Guijarro-Berdiñas

    Abstract: Dietary Restriction (DR) is one of the most popular anti-ageing interventions, prompting exhaustive research into genes associated with its mechanisms. Recently, Machine Learning (ML) has been explored to identify potential DR-related genes among ageing-related genes, aiming to minimize costly wet lab experiments needed to expand our knowledge on DR. However, to train a model from positive (DR-rel… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  4. arXiv:2405.17723  [pdf, other

    cs.DB

    TableDC: Deep Clustering for Tabular Data

    Authors: Hafiz Tayyab Rauf, Andre Freitas, Norman W. Paton

    Abstract: Deep clustering (DC), a fusion of deep representation learning and clustering, has recently demonstrated positive results in data science, particularly text processing and computer vision. However, joint optimization of feature learning and data distribution in the multi-dimensional space is domain-specific, so existing DC methods struggle to generalize to other application domains (such as data i… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  5. arXiv:2405.01379  [pdf, other

    cs.CL

    Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving

    Authors: Xin Quan, Marco Valentino, Louise A. Dennis, André Freitas

    Abstract: Natural language explanations have become a proxy for evaluating explainable and multi-step Natural Language Inference (NLI) models. However, assessing the validity of explanations for NLI is challenging as it typically involves the crowd-sourcing of apposite datasets, a process that is time-consuming and prone to logical errors. To address existing limitations, this paper investigates the verific… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  6. arXiv:2405.00402  [pdf, other

    cs.CL

    Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models

    Authors: Leonardo Ranaldi, Andrè Freitas

    Abstract: The alignments of reasoning abilities between smaller and larger Language Models are largely conducted via Supervised Fine-Tuning (SFT) using demonstrations generated from robust Large Language Models (LLMs). Although these approaches deliver more performant models, they do not show sufficiently strong generalization ability as the training only relies on the provided demonstrations. In this pap… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  7. arXiv:2404.18384  [pdf, other

    cs.CL

    Exploring the Limits of Fine-grained LLM-based Physics Inference via Premise Removal Interventions

    Authors: Jordan Meadows, Tamsin James, Andre Freitas

    Abstract: Language models can hallucinate when performing complex and detailed mathematical reasoning. Physics provides a rich domain for assessing mathematical reasoning capabilities where physical context imbues the use of symbols which needs to satisfy complex semantics (\textit{e.g.,} units, tensorial order), leading to instances where inference may be algebraically coherent, yet unphysical. In this wor… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  8. arXiv:2404.04963  [pdf, other

    cs.CL cs.AI

    SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials

    Authors: Mael Jullien, Marco Valentino, André Freitas

    Abstract: Large Language Models (LLMs) are at the forefront of NLP achievements but fall short in dealing with shortcut learning, factual inconsistency, and vulnerability to adversarial inputs.These shortcomings are especially critical in medical contexts, where they can misrepresent actual model capabilities. Addressing this, we present SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Cl… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  9. arXiv:2404.02625  [pdf, other

    cs.CL cs.AI cs.LG

    A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference

    Authors: Mokanarangan Thayaparan, Marco Valentino, André Freitas

    Abstract: Integer Linear Programming (ILP) has been proposed as a formalism for encoding precise structural and semantic constraints for Natural Language Inference (NLI). However, traditional ILP frameworks are non-differentiable, posing critical challenges for the integration of continuous language representations based on deep learning. In this paper, we introduce a novel approach, named Diff-Comb Explain… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted to LREC-COLING 2024 - Camera Ready. arXiv admin note: substantial text overlap with arXiv:2208.03339

  10. arXiv:2404.02622  [pdf, other

    cs.CL

    Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models

    Authors: Julia Rozanova, Marco Valentino, André Freitas

    Abstract: Rigorous evaluation of the causal effects of semantic features on language model predictions can be hard to achieve for natural language reasoning problems. However, this is such a desirable form of analysis from both an interpretability and model evaluation perspective, that it is valuable to investigate specific patterns of reasoning with enough structure and regularity to identify and quantify… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted to LREC-COLING 2024 - Camera Ready. arXiv admin note: substantial text overlap with arXiv:2305.08572

  11. arXiv:2402.10767  [pdf, other

    cs.CL cs.AI

    Inference to the Best Explanation in Large Language Models

    Authors: Dhairya Dalal, Marco Valentino, André Freitas, Paul Buitelaar

    Abstract: While Large Language Models (LLMs) have found success in real-world applications, their underlying explanatory process is still poorly understood. This paper proposes IBE-Eval, a framework inspired by philosophical accounts on Inference to the Best Explanation (IBE) to advance the interpretation and evaluation of LLMs' explanations. IBE-Eval estimates the plausibility of natural language explanati… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    ACM Class: I.2.7

  12. arXiv:2402.00745  [pdf, other

    cs.CL

    Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement

    Authors: Xin Quan, Marco Valentino, Louise A. Dennis, André Freitas

    Abstract: An increasing amount of research in Natural Language Inference (NLI) focuses on the application and evaluation of Large Language Models (LLMs) and their reasoning capabilities. Despite their success, however, LLMs are still prone to factual errors and inconsistencies in their explanations, offering limited control and interpretability for inference in complex domains. In this paper, we focus on et… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Camera-ready for EACL 2024

  13. arXiv:2402.00723  [pdf, other

    cs.CL

    Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders

    Authors: Yingji Zhang, Danilo S. Carvalho, Marco Valentino, Ian Pratt-Hartmann, Andre Freitas

    Abstract: Achieving precise semantic control over the latent spaces of Variational AutoEncoders (VAEs) holds significant value for downstream tasks in NLP as the underlying generative mechanisms could be better localised, explained and improved upon. Recent research, however, has struggled to achieve consistent results, primarily due to the inevitable loss of semantic information in the variational bottlene… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  14. arXiv:2401.06452  [pdf, other

    cs.LG

    Automated Machine Learning for Positive-Unlabelled Learning

    Authors: Jack D. Saunders, Alex A. Freitas

    Abstract: Positive-Unlabelled (PU) learning is a growing field of machine learning that aims to learn classifiers from data consisting of labelled positive and unlabelled instances, which can be in reality positive or negative, but whose label is unknown. An extensive number of methods have been proposed to address PU learning over the last two decades, so many so that selecting an optimal method for a give… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: 36 pages, 4 figures

  15. arXiv:2312.13208  [pdf, other

    cs.CL

    LlaMaVAE: Guiding Large Language Model Generation via Continuous Latent Sentence Spaces

    Authors: Yingji Zhang, Danilo S. Carvalho, Ian Pratt-Hartmann, André Freitas

    Abstract: Deep generative neural networks, such as Variational AutoEncoders (VAEs), offer an opportunity to better understand and control language models from the perspective of sentence-level latent spaces. To combine the controllability of VAE latent spaces with the state-of-the-art performance of recent large language models (LLMs), we present in this work LlaMaVAE, which combines expressive encoder and… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  16. arXiv:2311.08579  [pdf, other

    cs.CL

    Graph-Induced Syntactic-Semantic Spaces in Transformer-Based Variational AutoEncoders

    Authors: Yingji Zhang, Marco Valentino, Danilo S. Carvalho, Ian Pratt-Hartmann, André Freitas

    Abstract: The injection of syntactic information in Variational AutoEncoders (VAEs) has been shown to result in an overall improvement of performances and generalisation. An effective strategy to achieve such a goal is to separate the encoding of distributional semantic features and syntactic structures into heterogeneous latent spaces via multi-task learning or dual encoder architectures. However, existing… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  17. arXiv:2311.06364  [pdf, other

    cs.CL

    Relation Extraction in underexplored biomedical domains: A diversity-optimised sampling and synthetic data generation approach

    Authors: Maxime Delmas, Magdalena Wysocka, André Freitas

    Abstract: The sparsity of labelled data is an obstacle to the development of Relation Extraction models and the completion of databases in various biomedical areas. While being of high interest in drug-discovery, the natural-products literature, reporting the identification of potential bioactive compounds from organisms, is a concrete example of such an overlooked topic. To mark the start of this new task,… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  18. arXiv:2311.05330  [pdf, other

    stat.AP cs.CY stat.ME

    A Bayesian framework for measuring association and its application to emotional dynamics in Web discourse

    Authors: Henrique S. Xavier, Diogo Cortiz, Mateus Silvestrin, Ana Luísa Freitas, Letícia Yumi Nakao Morello, Fernanda Naomi Pantaleão, Gabriel Gaudencio do Rêgo

    Abstract: This paper introduces a Bayesian framework designed to measure the degree of association between categorical random variables. The method is grounded in the formal definition of variable independence and is implemented using Markov Chain Monte Carlo (MCMC) techniques. Unlike commonly employed techniques in Association Rule Learning, this approach enables a clear and precise estimation of confidenc… ▽ More

    Submitted 11 March, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: 9 pages, 2 tables, 4 figures. Accepted for publication at the Beyond Facts workshop of the Web Conference 2024

  19. Detecting Relevant Information in High-Volume Chat Logs: Keyphrase Extraction for Grooming and Drug Dealing Forensic Analysis

    Authors: Jeovane Honório Alves, Horácio A. C. G. Pedroso, Rafael Honorio Venetikides, Joel E. M. Köster, Luiz Rodrigo Grochocki, Cinthia O. A. Freitas, Jean Paul Barddal

    Abstract: The growing use of digital communication platforms has given rise to various criminal activities, such as grooming and drug dealing, which pose significant challenges to law enforcement and forensic experts. This paper presents a supervised keyphrase extraction approach to detect relevant information in high-volume chat logs involving grooming and drug dealing for forensic analysis. The proposed m… ▽ More

    Submitted 14 September, 2023; originally announced November 2023.

    Comments: Accepted for presentation at the 22nd IEEE International Conference on Machine Learning and Applications (ICMLA) 2023

  20. arXiv:2311.01230  [pdf, other

    cs.LG cs.AI cs.SC

    Multi-Operational Mathematical Derivations in Latent Space

    Authors: Marco Valentino, Jordan Meadows, Lan Zhang, André Freitas

    Abstract: This paper investigates the possibility of approximating multiple mathematical operations in latent space for expression derivation. To this end, we introduce different multi-operational representation paradigms, modelling mathematical operations as explicit geometric transformations. By leveraging a symbolic engine, we construct a large-scale dataset comprising 1.7M derivation steps stemming from… ▽ More

    Submitted 3 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024 - Camera Ready

  21. arXiv:2310.02752  [pdf, ps, other

    cs.LG

    Fair Feature Selection: A Comparison of Multi-Objective Genetic Algorithms

    Authors: James Brookhouse, Alex Freitas

    Abstract: Machine learning classifiers are widely used to make decisions with a major impact on people's lives (e.g. accepting or denying a loan, hiring decisions, etc). In such applications,the learned classifiers need to be both accurate and fair with respect to different groups of people, with different values of variables such as sex and race. This paper focuses on fair feature selection for classificat… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 10 pages, 1 figure, 3 tables

  22. arXiv:2308.14186  [pdf, other

    cs.CL cs.AI

    Empowering Cross-lingual Abilities of Instruction-tuned Large Language Models by Translation-following demonstrations

    Authors: Leonardo Ranaldi, Giulia Pucci, Andre Freitas

    Abstract: The language ability of Large Language Models (LLMs) is often unbalanced towards English because of the imbalance in the distribution of the pre-training data. This disparity is demanded in further fine-tuning and affecting the cross-lingual abilities of LLMs. In this paper, we propose to empower Instructiontuned LLMs (It-LLMs) in languages other than English by building semantic alignment between… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

  23. arXiv:2308.03581  [pdf, other

    cs.CL

    Towards Controllable Natural Language Inference through Lexical Inference Types

    Authors: Yingji Zhang, Danilo S. Carvalho, Ian Pratt-Hartmann, Andre Freitas

    Abstract: Explainable natural language inference aims to provide a mechanism to produce explanatory (abductive) inference chains which ground claims to their supporting premises. A recent corpus called EntailmentBank strives to advance this task by explaining the answer to a question using an entailment tree \cite{dalvi2021explaining}. They employ the T5 model to directly generate the tree, which can explai… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  24. arXiv:2308.00425  [pdf, ps, other

    cs.CL cs.AI

    Discourse-Aware Text Simplification: From Complex Sentences to Linked Propositions

    Authors: Christina Niklaus, Matthias Cetto, André Freitas, Siegfried Handschuh

    Abstract: Sentences that present a complex syntax act as a major stumbling block for downstream Natural Language Processing applications whose predictive quality deteriorates with sentence length and complexity. The task of Text Simplification (TS) may remedy this situation. It aims to modify sentences in order to make them easier to process, using a set of rewriting operations, such as reordering, deletion… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  25. arXiv:2307.09998  [pdf, other

    cs.CL math.HO

    Generating Mathematical Derivations with Large Language Models

    Authors: Jordan Meadows, Marco Valentino, Andre Freitas

    Abstract: The derivation of mathematical results in specialised fields, using Large Language Models (LLMs), is an emerging research direction that can help identify models' limitations, and potentially support mathematical discovery. In this paper, we leverage a symbolic engine to generate derivations of equations at scale, and investigate the capabilities of LLMs when deriving goal equations from premises.… ▽ More

    Submitted 8 August, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 10 pages

  26. arXiv:2305.17819  [pdf, other

    cs.CL cs.AI

    Large Language Models, scientific knowledge and factuality: A systematic analysis in antibiotic discovery

    Authors: Magdalena Wysocka, Oskar Wysocki, Maxime Delmas, Vincent Mutel, Andre Freitas

    Abstract: Inferring over and extracting information from Large Language Models (LLMs) trained on a large corpus of scientific literature can potentially drive a new era in biomedical research, reducing the barriers for accessing existing medical evidence. This work examines the potential of LLMs for dialoguing with biomedical background knowledge, using the context of antibiotic discovery. The systematic an… ▽ More

    Submitted 5 December, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: 28 pages, 3 figures

  27. arXiv:2305.13494  [pdf, other

    cs.DB

    Deep Clustering for Data Cleaning and Integration

    Authors: Hafiz Tayyab Rauf, Andre Freitas, Norman W. Paton

    Abstract: Deep Learning (DL) techniques now constitute the state-of-the-art for important problems in areas such as text and image processing, and there have been impactful results that deploy DL in several data management tasks. Deep Clustering (DC) has recently emerged as a sub-discipline of DL, in which data representations are learned in tandem with clustering, with a view to automatically identifying t… ▽ More

    Submitted 22 September, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: The following enhancements have been carried out in the updated version of the manuscript: *Evaluated each data integration problem on additional datasets. *Added more DC and SC methods to the evaluation *Discussed algorithmic-specific observations

  28. arXiv:2305.12563  [pdf, other

    cs.CL cs.LG

    A Symbolic Framework for Evaluating Mathematical Reasoning and Generalisation with Transformers

    Authors: Jordan Meadows, Marco Valentino, Damien Teney, Andre Freitas

    Abstract: This paper proposes a methodology for generating and perturbing detailed derivations of equations at scale, aided by a symbolic engine, to evaluate the generalisability of Transformers to out-of-distribution mathematical reasoning problems. Instantiating the framework in the context of sequence classification tasks, we compare the capabilities of GPT-4, GPT-3.5, and a canon of fine-tuned BERT mode… ▽ More

    Submitted 8 April, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: NAACL 2024

  29. arXiv:2305.11391  [pdf, other

    cs.AI cs.LG

    A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation

    Authors: Xiaowei Huang, Wenjie Ruan, Wei Huang, Gaojie Jin, Yi Dong, Changshun Wu, Saddek Bensalem, Ronghui Mu, Yi Qi, Xingyu Zhao, Kaiwen Cai, Yanghao Zhang, Sihao Wu, Peipei Xu, Dengyu Wu, Andre Freitas, Mustafa A. Mustafa

    Abstract: Large Language Models (LLMs) have exploded a new heatwave of AI for their ability to engage end-users in human-level conversations with detailed and articulate answers across many knowledge domains. In response to their fast adoption in many industrial applications, this survey concerns their safety and trustworthiness. First, we review known vulnerabilities and limitations of the LLMs, categorisi… ▽ More

    Submitted 27 August, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  30. arXiv:2305.08572  [pdf, other

    cs.CL

    Estimating the Causal Effects of Natural Logic Features in Neural NLI Models

    Authors: Julia Rozanova, Marco Valentino, Andre Freitas

    Abstract: Rigorous evaluation of the causal effects of semantic features on language model predictions can be hard to achieve for natural language reasoning problems. However, this is such a desirable form of analysis from both an interpretability and model evaluation perspective, that it is valuable to zone in on specific patterns of reasoning with enough structure and regularity to be able to identify and… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  31. arXiv:2305.07303  [pdf, other

    cs.CL cs.LG

    Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions

    Authors: Marco Valentino, Danilo S. Carvalho, André Freitas

    Abstract: Natural language definitions possess a recursive, self-explanatory semantic structure that can support representation learning methods able to preserve explicit conceptual relations and constraints in the latent space. This paper presents a multi-relational model that explicitly leverages such a structure to derive word embeddings from definitions. By automatically extracting the relations linking… ▽ More

    Submitted 16 February, 2024; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: Accepted at the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), camera-ready

  32. arXiv:2305.04675  [pdf, other

    nucl-th cs.LG

    Predicting nuclear masses with product-unit networks

    Authors: Babette Dellen, Uwe Jaekel, Paulo S. A. Freitas, John W. Clark

    Abstract: Accurate estimation of nuclear masses and their prediction beyond the experimentally explored domains of the nuclear landscape are crucial to an understanding of the fundamental origin of nuclear properties and to many applications of nuclear science, most notably in quantifying the $r$-process of stellar nucleosynthesis. Neural networks have been applied with some success to the prediction of nuc… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  33. arXiv:2305.03598  [pdf, other

    cs.CL cs.AI cs.LG

    NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports

    Authors: Maël Jullien, Marco Valentino, Hannah Frost, Paul O'Regan, Donal Landers, André Freitas

    Abstract: How can we interpret and retrieve medical evidence to support clinical decisions? Clinical trial reports (CTR) amassed over the years contain indispensable information for the development of personalized medicine. However, it is practically infeasible to manually inspect over 400,000+ clinical trial reports in order to find the best evidence for experimental treatments. Natural Language Inference… ▽ More

    Submitted 28 October, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 Camera-ready, 15 pages

  34. arXiv:2305.02993  [pdf, other

    cs.CL cs.AI cs.LG

    SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data

    Authors: Maël Jullien, Marco Valentino, Hannah Frost, Paul O'Regan, Donal Landers, André Freitas

    Abstract: This paper describes the results of SemEval 2023 task 7 -- Multi-Evidence Natural Language Inference for Clinical Trial Data (NLI4CT) -- consisting of 2 tasks, a Natural Language Inference (NLI) task, and an evidence selection task on clinical trial data. The proposed challenges require multi-hop biomedical and numerical reasoning, which are of significant importance to the development of systems… ▽ More

    Submitted 11 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  35. arXiv:2305.01713  [pdf, other

    cs.CL cs.AI

    Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks

    Authors: Yingji Zhang, Danilo S. Carvalho, André Freitas

    Abstract: Disentangled latent spaces usually have better semantic separability and geometrical properties, which leads to better interpretability and more controllable data generation. While this has been well investigated in Computer Vision, in tasks such as image disentanglement, in the NLP domain sentence disentanglement is still comparatively under-investigated. Most previous work have concentrated on d… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: ACL 2024

  36. arXiv:2304.10346  [pdf, other

    cs.CL

    Interventional Probing in High Dimensions: An NLI Case Study

    Authors: Julia Rozanova, Marco Valentino, Lucas Cordeiro, Andre Freitas

    Abstract: Probing strategies have been shown to detect the presence of various linguistic features in large language models; in particular, semantic features intermediate to the "natural logic" fragment of the Natural Language Inference task (NLI). In the case of natural logic, the relation between the intermediate features and the entailment label is explicitly known: as such, this provides a ripe setting… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  37. arXiv:2303.03235  [pdf, other

    cs.HC cs.AI

    On the Visualisation of Argumentation Graphs to Support Text Interpretation

    Authors: Hanadi Mardah, Oskar Wysocki, Markel Vigo, Andre Freitas

    Abstract: The recent evolution in Natural Language Processing (NLP) methods, in particular in the field of argumentation mining, has the potential to transform the way we interact with text, supporting the interpretation and analysis of complex discourse and debates. Can a graphic visualisation of complex argumentation enable a more critical interpretation of the arguments? This study focuses on analysing t… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 35 pages

    ACM Class: I.2.7; I.2.4

  38. arXiv:2302.04785  [pdf, other

    cs.AI cs.CE cs.CY

    Analysis of business process automation as linear time-invariant system network

    Authors: Mauricio Jacobo-Romero, Danilo S. Carvalho, Andre Freitas

    Abstract: In this work, we examined Business Process (BP) production as a signal; this novel approach explores a BP workflow as a linear time-invariant (LTI) system. We analysed BP productivity in the frequency domain; this standpoint examines how labour and capital act as BP input signals and how their fundamental frequencies affect BP production. Our research also proposes a simulation framework of a BP i… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: 16 pages, 24 figures

  39. arXiv:2212.04310  [pdf, other

    cs.CL

    Montague semantics and modifier consistency measurement in neural language models

    Authors: Danilo S. Carvalho, Edoardo Manino, Julia Rozanova, Lucas Cordeiro, André Freitas

    Abstract: In recent years, distributional language representation models have demonstrated great practical success. At the same time, the need for interpretability has elicited questions on their intrinsic properties and capabilities. Crucially, distributional models are often inconsistent when dealing with compositional phenomena in natural language, which has significant implications for their safety and… ▽ More

    Submitted 3 April, 2023; v1 submitted 10 October, 2022; originally announced December 2022.

  40. arXiv:2210.06230  [pdf, other

    cs.CL cs.AI

    Formal Semantic Geometry over Transformer-based Variational AutoEncoder

    Authors: Yingji Zhang, Danilo S. Carvalho, Ian Pratt-Hartmann, André Freitas

    Abstract: Formal/symbolic semantics can provide canonical, rigid controllability and interpretability to sentence representations due to their \textit{localisation} or \textit{composition} property. How can we deliver such property to the current distributional sentence representations to control and interpret the generation of language models (LMs)? In this work, we theoretically frame the sentence semanti… ▽ More

    Submitted 11 June, 2024; v1 submitted 12 October, 2022; originally announced October 2022.

  41. arXiv:2210.02898  [pdf, other

    cs.CL cs.AI

    Learning Disentangled Representations for Natural Language Definitions

    Authors: Danilo S. Carvalho, Giangiacomo Mercatali, Yingji Zhang, Andre Freitas

    Abstract: Disentangling the encodings of neural models is a fundamental aspect for improving interpretability, semantic control and downstream task performance in Natural Language Processing. Currently, most disentanglement methods are unsupervised or rely on synthetic datasets with known generative factors. We argue that recurrent syntactic and semantic regularities in textual data can be used to provide t… ▽ More

    Submitted 15 February, 2023; v1 submitted 22 September, 2022; originally announced October 2022.

    Comments: Findings of EACL 2023

  42. arXiv:2210.01252  [pdf, other

    cs.AI eess.SY

    Estimating productivity gains in digital automation

    Authors: Mauricio Jacobo-Romero, Danilo S. Carvalho, André Freitas

    Abstract: This paper proposes a novel productivity estimation model to evaluate the effects of adopting Artificial Intelligence (AI) components in a production chain. Our model provides evidence to address the "AI's" Solow's Paradox. We provide (i) theoretical and empirical evidence to explain Solow's dichotomy; (ii) a data-driven model to estimate and asses productivity variations; (iii) a methodology unde… ▽ More

    Submitted 8 October, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 11 pages and 9 figures

  43. arXiv:2208.03339  [pdf, other

    cs.AI cs.CL

    Going Beyond Approximation: Encoding Constraints for Explainable Multi-hop Inference via Differentiable Combinatorial Solvers

    Authors: Mokanarangan Thayaparan, Marco Valentino, André Freitas

    Abstract: Integer Linear Programming (ILP) provides a viable mechanism to encode explicit and controllable assumptions about explainable multi-hop inference with natural language. However, an ILP formulation is non-differentiable and cannot be integrated into broader deep learning architectures. Recently, Thayaparan et al. (2021a) proposed a novel methodology to integrate ILP with Transformers to achieve en… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

  44. arXiv:2208.01376  [pdf, other

    cs.CL cs.AI

    Active entailment encoding for explanation tree construction using parsimonious generation of hard negatives

    Authors: Alex Bogatu, Zili Zhou, Dónal Landers, André Freitas

    Abstract: Entailment trees have been proposed to simulate the human reasoning process of explanation generation in the context of open--domain textual question answering. However, in practice, manually constructing these explanation trees proves a laborious process that requires active human involvement. Given the complexity of capturing the line of reasoning from question to the answer or from claim to pre… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  45. arXiv:2207.00812  [pdf, other

    q-bio.QM cs.AI cs.LG

    A systematic review of biologically-informed deep learning models for cancer: fundamental trends for encoding and interpreting oncology data

    Authors: Magdalena Wysocka, Oskar Wysocki, Marie Zufferey, Dónal Landers, André Freitas

    Abstract: There is an increasing interest in the use of Deep Learning (DL) based methods as a supporting analytical framework in oncology. However, most direct applications of DL will deliver models with limited transparency and explainability, which constrain their deployment in biomedical settings. This systematic review discusses DL models used to support inference in cancer biology with a particular emp… ▽ More

    Submitted 24 January, 2023; v1 submitted 2 July, 2022; originally announced July 2022.

    Comments: 25 pages, 5 figures

  46. arXiv:2206.10612  [pdf, other

    q-bio.QM cs.LG

    Metareview-informed Explainable Cytokine Storm Detection during CAR-T cell Therapy

    Authors: Alex Bogatu, Magdalena Wysocka, Oskar Wysocki, Holly Butterworth, Donal Landers, Elaine Kilgour, Andre Freitas

    Abstract: Cytokine release syndrome (CRS), also known as cytokine storm, is one of the most consequential adverse effects of chimeric antigen receptor therapies that have shown promising results in cancer treatment. When emerging, CRS could be identified by the analysis of specific cytokine and chemokine profiles that tend to exhibit similarities across patients. In this paper, we exploit these similarities… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  47. arXiv:2206.02423  [pdf

    cs.LG stat.ML

    Evaluating the Predictive Performance of Positive-Unlabelled Classifiers: a brief critical review and practical recommendations for improvement

    Authors: Jack D. Saunders, Alex, A. Freitas

    Abstract: Positive-Unlabelled (PU) learning is a growing area of machine learning that aims to learn classifiers from data consisting of labelled positive and unlabelled instances. Whilst much work has been done proposing methods for PU learning, little has been written on the subject of evaluating these methods. Many popular standard classification metrics cannot be precisely calculated due to the absence… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  48. arXiv:2205.15231  [pdf, other

    cs.CL

    A Survey in Mathematical Language Processing

    Authors: Jordan Meadows, Andre Freitas

    Abstract: Informal mathematical text underpins real-world quantitative reasoning and communication. Developing sophisticated methods of retrieval and abstraction from this dual modality is crucial in the pursuit of the vision of automating discovery in quantitative science and mathematics. We track the development of informal mathematical language processing approaches across five strategic sub-areas in rec… ▽ More

    Submitted 8 April, 2024; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: TACL 2023 (Introduction to Mathematical Language Processing...)

  49. arXiv:2205.01809  [pdf, other

    cs.AI cs.CL

    Scientific Explanation and Natural Language: A Unified Epistemological-Linguistic Perspective for Explainable AI

    Authors: Marco Valentino, André Freitas

    Abstract: A fundamental research goal for Explainable AI (XAI) is to build models that are capable of reasoning through the generation of natural language explanations. However, the methodologies to design and evaluate explanation-based inference models are still poorly informed by theoretical accounts on the nature of explanation. As an attempt to provide an epistemologically grounded characterisation for… ▽ More

    Submitted 5 May, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

  50. arXiv:2204.12316  [pdf, other

    cs.CL

    Systematicity, Compositionality and Transitivity of Deep NLP Models: a Metamorphic Testing Perspective

    Authors: Edoardo Manino, Julia Rozanova, Danilo Carvalho, Andre Freitas, Lucas Cordeiro

    Abstract: Metamorphic testing has recently been used to check the safety of neural NLP models. Its main advantage is that it does not rely on a ground truth to generate test cases. However, existing studies are mostly concerned with robustness-like metamorphic relations, limiting the scope of linguistic properties they can test. We propose three new classes of metamorphic relations, which address the proper… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: Findings of the Association for Computational Linguistics 2022