Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 113 results for author: Augenstein, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.17023  [pdf, other

    cs.CL cs.AI

    From Internal Conflict to Contextual Adaptation of Language Models

    Authors: Sara Vera Marjanović, Haeun Yu, Pepa Atanasova, Maria Maistro, Christina Lioma, Isabelle Augenstein

    Abstract: Knowledge-intensive language understanding tasks require Language Models (LMs) to integrate relevant context, mitigating their inherent weaknesses, such as incomplete or outdated knowledge. Nevertheless, studies indicate that LMs often ignore the provided context as it can conflict with the pre-existing LM's memory learned during pre-training. Moreover, conflicting knowledge can already be present… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: 22 pages, 15 figures

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:2406.19238  [pdf, other

    cs.CL cs.CY cs.LG

    Revealing Fine-Grained Values and Opinions in Large Language Models

    Authors: Dustin Wright, Arnav Arora, Nadav Borenstein, Srishti Yadav, Serge Belongie, Isabelle Augenstein

    Abstract: Uncovering latent values and opinions in large language models (LLMs) can help identify biases and mitigate potential harm. Recently, this has been approached by presenting LLMs with survey questions and quantifying their stances towards morally and politically charged statements. However, the stances generated by LLMs can vary greatly depending on how they are prompted, and there are many ways to… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 28 pages, 20 figures, 7 tables

  3. arXiv:2406.17753  [pdf, other

    cs.CL cs.AI

    Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language

    Authors: Amalie Brogaard Pauli, Isabelle Augenstein, Ira Assent

    Abstract: We are exposed to much information trying to influence us, such as teaser messages, debates, politically framed news, and propaganda - all of which use persuasive language. With the recent interest in Large Language Models (LLMs), we study the ability of LLMs to produce persuasive text. As opposed to prior work which focuses on particular domains or types of persuasion, we conduct a general study… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.15085  [pdf, other

    cs.CL

    A Unified Framework for Input Feature Attribution Analysis

    Authors: Jingyi Sun, Pepa Atanasova, Isabelle Augenstein

    Abstract: Explaining the decision-making process of machine learning models is crucial for ensuring their reliability and fairness. One popular explanation form highlights key input features, such as i) tokens (e.g., Shapley Values and Integrated Gradients), ii) interactions between tokens (e.g., Bivariate Shapley and Attention-based methods), or iii) interactions between spans of the input (e.g., Louvain S… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  5. arXiv:2406.14425  [pdf, other

    cs.CL cs.AI cs.LG

    SynDARin: Synthesising Datasets for Automated Reasoning in Low-Resource Languages

    Authors: Gayane Ghazaryan, Erik Arakelyan, Pasquale Minervini, Isabelle Augenstein

    Abstract: Question Answering (QA) datasets have been instrumental in developing and evaluating Large Language Model (LLM) capabilities. However, such datasets are scarce for languages other than English due to the cost and difficulties of collection and manual annotation. This means that producing novel models and measuring the performance of multilingual LLMs in low-resource languages is challenging. To mi… ▽ More

    Submitted 25 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  6. arXiv:2406.04289  [pdf, other

    cs.CL

    What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages

    Authors: Nadav Borenstein, Anej Svete, Robin Chan, Josef Valvoda, Franz Nowak, Isabelle Augenstein, Eleanor Chodroff, Ryan Cotterell

    Abstract: What can large language models learn? By definition, language models (LM) are distributions over strings. Therefore, an intuitive way of addressing the above question is to formalize it as a matter of learnability of classes of distributions over strings. While prior work in this direction focused on assessing the theoretical limits, in contrast, we seek to understand the empirical learnability. U… ▽ More

    Submitted 10 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024

  7. arXiv:2404.18655  [pdf, other

    cs.CL cs.AI

    Revealing the Parametric Knowledge of Language Models: A Unified Framework for Attribution Methods

    Authors: Haeun Yu, Pepa Atanasova, Isabelle Augenstein

    Abstract: Language Models (LMs) acquire parametric knowledge from their training process, embedding it within their weights. The increasing scalability of LMs, however, poses significant challenges for understanding a model's inner workings and further for updating or correcting this embedded knowledge without the significant cost of retraining. This underscores the importance of unveiling exactly what know… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 14 pages, 6 figures

    MSC Class: 68T50 ACM Class: I.2.7

  8. arXiv:2402.14177  [pdf, other

    cs.SI cs.CY

    Investigating Human Values in Online Communities

    Authors: Nadav Borenstein, Arnav Arora, Lucie-Aimée Kaffee, Isabelle Augenstein

    Abstract: Human values play a vital role as an analytical tool in social sciences, enabling the study of diverse dimensions within society as a whole and among individual communities. This paper addresses the limitations of traditional survey-based studies of human values by proposing a computational application of Schwartz's values framework to Reddit, a platform organized into distinct online communities.… ▽ More

    Submitted 17 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  9. arXiv:2402.13006  [pdf, other

    cs.LG cs.CL

    Investigating the Impact of Model Instability on Explanations and Uncertainty

    Authors: Sara Vera Marjanović, Isabelle Augenstein, Christina Lioma

    Abstract: Explainable AI methods facilitate the understanding of model behaviour, yet, small, imperceptible perturbations to inputs can vastly distort explanations. As these explanations are typically evaluated holistically, before model deployment, it is difficult to assess when a particular explanation is trustworthy. Some studies have tried to create confidence estimators for explanations, but none have… ▽ More

    Submitted 4 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  10. arXiv:2402.12431  [pdf, other

    cs.CL

    Understanding Fine-grained Distortions in Reports of Scientific Findings

    Authors: Amelie Wührl, Dustin Wright, Roman Klinger, Isabelle Augenstein

    Abstract: Distorted science communication harms individuals and society as it can lead to unhealthy behavior change and decrease trust in scientific institutions. Given the rapidly increasing volume of science communication in recent years, a fine-grained understanding of how findings from scientific publications are reported to the general public, and methods to detect distortions from the original work au… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  11. arXiv:2401.14440  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models

    Authors: Erik Arakelyan, Zhaoqi Liu, Isabelle Augenstein

    Abstract: Recent studies of the emergent capabilities of transformer-based Natural Language Understanding (NLU) models have indicated that they have an understanding of lexical and compositional semantics. We provide evidence that suggests these claims should be taken with a grain of salt: we find that state-of-the-art Natural Language Inference (NLI) models are sensitive towards minor semantics preserving… ▽ More

    Submitted 31 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: EACL 2024

  12. arXiv:2311.18567  [pdf, other

    cs.CL

    Grammatical Gender's Influence on Distributional Semantics: A Causal Perspective

    Authors: Karolina Stańczak, Kevin Du, Adina Williams, Isabelle Augenstein, Ryan Cotterell

    Abstract: How much meaning influences gender assignment across languages is an active area of research in modern linguistics and cognitive science. We can view current approaches as aiming to determine where gender assignment falls on a spectrum, from being fully arbitrarily determined to being largely semantically determined. For the latter case, there is a formulation of the neo-Whorfian hypothesis, which… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  13. arXiv:2311.17627  [pdf, other

    cs.SI cs.CY

    Invisible Women in Digital Diplomacy: A Multidimensional Framework for Online Gender Bias Against Women Ambassadors Worldwide

    Authors: Yevgeniy Golovchenko, Karolina Stańczak, Rebecca Adler-Nissen, Patrice Wangen, Isabelle Augenstein

    Abstract: Despite mounting evidence that women in foreign policy often bear the brunt of online hostility, the extent of online gender bias against diplomats remains unexplored. This paper offers the first global analysis of the treatment of women diplomats on social media. Introducing a multidimensional and multilingual methodology for studying online gender bias, it focuses on three critical elements: gen… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  14. arXiv:2311.09090  [pdf, other

    cs.CL

    Social Bias Probing: Fairness Benchmarking for Language Models

    Authors: Marta Marchiori Manerba, Karolina Stańczak, Riccardo Guidotti, Isabelle Augenstein

    Abstract: While the impact of social biases in language models has been recognized, prior methods for bias evaluation have been limited to binary association tests on small datasets, limiting our understanding of bias complexities. This paper proposes a novel framework for probing language models for social biases by assessing disparate treatment, which involves treating individuals differently according to… ▽ More

    Submitted 22 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  15. arXiv:2311.09000  [pdf, other

    cs.CL

    Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers

    Authors: Yuxia Wang, Revanth Gangi Reddy, Zain Muhammad Mujahid, Arnav Arora, Aleksandr Rubashevskii, Jiahui Geng, Osama Mohammed Afzal, Liangming Pan, Nadav Borenstein, Aditya Pillai, Isabelle Augenstein, Iryna Gurevych, Preslav Nakov

    Abstract: The increased use of large language models (LLMs) across a variety of real-world applications calls for mechanisms to verify the factual accuracy of their outputs. In this work, we present a holistic end-to-end solution for annotating the factuality of LLM-generated responses, which encompasses a multi-stage annotation scheme designed to yield detailed labels concerning the verifiability and factu… ▽ More

    Submitted 16 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 30 pages, 13 figures

  16. arXiv:2311.01270  [pdf, other

    cs.CL cs.CY

    People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection

    Authors: Indira Sen, Dennis Assenmacher, Mattia Samory, Isabelle Augenstein, Wil van der Aalst, Claudia Wagner

    Abstract: NLP models are used in a variety of critical social computing tasks, such as detecting sexist, racist, or otherwise hateful content. Therefore, it is imperative that these models are robust to spurious features. Past work has attempted to tackle such spurious features using training data augmentation, including Counterfactually Augmented Data (CADs). CADs introduce minimal changes to existing trai… ▽ More

    Submitted 25 February, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Preprint of EMNLP'23 paper

  17. arXiv:2310.18343  [pdf, other

    cs.CL

    PHD: Pixel-Based Language Modeling of Historical Documents

    Authors: Nadav Borenstein, Phillip Rust, Desmond Elliott, Isabelle Augenstein

    Abstract: The digitisation of historical documents has provided historians with unprecedented research opportunities. Yet, the conventional approach to analysing historical documents involves converting them from images to text using OCR, a process that overlooks the potential benefits of treating them as images and introduces high levels of noise. To bridge this gap, we take advantage of recent advancement… ▽ More

    Submitted 4 November, 2023; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted to the main conference of EMNLP 2023

  18. arXiv:2310.13506  [pdf, other

    cs.CL cs.AI

    Explaining Interactions Between Text Spans

    Authors: Sagnik Ray Choudhury, Pepa Atanasova, Isabelle Augenstein

    Abstract: Reasoning over spans of tokens from different parts of the input is essential for natural language understanding (NLU) tasks such as fact-checking (FC), machine reading comprehension (MRC) or natural language inference (NLI). However, existing highlight-based explanations primarily focus on identifying individual important tokens or interactions only between adjacent tokens or tuples of tokens. Mo… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: code: https://github.com/copenlu/spanex , dataset: https://huggingface.co/datasets/copenlu/spanex. Accepted EMNLP 2023

    ACM Class: I.2.7

  19. arXiv:2310.05779  [pdf, other

    cs.LG

    Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions

    Authors: Lucie-Aimée Kaffee, Arnav Arora, Isabelle Augenstein

    Abstract: The moderation of content on online platforms is usually non-transparent. On Wikipedia, however, this discussion is carried out publicly and the editors are encouraged to use the content moderation policies as explanations for making moderation decisions. Currently, only a few comments explicitly mention those policies -- 20% of the English ones, but as few as 2% of the German and Turkish comments… ▽ More

    Submitted 23 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: This submission has been accepted to 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

  20. arXiv:2310.05189  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Factuality Challenges in the Era of Large Language Models

    Authors: Isabelle Augenstein, Timothy Baldwin, Meeyoung Cha, Tanmoy Chakraborty, Giovanni Luca Ciampaglia, David Corney, Renee DiResta, Emilio Ferrara, Scott Hale, Alon Halevy, Eduard Hovy, Heng Ji, Filippo Menczer, Ruben Miguez, Preslav Nakov, Dietram Scheufele, Shivam Sharma, Giovanni Zagni

    Abstract: The emergence of tools based on Large Language Models (LLMs), such as OpenAI's ChatGPT, Microsoft's Bing Chat, and Google's Bard, has garnered immense public attention. These incredibly useful, natural-sounding tools mark significant advances in natural language generation, yet they exhibit a propensity to generate false, erroneous, or misleading content -- commonly referred to as "hallucinations.… ▽ More

    Submitted 9 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: Our article offers a comprehensive examination of the challenges and risks associated with Large Language Models (LLMs), focusing on their potential impact on the veracity of information in today's digital landscape

  21. arXiv:2306.00765  [pdf, other

    cs.CL cs.AI cs.IR stat.CO stat.ML

    Topic-Guided Sampling For Data-Efficient Multi-Domain Stance Detection

    Authors: Erik Arakelyan, Arnav Arora, Isabelle Augenstein

    Abstract: Stance Detection is concerned with identifying the attitudes expressed by an author towards a target of interest. This task spans a variety of domains ranging from social media opinion identification to detecting the stance for a legal claim. However, the framing of the task varies within these domains, in terms of the data collection protocol, the label dictionary and the number of available anno… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ACL 2023 (Oral)

  22. arXiv:2305.18029  [pdf, other

    cs.CL cs.AI

    Faithfulness Tests for Natural Language Explanations

    Authors: Pepa Atanasova, Oana-Maria Camburu, Christina Lioma, Thomas Lukasiewicz, Jakob Grue Simonsen, Isabelle Augenstein

    Abstract: Explanations of neural models aim to reveal a model's decision-making process for its predictions. However, recent work shows that current methods giving explanations such as saliency maps or counterfactuals can be misleading, as they are prone to present reasons that are unfaithful to the model's inner workings. This work explores the challenging question of evaluating the faithfulness of natural… ▽ More

    Submitted 30 June, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: Short paper, ACL 2023

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)

  23. arXiv:2305.12376  [pdf, other

    cs.CL cs.CY cs.LG

    Measuring Intersectional Biases in Historical Documents

    Authors: Nadav Borenstein, Karolina Stańczak, Thea Rolskov, Natália da Silva Perez, Natacha Klein Käfer, Isabelle Augenstein

    Abstract: Data-driven analyses of biases in historical texts can help illuminate the origin and development of biases prevailing in modern society. However, digitised historical documents pose a challenge for NLP practitioners as these corpora suffer from errors introduced by optical character recognition (OCR) and are written in an archaic language. In this paper, we investigate the continuities and tran… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of ACL2023

  24. arXiv:2305.10928  [pdf, other

    cs.CL cs.LG

    Multilingual Event Extraction from Historical Newspaper Adverts

    Authors: Nadav Borenstein, Natalia da Silva Perez, Isabelle Augenstein

    Abstract: NLP methods can aid historians in analyzing textual materials in greater volumes than manually feasible. Developing such methods poses substantial challenges though. First, acquiring large, annotated historical datasets is difficult, as only domain experts can reliably label them. Second, most available off-the-shelf NLP models are trained on modern language texts, rendering them significantly les… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted to the main track of ACL2023

  25. arXiv:2304.08315  [pdf, other

    cs.CL cs.AI

    Thorny Roses: Investigating the Dual Use Dilemma in Natural Language Processing

    Authors: Lucie-Aimée Kaffee, Arnav Arora, Zeerak Talat, Isabelle Augenstein

    Abstract: Dual use, the intentional, harmful reuse of technology and scientific artefacts, is a problem yet to be well-defined within the context of Natural Language Processing (NLP). However, as NLP technologies continue to advance and become increasingly widespread in society, their inner workings have become increasingly opaque. Therefore, understanding dual use concerns and potential ways of limiting th… ▽ More

    Submitted 30 October, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

  26. arXiv:2304.05783  [pdf, other

    cs.CL

    Measuring Gender Bias in West Slavic Language Models

    Authors: Sandra Martinková, Karolina Stańczak, Isabelle Augenstein

    Abstract: Pre-trained language models have been known to perpetuate biases from the underlying datasets to downstream tasks. However, these findings are predominantly based on monolingual language models for English, whereas there are few investigative studies of biases encoded in language models for languages beyond English. In this paper, we fill this gap by analysing gender bias in West Slavic language m… ▽ More

    Submitted 25 May, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

  27. arXiv:2302.02500  [pdf, other

    cs.CL

    TempEL: Linking Dynamically Evolving and Newly Emerging Entities

    Authors: Klim Zaporojets, Lucie-Aimee Kaffee, Johannes Deleu, Thomas Demeester, Chris Develder, Isabelle Augenstein

    Abstract: In our continuously evolving world, entities change over time and new, previously non-existing or unknown, entities appear. We study how this evolutionary scenario impacts the performance on a well established entity linking (EL) task. For that study, we introduce TempEL, an entity linking dataset that consists of time-stratified English Wikipedia snapshots from 2013 to 2022, from which we collect… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

  28. arXiv:2301.12313  [pdf, other

    cs.LG cs.AI cs.LO cs.NE

    Adapting Neural Link Predictors for Data-Efficient Complex Query Answering

    Authors: Erik Arakelyan, Pasquale Minervini, Daniel Daza, Michael Cochez, Isabelle Augenstein

    Abstract: Answering complex queries on incomplete knowledge graphs is a challenging task where a model needs to answer complex logical queries in the presence of missing knowledge. Prior work in the literature has proposed to address this problem by designing architectures trained end-to-end for the complex query answering task with a reasoning process that is hard to interpret while requiring data and reso… ▽ More

    Submitted 11 July, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

  29. arXiv:2212.09409  [pdf, other

    cs.CL

    Multi-View Knowledge Distillation from Crowd Annotations for Out-of-Domain Generalization

    Authors: Dustin Wright, Isabelle Augenstein

    Abstract: Selecting an effective training signal for tasks in natural language processing is difficult: expert annotations are expensive, and crowd-sourced annotations may not be reliable. At the same time, recent work in NLP has demonstrated that learning from a distribution over labels acquired from crowd annotations can be effective. However, there are many ways to acquire such a distribution, and the pe… ▽ More

    Submitted 23 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 14 pages, 4 figures, 1 table

  30. arXiv:2210.14037  [pdf, other

    cs.LG cs.CL

    Revisiting Softmax for Uncertainty Approximation in Text Classification

    Authors: Andreas Nugaard Holm, Dustin Wright, Isabelle Augenstein

    Abstract: Uncertainty approximation in text classification is an important area with applications in domain adaptation and interpretability. One of the most widely used uncertainty approximation methods is Monte Carlo (MC) Dropout, which is computationally expensive as it requires multiple forward passes through the model. A cheaper alternative is to simply use the softmax based on a single forward pass wit… ▽ More

    Submitted 19 July, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

  31. arXiv:2210.13001  [pdf, other

    cs.CL cs.CY cs.LG

    Modeling Information Change in Science Communication with Semantically Matched Paraphrases

    Authors: Dustin Wright, Jiaxin Pei, David Jurgens, Isabelle Augenstein

    Abstract: Whether the media faithfully communicate scientific information has long been a core issue to the science community. Automatically identifying paraphrased scientific findings could enable large-scale tracking and analysis of information changes in the science communication process, but this requires systems to understand the similarity between scientific information across multiple domains. To thi… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: In EMNLP 2022; 25 pages; 11 figures; 6 tables

  32. arXiv:2209.07430  [pdf, other

    cs.CL

    Machine Reading, Fast and Slow: When Do Models "Understand" Language?

    Authors: Sagnik Ray Choudhury, Anna Rogers, Isabelle Augenstein

    Abstract: Two of the most fundamental challenges in Natural Language Understanding (NLU) at present are: (a) how to establish whether deep learning-based models score highly on NLU benchmarks for the 'right' reasons; and (b) to understand what those reasons would even be. We investigate the behavior of reading comprehension models with respect to two linguistic 'skills': coreference resolution and compariso… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted COLING 2022

  33. arXiv:2205.04238  [pdf, other

    cs.CL

    Counterfactually Augmented Data and Unintended Bias: The Case of Sexism and Hate Speech Detection

    Authors: Indira Sen, Mattia Samory, Claudia Wagner, Isabelle Augenstein

    Abstract: Counterfactually Augmented Data (CAD) aims to improve out-of-domain generalizability, an indicator of model robustness. The improvement is credited with promoting core features of the construct over spurious artifacts that happen to correlate with it. Yet, over-relying on core features may lead to unintended model bias. Especially, construct-driven CAD -- perturbations of core features -- may indu… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: Accepted to NAACL'22 as a short paper

  34. arXiv:2205.02023  [pdf, other

    cs.CL

    Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained Models

    Authors: Karolina Stańczak, Edoardo Ponti, Lucas Torroba Hennigen, Ryan Cotterell, Isabelle Augenstein

    Abstract: The success of multilingual pre-trained models is underpinned by their ability to learn representations shared by multiple languages even in absence of any explicit supervision. However, it remains unclear how these models learn to generalise across languages. In this work, we conjecture that multilingual pre-trained models can derive language-universal abstractions about grammar. In particular, w… ▽ More

    Submitted 8 May, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted at NAACL 2022 (Main Conference)

  35. arXiv:2204.02007  [pdf, other

    cs.CL cs.LG

    Fact Checking with Insufficient Evidence

    Authors: Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

    Abstract: Automating the fact checking (FC) process relies on information obtained from external sources. In this work, we posit that it is crucial for FC models to make veracity predictions only when there is sufficient evidence and otherwise indicate when it is not enough. To this end, we are the first to study what information FC models consider sufficient by introducing a novel task and advancing it wit… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: 14 pages

    MSC Class: cs.CL

  36. arXiv:2203.13722  [pdf, other

    cs.CL

    Probing Pre-Trained Language Models for Cross-Cultural Differences in Values

    Authors: Arnav Arora, Lucie-Aimée Kaffee, Isabelle Augenstein

    Abstract: Language embeds information about social, cultural, and political values people hold. Prior work has explored social and potentially harmful biases encoded in Pre-Trained Language models (PTLMs). However, there has been no systematic study investigating how values embedded in these models vary across cultures. In this paper, we introduce probes to study which values across cultures are embedded in… ▽ More

    Submitted 6 April, 2023; v1 submitted 25 March, 2022; originally announced March 2022.

  37. arXiv:2203.12990  [pdf, other

    cs.CL

    Generating Scientific Claims for Zero-Shot Scientific Fact Checking

    Authors: Dustin Wright, David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan, Isabelle Augenstein, Lucy Lu Wang

    Abstract: Automated scientific fact checking is difficult due to the complexity of scientific language and a lack of significant amounts of training data, as annotation requires domain expertise. To address this challenge, we propose scientific claim generation, the task of generating one or more atomic and verifiable claims from scientific sentences, and demonstrate its usefulness in zero-shot fact checkin… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022; 13 pages, 3 figures, 8 tables

  38. arXiv:2202.06671  [pdf, other

    cs.CL

    Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings

    Authors: Malte Ostendorff, Nils Rethmeier, Isabelle Augenstein, Bela Gipp, Georg Rehm

    Abstract: Learning scientific document representations can be substantially improved through contrastive learning objectives, where the challenge lies in creating positive and negative training samples that encode the desired similarity semantics. Prior work relies on discrete citation relations to generate contrast samples. However, discrete citations enforce a hard cut-off to similarity. This is counter-i… ▽ More

    Submitted 19 October, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted to EMNLP 2022

  39. arXiv:2201.08214  [pdf, other

    cs.CL

    A Latent-Variable Model for Intrinsic Probing

    Authors: Karolina Stańczak, Lucas Torroba Hennigen, Adina Williams, Ryan Cotterell, Isabelle Augenstein

    Abstract: The success of pre-trained contextualized representations has prompted researchers to analyze them for the presence of linguistic information. Indeed, it is natural to assume that these pre-trained representations do encode some level of linguistic knowledge as they have brought about large empirical improvements on a wide variety of NLP tasks, which suggests they are learning true linguistic gene… ▽ More

    Submitted 11 July, 2024; v1 submitted 20 January, 2022; originally announced January 2022.

  40. arXiv:2112.14168  [pdf, other

    cs.CL cs.CY

    A Survey on Gender Bias in Natural Language Processing

    Authors: Karolina Stanczak, Isabelle Augenstein

    Abstract: Language can be used as a means of reproducing and enforcing harmful stereotypes and biases and has been analysed as such in numerous research. In this paper, we present a survey of 304 papers on gender bias in natural language processing. We analyse definitions of gender and its categories within social sciences and connect them to formal definitions of gender bias in NLP research. We survey lexi… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

  41. Quantifying Gender Biases Towards Politicians on Reddit

    Authors: Sara Marjanovic, Karolina Stańczak, Isabelle Augenstein

    Abstract: Despite attempts to increase gender parity in politics, global efforts have struggled to ensure equal female representation. This is likely tied to implicit gender biases against women in authority. In this work, we present a comprehensive study of gender biases that appear in online political discussion. To this end, we collect 10 million comments on Reddit in conversations about male and female… ▽ More

    Submitted 7 September, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

  42. arXiv:2112.06924  [pdf, other

    cs.CL cs.LG

    Generating Fluent Fact Checking Explanations with Unsupervised Post-Editing

    Authors: Shailza Jolly, Pepa Atanasova, Isabelle Augenstein

    Abstract: Fact-checking systems have become important tools to verify fake and misguiding news. These systems become more trustworthy when human-readable explanations accompany the veracity labels. However, manual collection of such explanations is expensive and time-consuming. Recent works frame explanation generation as extractive summarization, and propose to automatically select a sufficient subset of t… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

  43. arXiv:2109.07102  [pdf, other

    cs.CL

    Can Edge Probing Tasks Reveal Linguistic Knowledge in QA Models?

    Authors: Sagnik Ray Choudhury, Nikita Bhutani, Isabelle Augenstein

    Abstract: There have been many efforts to try to understand what grammatical knowledge (e.g., ability to understand the part of speech of a token) is encoded in large pre-trained language models (LM). This is done through `Edge Probing' (EP) tests: supervised classification tasks to predict the grammatical properties of a span (whether it has a particular part of speech) using only the token representations… ▽ More

    Submitted 7 September, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: Accepted COLING 2022

  44. arXiv:2109.07022  [pdf, other

    cs.CY

    How Does Counterfactually Augmented Data Impact Models for Social Computing Constructs?

    Authors: Indira Sen, Mattia Samory, Fabian Floeck, Claudia Wagner, Isabelle Augenstein

    Abstract: As NLP models are increasingly deployed in socially situated settings such as online abusive content detection, it is crucial to ensure that these models are robust. One way of improving model robustness is to generate counterfactually augmented data (CAD) for training models that can better learn to distinguish between core features and data artifacts. While models trained on this type of data ha… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: Preprint of a paper accepted to EMNLP 2021

  45. arXiv:2109.06050  [pdf, other

    cs.CL cs.LG

    Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training

    Authors: Momchil Hardalov, Arnav Arora, Preslav Nakov, Isabelle Augenstein

    Abstract: The goal of stance detection is to determine the viewpoint expressed in a piece of text towards a target. These viewpoints or contexts are often expressed in many different languages depending on the user and the platform, which can be a local news outlet, a social media platform, a news forum, etc. Most research in stance detection, however, has been limited to working with a single language and… ▽ More

    Submitted 21 December, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Accepted to AAAI 2022 (Preprint version)

  46. arXiv:2109.03756  [pdf, other

    cs.LG

    Diagnostics-Guided Explanation Generation

    Authors: Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

    Abstract: Explanations shed light on a machine learning model's rationales and can aid in identifying deficiencies in its reasoning process. Explanation generation models are typically trained in a supervised way given human explanations. When such annotations are not available, explanations are often selected as those portions of the input that maximise a downstream task's performance, which corresponds to… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    ACM Class: I.2.7

  47. arXiv:2108.13493  [pdf, other

    cs.CL cs.LG

    Semi-Supervised Exaggeration Detection of Health Science Press Releases

    Authors: Dustin Wright, Isabelle Augenstein

    Abstract: Public trust in science depends on honest and factual communication of scientific papers. However, recent studies have demonstrated a tendency of news media to misrepresent scientific papers by exaggerating their findings. Given this, we present a formalization of and study into the problem of exaggeration detection in science communication. While there are an abundance of scientific papers and po… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: Accepted to EMNLP 2021; 13 pages, 6 figures, 9 tables

  48. arXiv:2108.10274  [pdf, other

    cs.CL stat.ML

    Towards Explainable Fact Checking

    Authors: Isabelle Augenstein

    Abstract: The past decade has seen a substantial rise in the amount of mis- and disinformation online, from targeted disinformation campaigns to influence politics, to the unintentional spreading of misinformation about public health. This development has spurred research in the area of automatic fact checking, from approaches to detect check-worthy claims and determining the stance of tweets towards claims… ▽ More

    Submitted 8 December, 2021; v1 submitted 23 August, 2021; originally announced August 2021.

    Comments: Thesis presented to the University of Copenhagen Faculty of Science in partial fulfillment of the requirements for the degree of Doctor Scientiarum (Dr. Scient.)

  49. arXiv:2107.12708  [pdf, other

    cs.CL cs.AI

    QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension

    Authors: Anna Rogers, Matt Gardner, Isabelle Augenstein

    Abstract: Alongside huge volumes of research on deep learning models in NLP in the recent years, there has been also much work on benchmark datasets needed to track modeling progress. Question answering and reading comprehension have been particularly prolific in this regard, with over 80 new datasets appearing in the past two years. This study is the largest survey of the field to date. We provide an overv… ▽ More

    Submitted 19 September, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: Published in ACM Comput. Surv (2022). This version differs from the final version in that section 7 ("Languages") is not in the main paper rather than the supplementary materials

  50. arXiv:2106.01087  [pdf, other

    cs.CL

    Is Sparse Attention more Interpretable?

    Authors: Clara Meister, Stefan Lazov, Isabelle Augenstein, Ryan Cotterell

    Abstract: Sparse attention has been claimed to increase model interpretability under the assumption that it highlights influential inputs. Yet the attention distribution is typically over representations internal to the model rather than the inputs themselves, suggesting this assumption may not have merit. We build on the recent work exploring the interpretability of attention; we design a set of experiment… ▽ More

    Submitted 8 June, 2021; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: ACL 2021

    Journal ref: Proceedings of ACL-IJCNLP 2021