Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–9 of 9 results for author: Wanner, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.15238  [pdf, other

    cs.CL cs.CY

    GPT-HateCheck: Can LLMs Write Better Functional Tests for Hate Speech Detection?

    Authors: Yiping Jin, Leo Wanner, Alexander Shvets

    Abstract: Online hate detection suffers from biases incurred in data sampling, annotation, and model pre-training. Therefore, measuring the averaged performance over all examples in held-out test data is inadequate. Instead, we must identify specific model weaknesses and be informed when it is more likely to fail. A recent proposal in this direction is HateCheck, a suite for testing fine-grained model funct… ▽ More

    Submitted 27 May, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: LREC-COLING 2024. Content Warning: This paper contains model outputs that are offensive in nature

  2. User Identity Linkage in Social Media Using Linguistic and Social Interaction Features

    Authors: Despoina Chatzakou, Juan Soler-Company, Theodora Tsikrika, Leo Wanner, Stefanos Vrochidis, Ioannis Kompatsiaris

    Abstract: Social media users often hold several accounts in their effort to multiply the spread of their thoughts, ideas, and viewpoints. In the particular case of objectionable content, users tend to create multiple accounts to bypass the combating measures enforced by social media platforms and thus retain their online identity even if some of their accounts are suspended. User identity linkage aims to re… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  3. arXiv:2305.02637  [pdf, other

    cs.CL cs.AI cs.CY

    Towards Weakly-Supervised Hate Speech Classification Across Datasets

    Authors: Yiping Jin, Leo Wanner, Vishakha Laxman Kadam, Alexander Shvets

    Abstract: As pointed out by several scholars, current research on hate speech (HS) recognition is characterized by unsystematic data creation strategies and diverging annotation schemata. Subsequently, supervised-learning models tend to generalize poorly to datasets they were not trained on, and the performance of the models trained on datasets labeled using different HS taxonomies cannot be compared. To ea… ▽ More

    Submitted 27 May, 2024; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: WOAH 7@ACL 2023

  4. arXiv:2305.01633  [pdf, other

    cs.CL

    Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

    Authors: Anya Belz, Craig Thomson, Ehud Reiter, Gavin Abercrombie, Jose M. Alonso-Moral, Mohammad Arvan, Anouck Braggaar, Mark Cieliebak, Elizabeth Clark, Kees van Deemter, Tanvi Dinkar, Ondřej Dušek, Steffen Eger, Qixiang Fang, Mingqi Gao, Albert Gatt, Dimitra Gkatzia, Javier González-Corbelle, Dirk Hovy, Manuela Hürlimann, Takumi Ito, John D. Kelleher, Filip Klubicka, Emiel Krahmer, Huiyuan Lai , et al. (17 additional authors not shown)

    Abstract: We report our efforts in identifying a set of previous human evaluations in NLP that would be suitable for a coordinated study examining what makes human evaluations in NLP more/less reproducible. We present our results and findings, which include that just 13\% of papers had (i) sufficiently low barriers to reproduction, and (ii) enough obtainable information, to be considered for reproduction, a… ▽ More

    Submitted 7 August, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 5 pages plus appendix, 4 tables, 1 figure. To appear at "Workshop on Insights from Negative Results in NLP" (co-located with EACL2023). Updated author list and acknowledgements

    MSC Class: 68 ACM Class: I.2.7

  5. arXiv:2205.11456  [pdf, other

    cs.CL

    Multilingual Extraction and Categorization of Lexical Collocations with Graph-aware Transformers

    Authors: Luis Espinosa-Anke, Alexander Shvets, Alireza Mohammadshahi, James Henderson, Leo Wanner

    Abstract: Recognizing and categorizing lexical collocations in context is useful for language learning, dictionary compilation and downstream NLP. However, it is a challenging task due to the varying degrees of frozenness lexical collocations exhibit. In this paper, we put forward a sequence tagging BERT-based model enhanced with a graph-aware transformer architecture, which we evaluate on the task of collo… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted to *SEM2022

  6. arXiv:2109.03160  [pdf, other

    cs.CL

    How much pretraining data do language models need to learn syntax?

    Authors: Laura Pérez-Mayos, Miguel Ballesteros, Leo Wanner

    Abstract: Transformers-based pretrained language models achieve outstanding results in many well-known NLU benchmarks. However, while pretraining methods are very convenient, they are expensive in terms of time and resources. This calls for a study of the impact of pretraining data size on the knowledge of the models. We explore this impact on the syntactic capabilities of RoBERTa, using models trained on i… ▽ More

    Submitted 9 September, 2021; v1 submitted 7 September, 2021; originally announced September 2021.

    Comments: To be published in proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)

  7. arXiv:2105.04688  [pdf, other

    cs.CL

    Assessing the Syntactic Capabilities of Transformer-based Multilingual Language Models

    Authors: Laura Pérez-Mayos, Alba Táboas García, Simon Mille, Leo Wanner

    Abstract: Multilingual Transformer-based language models, usually pretrained on more than 100 languages, have been shown to achieve outstanding results in a wide range of cross-lingual transfer tasks. However, it remains unknown whether the optimization for different languages conditions the capacity of the models to generalize over syntactic structures, and how languages with syntactic phenomena of differe… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: To be published in Findings of ACL 2021

  8. arXiv:2101.11492  [pdf, other

    cs.CL

    On the Evolution of Syntactic Information Encoded by BERT's Contextualized Representations

    Authors: Laura Pérez-Mayos, Roberto Carlini, Miguel Ballesteros, Leo Wanner

    Abstract: The adaptation of pretrained language models to solve supervised tasks has become a baseline in NLP, and many recent works have focused on studying how linguistic information is encoded in the pretrained sentence representations. Among other information, it has been shown that entire syntax trees are implicitly embedded in the geometry of such models. As these models are often fine-tuned, it becom… ▽ More

    Submitted 10 February, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

  9. arXiv:2008.11295  [pdf, other

    cs.CL

    Concept Extraction Using Pointer-Generator Networks

    Authors: Alexander Shvets, Leo Wanner

    Abstract: Concept extraction is crucial for a number of downstream applications. However, surprisingly enough, straightforward single token/nominal chunk-concept alignment or dictionary lookup techniques such as DBpedia Spotlight still prevail. We propose a generic open-domain OOV-oriented extractive model that is based on distant supervision of a pointer-generator network leveraging bidirectional LSTMs and… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: Contribution to the Proceedings of the 22nd International Conference on Knowledge Engineering and Knowledge Management (EKAW 2020). A link to the final authenticated publication will be added once it is available online. Keywords: Open-domain discourse texts, Concept extraction, Pointer-generator neural network, Distant supervision