Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–26 of 26 results for author: Navigli, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.13831  [pdf, other

    cs.CL cs.AI

    Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!

    Authors: Stefano Perrella, Lorenzo Proietti, Alessandro Scirè, Edoardo Barba, Roberto Navigli

    Abstract: Annually, at the Conference of Machine Translation (WMT), the Metrics Shared Task organizers conduct the meta-evaluation of Machine Translation (MT) metrics, ranking them according to their correlation with human judgments. Their results guide researchers toward enhancing the next generation of metrics and MT systems. With the recent introduction of neural metrics, the field has witnessed notable… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

    Comments: Presented at ACL 2024 Main Conference. 29 pages

  2. arXiv:2408.09794  [pdf, other

    cs.AI cs.CL

    AutoML-guided Fusion of Entity and LLM-based representations

    Authors: Boshko Koloski, Senja Pollak, Roberto Navigli, Blaž Škrlj

    Abstract: Large semantic knowledge bases are grounded in factual knowledge. However, recent approaches to dense text representations (embeddings) do not efficiently exploit these resources. Dense and robust representations of documents are essential for effectively solving downstream classification and retrieval tasks. This work demonstrates that injecting embedded information from knowledge bases can augme… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  3. arXiv:2408.00103  [pdf, other

    cs.CL cs.AI

    ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget

    Authors: Riccardo Orlando, Pere-Lluis Huguet-Cabot, Edoardo Barba, Roberto Navigli

    Abstract: Entity Linking (EL) and Relation Extraction (RE) are fundamental tasks in Natural Language Processing, serving as critical components in a wide range of applications. In this paper, we propose ReLiK, a Retriever-Reader architecture for both EL and RE, where, given an input text, the Retriever module undertakes the identification of candidate entities or relations that could potentially appear with… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

    Comments: To be presented at ACL 2024

  4. arXiv:2407.21489  [pdf, other

    cs.CL cs.AI

    Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends

    Authors: Giuliano Martinelli, Edoardo Barba, Roberto Navigli

    Abstract: Large autoregressive generative models have emerged as the cornerstone for achieving the highest performance across several Natural Language Processing tasks. However, the urge to attain superior results has, at times, led to the premature replacement of carefully designed task-specific approaches without exhaustive experimentation. The Coreference Resolution task is no exception; all recent state… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: Accepted at main conference of ACL 2024. 15 pages

  5. arXiv:2404.08676  [pdf, other

    cs.CL cs.CY cs.LG

    ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming

    Authors: Simone Tedeschi, Felix Friedrich, Patrick Schramowski, Kristian Kersting, Roberto Navigli, Huu Nguyen, Bo Li

    Abstract: When building Large Language Models (LLMs), it is paramount to bear safety in mind and protect them with guardrails. Indeed, LLMs should never generate content promoting or normalizing harmful, illegal, or unethical behavior that may contribute to harm to individuals or society. This principle applies to both normal and adversarial use. In response, we introduce ALERT, a large-scale benchmark to a… ▽ More

    Submitted 24 June, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: 17 pages, preprint

    MSC Class: I.2

  6. arXiv:2404.00399  [pdf, other

    cs.CL cs.AI cs.LG

    Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

    Authors: Taishi Nakamura, Mayank Mishra, Simone Tedeschi, Yekun Chai, Jason T Stillerman, Felix Friedrich, Prateek Yadav, Tanmay Laud, Vu Minh Chien, Terry Yue Zhuo, Diganta Misra, Ben Bogin, Xuan-Son Vu, Marzena Karpinska, Arnav Varma Dantuluri, Wojciech Kusa, Tommaso Furlanello, Rio Yokota, Niklas Muennighoff, Suhas Pai, Tosin Adewumi, Veronika Laippala, Xiaozhe Yao, Adalberto Junior, Alpay Ariyak , et al. (20 additional authors not shown)

    Abstract: Pretrained language models underpin several AI applications, but their high computational cost for training limits accessibility. Initiatives such as BLOOM and StarCoder aim to democratize access to pretrained models for collaborative community development. However, such existing models face challenges: limited multilingual capabilities, continual pretraining causing catastrophic forgetting, where… ▽ More

    Submitted 23 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: Preprint

  7. arXiv:2403.02270  [pdf, other

    cs.CL

    FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction

    Authors: Alessandro Scirè, Karim Ghonim, Roberto Navigli

    Abstract: Recent advancements in text summarization, particularly with the advent of Large Language Models (LLMs), have shown remarkable performance. However, a notable challenge persists as a substantial number of automatically-generated summaries exhibit factual inconsistencies, such as hallucinations. In response to this issue, various approaches for the evaluation of consistency for summarization have e… ▽ More

    Submitted 31 August, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: ACL 2024 camera ready. Code and data at https://github.com/Babelscape/FENICE

  8. arXiv:2310.14050  [pdf, other

    cs.CL

    Code-Switching with Word Senses for Pretraining in Neural Machine Translation

    Authors: Vivek Iyer, Edoardo Barba, Alexandra Birch, Jeff Z. Pan, Roberto Navigli

    Abstract: Lexical ambiguity is a significant and pervasive challenge in Neural Machine Translation (NMT), with many state-of-the-art (SOTA) NMT systems struggling to handle polysemous words (Campolungo et al., 2022). The same holds for the NMT pretraining paradigm of denoising synthetic "code-switched" text (Pan et al., 2021; Iyer et al., 2023), where word senses are ignored in the noising stage -- leading… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: EMNLP (Findings) 2023 Long Paper

  9. arXiv:2307.01870  [pdf, other

    cs.CL cs.AI

    Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities

    Authors: Riccardo Orlando, Simone Conia, Roberto Navigli

    Abstract: Although we have witnessed impressive progress in Semantic Role Labeling (SRL), most of the research in the area is carried out assuming that the majority of predicates are verbs. Conversely, predicates can also be expressed using other parts of speech, e.g., nouns and adjectives. However, non-verbal predicates appear in the benchmarks we commonly use to measure progress in SRL less frequently tha… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: Accepted at Findings of ACL 2023

  10. arXiv:2306.13467  [pdf, other

    cs.CL cs.AI

    Incorporating Graph Information in Transformer-based AMR Parsing

    Authors: Pavlo Vasylenko, Pere-Lluís Huguet Cabot, Abelardo Carlos Martínez Lorenzo, Roberto Navigli

    Abstract: Abstract Meaning Representation (AMR) is a Semantic Parsing formalism that aims at providing a semantic graph abstraction representing a given text. Current approaches are based on autoregressive language models such as BART or T5, fine-tuned through Teacher Forcing to obtain a linearized version of the AMR graph from a sentence. In this paper, we present LeakDistill, a model and method that explo… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: ACL 2023. Please cite authors correctly using both lastnames ("Martínez Lorenzo", "Huguet Cabot")

  11. arXiv:2306.10786  [pdf, other

    cs.CL cs.AI

    AMRs Assemble! Learning to Ensemble with Autoregressive Models for AMR Parsing

    Authors: Abelardo Carlos Martínez Lorenzo, Pere-Lluís Huguet Cabot, Roberto Navigli

    Abstract: In this paper, we examine the current state-of-the-art in AMR parsing, which relies on ensemble strategies by merging multiple graph predictions. Our analysis reveals that the present models often violate AMR structural constraints. To address this issue, we develop a validation method, and show how ensemble models can exploit SMATCH metric weaknesses to obtain higher scores, but sometimes result… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: ACL 2023. Please cite authors correctly using both lastnames ("Martínez Lorenzo", "Huguet Cabot")

  12. arXiv:2306.09802  [pdf, other

    cs.CL

    RED$^{\rm FM}$: a Filtered and Multilingual Relation Extraction Dataset

    Authors: Pere-Lluís Huguet Cabot, Simone Tedeschi, Axel-Cyrille Ngonga Ngomo, Roberto Navigli

    Abstract: Relation Extraction (RE) is a task that identifies relationships between entities in a text, enabling the acquisition of relational facts and bridging the gap between natural language and structured knowledge. However, current RE models often rely on small datasets with low coverage of relation types, particularly when working with languages other than English. In this paper, we address the above… ▽ More

    Submitted 19 June, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: ACL 2023. Please cite authors correctly using both lastnames ("Huguet Cabot", "Ngonga Ngomo")

  13. arXiv:2306.04334  [pdf, other

    cs.CL

    Echoes from Alexandria: A Large Resource for Multilingual Book Summarization

    Authors: Alessandro Scirè, Simone Conia, Simone Ciciliano, Roberto Navigli

    Abstract: In recent years, research in text summarization has mainly focused on the news domain, where texts are typically short and have strong layout features. The task of full-book summarization presents additional challenges which are hard to tackle with current resources, due to their limited size and availability in English only. To overcome these limitations, we present "Echoes from Alexandria", or i… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 9 pages, long paper at ACL 2023

  14. arXiv:2305.08414  [pdf, other

    cs.CL cs.AI

    What's the Meaning of Superhuman Performance in Today's NLU?

    Authors: Simone Tedeschi, Johan Bos, Thierry Declerck, Jan Hajic, Daniel Hershcovich, Eduard H. Hovy, Alexander Koller, Simon Krek, Steven Schockaert, Rico Sennrich, Ekaterina Shutova, Roberto Navigli

    Abstract: In the last five years, there has been a significant focus in Natural Language Processing (NLP) on developing larger Pretrained Language Models (PLMs) and introducing benchmarks such as SuperGLUE and SQuAD to measure their abilities in language understanding, reasoning, and reading comprehension. These PLMs have achieved impressive results on these benchmarks, even surpassing human performance in… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 9 pages, long paper at ACL 2023 proceedings

  15. arXiv:2212.01094  [pdf, other

    cs.CL cs.AI cs.LG

    Semantic Role Labeling Meets Definition Modeling: Using Natural Language to Describe Predicate-Argument Structures

    Authors: Simone Conia, Edoardo Barba, Alessandro Scirè, Roberto Navigli

    Abstract: One of the common traits of past and present approaches for Semantic Role Labeling (SRL) is that they rely upon discrete labels drawn from a predefined linguistic inventory to classify predicate senses and their arguments. However, we argue this need not be the case. In this paper, we present an approach that leverages Definition Modeling to introduce a generalized formulation of SRL as the task o… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  16. arXiv:2210.12846  [pdf, other

    cs.CL

    EUREKA: EUphemism Recognition Enhanced through Knn-based methods and Augmentation

    Authors: Sedrick Scott Keh, Rohit K. Bharadwaj, Emmy Liu, Simone Tedeschi, Varun Gangal, Roberto Navigli

    Abstract: We introduce EUREKA, an ensemble-based approach for performing automatic euphemism detection. We (1) identify and correct potentially mislabelled rows in the dataset, (2) curate an expanded corpus called EuphAug, (3) leverage model representations of Potentially Euphemistic Terms (PETs), and (4) explore using representations of semantically close sentences to aid in classification. Using our augme… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022 Figurative Language Workshop; first place for Euphemism Detection Shared Task. Code at https://github.com/sedrickkeh/EUREKA

  17. arXiv:2210.06164  [pdf, other

    cs.CL cs.IR

    Focusing on Context is NICE: Improving Overshadowed Entity Disambiguation

    Authors: Vera Provatorova, Simone Tedeschi, Svitlana Vakulenko, Roberto Navigli, Evangelos Kanoulas

    Abstract: Entity disambiguation (ED) is the task of mapping an ambiguous entity mention to the corresponding entry in a structured knowledge base. Previous research showed that entity overshadowing is a significant challenge for existing ED models: when presented with an ambiguous entity mention, the models are much more likely to rank a more frequent yet less contextually relevant entity at the top. Here,… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  18. arXiv:2210.05648  [pdf, other

    cs.CL cs.AI cs.LG

    Entity Disambiguation with Entity Definitions

    Authors: Luigi Procopio, Simone Conia, Edoardo Barba, Roberto Navigli

    Abstract: Local models have recently attained astounding performances in Entity Disambiguation (ED), with generative and extractive formulations being the most promising research directions. However, previous works limited their studies to using, as the textual representation of each candidate, only its Wikipedia title. Although certainly effective, this strategy presents a few critical issues, especially w… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  19. arXiv:2206.07587  [pdf, other

    cs.CL

    Cross-lingual AMR Aligner: Paying Attention to Cross-Attention

    Authors: Abelardo Carlos Martínez Lorenzo, Pere-Lluís Huguet Cabot, Roberto Navigli

    Abstract: This paper introduces a novel aligner for Abstract Meaning Representation (AMR) graphs that can scale cross-lingually, and is thus capable of aligning units and spans in sentences of different languages. Our approach leverages modern Transformer-based parsers, which inherently encode alignment information in their cross-attention weights, allowing us to extract this information during parsing. Thi… ▽ More

    Submitted 19 June, 2023; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: ACL 2023. Please cite authors correctly using both lastnames ("Martínez Lorenzo", "Huguet Cabot")

  20. arXiv:2003.02320  [pdf, other

    cs.AI cs.DB cs.LG

    Knowledge Graphs

    Authors: Aidan Hogan, Eva Blomqvist, Michael Cochez, Claudia d'Amato, Gerard de Melo, Claudio Gutierrez, José Emilio Labra Gayo, Sabrina Kirrane, Sebastian Neumaier, Axel Polleres, Roberto Navigli, Axel-Cyrille Ngonga Ngomo, Sabbir M. Rashid, Anisa Rula, Lukas Schmelzeisen, Juan Sequeda, Steffen Staab, Antoine Zimmermann

    Abstract: In this paper we provide a comprehensive introduction to knowledge graphs, which have recently garnered significant attention from both industry and academia in scenarios that require exploiting diverse, dynamic, large-scale collections of data. After some opening remarks, we motivate and contrast various graph-based data models and query languages that are used for knowledge graphs. We discuss th… ▽ More

    Submitted 11 September, 2021; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: Revision from v5: Correcting errata from previous version for entailment/models, and some other minor typos

    Journal ref: ACM Comput. Surv. 54(4): 71:1-71:37 (2021)

  21. arXiv:1805.04685  [pdf, other

    cs.CL

    Huge Automatically Extracted Training Sets for Multilingual Word Sense Disambiguation

    Authors: Tommaso Pasini, Francesco Maria Elia, Roberto Navigli

    Abstract: We release to the community six large-scale sense-annotated datasets in multiple language to pave the way for supervised multilingual Word Sense Disambiguation. Our datasets cover all the nouns in the English WordNet and their translations in other languages for a total of millions of sense-tagged sentences. Experiments prove that these corpora can be effectively used as training sets for supervis… ▽ More

    Submitted 12 May, 2018; originally announced May 2018.

  22. Towards a Seamless Integration of Word Senses into Downstream NLP Applications

    Authors: Mohammad Taher Pilehvar, Jose Camacho-Collados, Roberto Navigli, Nigel Collier

    Abstract: Lexical ambiguity can impede NLP systems from accurate understanding of semantics. Despite its potential benefits, the integration of sense-level information into NLP systems has remained understudied. By incorporating a novel disambiguation algorithm into a state-of-the-art classification model, we create a pipeline to integrate sense-level information into downstream NLP applications. We show th… ▽ More

    Submitted 18 October, 2017; originally announced October 2017.

    Comments: ACL 2017

    Journal ref: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vancouver, Canada (2017), pages 1857-1869

  23. arXiv:1612.02703  [pdf, other

    cs.CL

    Embedding Words and Senses Together via Joint Knowledge-Enhanced Training

    Authors: Massimiliano Mancini, Jose Camacho-Collados, Ignacio Iacobacci, Roberto Navigli

    Abstract: Word embeddings are widely used in Natural Language Processing, mainly due to their success in capturing semantic information from massive corpora. However, their creation process does not allow the different meanings of a word to be automatically separated, as it conflates them into a single vector. We address this issue by proposing a new model which learns word and sense embeddings jointly. Our… ▽ More

    Submitted 21 June, 2017; v1 submitted 8 December, 2016; originally announced December 2016.

    Comments: Accepted in CoNLL 2017. 12 pages

  24. arXiv:1608.06718  [pdf, other

    cs.CL

    A Large-Scale Multilingual Disambiguation of Glosses

    Authors: José Camacho Collados, Claudio Delli Bovi, Alessandro Raganato, Roberto Navigli

    Abstract: Linking concepts and named entities to knowledge bases has become a crucial Natural Language Understanding task. In this respect, recent works have shown the key advantage of exploiting textual definitions in various Natural Language Processing applications. However, to date there are no reliable large-scale corpora of sense-annotated textual definitions available to the research community. In thi… ▽ More

    Submitted 24 August, 2016; originally announced August 2016.

    Comments: Accepted in LREC 2016

    Journal ref: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC), 2016, pages 1701-1708, Portoroz, Slovenia

  25. arXiv:1608.00841  [pdf, ps, other

    cs.CL

    Semantic Representations of Word Senses and Concepts

    Authors: José Camacho-Collados, Ignacio Iacobacci, Roberto Navigli, Mohammad Taher Pilehvar

    Abstract: Representing the semantics of linguistic items in a machine-interpretable form has been a major goal of Natural Language Processing since its earliest days. Among the range of different linguistic items, words have attracted the most research attention. However, word representations have an important limitation: they conflate different meanings of a word into a single vector. Representations of wo… ▽ More

    Submitted 2 August, 2016; originally announced August 2016.

  26. The CQC Algorithm: Cycling in Graphs to Semantically Enrich and Enhance a Bilingual Dictionary

    Authors: Tiziano Flati, Roberto Navigli

    Abstract: Bilingual machine-readable dictionaries are knowledge resources useful in many automatic tasks. However, compared to monolingual computational lexicons like WordNet, bilingual dictionaries typically provide a lower amount of structured information, such as lexical and semantic relations, and often do not cover the entire range of possible translations for a word of interest. In this paper we prese… ▽ More

    Submitted 18 January, 2014; originally announced February 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 43, pages 135-171, 2012