Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–6 of 6 results for author: Filice, S

.
  1. arXiv:2404.02422  [pdf, other

    cs.CL cs.LG

    Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data

    Authors: Parth Patwa, Simone Filice, Zhiyu Chen, Giuseppe Castellucci, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Large Language Models (LLMs) operating in 0-shot or few-shot settings achieve competitive results in Text Classification tasks. In-Context Learning (ICL) typically achieves better accuracy than the 0-shot setting, but it pays in terms of efficiency, due to the longer input prompt. In this paper, we propose a strategy to make LLMs as efficient as 0-shot text classifiers, while getting comparable or… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at LREC-COLING 2024

  2. The Power of Noise: Redefining Retrieval for RAG Systems

    Authors: Florin Cuconasu, Giovanni Trappolini, Federico Siciliano, Simone Filice, Cesare Campagnano, Yoelle Maarek, Nicola Tonellotto, Fabrizio Silvestri

    Abstract: Retrieval-Augmented Generation (RAG) has recently emerged as a method to extend beyond the pre-trained knowledge of Large Language Models by augmenting the original prompt with relevant passages or documents retrieved by an Information Retrieval (IR) system. RAG has become increasingly important for Generative AI solutions, especially in enterprise settings or in any domain in which knowledge is c… ▽ More

    Submitted 1 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  3. arXiv:2311.12534  [pdf, other

    cs.CL

    Evaluation Metrics of Language Generation Models for Synthetic Traffic Generation Tasks

    Authors: Simone Filice, Jason Ingyu Choi, Giuseppe Castellucci, Eugene Agichtein, Oleg Rokhlenko

    Abstract: Many Natural Language Generation (NLG) tasks aim to generate a single output text given an input prompt. Other settings require the generation of multiple texts, e.g., for Synthetic Traffic Generation (STG). This generation task is crucial for training and evaluating QA systems as well as conversational agents, where the goal is to generate multiple questions or utterances resembling the linguisti… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  4. arXiv:2305.14793  [pdf, other

    cs.CL

    Faithful Low-Resource Data-to-Text Generation through Cycle Training

    Authors: Zhuoer Wang, Marcus Collins, Nikhita Vedula, Simone Filice, Shervin Malmasi, Oleg Rokhlenko

    Abstract: Methods to generate text from structured data have advanced significantly in recent years, primarily due to fine-tuning of pre-trained language models on large datasets. However, such models can fail to produce output faithful to the input data, particularly on out-of-domain data. Sufficient annotated data is often not available for specific domains, leading us to seek an unsupervised approach to… ▽ More

    Submitted 11 July, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 19 pages, 4 figures, ACL 2023

  5. arXiv:2302.11074  [pdf, other

    cs.CL cs.AI cs.LG

    Preventing Catastrophic Forgetting in Continual Learning of New Natural Language Tasks

    Authors: Sudipta Kar, Giuseppe Castellucci, Simone Filice, Shervin Malmasi, Oleg Rokhlenko

    Abstract: Multi-Task Learning (MTL) is widely-accepted in Natural Language Processing as a standard technique for learning multiple related tasks in one model. Training an MTL model requires having the training data for all tasks available at the same time. As systems usually evolve over time, (e.g., to support new functionalities), adding a new task to an existing MTL model usually requires retraining the… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: KDD 2022

  6. arXiv:1911.08755  [pdf, ps, other

    cs.CL cs.AI cs.IR cs.LO

    Global Thread-Level Inference for Comment Classification in Community Question Answering

    Authors: Shafiq Joty, Alberto Barrón-Cedeño, Giovanni Da San Martino, Simone Filice, Lluís Màrquez, Alessandro Moschitti, Preslav Nakov

    Abstract: Community question answering, a recent evolution of question answering in the Web context, allows a user to quickly consult the opinion of a number of people on a particular topic, thus taking advantage of the wisdom of the crowd. Here we try to help the user by deciding automatically which answers are good and which are bad for a given question. In particular, we focus on exploiting the output st… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

    Comments: community question answering, thread-level inference, graph-cut, inductive logic programming

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: EMNLP-2015