Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–9 of 9 results for author: Castellucci, G

.
  1. arXiv:2406.05255  [pdf, other

    cs.CL cs.AI

    Generative Explore-Exploit: Training-free Optimization of Generative Recommender Systems using LLM Optimizers

    Authors: Lütfi Kerem Senel, Besnik Fetahu, Davis Yoshida, Zhiyu Chen, Giuseppe Castellucci, Nikhita Vedula, Jason Choi, Shervin Malmasi

    Abstract: Recommender systems are widely used to suggest engaging content, and Large Language Models (LLMs) have given rise to generative recommenders. Such systems can directly generate items, including for open-set tasks like question suggestion. While the world knowledge of LLMs enable good recommendations, improving the generated content through user feedback is challenging as continuously fine-tuning L… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024 Main Proceedings

  2. arXiv:2404.06659  [pdf, other

    cs.CL

    Leveraging Interesting Facts to Enhance User Engagement with Conversational Interfaces

    Authors: Nikhita Vedula, Giuseppe Castellucci, Eugene Agichtein, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Conversational Task Assistants (CTAs) guide users in performing a multitude of activities, such as making recipes. However, ensuring that interactions remain engaging, interesting, and enjoyable for CTA users is not trivial, especially for time-consuming or challenging tasks. Grounded in psychological theories of human interest, we propose to engage users with contextual and interesting statements… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 10 pages, 1 figure

  3. arXiv:2404.02422  [pdf, other

    cs.CL cs.LG

    Enhancing Low-Resource LLMs Classification with PEFT and Synthetic Data

    Authors: Parth Patwa, Simone Filice, Zhiyu Chen, Giuseppe Castellucci, Oleg Rokhlenko, Shervin Malmasi

    Abstract: Large Language Models (LLMs) operating in 0-shot or few-shot settings achieve competitive results in Text Classification tasks. In-Context Learning (ICL) typically achieves better accuracy than the 0-shot setting, but it pays in terms of efficiency, due to the longer input prompt. In this paper, we propose a strategy to make LLMs as efficient as 0-shot text classifiers, while getting comparable or… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at LREC-COLING 2024

  4. arXiv:2311.12534  [pdf, other

    cs.CL

    Evaluation Metrics of Language Generation Models for Synthetic Traffic Generation Tasks

    Authors: Simone Filice, Jason Ingyu Choi, Giuseppe Castellucci, Eugene Agichtein, Oleg Rokhlenko

    Abstract: Many Natural Language Generation (NLG) tasks aim to generate a single output text given an input prompt. Other settings require the generation of multiple texts, e.g., for Synthetic Traffic Generation (STG). This generation task is crucial for training and evaluating QA systems as well as conversational agents, where the goal is to generate multiple questions or utterances resembling the linguisti… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  5. arXiv:2310.17034  [pdf, other

    cs.CL

    Follow-on Question Suggestion via Voice Hints for Voice Assistants

    Authors: Besnik Fetahu, Pedro Faustini, Giuseppe Castellucci, Anjie Fang, Oleg Rokhlenko, Shervin Malmasi

    Abstract: The adoption of voice assistants like Alexa or Siri has grown rapidly, allowing users to instantly access information via voice search. Query suggestion is a standard feature of screen-based search experiences, allowing users to explore additional topics. However, this is not trivial to implement in voice-based settings. To enable this, we tackle the novel task of suggesting questions with compact… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted as Long Paper at EMNLP'23 Findings

  6. arXiv:2302.11074  [pdf, other

    cs.CL cs.AI cs.LG

    Preventing Catastrophic Forgetting in Continual Learning of New Natural Language Tasks

    Authors: Sudipta Kar, Giuseppe Castellucci, Simone Filice, Shervin Malmasi, Oleg Rokhlenko

    Abstract: Multi-Task Learning (MTL) is widely-accepted in Natural Language Processing as a standard technique for learning multiple related tasks in one model. Training an MTL model requires having the training data for all tasks available at the same time. As systems usually evolve over time, (e.g., to support new functionalities), adding a new task to an existing MTL model usually requires retraining the… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: KDD 2022

  7. arXiv:2209.06321  [pdf, other

    cs.CL cs.AI cs.HC

    Alexa, Let's Work Together: Introducing the First Alexa Prize TaskBot Challenge on Conversational Task Assistance

    Authors: Anna Gottardi, Osman Ipek, Giuseppe Castellucci, Shui Hu, Lavina Vaz, Yao Lu, Anju Khatri, Anjali Chadha, Desheng Zhang, Sattvik Sahai, Prerna Dwivedi, Hangjie Shi, Lucy Hu, Andy Huang, Luke Dai, Bofei Yang, Varun Somani, Pankaj Rajan, Ron Rezac, Michael Johnston, Savanna Stiff, Leslie Ball, David Carmel, Yang Liu, Dilek Hakkani-Tur , et al. (5 additional authors not shown)

    Abstract: Since its inception in 2016, the Alexa Prize program has enabled hundreds of university students to explore and compete to develop conversational agents through the SocialBot Grand Challenge. The goal of the challenge is to build agents capable of conversing coherently and engagingly with humans on popular topics for 20 minutes, while achieving an average rating of at least 4.0/5.0. However, as co… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 14 pages, Proceedings of Alexa Prize Taskbot (Alexa Prize 2021)

    ACM Class: I.2.7; J.0; H.5.1; H.5.2

  8. arXiv:1907.07526  [pdf, other

    cs.CL

    Almawave-SLU: A new dataset for SLU in Italian

    Authors: Valentina Bellomaria, Giuseppe Castellucci, Andrea Favalli, Raniero Romagnoli

    Abstract: The widespread use of conversational and question answering systems made it necessary to improve the performances of speaker intent detection and understanding of related semantic slots, i.e., Spoken Language Understanding (SLU). Often, these tasks are approached with supervised learning methods, which needs considerable labeled datasets. This paper presents the first Italian dataset for SLU. It i… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

  9. arXiv:1907.02884  [pdf, other

    cs.CL cs.LG

    Multi-lingual Intent Detection and Slot Filling in a Joint BERT-based Model

    Authors: Giuseppe Castellucci, Valentina Bellomaria, Andrea Favalli, Raniero Romagnoli

    Abstract: Intent Detection and Slot Filling are two pillar tasks in Spoken Natural Language Understanding. Common approaches adopt joint Deep Learning architectures in attention-based recurrent frameworks. In this work, we aim at exploiting the success of "recurrence-less" models for these tasks. We introduce Bert-Joint, i.e., a multi-lingual joint text classification and sequence labeling framework. The ex… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.