Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–26 of 26 results for author: Antognini, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17258  [pdf, other

    cs.LG cs.AI

    $\textit{Trans-LoRA}$: towards data-free Transferable Parameter Efficient Finetuning

    Authors: Runqian Wang, Soumya Ghosh, David Cox, Diego Antognini, Aude Oliva, Rogerio Feris, Leonid Karlinsky

    Abstract: Low-rank adapters (LoRA) and their variants are popular parameter-efficient fine-tuning (PEFT) techniques that closely match full model fine-tune performance while requiring only a small number of additional parameters. These additional LoRA parameters are specific to the base model being adapted. When the base model needs to be deprecated and replaced with a new one, all the associated LoRA modul… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  2. arXiv:2404.11500  [pdf, other

    cs.CL cs.AI

    Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models

    Authors: Yue Zhou, Yada Zhu, Diego Antognini, Yoon Kim, Yang Zhang

    Abstract: This paper studies the relationship between the surface form of a mathematical problem and its solvability by large language models. We find that subtle alterations in the surface form can significantly impact the answer distribution and the solve rate, exposing the language model's lack of robustness and sensitivity to the surface form in reasoning through complex problems. To improve mathematica… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted to the main conference of NAACL (2024)

  3. ESG Accountability Made Easy: DocQA at Your Service

    Authors: Lokesh Mishra, Cesar Berrospi, Kasper Dinkla, Diego Antognini, Francesco Fusco, Benedikt Bothur, Maksym Lysak, Nikolaos Livathinos, Ahmed Nassar, Panagiotis Vagenas, Lucas Morin, Christoph Auer, Michele Dolfi, Peter Staar

    Abstract: We present Deep Search DocQA. This application enables information extraction from documents via a question-answering conversational assistant. The system integrates several technologies from different AI disciplines consisting of document conversion to machine-readable format (via computer vision), finding relevant data (via natural language processing), and formulating an eloquent response (via… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted at the Demonstration Track of the 38th Annual AAAI Conference on Artificial Intelligence (AAAI 24)

    Journal ref: AAAI 2024, 38, 23814-23816

  4. arXiv:2305.15867  [pdf, other

    cs.CL cs.AI cs.LG

    Extracting Text Representations for Terms and Phrases in Technical Domains

    Authors: Francesco Fusco, Diego Antognini

    Abstract: Extracting dense representations for terms and phrases is a task of great importance for knowledge discovery platforms targeting highly-technical fields. Dense representations are used as features for downstream components and have multiple applications ranging from ranking results in search to summarization. Common approaches to create dense representations include training domain-specific embedd… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 (industry). 10 pages, 3 figures, 5 tables

  5. arXiv:2210.13118  [pdf, other

    cs.CL cs.AI cs.LG

    Unsupervised Term Extraction for Highly Technical Domains

    Authors: Francesco Fusco, Peter Staar, Diego Antognini

    Abstract: Term extraction is an information extraction task at the root of knowledge discovery platforms. Developing term extractors that are able to generalize across very diverse and potentially highly technical domains is challenging, as annotations for domains requiring in-depth expertise are scarce and expensive to obtain. In this paper, we describe the term extraction subsystem of a commercial knowled… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022 (industry). 8 pages, 3 figures, 3 tables

  6. arXiv:2210.10586  [pdf, other

    cs.CV cs.AI cs.LG

    Active Learning for Imbalanced Civil Infrastructure Data

    Authors: Thomas Frick, Diego Antognini, Mattia Rigotti, Ioana Giurgiu, Benjamin Grewe, Cristiano Malossi

    Abstract: Aging civil infrastructures are closely monitored by engineers for damage and critical defects. As the manual inspection of such large structures is costly and time-consuming, we are working towards fully automating the visual inspections to support the prioritization of maintenance activities. To that end we combine recent advances in drone technology and deep learning. Unfortunately, annotation… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  7. arXiv:2205.07268  [pdf, other

    cs.LG cs.CL cs.IR

    Textual Explanations and Critiques in Recommendation Systems

    Authors: Diego Antognini

    Abstract: Artificial intelligence and machine learning algorithms have become ubiquitous. Although they offer a wide range of benefits, their adoption in decision-critical fields is limited by their lack of interpretability, particularly with textual data. Moreover, with more data available than ever before, it has become increasingly important to explain automated predictions. Generally, users find it di… ▽ More

    Submitted 26 January, 2023; v1 submitted 15 May, 2022; originally announced May 2022.

    Comments: Ph.D. Thesis, Ecole Polytechnique Fédérale de Lausanne (EPFL). See https://infoscience.epfl.ch/record/292597 for the original version. This current version fixes two references

  8. arXiv:2205.06756  [pdf, other

    cs.CL cs.IR cs.LG

    Interlock-Free Multi-Aspect Rationalization for Text Classification

    Authors: Shuangqi Li, Diego Antognini, Boi Faltings

    Abstract: Explanation is important for text classification tasks. One prevalent type of explanation is rationales, which are text snippets of input text that suffice to yield the prediction and are meaningful to humans. A lot of research on rationalization has been based on the selective rationalization framework, which has recently been shown to be problematic due to the interlocking dynamics. In this pape… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: Work in progress. 5 pages, 2 figures, 1 table

  9. arXiv:2205.02454  [pdf, other

    cs.CL cs.IR cs.LG

    Assistive Recipe Editing through Critiquing

    Authors: Diego Antognini, Shuyang Li, Boi Faltings, Julian McAuley

    Abstract: There has recently been growing interest in the automatic generation of cooking recipes that satisfy some form of dietary restrictions, thanks in part to the availability of online recipe data. Prior studies have used pre-trained language models, or relied on small paired recipe data (e.g., a recipe paired with a similar one that satisfies a dietary constraint). However, pre-trained language model… ▽ More

    Submitted 26 January, 2023; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: Accepted at EACL 2023. 10 pages, 2 figures, 6 tables, 1 algorithm

  10. arXiv:2204.02162  [pdf, other

    cs.IR cs.AI cs.LG

    Positive and Negative Critiquing for VAE-based Recommenders

    Authors: Diego Antognini, Boi Faltings

    Abstract: Providing explanations for recommended items allows users to refine the recommendations by critiquing parts of the explanations. As a result of revisiting critiquing from the perspective of multimodal generative models, recent work has proposed M&Ms-VAE, which achieves state-of-the-art performance in terms of recommendation, explanation, and critiquing. M&Ms-VAE and similar models allow users to n… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: 5 pages, 2 figures, 2 tables

  11. arXiv:2202.04350  [pdf, other

    cs.CL cs.AI cs.LG

    pNLP-Mixer: an Efficient all-MLP Architecture for Language

    Authors: Francesco Fusco, Damian Pascual, Peter Staar, Diego Antognini

    Abstract: Large pre-trained language models based on transformer architecture have drastically changed the natural language processing (NLP) landscape. However, deploying those models for on-device applications in constrained devices such as smart watches is completely impractical due to their size and inference cost. As an alternative to transformer-based architectures, recent work on efficient NLP has sho… ▽ More

    Submitted 25 May, 2023; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: Accepted at ACL 2023 (industry). 8 pages, 2 figures, 4 tables

  12. arXiv:2107.06416  [pdf, other

    cs.IR

    Multi-Step Critiquing User Interface for Recommender Systems

    Authors: Diana Petrescu, Diego Antognini, Boi Faltings

    Abstract: Recommendations with personalized explanations have been shown to increase user trust and perceived quality and help users make better decisions. Moreover, such explanations allow users to provide feedback by critiquing them. Several algorithms for recommender systems with multi-step critiquing have therefore been developed. However, providing a user-friendly interface based on personalized explan… ▽ More

    Submitted 5 August, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: Accepted at RecSys 2021. 5 pages, 7 figures

  13. arXiv:2105.04837  [pdf, other

    cs.CL cs.LG

    Rationalization through Concepts

    Authors: Diego Antognini, Boi Faltings

    Abstract: Automated predictions require explanations to be interpretable by humans. One type of explanation is a rationale, i.e., a selection of input features such as relevant text snippets from which the model computes the outcome. However, a single overall selection does not provide a complete explanation, e.g., weighing several aspects for decisions. To this end, we present a novel self-interpretable mo… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

    Comments: Accepted at ACL 2021 (findings). 15 pages, 10 figures, 7 tables

  14. arXiv:2105.00774  [pdf, other

    cs.IR cs.AI cs.LG

    Fast Multi-Step Critiquing for VAE-based Recommender Systems

    Authors: Diego Antognini, Boi Faltings

    Abstract: Recent studies have shown that providing personalized explanations alongside recommendations increases trust and perceived quality. Furthermore, it gives users an opportunity to refine the recommendations by critiquing parts of the explanations. On one hand, current recommender systems model the recommendation, explanation, and critiquing objectives jointly, but this creates an inherent trade-off… ▽ More

    Submitted 7 July, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: Accepted at RecSys 2021. 19 pages, 7 figures, 5 tables

  15. arXiv:2104.12822  [pdf, other

    cs.IR cs.LG

    Recommending Burgers based on Pizza Preferences: Addressing Data Sparsity with a Product of Experts

    Authors: Martin Milenkoski, Diego Antognini, Claudiu Musat

    Abstract: In this paper, we describe a method to tackle data sparsity and create recommendations in domains with limited knowledge about user preferences. We expand the variational autoencoder collaborative filtering from a single-domain to a multi-domain setting. The intuition is that user-item interactions in a source domain can augment the recommendation quality in a target domain. The intuition can be t… ▽ More

    Submitted 7 September, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: 10 pages, 2 figures, 1 table, accepted at RecSys 2021 - Workshop on Cross-Market Recommendation (XMRec)

  16. arXiv:2012.03656   

    cs.CL cs.AI

    An Enhanced MeanSum Method For Generating Hotel Multi-Review Summarizations

    Authors: Saibo Geng, Diego Antognini

    Abstract: Multi-document summaritazion is the process of taking multiple texts as input and producing a short summary text based on the content of input texts. Up until recently, multi-document summarizers are mostly supervised extractive. However, supervised methods require datasets of large, paired document-summary examples which are rare and expensive to produce. In 2018, an unsupervised multi-document a… ▽ More

    Submitted 20 April, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: Work is not complete and may midlead readers

  17. arXiv:2009.08978  [pdf, other

    cs.IR cs.AI cs.LG stat.ML

    Modeling Online Behavior in Recommender Systems: The Importance of Temporal Context

    Authors: Milena Filipovic, Blagoj Mitrevski, Diego Antognini, Emma Lejal Glaude, Boi Faltings, Claudiu Musat

    Abstract: Recommender systems research tends to evaluate model performance offline and on randomly sampled targets, yet the same systems are later used to predict user behavior sequentially from a fixed point in time. Simulating online recommender system performance is notoriously difficult and the discrepancy between online and offline behaviors is typically not accounted for in offline evaluations. This d… ▽ More

    Submitted 5 September, 2021; v1 submitted 19 September, 2020; originally announced September 2020.

    Comments: 11 pages, 3 figures, 2 tables, accepted at RecSys 2021 - Workshop on Perspectives on the Evaluation of Recommender Systems (PERSPECTIVES)

  18. arXiv:2009.04695  [pdf, other

    cs.LG cs.AI cs.IR stat.ML

    Momentum-based Gradient Methods in Multi-Objective Recommendation

    Authors: Blagoj Mitrevski, Milena Filipovic, Diego Antognini, Emma Lejal Glaude, Boi Faltings, Claudiu Musat

    Abstract: Multi-objective gradient methods are becoming the standard for solving multi-objective problems. Among others, they show promising results in developing multi-objective recommender systems with both correlated and conflicting objectives. Classic multi-gradient~descent usually relies on the combination of the gradients, not including the computation of first and second moments of the gradients. Thi… ▽ More

    Submitted 1 September, 2021; v1 submitted 10 September, 2020; originally announced September 2020.

    Comments: 10 pages, 2 figures, 2 tables, accepted at RecSys 2021 - Workshop on Multi-Objective Recommender Systems (MORS)

  19. arXiv:2009.04441  [pdf, other

    cs.LG cs.AI cs.IR stat.ML

    Addressing Fairness in Classification with a Model-Agnostic Multi-Objective Algorithm

    Authors: Kirtan Padh, Diego Antognini, Emma Lejal Glaude, Boi Faltings, Claudiu Musat

    Abstract: The goal of fairness in classification is to learn a classifier that does not discriminate against groups of individuals based on sensitive attributes, such as race and gender. One approach to designing fair algorithms is to use relaxations of fairness notions as regularization terms or in a constrained optimization problem. We observe that the hyperbolic tangent function can approximate the indic… ▽ More

    Submitted 8 June, 2021; v1 submitted 9 September, 2020; originally announced September 2020.

    Comments: Accepted at UAI 2021. 14 pages, 5 figures, 4 tables

  20. arXiv:2005.11067  [pdf, other

    cs.CL cs.LG stat.ML

    Interacting with Explanations through Critiquing

    Authors: Diego Antognini, Claudiu Musat, Boi Faltings

    Abstract: Using personalized explanations to support recommendations has been shown to increase trust and perceived quality. However, to actually obtain better recommendations, there needs to be a means for users to modify the recommendation criteria by interacting with the explanation. We present a novel technique using aspect markers that learns to generate personalized explanations of recommendations fro… ▽ More

    Submitted 12 January, 2022; v1 submitted 22 May, 2020; originally announced May 2020.

    Comments: Accepted at IJCAI 2021. 15 pages, 10 figures, 12 tables

  21. arXiv:2002.06854  [pdf, other

    cs.IR cs.CL

    HotelRec: a Novel Very Large-Scale Hotel Recommendation Dataset

    Authors: Diego Antognini, Boi Faltings

    Abstract: Today, recommender systems are an inevitable part of everyone's daily digital routine and are present on most internet platforms. State-of-the-art deep learning-based models require a large number of data to achieve their best performance. Many datasets fulfilling this criterion have been proposed for multiple domains, such as Amazon products, restaurants, or beers. However, works and datasets in… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

    Comments: 7 pages, 3 figure, 5 tables. Accepted at LREC 2020

  22. arXiv:2002.06851  [pdf, other

    cs.CL

    GameWikiSum: a Novel Large Multi-Document Summarization Dataset

    Authors: Diego Antognini, Boi Faltings

    Abstract: Today's research progress in the field of multi-document summarization is obstructed by the small number of available datasets. Since the acquisition of reference summaries is costly, existing datasets contain only hundreds of samples at most, resulting in heavy reliance on hand-crafted features or necessitating additional, manually annotated data. The lack of large corpora therefore hinders the d… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

    Comments: 6 pages, 1 figure, 4 tables. Accepted at LREC 2020

  23. arXiv:2001.00846  [pdf, other

    cs.IR cs.AI cs.LG stat.ML

    Multi-Gradient Descent for Multi-Objective Recommender Systems

    Authors: Nikola Milojkovic, Diego Antognini, Giancarlo Bergamin, Boi Faltings, Claudiu Musat

    Abstract: Recommender systems need to mirror the complexity of the environment they are applied in. The more we know about what might benefit the user, the more objectives the recommender system has. In addition there may be multiple stakeholders - sellers, buyers, shareholders - in addition to legal and ethical constraints. Simultaneously optimizing for a multitude of objectives, correlated and not correla… ▽ More

    Submitted 17 April, 2020; v1 submitted 9 December, 2019; originally announced January 2020.

    Comments: 9 pages, 4 figures, Accepted at AAAI 2020 - Workshop on Interactive and Conversational Recommendation Systems (WICRS)

  24. arXiv:1909.12231  [pdf, other

    cs.CL cs.LG stat.ML

    Learning to Create Sentence Semantic Relation Graphs for Multi-Document Summarization

    Authors: Diego Antognini, Boi Faltings

    Abstract: Linking facts across documents is a challenging task, as the language used to express the same information in a sentence can vary significantly, which complicates the task of multi-document summarization. Consequently, existing approaches heavily rely on hand-crafted features, which are domain-dependent and hard to craft, or additional annotated data, which is costly to gather. To overcome these l… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

    Comments: 10 pages, 4 tables, 1 figure, Accepted at 2019 Empirical Methods in Natural Language Processing - Workshop on New Frontiers in Summarization

  25. arXiv:1909.11386  [pdf, other

    cs.CL cs.LG

    Multi-Dimensional Explanation of Target Variables from Documents

    Authors: Diego Antognini, Claudiu Musat, Boi Faltings

    Abstract: Automated predictions require explanations to be interpretable by humans. Past work used attention and rationale mechanisms to find words that predict the target variable of a document. Often though, they result in a tradeoff between noisy explanations or a drop in accuracy. Furthermore, rationale methods cannot capture the multi-faceted nature of justifications for multiple targets, because of th… ▽ More

    Submitted 21 December, 2020; v1 submitted 25 September, 2019; originally announced September 2019.

    Comments: Accepted in AAAI 2021. 18 pages, 14 figures, 9 tables

  26. arXiv:1709.09220  [pdf, other

    cs.CL

    Dataset Construction via Attention for Aspect Term Extraction with Distant Supervision

    Authors: Athanasios Giannakopoulos, Diego Antognini, Claudiu Musat, Andreea Hossmann, Michael Baeriswyl

    Abstract: Aspect Term Extraction (ATE) detects opinionated aspect terms in sentences or text spans, with the end goal of performing aspect-based sentiment analysis. The small amount of available datasets for supervised ATE and the fact that they cover only a few domains raise the need for exploiting other data sources in new and creative ways. Publicly available review corpora contain a plethora of opiniona… ▽ More

    Submitted 26 September, 2017; originally announced September 2017.