Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–14 of 14 results for author: Baralis, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15862  [pdf, other

    cs.CL

    Speech Analysis of Language Varieties in Italy

    Authors: Moreno La Quatra, Alkis Koudounas, Elena Baralis, Sabato Marco Siniscalchi

    Abstract: Italy exhibits rich linguistic diversity across its territory due to the distinct regional languages spoken in different areas. Recent advances in self-supervised learning provide new opportunities to analyze Italy's linguistic varieties using speech data alone. This includes the potential to leverage representations learned from large amounts of data to better examine nuances between closely rela… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Accepted to LREC-COLING 2024 - https://aclanthology.org/2024.lrec-main.1317/

  2. arXiv:2406.14693  [pdf, other

    eess.AS cs.LG

    Voice Disorder Analysis: a Transformer-based Approach

    Authors: Alkis Koudounas, Gabriele Ciravegna, Marco Fantini, Giovanni Succo, Erika Crosetti, Tania Cerquitelli, Elena Baralis

    Abstract: Voice disorders are pathologies significantly affecting patient quality of life. However, non-invasive automated diagnosis of these pathologies is still under-explored, due to both a shortage of pathological voice data, and diversity of the recording types used for the diagnosis. This paper proposes a novel solution that adopts transformers directly working on raw voice signals and addresses data… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  3. arXiv:2406.14686  [pdf, other

    cs.CL cs.LG eess.AS

    A Contrastive Learning Approach to Mitigate Bias in Speech Models

    Authors: Alkis Koudounas, Flavio Giobergia, Eliana Pastor, Elena Baralis

    Abstract: Speech models may be affected by performance imbalance in different population subgroups, raising concerns about fair treatment across these groups. Prior attempts to mitigate unfairness either focus on user-defined subgroups, potentially overlooking other affected subgroups, or do not explicitly improve the internal representation at the subgroup level. This paper proposes the first adoption of c… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  4. arXiv:2406.14529  [pdf, other

    cs.LG cs.AI

    A Benchmarking Study of Kolmogorov-Arnold Networks on Tabular Data

    Authors: Eleonora Poeta, Flavio Giobergia, Eliana Pastor, Tania Cerquitelli, Elena Baralis

    Abstract: Kolmogorov-Arnold Networks (KANs) have very recently been introduced into the world of machine learning, quickly capturing the attention of the entire community. However, KANs have mostly been tested for approximating complex functions or processing synthetic data, while a test on real-world tabular datasets is currently lacking. In this paper, we present a benchmarking study comparing KANs and Mu… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  5. arXiv:2405.00934  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Benchmarking Representations for Speech, Music, and Acoustic Events

    Authors: Moreno La Quatra, Alkis Koudounas, Lorenzo Vaiani, Elena Baralis, Luca Cagliero, Paolo Garza, Sabato Marco Siniscalchi

    Abstract: Limited diversity in standardized benchmarks for evaluating audio representation learning (ARL) methods may hinder systematic comparison of current methods' capabilities. We present ARCH, a comprehensive benchmark for evaluating ARL methods on diverse audio classification domains, covering acoustic events, music, and speech. ARCH comprises 12 datasets, that allow us to thoroughly assess pre-traine… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  6. arXiv:2312.12936  [pdf, other

    cs.AI cs.HC

    Concept-based Explainable Artificial Intelligence: A Survey

    Authors: Eleonora Poeta, Gabriele Ciravegna, Eliana Pastor, Tania Cerquitelli, Elena Baralis

    Abstract: The field of explainable artificial intelligence emerged in response to the growing need for more transparent and reliable models. However, using raw features to provide explanations has been disputed in several works lately, advocating for more user-understandable explanations. To address this issue, a wide range of papers proposing Concept-based eXplainable Artificial Intelligence (C-XAI) method… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  7. arXiv:2310.01227  [pdf, other

    astro-ph.EP cs.LG

    Reconstructing Atmospheric Parameters of Exoplanets Using Deep Learning

    Authors: Flavio Giobergia, Alkis Koudounas, Elena Baralis

    Abstract: Exploring exoplanets has transformed our understanding of the universe by revealing many planetary systems that defy our current understanding. To study their atmospheres, spectroscopic observations are used to infer essential atmospheric properties that are not directly measurable. Estimating atmospheric parameters that best fit the observed spectrum within a specified atmospheric model is a comp… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 5 pages + references

  8. arXiv:2309.07733  [pdf, other

    cs.CL cs.SD eess.AS

    Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features

    Authors: Eliana Pastor, Alkis Koudounas, Giuseppe Attanasio, Dirk Hovy, Elena Baralis

    Abstract: Recent advances in eXplainable AI (XAI) have provided new insights into how models for vision, language, and tabular data operate. However, few approaches exist for understanding speech models. Existing work focuses on a few spoken language understanding (SLU) tasks, and explanations are difficult to interpret for most users. We introduce a new approach to explain speech classification models. We… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 8 pages

  9. ITALIC: An Italian Intent Classification Dataset

    Authors: Alkis Koudounas, Moreno La Quatra, Lorenzo Vaiani, Luca Colomba, Giuseppe Attanasio, Eliana Pastor, Luca Cagliero, Elena Baralis

    Abstract: Recent large-scale Spoken Language Understanding datasets focus predominantly on English and do not account for language-specific phenomena such as particular phonemes or words in different lects. We introduce ITALIC, the first large-scale speech dataset designed for intent classification in Italian. The dataset comprises 16,521 crowdsourced audio samples recorded by 70 speakers from various Itali… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: Accepted at INTERSPEECH 2023. Data and code at https://github.com/RiTA-nlp/ITALIC

  10. arXiv:2203.09192  [pdf, other

    cs.CL

    Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists

    Authors: Giuseppe Attanasio, Debora Nozza, Dirk Hovy, Elena Baralis

    Abstract: Natural Language Processing (NLP) models risk overfitting to specific terms in the training data, thereby reducing their performance, fairness, and generalizability. E.g., neural hate speech detection models are strongly influenced by identity terms like gay, or women, resulting in false positives, severe unintended bias, and lower performance. Most mitigation techniques use lists of identity term… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: Accepted to Findings of ACL 2022

  11. arXiv:2108.07450  [pdf, other

    cs.LG cs.CY cs.IR

    Identifying Biased Subgroups in Ranking and Classification

    Authors: Eliana Pastor, Luca de Alfaro, Elena Baralis

    Abstract: When analyzing the behavior of machine learning algorithms, it is important to identify specific data subgroups for which the considered algorithm shows different performance with respect to the entire dataset. The intervention of domain experts is normally required to identify relevant attributes that define these subgroups. We introduce the notion of divergence to measure this performance diff… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    Comments: 5 pages

    Journal ref: In Responsible AI @ KDD 2021 Workshop, 2021

  12. arXiv:1907.08120  [pdf, other

    cs.LG stat.ML

    Automating concept-drift detection by self-evaluating predictive model degradation

    Authors: Tania Cerquitelli, Stefano Proto, Francesco Ventura, Daniele Apiletti, Elena Baralis

    Abstract: A key aspect of automating predictive machine learning entails the capability of properly triggering the update of the trained model. To this aim, suitable automatic solutions to self-assess the prediction quality and the data distribution drift between the original training set and the new data have to be devised. In this paper, we propose a novel methodology to automatically detect prediction-qu… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

    Comments: 5 pages, 4 figures

    ACM Class: I.2

  13. arXiv:1805.03887  [pdf, ps, other

    cs.LG cs.AI cs.DC stat.ML

    Scaling associative classification for very large datasets

    Authors: Luca Venturini, Elena Baralis, Paolo Garza

    Abstract: Supervised learning algorithms are nowadays successfully scaling up to datasets that are very large in volume, leveraging the potential of in-memory cluster-computing Big Data frameworks. Still, massive datasets with a number of large-domain categorical features are a difficult challenge for any classifier. Most off-the-shelf solutions cannot cope with this problem. In this work we introduce DAC,… ▽ More

    Submitted 10 May, 2018; originally announced May 2018.

    Journal ref: J Big Data (2017) 4: 44

  14. arXiv:1503.05426  [pdf, other

    cs.NI

    YouLighter: An Unsupervised Methodology to Unveil YouTube CDN Changes

    Authors: Danilo Giordano, Stefano Traverso, Luigi Grimaudo, Marco Mellia, Elena Baralis, Alok Tongaonkar, Sabyasachi Saha

    Abstract: YouTube relies on a massively distributed Content Delivery Network (CDN) to stream the billions of videos in its catalogue. Unfortunately, very little information about the design of such CDN is available. This, combined with the pervasiveness of YouTube, poses a big challenge for Internet Service Providers (ISPs), which are compelled to optimize end-users' Quality of Experience (QoE) while having… ▽ More

    Submitted 18 March, 2015; originally announced March 2015.