Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–20 of 20 results for author: Staerman, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16938  [pdf, other

    eess.SP cs.LG stat.ML

    Unmixing Noise from Hawkes Process to Model Learned Physiological Events

    Authors: Guillaume Staerman, Virginie Loison, Thomas Moreau

    Abstract: Physiological signal analysis often involves identifying events crucial to understanding biological dynamics. Traditional methods rely on handcrafted procedures or supervised learning, presenting challenges such as expert dependence, lack of robustness, and the need for extensive labeled data. Data-driven methods like Convolutional Dictionary Learning (CDL) offer an alternative but tend to produce… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.06849  [pdf, other

    stat.ML cs.LG

    Flexible Parametric Inference for Space-Time Hawkes Processes

    Authors: Emilia Siviero, Guillaume Staerman, Stephan Clémençon, Thomas Moreau

    Abstract: Many modern spatio-temporal data sets, in sociology, epidemiology or seismology, for example, exhibit self-exciting characteristics, triggering and clustering behaviors both at the same time, that a suitable Hawkes space-time process can accurately capture. This paper aims to develop a fast and flexible parametric inference technique to recover the parameters of the kernel functions involved in th… ▽ More

    Submitted 17 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  3. arXiv:2403.04405  [pdf, other

    stat.ML cs.LG

    Signature Isolation Forest

    Authors: Guillaume Staerman, Marta Campi, Gareth W. Peters

    Abstract: Functional Isolation Forest (FIF) is a recent state-of-the-art Anomaly Detection (AD) algorithm designed for functional data. It relies on a tree partition procedure where an abnormality score is computed by projecting each curve observation on a drawn dictionary through a linear inner product. Such linear inner product and the dictionary are a priori choices that highly influence the algorithm's… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  4. arXiv:2402.13331  [pdf, other

    cs.CL

    Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation

    Authors: Anas Himmi, Guillaume Staerman, Marine Picot, Pierre Colombo, Nuno M. Guerreiro

    Abstract: Hallucinated translations pose significant threats and safety concerns when it comes to the practical deployment of machine translation systems. Previous research works have identified that detectors exhibit complementary performance different detectors excel at detecting different types of hallucinations. In this paper, we propose to address the limitations of individual detectors by combining th… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  5. arXiv:2310.14001  [pdf, other

    cs.CL

    Toward Stronger Textual Attack Detectors

    Authors: Pierre Colombo, Marine Picot, Nathan Noiry, Guillaume Staerman, Pablo Piantanida

    Abstract: The landscape of available textual adversarial attacks keeps growing, posing severe threats and raising concerns regarding the deep NLP system's integrity. However, the crucial problem of defending against malicious attacks has only drawn the attention of the NLP community. The latter is nonetheless instrumental in developing robust and trustworthy systems. This paper makes two important contribut… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: Findings EMNLP 2023

  6. arXiv:2310.13990  [pdf, other

    cs.LG cs.CL

    A Novel Information-Theoretic Objective to Disentangle Representations for Fair Classification

    Authors: Pierre Colombo, Nathan Noiry, Guillaume Staerman, Pablo Piantanida

    Abstract: One of the pursued objectives of deep learning is to provide tools that learn abstract representations of reality from the observation of multiple contextual situations. More precisely, one wishes to extract disentangled representations which are (i) low dimensional and (ii) whose components are independent and correspond to concepts capturing the essence of the objects under consideration (Locate… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: Findings AACL 2023

  7. arXiv:2306.03522  [pdf, other

    cs.LG cs.CV stat.ML

    A Functional Data Perspective and Baseline On Multi-Layer Out-of-Distribution Detection

    Authors: Eduardo Dadalto, Pierre Colombo, Guillaume Staerman, Nathan Noiry, Pablo Piantanida

    Abstract: A key feature of out-of-distribution (OOD) detection is to exploit a trained neural network by extracting statistical patterns and relationships through the multi-layer classifier to detect shifts in the expected input data distribution. Despite achieving solid results, several state-of-the-art methods rely on the penultimate or last layer outputs only, leaving behind valuable information for OOD… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  8. arXiv:2305.19694  [pdf, other

    stat.ML cs.LG

    Hypothesis Transfer Learning with Surrogate Classification Losses: Generalization Bounds through Algorithmic Stability

    Authors: Anass Aghbalou, Guillaume Staerman

    Abstract: Hypothesis transfer learning (HTL) contrasts domain adaptation by allowing for a previous task leverage, named the source, into a new one, the target, without requiring access to the source data. Indeed, HTL relies only on a hypothesis learnt from such source data, relieving the hurdle of expansive data storage and providing great practical benefits. Hence, HTL is highly beneficial for real-world… ▽ More

    Submitted 14 July, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

  9. arXiv:2302.09852  [pdf, other

    cs.CL cs.AI

    Unsupervised Layer-wise Score Aggregation for Textual OOD Detection

    Authors: Maxime Darrin, Guillaume Staerman, Eduardo Dadalto Câmara Gomes, Jackie CK Cheung, Pablo Piantanida, Pierre Colombo

    Abstract: Out-of-distribution (OOD) detection is a rapidly growing field due to new robustness and security requirements driven by an increased number of AI-based systems. Existing OOD textual detectors often rely on an anomaly score (e.g., Mahalanobis distance) computed on the embedding output of the last layer of the encoder. In this work, we observe that OOD detection performance varies greatly depending… ▽ More

    Submitted 21 February, 2024; v1 submitted 20 February, 2023; originally announced February 2023.

  10. arXiv:2211.13527  [pdf, other

    cs.CL

    Beyond Mahalanobis-Based Scores for Textual OOD Detection

    Authors: Pierre Colombo, Eduardo D. C. Gomes, Guillaume Staerman, Nathan Noiry, Pablo Piantanida

    Abstract: Deep learning methods have boosted the adoption of NLP systems in real-life applications. However, they turn out to be vulnerable to distribution shifts over time which may cause severe dysfunctions in production systems, urging practitioners to develop tools to detect out-of-distribution (OOD) samples through the lens of the neural network. In this paper, we introduce TRUSTED, a new OOD detector… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Journal ref: NeurIPS 2022

  11. arXiv:2210.04635  [pdf, other

    stat.ML cs.LG

    FaDIn: Fast Discretized Inference for Hawkes Processes with General Parametric Kernels

    Authors: Guillaume Staerman, Cédric Allain, Alexandre Gramfort, Thomas Moreau

    Abstract: Temporal point processes (TPP) are a natural tool for modeling event-based data. Among all TPP models, Hawkes processes have proven to be the most widely used, mainly due to their adequate modeling for various applications, particularly when considering exponential or non-parametric kernels. Although non-parametric kernels are an option, such models require large datasets. While exponential kernel… ▽ More

    Submitted 2 August, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

  12. arXiv:2205.03589  [pdf, other

    cs.CL

    Learning Disentangled Textual Representations via Statistical Measures of Similarity

    Authors: Pierre Colombo, Guillaume Staerman, Nathan Noiry, Pablo Piantanida

    Abstract: When working with textual data, a natural application of disentangled representations is fair classification where the goal is to make predictions without being biased (or influenced) by sensitive attributes that may be present in the data (e.g., age, gender or race). Dominant approaches to disentangle a sensitive attribute from textual representations rely on learning simultaneously a penalizatio… ▽ More

    Submitted 7 October, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

    Comments: ACL 2022

  13. arXiv:2201.05115  [pdf, other

    stat.ML cs.LG

    Functional Anomaly Detection: a Benchmark Study

    Authors: Guillaume Staerman, Eric Adjakossa, Pavlo Mozharovskyi, Vera Hofer, Jayant Sen Gupta, Stephan Clémençon

    Abstract: The increasing automation in many areas of the Industry expressly demands to design efficient machine-learning solutions for the detection of abnormal events. With the ubiquitous deployment of sensors monitoring nearly continuously the health of complex infrastructures, anomaly detection can now rely on measurements sampled at a very high frequency, providing a very rich representation of the phen… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

  14. arXiv:2108.12463  [pdf, other

    cs.CL cs.AI

    Automatic Text Evaluation through the Lens of Wasserstein Barycenters

    Authors: Pierre Colombo, Guillaume Staerman, Chloe Clavel, Pablo Piantanida

    Abstract: A new metric \texttt{BaryScore} to evaluate text generation based on deep contextualized embeddings e.g., BERT, Roberta, ELMo) is introduced. This metric is motivated by a new framework relying on optimal transport tools, i.e., Wasserstein distance and barycenter. By modelling the layer output of deep contextualized embeddings as a probability distribution rather than by a vector embedding; this f… ▽ More

    Submitted 9 September, 2021; v1 submitted 27 August, 2021; originally announced August 2021.

    Journal ref: EMNLP 2021

  15. arXiv:2106.11068  [pdf, other

    stat.ML cs.LG

    Affine-Invariant Integrated Rank-Weighted Depth: Definition, Properties and Finite Sample Analysis

    Authors: Guillaume Staerman, Pavlo Mozharovskyi, Stéphan Clémençon

    Abstract: Because it determines a center-outward ordering of observations in $\mathbb{R}^d$ with $d\geq 2$, the concept of statistical depth permits to define quantiles and ranks for multivariate data and use them for various statistical tasks (e.g. inference, hypothesis testing). Whereas many depth functions have been proposed \textit{ad-hoc} in the literature since the seminal contribution of \cite{Tukey7… ▽ More

    Submitted 4 February, 2022; v1 submitted 21 June, 2021; originally announced June 2021.

  16. arXiv:2103.12711  [pdf, other

    stat.ML cs.LG

    A Pseudo-Metric between Probability Distributions based on Depth-Trimmed Regions

    Authors: Guillaume Staerman, Pavlo Mozharovskyi, Pierre Colombo, Stéphan Clémençon, Florence d'Alché-Buc

    Abstract: The design of a metric between probability distributions is a longstanding problem motivated by numerous applications in Machine Learning. Focusing on continuous probability distributions on the Euclidean space $\mathbb{R}^d$, we introduce a novel pseudo-metric between probability distributions by leveraging the extension of univariate quantiles to multivariate spaces. Data depth is a nonparametri… ▽ More

    Submitted 10 October, 2022; v1 submitted 23 March, 2021; originally announced March 2021.

  17. arXiv:2006.10325  [pdf, other

    stat.ML cs.LG

    When OT meets MoM: Robust estimation of Wasserstein Distance

    Authors: Guillaume Staerman, Pierre Laforgue, Pavlo Mozharovskyi, Florence d'Alché-Buc

    Abstract: Issued from Optimal Transport, the Wasserstein distance has gained importance in Machine Learning due to its appealing geometrical properties and the increasing availability of efficient approximations. In this work, we consider the problem of estimating the Wasserstein distance between two probability distributions when observations are polluted by outliers. To that end, we investigate how to lev… ▽ More

    Submitted 18 February, 2022; v1 submitted 18 June, 2020; originally announced June 2020.

    Journal ref: Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021

  18. arXiv:2006.05240  [pdf, other

    stat.ML cs.LG

    Generalization Bounds in the Presence of Outliers: a Median-of-Means Study

    Authors: Pierre Laforgue, Guillaume Staerman, Stephan Clémençon

    Abstract: In contrast to the empirical mean, the Median-of-Means (MoM) is an estimator of the mean $θ$ of a square integrable r.v. $Z$, around which accurate nonasymptotic confidence bounds can be built, even when $Z$ does not exhibit a sub-Gaussian tail behavior. Thanks to the high confidence it achieves on heavy-tailed data, MoM has found various applications in machine learning, where it is used to desig… ▽ More

    Submitted 7 February, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

  19. arXiv:1910.04085  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    The Area of the Convex Hull of Sampled Curves: a Robust Functional Statistical Depth Measure

    Authors: Guillaume Staerman, Pavlo Mozharovskyi, Stephan Clémençon

    Abstract: With the ubiquity of sensors in the IoT era, statistical observations are becoming increasingly available in the form of massive (multivariate) time-series. Formulated as unsupervised anomaly detection tasks, an abundance of applications like aviation safety management, the health monitoring of complex infrastructures or fraud detection can now rely on such functional data, acquired and stored wit… ▽ More

    Submitted 13 February, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

  20. arXiv:1904.04573  [pdf, other

    stat.ML cs.LG

    Functional Isolation Forest

    Authors: Guillaume Staerman, Pavlo Mozharovskyi, Stephan Clémençon, Florence d'Alché-Buc

    Abstract: For the purpose of monitoring the behavior of complex infrastructures (e.g. aircrafts, transport or energy networks), high-rate sensors are deployed to capture multivariate data, generally unlabeled, in quasi continuous-time to detect quickly the occurrence of anomalies that may jeopardize the smooth operation of the system of interest. The statistical analysis of such massive data of functional n… ▽ More

    Submitted 9 October, 2019; v1 submitted 9 April, 2019; originally announced April 2019.