Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 81 results for author: Gramfort, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.07593  [pdf, other

    stat.ML cs.LG stat.ME

    Diffusion posterior sampling for simulation-based inference in tall data settings

    Authors: Julia Linhart, Gabriel Victorino Cardoso, Alexandre Gramfort, Sylvain Le Corff, Pedro L. C. Rodrigues

    Abstract: Determining which parameters of a non-linear model best describe a set of experimental data is a fundamental problem in science and it has gained much traction lately with the rise of complex large-scale simulators. The likelihood of such models is typically intractable, which is why classical MCMC methods can not be used. Simulation-based inference (SBI) stands out in this context by only requiri… ▽ More

    Submitted 7 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: 49 pages, 24 figures, 3 tables, 2 algorithms, 12 appendices, in proceedings

  2. arXiv:2403.19394  [pdf, ps, other

    cs.CY q-bio.OT

    Cycling on the Freeway: The Perilous State of Open Source Neuroscience Software

    Authors: Britta U. Westner, Daniel R. McCloy, Eric Larson, Alexandre Gramfort, Daniel S. Katz, Arfon M. Smith, invited co-signees

    Abstract: Most scientists need software to perform their research (Barker et al., 2020; Carver et al., 2022; Hettrick, 2014; Hettrick et al., 2014; Switters and Osimo, 2019), and neuroscientists are no exception. Whether we work with reaction times, electrophysiological signals, or magnetic resonance imaging data, we rely on software to acquire, analyze, and statistically evaluate the raw data we obtain - o… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  3. arXiv:2403.15415  [pdf, other

    eess.SP cs.LG

    Physics-informed and Unsupervised Riemannian Domain Adaptation for Machine Learning on Heterogeneous EEG Datasets

    Authors: Apolline Mellot, Antoine Collas, Sylvain Chevallier, Denis Engemann, Alexandre Gramfort

    Abstract: Combining electroencephalogram (EEG) datasets for supervised machine learning (ML) is challenging due to session, subject, and device variability. ML algorithms typically require identical features at train and test time, complicating analysis due to varying sensor numbers and positions across datasets. Simple channel selection discards valuable data, leading to poorer performance, especially with… ▽ More

    Submitted 27 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  4. arXiv:2402.03345  [pdf, other

    eess.SP cs.LG stat.ML

    Weakly supervised covariance matrices alignment through Stiefel matrices estimation for MEG applications

    Authors: Antoine Collas, Rémi Flamary, Alexandre Gramfort

    Abstract: This paper introduces a novel domain adaptation technique for time series data, called Mixing model Stiefel Adaptation (MSA), specifically addressing the challenge of limited labeled signals in the target dataset. Leveraging a domain-dependent mixing model and the optimal transport domain adaptation assumption, we exploit abundant unlabeled data in the target domain to ensure effective prediction… ▽ More

    Submitted 24 January, 2024; originally announced February 2024.

  5. arXiv:2312.00484  [pdf, other

    cs.LG eess.SP

    MultiView Independent Component Analysis with Delays

    Authors: Ambroise Heurtebise, Pierre Ablin, Alexandre Gramfort

    Abstract: Linear Independent Component Analysis (ICA) is a blind source separation technique that has been used in various domains to identify independent latent sources from observed signals. In order to obtain a higher signal-to-noise ratio, the presence of multiple views of the same sources can be used. In this work, we present MultiView Independent Component Analysis with Delays (MVICAD). This algorithm… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  6. arXiv:2308.02408  [pdf, other

    eess.SP cs.AI cs.LG q-bio.NC

    Evaluating the structure of cognitive tasks with transfer learning

    Authors: Bruno Aristimunha, Raphael Y. de Camargo, Walter H. Lopez Pinaya, Sylvain Chevallier, Alexandre Gramfort, Cedric Rommel

    Abstract: Electroencephalography (EEG) decoding is a challenging task due to the limited availability of labelled data. While transfer learning is a promising technique to address this challenge, it assumes that transferable data domains and task are known, which is not the case in this setting. This study investigates the transferability of deep learning representations between different EEG decoding tasks… ▽ More

    Submitted 28 July, 2023; originally announced August 2023.

    Comments: 19 pages, 9 figures

    ACM Class: I.5.1; I.6.3; I.2.6; K.3.2

  7. arXiv:2306.03580  [pdf, other

    stat.ML cs.AI cs.LG q-bio.NC

    L-C2ST: Local Diagnostics for Posterior Approximations in Simulation-Based Inference

    Authors: Julia Linhart, Alexandre Gramfort, Pedro L. C. Rodrigues

    Abstract: Many recent works in simulation-based inference (SBI) rely on deep generative models to approximate complex, high-dimensional posterior distributions. However, evaluating whether or not these approximations can be trusted remains a challenge. Most approaches evaluate the posterior estimator only in expectation over the observation space. This limits their interpretability and is not sufficient to… ▽ More

    Submitted 9 October, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 27 pages, 6 figures, 3 tables, 6 appendices, NeurIPS 2023

  8. arXiv:2305.18831  [pdf, other

    eess.SP cs.LG

    Convolutional Monge Mapping Normalization for learning on sleep data

    Authors: Théo Gnassounou, Rémi Flamary, Alexandre Gramfort

    Abstract: In many machine learning applications on signals and biomedical data, especially electroencephalogram (EEG), one major challenge is the variability of the data across subjects, sessions, and hardware devices. In this work, we propose a new method called Convolutional Monge Mapping Normalization (CMMN), which consists in filtering the signals in order to adapt their power spectrum density (PSD) to… ▽ More

    Submitted 13 November, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  9. arXiv:2301.09696  [pdf, other

    stat.ML cs.LG

    Optimizing the Noise in Self-Supervised Learning: from Importance Sampling to Noise-Contrastive Estimation

    Authors: Omar Chehab, Alexandre Gramfort, Aapo Hyvarinen

    Abstract: Self-supervised learning is an increasingly popular approach to unsupervised learning, achieving state-of-the-art results. A prevalent approach consists in contrasting data points and noise points within a classification task: this requires a good noise distribution which is notoriously hard to specify. While a comprehensive theory is missing, it is widely assumed that the optimal noise distributi… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: arXiv admin note: text overlap with arXiv:2203.01110

  10. arXiv:2211.09602  [pdf, other

    stat.ML cs.AI cs.LG q-bio.QM

    Validation Diagnostics for SBI algorithms based on Normalizing Flows

    Authors: Julia Linhart, Alexandre Gramfort, Pedro L. C. Rodrigues

    Abstract: Building on the recent trend of new deep generative models known as Normalizing Flows (NF), simulation-based inference (SBI) algorithms can now efficiently accommodate arbitrary complex and high-dimensional data distributions. The development of appropriate validation methods however has fallen behind. Indeed, most of the existing metrics either require access to the true posterior distribution, o… ▽ More

    Submitted 24 November, 2022; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: 7 pages, 2 figures, 1 appendix, published at "Machine Learning and the Physical Sciences" workshop (NeurIPS 2022): https://ml4physicalsciences.github.io/2022/

  11. arXiv:2210.04635  [pdf, other

    stat.ML cs.LG

    FaDIn: Fast Discretized Inference for Hawkes Processes with General Parametric Kernels

    Authors: Guillaume Staerman, Cédric Allain, Alexandre Gramfort, Thomas Moreau

    Abstract: Temporal point processes (TPP) are a natural tool for modeling event-based data. Among all TPP models, Hawkes processes have proven to be the most widely used, mainly due to their adequate modeling for various applications, particularly when considering exponential or non-parametric kernels. Although non-parametric kernels are an option, such models require large datasets. While exponential kernel… ▽ More

    Submitted 2 August, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

  12. arXiv:2206.14483  [pdf, other

    cs.LG cs.AI eess.SP

    Data augmentation for learning predictive models on EEG: a systematic comparison

    Authors: Cédric Rommel, Joseph Paillard, Thomas Moreau, Alexandre Gramfort

    Abstract: Objective: The use of deep learning for electroencephalography (EEG) classification tasks has been rapidly growing in the last years, yet its application has been limited by the relatively small size of EEG datasets. Data augmentation, which consists in artificially increasing the size of the dataset during training, can be employed to alleviate this problem. While a few augmentation transformatio… ▽ More

    Submitted 15 November, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: Accepted in Journal of Neural Engineering

  13. arXiv:2206.13424  [pdf, other

    cs.LG math.OC stat.ML

    Benchopt: Reproducible, efficient and collaborative optimization benchmarks

    Authors: Thomas Moreau, Mathurin Massias, Alexandre Gramfort, Pierre Ablin, Pierre-Antoine Bannier, Benjamin Charlier, Mathieu Dagréou, Tom Dupré la Tour, Ghislain Durif, Cassio F. Dantas, Quentin Klopfenstein, Johan Larsson, En Lai, Tanguy Lefort, Benoit Malézieux, Badr Moufad, Binh T. Nguyen, Alain Rakotomamonjy, Zaccharie Ramzi, Joseph Salmon, Samuel Vaiter

    Abstract: Numerical validation is at the core of machine learning research as it allows to assess the actual impact of new methods, and to confirm the agreement between theory and practice. Yet, the rapid development of the field poses several challenges: researchers are confronted with a profusion of methods to compare, limited transparency and consensus on best practices, as well as tedious re-implementat… ▽ More

    Submitted 28 October, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted in proceedings of NeurIPS 22; Benchopt library documentation is available at https://benchopt.github.io/

  14. arXiv:2206.01685  [pdf, other

    q-bio.NC cs.AI cs.CL

    Toward a realistic model of speech processing in the brain with self-supervised learning

    Authors: Juliette Millet, Charlotte Caucheteux, Pierre Orhan, Yves Boubenec, Alexandre Gramfort, Ewan Dunbar, Christophe Pallier, Jean-Remi King

    Abstract: Several deep neural networks have recently been shown to generate activations similar to those of the brain in response to the same input. These algorithms, however, remain largely implausible: they require (1) extraordinarily large amounts of data, (2) unobtainable supervised labels, (3) textual rather than raw sensory input, and / or (4) implausibly large memory (e.g. thousands of contextual wor… ▽ More

    Submitted 20 March, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: Accepted to NeurIPS 2022

    Journal ref: Neural Information Processing Systems (NeurIPS), 2022

  15. arXiv:2203.05813  [pdf, other

    stat.ML cs.LG

    Averaging Spatio-temporal Signals using Optimal Transport and Soft Alignments

    Authors: Hicham Janati, Marco Cuturi, Alexandre Gramfort

    Abstract: Several fields in science, from genomics to neuroimaging, require monitoring populations (measures) that evolve with time. These complex datasets, describing dynamics with both time and spatial components, pose new challenges for data analysis. We propose in this work a new framework to carry out averaging of these datasets, with the goal of synthesizing a representative template trajectory from m… ▽ More

    Submitted 8 April, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

  16. arXiv:2203.01110  [pdf, other

    stat.ML cs.LG

    The Optimal Noise in Noise-Contrastive Learning Is Not What You Think

    Authors: Omar Chehab, Alexandre Gramfort, Aapo Hyvarinen

    Abstract: Learning a parametric model of a data distribution is a well-known statistical problem that has seen renewed interest as it is brought to scale in deep learning. Framing the problem as a self-supervised task, where data samples are discriminated from noise samples, is at the core of state-of-the-art methods, beginning with Noise-Contrastive Estimation (NCE). Yet, such contrastive learning requires… ▽ More

    Submitted 26 July, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  17. arXiv:2202.12950  [pdf, other

    eess.SP cs.AI cs.LG

    2021 BEETL Competition: Advancing Transfer Learning for Subject Independence & Heterogenous EEG Data Sets

    Authors: Xiaoxi Wei, A. Aldo Faisal, Moritz Grosse-Wentrup, Alexandre Gramfort, Sylvain Chevallier, Vinay Jayaram, Camille Jeunet, Stylianos Bakas, Siegfried Ludwig, Konstantinos Barmpas, Mehdi Bahri, Yannis Panagakis, Nikolaos Laskaris, Dimitrios A. Adamos, Stefanos Zafeiriou, William C. Duong, Stephen M. Gordon, Vernon J. Lawhern, Maciej Śliwowski, Vincent Rouanne, Piotr Tempczyk

    Abstract: Transfer learning and meta-learning offer some of the most promising avenues to unlock the scalability of healthcare and consumer technologies driven by biosignal data. This is because current methods cannot generalise well across human subjects' data and handle learning from different heterogeneously collected data sets, thus limiting the scale of training data. On the other side, developments in… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: PrePrint of the NeurIPS2021 BEETL Competition Submitted to Proceedings of Machine Learning Research (PMLR)

  18. arXiv:2202.02142  [pdf, other

    cs.LG cs.AI

    Deep invariant networks with differentiable augmentation layers

    Authors: Cédric Rommel, Thomas Moreau, Alexandre Gramfort

    Abstract: Designing learning systems which are invariant to certain data transformations is critical in machine learning. Practitioners can typically enforce a desired invariance on the trained model through the choice of a network architecture, e.g. using convolutions for translations, or using data augmentation. Yet, enforcing true invariance in the network can be difficult, and data invariances are not a… ▽ More

    Submitted 25 October, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: Accepted to NeurIPS 2022

  19. arXiv:2112.06652  [pdf, other

    eess.SP cs.LG math.ST stat.AP

    DriPP: Driven Point Processes to Model Stimuli Induced Patterns in M/EEG Signals

    Authors: Cédric Allain, Alexandre Gramfort, Thomas Moreau

    Abstract: The quantitative analysis of non-invasive electrophysiology signals from electroencephalography (EEG) and magnetoencephalography (MEG) boils down to the identification of temporal patterns such as evoked responses, transient bursts of neural oscillations but also blinks or heartbeats for data cleaning. Several works have shown that these patterns can be extracted efficiently in an unsupervised way… ▽ More

    Submitted 11 July, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

  20. arXiv:2111.14232  [pdf, other

    q-bio.NC cs.AI cs.CL cs.LG cs.NE

    Long-range and hierarchical language predictions in brains and algorithms

    Authors: Charlotte Caucheteux, Alexandre Gramfort, Jean-Remi King

    Abstract: Deep learning has recently made remarkable progress in natural language processing. Yet, the resulting algorithms remain far from competing with the language abilities of the human brain. Predictive coding theory offers a potential explanation to this discrepancy: while deep language algorithms are optimized to predict adjacent words, the human brain would be tuned to make long-range and hierarchi… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

  21. arXiv:2111.08693  [pdf, other

    q-bio.QM cs.LG

    Inverting brain grey matter models with likelihood-free inference: a tool for trustable cytoarchitecture measurements

    Authors: Maëliss Jallais, Pedro Luiz Coelho Rodrigues, Alexandre Gramfort, Demian Wassermann

    Abstract: Effective characterisation of the brain grey matter cytoarchitecture with quantitative sensitivity to soma density and volume remains an unsolved challenge in diffusion MRI (dMRI). Solving the problem of relating the dMRI signal with cytoarchitectural characteristics calls for the definition of a mathematical model that describes brain tissue via a handful of physiologically-relevant parameters an… ▽ More

    Submitted 4 May, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Journal ref: Journal of Machine Learning for Biomedical Imaging, Melba editors, 2022, pp.1-27

  22. arXiv:2111.02790  [pdf, other

    cs.LG

    LassoBench: A High-Dimensional Hyperparameter Optimization Benchmark Suite for Lasso

    Authors: Kenan Šehić, Alexandre Gramfort, Joseph Salmon, Luigi Nardi

    Abstract: While Weighted Lasso sparse regression has appealing statistical guarantees that would entail a major real-world impact in finance, genomics, and brain imaging applications, it is typically scarcely adopted due to its complex high-dimensional space composed by thousands of hyperparameters. On the other hand, the latest progress with high-dimensional hyperparameter optimization (HD-HPO) methods for… ▽ More

    Submitted 10 June, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: 21 pages, 13 figures, Accepted as a conference paper at AutoML2022

  23. arXiv:2110.13502  [pdf, other

    cs.LG

    Shared Independent Component Analysis for Multi-Subject Neuroimaging

    Authors: Hugo Richard, Pierre Ablin, Bertrand Thirion, Alexandre Gramfort, Aapo Hyvärinen

    Abstract: We consider shared response modeling, a multi-view learning problem where one wants to identify common components from multiple datasets or views. We introduce Shared Independent Component Analysis (ShICA) that models each view as a linear transform of shared independent components contaminated by additive Gaussian noise. We show that this model is identifiable if the components are either non-Gau… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted at NeurIPS 2021

  24. arXiv:2110.06135  [pdf, other

    cs.LG q-bio.NC

    Label scarcity in biomedicine: Data-rich latent factor discovery enhances phenotype prediction

    Authors: Marc-Andre Schulz, Bertrand Thirion, Alexandre Gramfort, Gaël Varoquaux, Danilo Bzdok

    Abstract: High-quality data accumulation is now becoming ubiquitous in the health domain. There is increasing opportunity to exploit rich data from normal subjects to improve supervised estimators in specific diseases with notorious data scarcity. We demonstrate that low-dimensional embedding spaces can be derived from the UK Biobank population dataset and used to enhance data-scarce prediction of health in… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted at NIPS 2017 Workshop on Machine Learning for Health

  25. arXiv:2110.06078  [pdf

    q-bio.NC cs.AI cs.CL cs.LG

    Model-based analysis of brain activity reveals the hierarchy of language in 305 subjects

    Authors: Charlotte Caucheteux, Alexandre Gramfort, Jean-Rémi King

    Abstract: A popular approach to decompose the neural bases of language consists in correlating, across individuals, the brain responses to different stimuli (e.g. regular speech versus scrambled words, sentences, or paragraphs). Although successful, this `model-free' approach necessitates the acquisition of a large and costly set of neuroimaging data. Here, we show that a model-based approach can reach equi… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted to EMNLP 2021 (Findings)

    Journal ref: Findings of the Association for Computational Linguistics (EMNLP 2021)

  26. arXiv:2106.13695  [pdf, other

    cs.LG

    CADDA: Class-wise Automatic Differentiable Data Augmentation for EEG Signals

    Authors: Cédric Rommel, Thomas Moreau, Joseph Paillard, Alexandre Gramfort

    Abstract: Data augmentation is a key element of deep learning pipelines, as it informs the network during training about transformations of the input data that keep the label unchanged. Manually finding adequate augmentation methods and parameters for a given pipeline is however rapidly cumbersome. In particular, while intuition can guide this decision for images, the design and choice of augmentation polic… ▽ More

    Submitted 7 February, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

  27. arXiv:2105.12916  [pdf, other

    cs.LG eess.SP q-bio.NC q-bio.QM stat.ML

    Robust learning from corrupted EEG with dynamic spatial filtering

    Authors: Hubert Banville, Sean U. N. Wood, Chris Aimone, Denis-Alexander Engemann, Alexandre Gramfort

    Abstract: Building machine learning models using EEG recorded outside of the laboratory setting requires methods robust to noisy data and randomly missing channels. This need is particularly great when working with sparse EEG montages (1-6 channels), often encountered in consumer-grade or mobile EEG devices. Neither classical machine learning models nor deep neural networks trained end-to-end on EEG are typ… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: 42 pages, 9 figures

  28. arXiv:2105.01637  [pdf, other

    stat.ML cs.LG math.OC

    Implicit differentiation for fast hyperparameter selection in non-smooth convex learning

    Authors: Quentin Bertrand, Quentin Klopfenstein, Mathurin Massias, Mathieu Blondel, Samuel Vaiter, Alexandre Gramfort, Joseph Salmon

    Abstract: Finding the optimal hyperparameters of a model can be cast as a bilevel optimization problem, typically solved using zero-order techniques. In this work we study first-order methods when the inner optimization problem is convex but non-smooth. We show that the forward-mode differentiation of proximal gradient descent and proximal coordinate descent yield sequences of Jacobians converging toward th… ▽ More

    Submitted 8 August, 2022; v1 submitted 4 May, 2021; originally announced May 2021.

  29. arXiv:2103.02339  [pdf, other

    q-bio.NC cs.LG cs.NE

    Deep Recurrent Encoder: A scalable end-to-end network to model brain signals

    Authors: Omar Chehab, Alexandre Defossez, Jean-Christophe Loiseau, Alexandre Gramfort, Jean-Remi King

    Abstract: Understanding how the brain responds to sensory inputs is challenging: brain recordings are partial, noisy, and high dimensional; they vary across sessions and subjects and they capture highly nonlinear dynamics. These challenges have led the community to develop a variety of preprocessing and analytical (almost exclusively linear) methods, each designed to tackle one of these issues. Instead, we… ▽ More

    Submitted 30 September, 2022; v1 submitted 3 March, 2021; originally announced March 2021.

  30. arXiv:2103.01620  [pdf, other

    cs.CL cs.LG q-bio.NC

    Disentangling Syntax and Semantics in the Brain with Deep Networks

    Authors: Charlotte Caucheteux, Alexandre Gramfort, Jean-Remi King

    Abstract: The activations of language transformers like GPT-2 have been shown to linearly map onto brain activity during speech comprehension. However, the nature of these activations remains largely unknown and presumably conflate distinct linguistic classes. Here, we propose a taxonomy to factorize the high-dimensional activations of language models into four combinatorial classes: lexical, compositional,… ▽ More

    Submitted 15 June, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: Accepted to ICML 2021

    Journal ref: International Conference on Machine Learning (ICML), 2021

  31. arXiv:2102.10964  [pdf, other

    stat.ML cs.LG

    Adaptive Multi-View ICA: Estimation of noise levels for optimal inference

    Authors: Hugo Richard, Pierre Ablin, Aapo Hyvärinen, Alexandre Gramfort, Bertrand Thirion

    Abstract: We consider a multi-view learning problem known as group independent component analysis (group ICA), where the goal is to recover shared independent sources from many views. The statistical modeling of this problem requires to take noise into account. When the model includes additive noise on the observations, the likelihood is intractable. By contrast, we propose Adaptive multiView ICA (AVICA), a… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

  32. arXiv:2102.06477  [pdf, other

    stat.ML cs.LG q-bio.QM

    HNPE: Leveraging Global Parameters for Neural Posterior Estimation

    Authors: Pedro L. C. Rodrigues, Thomas Moreau, Gilles Louppe, Alexandre Gramfort

    Abstract: Inferring the parameters of a stochastic model based on experimental observations is central to the scientific method. A particularly challenging setting is when the model is strongly indeterminate, i.e. when distinct sets of parameters yield identical observations. This arises in many practical situations, such as when inferring the distance and power of a radio source (is the source close and we… ▽ More

    Submitted 9 November, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

  33. arXiv:2012.02807  [pdf, other

    stat.ML cs.LG stat.AP

    Learning summary features of time series for likelihood free inference

    Authors: Pedro L. C. Rodrigues, Alexandre Gramfort

    Abstract: There has been an increasing interest from the scientific community in using likelihood-free inference (LFI) to determine which parameters of a given simulator model could best describe a set of experimental data. Despite exciting recent results and a wide range of possible applications, an important bottleneck of LFI when applied to time series data is the necessity of defining a set of summary f… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  34. arXiv:2010.11825  [pdf, other

    stat.ML cs.LG math.OC

    Model identification and local linear convergence of coordinate descent

    Authors: Quentin Klopfenstein, Quentin Bertrand, Alexandre Gramfort, Joseph Salmon, Samuel Vaiter

    Abstract: For composite nonsmooth optimization problems, Forward-Backward algorithm achieves model identification (e.g. support identification for the Lasso) after a finite number of iterations, provided the objective function is regular enough. Results concerning coordinate descent are scarcer and model identification has only been shown for specific estimators, the support-vector machine for instance. In… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  35. arXiv:2009.14310  [pdf, other

    stat.ML cs.LG stat.AP

    Statistical control for spatio-temporal MEG/EEG source imaging with desparsified multi-task Lasso

    Authors: Jérôme-Alexis Chevalier, Alexandre Gramfort, Joseph Salmon, Bertrand Thirion

    Abstract: Detecting where and when brain regions activate in a cognitive task or in a given clinical condition is the promise of non-invasive techniques like magnetoencephalography (MEG) or electroencephalography (EEG). This problem, referred to as source localization, or source imaging, poses however a high-dimensional statistical inference challenge. While sparsity promoting regularizations have been prop… ▽ More

    Submitted 25 November, 2020; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: 21 pages

  36. arXiv:2007.16104  [pdf, other

    stat.ML cs.LG eess.SP q-bio.NC q-bio.QM

    Uncovering the structure of clinical EEG signals with self-supervised learning

    Authors: Hubert Banville, Omar Chehab, Aapo Hyvärinen, Denis-Alexander Engemann, Alexandre Gramfort

    Abstract: Objective. Supervised learning paradigms are often limited by the amount of labeled data that is available. This phenomenon is particularly problematic in clinically-relevant data, such as electroencephalography (EEG), where labeling can be costly in terms of specialized expertise and human processing time. Consequently, deep learning architectures designed to learn on EEG data have yielded relati… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: 32 pages, 9 figures

  37. arXiv:2006.06635  [pdf, other

    stat.ML cs.LG

    Modeling Shared Responses in Neuroimaging Studies through MultiView ICA

    Authors: Hugo Richard, Luigi Gresele, Aapo Hyvärinen, Bertrand Thirion, Alexandre Gramfort, Pierre Ablin

    Abstract: Group studies involving large cohorts of subjects are important to draw general conclusions about brain functional organization. However, the aggregation of data coming from multiple subjects is challenging, since it requires accounting for large variability in anatomy, functional topography and stimulus response across individuals. Data modeling is especially hard for ecologically relevant condit… ▽ More

    Submitted 24 December, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted to NeurIPS 2020

  38. arXiv:2006.02575  [pdf, other

    stat.ML cs.LG

    Debiased Sinkhorn barycenters

    Authors: Hicham Janati, Marco Cuturi, Alexandre Gramfort

    Abstract: Entropy regularization in optimal transport (OT) has been the driver of many recent interests for Wasserstein metrics and barycenters in machine learning. It allows to keep the appealing geometrical properties of the unregularized Wasserstein distance while having a significantly lower complexity thanks to Sinkhorn's algorithm. However, entropy brings some inherent smoothing bias, resulting for ex… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

    Comments: ICML 2020

  39. arXiv:2005.11890  [pdf, other

    stat.ML cs.LG stat.CO

    mvlearn: Multiview Machine Learning in Python

    Authors: Ronan Perry, Gavin Mischler, Richard Guo, Theodore Lee, Alexander Chang, Arman Koul, Cameron Franz, Hugo Richard, Iain Carmichael, Pierre Ablin, Alexandre Gramfort, Joshua T. Vogelstein

    Abstract: As data are generated more and more from multiple disparate sources, multiview data sets, where each sample has features in distinct views, have ballooned in recent years. However, no comprehensive package exists that enables non-specialists to use these methods easily. mvlearn is a Python library which implements the leading multiview machine learning methods. Its simple API closely follows that… ▽ More

    Submitted 25 May, 2021; v1 submitted 24 May, 2020; originally announced May 2020.

    Comments: 6 pages, 2 figures, 1 table

  40. arXiv:2002.08943  [pdf, other

    stat.ML cs.LG

    Implicit differentiation of Lasso-type models for hyperparameter optimization

    Authors: Quentin Bertrand, Quentin Klopfenstein, Mathieu Blondel, Samuel Vaiter, Alexandre Gramfort, Joseph Salmon

    Abstract: Setting regularization parameters for Lasso-type estimators is notoriously difficult, though crucial in practice. The most popular hyperparameter optimization approach is grid-search using held-out validation data. Grid-search however requires to choose a predefined grid for each parameter, which scales exponentially in the number of parameters. Another approach is to cast hyperparameter optimizat… ▽ More

    Submitted 3 September, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

  41. arXiv:2001.05401  [pdf, other

    stat.ML cs.LG math.OC

    Support recovery and sup-norm convergence rates for sparse pivotal estimation

    Authors: Mathurin Massias, Quentin Bertrand, Alexandre Gramfort, Joseph Salmon

    Abstract: In high dimensional sparse regression, pivotal estimators are estimators for which the optimal regularization parameter is independent of the noise level. The canonical pivotal estimator is the square-root Lasso, formulated along with its derivatives as a "non-smooth + non-smooth" optimization problem. Modern techniques to solve these include smoothing the datafitting term, to benefit from fast ef… ▽ More

    Submitted 3 September, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

  42. arXiv:1911.05419  [pdf, other

    cs.LG eess.SP stat.ML

    Self-supervised representation learning from electroencephalography signals

    Authors: Hubert Banville, Isabela Albuquerque, Aapo Hyvärinen, Graeme Moffat, Denis-Alexander Engemann, Alexandre Gramfort

    Abstract: The supervised learning paradigm is limited by the cost - and sometimes the impracticality - of data collection and labeling in multiple domains. Self-supervised learning, a paradigm which exploits the structure of unlabeled data to create learning problems that can be solved with standard supervised approaches, has shown great promise as a pretraining or feature learning approach in fields like c… ▽ More

    Submitted 13 November, 2019; originally announced November 2019.

  43. arXiv:1910.03860  [pdf, other

    stat.ML cs.LG

    Spatio-Temporal Alignments: Optimal transport through space and time

    Authors: Hicham Janati, Marco Cuturi, Alexandre Gramfort

    Abstract: Comparing data defined over space and time is notoriously hard, because it involves quantifying both spatial and temporal variability, while at the same time taking into account the chronological structure of data. Dynamic Time Warping (DTW) computes an optimal alignment between time series in agreement with the chronological order, but is inherently blind to spatial shifts. In this paper, we prop… ▽ More

    Submitted 10 November, 2019; v1 submitted 9 October, 2019; originally announced October 2019.

  44. arXiv:1910.01914  [pdf, other

    stat.ML cs.LG q-bio.NC

    Multi-subject MEG/EEG source imaging with sparse multi-task regression

    Authors: Hicham Janati, Thomas Bazeille, Bertrand Thirion, Marco Cuturi, Alexandre Gramfort

    Abstract: Magnetoencephalography and electroencephalography (M/EEG) are non-invasive modalities that measure the weak electromagnetic fields generated by neural activity. Estimating the location and magnitude of the current sources that generated these electromagnetic fields is a challenging ill-posed regression problem known as \emph{source imaging}. When considering a group study, a common approach consis… ▽ More

    Submitted 14 October, 2019; v1 submitted 3 October, 2019; originally announced October 2019.

    Comments: version 2. arXiv admin note: text overlap with arXiv:1902.04812

  45. arXiv:1907.05830  [pdf, other

    stat.ML cs.LG

    Dual Extrapolation for Sparse Generalized Linear Models

    Authors: Mathurin Massias, Samuel Vaiter, Alexandre Gramfort, Joseph Salmon

    Abstract: Generalized Linear Models (GLM) form a wide class of regression and classification models, where prediction is a function of a linear combination of the input variables. For statistical inference in high dimension, sparsity inducing regularizations have proven to be useful while offering statistical guarantees. However, solving the resulting optimization problems can be challenging: even for popul… ▽ More

    Submitted 24 August, 2022; v1 submitted 12 July, 2019; originally announced July 2019.

  46. arXiv:1906.02687  [pdf, other

    eess.SP cs.LG stat.ML

    Manifold-regression to predict from MEG/EEG brain signals without source modeling

    Authors: David Sabbagh, Pierre Ablin, Gael Varoquaux, Alexandre Gramfort, Denis A. Engemann

    Abstract: Magnetoencephalography and electroencephalography (M/EEG) can reveal neuronal dynamics non-invasively in real-time and are therefore appreciated methods in medicine and neuroscience. Recent advances in modeling brain-behavior relationships have highlighted the effectiveness of Riemannian geometry for summarizing the spatially correlated time-series from M/EEG in terms of their covariance. However,… ▽ More

    Submitted 22 November, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

  47. arXiv:1905.11071  [pdf, other

    stat.ML cs.LG

    Learning step sizes for unfolded sparse coding

    Authors: Pierre Ablin, Thomas Moreau, Mathurin Massias, Alexandre Gramfort

    Abstract: Sparse coding is typically solved by iterative optimization techniques, such as the Iterative Shrinkage-Thresholding Algorithm (ISTA). Unfolding and learning weights of ISTA using neural networks is a practical way to accelerate estimation. In this paper, we study the selection of adapted step sizes for ISTA. We show that a simple step size strategy can improve the convergence rate of ISTA by leve… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    Comments: 22 pages

  48. arXiv:1902.04812  [pdf, other

    stat.ML cs.LG

    Group level MEG/EEG source imaging via optimal transport: minimum Wasserstein estimates

    Authors: Hicham Janati, Thomas Bazeille, Bertrand Thirion, Marco Cuturi, Alexandre Gramfort

    Abstract: Magnetoencephalography (MEG) and electroencephalogra-phy (EEG) are non-invasive modalities that measure the weak electromagnetic fields generated by neural activity. Inferring the location of the current sources that generated these magnetic fields is an ill-posed inverse problem known as source imaging. When considering a group study, a baseline approach consists in carrying out the estimation of… ▽ More

    Submitted 13 February, 2019; originally announced February 2019.

  49. arXiv:1902.02509  [pdf, other

    stat.ML cs.LG math.OC stat.AP

    Handling correlated and repeated measurements with the smoothed multivariate square-root Lasso

    Authors: Quentin Bertrand, Mathurin Massias, Alexandre Gramfort, Joseph Salmon

    Abstract: Sparsity promoting norms are frequently used in high dimensional regression. A limitation of such Lasso-type estimators is that the optimal regularization parameter depends on the unknown noise level. Estimators such as the concomitant Lasso address this dependence by jointly estimating the noise level and the regression coefficients. Additionally, in many applications, the data is obtained by ave… ▽ More

    Submitted 3 September, 2020; v1 submitted 7 February, 2019; originally announced February 2019.

  50. arXiv:1901.09235  [pdf, other

    cs.LG cs.DC stat.ML

    Distributed Convolutional Dictionary Learning (DiCoDiLe): Pattern Discovery in Large Images and Signals

    Authors: Thomas Moreau, Alexandre Gramfort

    Abstract: Convolutional dictionary learning (CDL) estimates shift invariant basis adapted to multidimensional data. CDL has proven useful for image denoising or inpainting, as well as for pattern discovery on multivariate signals. As estimated patterns can be positioned anywhere in signals or images, optimization techniques face the difficulty of working in extremely high dimensions with millions of pixels… ▽ More

    Submitted 26 January, 2019; originally announced January 2019.