Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–40 of 40 results for author: Moreau, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11676  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    SKADA-Bench: Benchmarking Unsupervised Domain Adaptation Methods with Realistic Validation

    Authors: Yanis Lalou, Théo Gnassounou, Antoine Collas, Antoine de Mathelin, Oleksii Kachaiev, Ambroise Odonnat, Alexandre Gramfort, Thomas Moreau, Rémi Flamary

    Abstract: Unsupervised Domain Adaptation (DA) consists of adapting a model trained on a labeled source domain to perform well on an unlabeled target domain with some data distribution shift. While many methods have been proposed in the literature, fair and realistic evaluation remains an open question, particularly due to methodological difficulties in selecting hyperparameters in the unsupervised setting.… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2406.16938  [pdf, other

    eess.SP cs.LG stat.ML

    Unmixing Noise from Hawkes Process to Model Learned Physiological Events

    Authors: Guillaume Staerman, Virginie Loison, Thomas Moreau

    Abstract: Physiological signal analysis often involves identifying events crucial to understanding biological dynamics. Traditional methods rely on handcrafted procedures or supervised learning, presenting challenges such as expert dependence, lack of robustness, and the need for extensive labeled data. Data-driven methods like Convolutional Dictionary Learning (CDL) offer an alternative but tend to produce… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.06849  [pdf, other

    stat.ML cs.LG

    Flexible Parametric Inference for Space-Time Hawkes Processes

    Authors: Emilia Siviero, Guillaume Staerman, Stephan Clémençon, Thomas Moreau

    Abstract: Many modern spatio-temporal data sets, in sociology, epidemiology or seismology, for example, exhibit self-exciting characteristics, triggering and clustering behaviors both at the same time, that a suitable Hawkes space-time process can accurately capture. This paper aims to develop a fast and flexible parametric inference technique to recover the parameters of the kernel functions involved in th… ▽ More

    Submitted 17 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2404.15319  [pdf, other

    eess.SP cs.AI cs.HC cs.LG q-bio.NC

    The largest EEG-based BCI reproducibility study for open science: the MOABB benchmark

    Authors: Sylvain Chevallier, Igor Carrara, Bruno Aristimunha, Pierre Guetschel, Sara Sedlar, Bruna Lopes, Sebastien Velut, Salim Khazem, Thomas Moreau

    Abstract: Objective. This study conduct an extensive Brain-computer interfaces (BCI) reproducibility analysis on open electroencephalography datasets, aiming to assess existing solutions and establish open and reproducible benchmarks for effective comparison within the field. The need for such benchmark lies in the rapid industrial progress that has given rise to undisclosed proprietary solutions. Furthermo… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 43 pages, 13 figures, 5 tables

  5. S-JEPA: towards seamless cross-dataset transfer through dynamic spatial attention

    Authors: Pierre Guetschel, Thomas Moreau, Michael Tangermann

    Abstract: Motivated by the challenge of seamless cross-dataset transfer in EEG signal processing, this article presents an exploratory study on the use of Joint Embedding Predictive Architectures (JEPAs). In recent years, self-supervised learning has emerged as a promising approach for transfer learning in various domains. However, its application to EEG signals remains largely unexplored. In this article,… ▽ More

    Submitted 7 October, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Journal ref: 9th Graz Brain-Computer Interface Conference (2024) 11-16

  6. arXiv:2312.01831  [pdf, other

    eess.IV cs.CV

    Equivariant plug-and-play image reconstruction

    Authors: Matthieu Terris, Thomas Moreau, Nelly Pustelnik, Julian Tachella

    Abstract: Plug-and-play algorithms constitute a popular framework for solving inverse imaging problems that rely on the implicit definition of an image prior via a denoiser. These algorithms can leverage powerful pre-trained denoisers to solve a wide range of imaging tasks, circumventing the necessity to train models on a per-task basis. Unfortunately, plug-and-play methods often show unstable behaviors, ha… ▽ More

    Submitted 23 May, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

  7. arXiv:2311.18710  [pdf, other

    cs.CV cs.LG

    Meta-Prior: Meta learning for Adaptive Inverse Problem Solvers

    Authors: Matthieu Terris, Thomas Moreau

    Abstract: Deep neural networks have become a foundational tool for addressing imaging inverse problems. They are typically trained for a specific task, with a supervised loss to learn a mapping from the observations to the image to recover. However, real-world imaging challenges often lack ground truth data, rendering traditional supervised approaches ineffective. Moreover, for each new imaging task, a new… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  8. arXiv:2308.16022  [pdf, other

    stat.ML cs.LG

    PAVI: Plate-Amortized Variational Inference

    Authors: Louis Rouillard, Alexandre Le Bris, Thomas Moreau, Demian Wassermann

    Abstract: Given observed data and a probabilistic generative model, Bayesian inference searches for the distribution of the model's parameters that could have yielded the data. Inference is challenging for large population studies where millions of measurements are performed over a cohort of hundreds of subjects, resulting in a massive parameter space. This large cardinality renders off-the-shelf Variationa… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  9. arXiv:2305.15042  [pdf, other

    cs.LG stat.ML

    Test like you Train in Implicit Deep Learning

    Authors: Zaccharie Ramzi, Pierre Ablin, Gabriel Peyré, Thomas Moreau

    Abstract: Implicit deep learning has recently gained popularity with applications ranging from meta-learning to Deep Equilibrium Networks (DEQs). In its general formulation, it relies on expressing some components of deep learning pipelines implicitly, typically via a root equation called the inner problem. In practice, the solution of the inner problem is approximated during training with an iterative proc… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  10. arXiv:2303.05798  [pdf, other

    cs.LG eess.SP stat.ML

    Sliced-Wasserstein on Symmetric Positive Definite Matrices for M/EEG Signals

    Authors: Clément Bonet, Benoît Malézieux, Alain Rakotomamonjy, Lucas Drumetz, Thomas Moreau, Matthieu Kowalski, Nicolas Courty

    Abstract: When dealing with electro or magnetoencephalography records, many supervised prediction tasks are solved by working with covariance matrices to summarize the signals. Learning with these matrices requires using Riemanian geometry to account for their structure. In this paper, we propose a new method to deal with distributions of covariance matrices and demonstrate its computational efficiency on M… ▽ More

    Submitted 24 May, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: Published as a conference paper at ICML2023

  11. arXiv:2302.08766  [pdf, other

    stat.ML cs.LG math.OC

    A Lower Bound and a Near-Optimal Algorithm for Bilevel Empirical Risk Minimization

    Authors: Mathieu Dagréou, Thomas Moreau, Samuel Vaiter, Pierre Ablin

    Abstract: Bilevel optimization problems, which are problems where two optimization problems are nested, have more and more applications in machine learning. In many practical cases, the upper and the lower objectives correspond to empirical risk minimization problems and therefore have a sum structure. In this context, we propose a bilevel extension of the celebrated SARAH algorithm. We demonstrate that the… ▽ More

    Submitted 20 February, 2024; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: Accepted at AISTATS 2024

  12. arXiv:2210.04635  [pdf, other

    stat.ML cs.LG

    FaDIn: Fast Discretized Inference for Hawkes Processes with General Parametric Kernels

    Authors: Guillaume Staerman, Cédric Allain, Alexandre Gramfort, Thomas Moreau

    Abstract: Temporal point processes (TPP) are a natural tool for modeling event-based data. Among all TPP models, Hawkes processes have proven to be the most widely used, mainly due to their adequate modeling for various applications, particularly when considering exponential or non-parametric kernels. Although non-parametric kernels are an option, such models require large datasets. While exponential kernel… ▽ More

    Submitted 2 August, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

  13. arXiv:2206.14483  [pdf, other

    cs.LG cs.AI eess.SP

    Data augmentation for learning predictive models on EEG: a systematic comparison

    Authors: Cédric Rommel, Joseph Paillard, Thomas Moreau, Alexandre Gramfort

    Abstract: Objective: The use of deep learning for electroencephalography (EEG) classification tasks has been rapidly growing in the last years, yet its application has been limited by the relatively small size of EEG datasets. Data augmentation, which consists in artificially increasing the size of the dataset during training, can be employed to alleviate this problem. While a few augmentation transformatio… ▽ More

    Submitted 15 November, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: Accepted in Journal of Neural Engineering

  14. arXiv:2206.13424  [pdf, other

    cs.LG math.OC stat.ML

    Benchopt: Reproducible, efficient and collaborative optimization benchmarks

    Authors: Thomas Moreau, Mathurin Massias, Alexandre Gramfort, Pierre Ablin, Pierre-Antoine Bannier, Benjamin Charlier, Mathieu Dagréou, Tom Dupré la Tour, Ghislain Durif, Cassio F. Dantas, Quentin Klopfenstein, Johan Larsson, En Lai, Tanguy Lefort, Benoit Malézieux, Badr Moufad, Binh T. Nguyen, Alain Rakotomamonjy, Zaccharie Ramzi, Joseph Salmon, Samuel Vaiter

    Abstract: Numerical validation is at the core of machine learning research as it allows to assess the actual impact of new methods, and to confirm the agreement between theory and practice. Yet, the rapid development of the field poses several challenges: researchers are confronted with a profusion of methods to compare, limited transparency and consensus on best practices, as well as tedious re-implementat… ▽ More

    Submitted 28 October, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted in proceedings of NeurIPS 22; Benchopt library documentation is available at https://benchopt.github.io/

  15. arXiv:2206.05111  [pdf, other

    cs.AI cs.LG q-bio.NC stat.ME stat.ML

    PAVI: Plate-Amortized Variational Inference

    Authors: Louis Rouillard, Thomas Moreau, Demian Wassermann

    Abstract: Given some observed data and a probabilistic generative model, Bayesian inference aims at obtaining the distribution of a model's latent parameters that could have yielded the data. This task is challenging for large population studies where thousands of measurements are performed over a cohort of hundreds of subjects, resulting in a massive latent parameter space. This large cardinality renders o… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  16. arXiv:2202.02142  [pdf, other

    cs.LG cs.AI

    Deep invariant networks with differentiable augmentation layers

    Authors: Cédric Rommel, Thomas Moreau, Alexandre Gramfort

    Abstract: Designing learning systems which are invariant to certain data transformations is critical in machine learning. Practitioners can typically enforce a desired invariance on the trained model through the choice of a network architecture, e.g. using convolutions for translations, or using data augmentation. Yet, enforcing true invariance in the network can be difficult, and data invariances are not a… ▽ More

    Submitted 25 October, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: Accepted to NeurIPS 2022

  17. arXiv:2201.13409  [pdf, other

    stat.ML cs.LG math.OC

    A framework for bilevel optimization that enables stochastic and global variance reduction algorithms

    Authors: Mathieu Dagréou, Pierre Ablin, Samuel Vaiter, Thomas Moreau

    Abstract: Bilevel optimization, the problem of minimizing a value function which involves the arg-minimum of another function, appears in many areas of machine learning. In a large scale empirical risk minimization setting where the number of samples is huge, it is crucial to develop stochastic methods, which only use a few samples at a time to progress. However, computing the gradient of the value function… ▽ More

    Submitted 10 November, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: Accepted at NeurIPS 2022

  18. arXiv:2112.06652  [pdf, other

    eess.SP cs.LG math.ST stat.AP

    DriPP: Driven Point Processes to Model Stimuli Induced Patterns in M/EEG Signals

    Authors: Cédric Allain, Alexandre Gramfort, Thomas Moreau

    Abstract: The quantitative analysis of non-invasive electrophysiology signals from electroencephalography (EEG) and magnetoencephalography (MEG) boils down to the identification of temporal patterns such as evoked responses, transient bursts of neural oscillations but also blinks or heartbeats for data cleaning. Several works have shown that these patterns can be extracted efficiently in an unsupervised way… ▽ More

    Submitted 11 July, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

  19. arXiv:2106.13695  [pdf, other

    cs.LG

    CADDA: Class-wise Automatic Differentiable Data Augmentation for EEG Signals

    Authors: Cédric Rommel, Thomas Moreau, Joseph Paillard, Alexandre Gramfort

    Abstract: Data augmentation is a key element of deep learning pipelines, as it informs the network during training about transformations of the input data that keep the label unchanged. Manually finding adequate augmentation methods and parameters for a given pipeline is however rapidly cumbersome. In particular, while intuition can guide this decision for images, the design and choice of augmentation polic… ▽ More

    Submitted 7 February, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

  20. arXiv:2106.06338  [pdf, other

    cs.LG math.OC stat.ML

    Understanding approximate and unrolled dictionary learning for pattern recovery

    Authors: Benoît Malézieux, Thomas Moreau, Matthieu Kowalski

    Abstract: Dictionary learning consists of finding a sparse representation from noisy data and is a common way to encode data-driven prior knowledge on signals. Alternating minimization (AM) is standard for the underlying optimization, where gradient descent steps alternate with sparse coding procedures. The major drawback of this method is its prohibitive computational cost, making it unpractical on large r… ▽ More

    Submitted 8 February, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

  21. arXiv:2106.00553  [pdf, other

    cs.LG stat.ML

    SHINE: SHaring the INverse Estimate from the forward pass for bi-level optimization and implicit models

    Authors: Zaccharie Ramzi, Florian Mannel, Shaojie Bai, Jean-Luc Starck, Philippe Ciuciu, Thomas Moreau

    Abstract: In recent years, implicit deep learning has emerged as a method to increase the effective depth of deep neural networks. While their training is memory-efficient, they are still significantly slower to train than their explicit counterparts. In Deep Equilibrium Models (DEQs), the training is performed as a bi-level problem, and its computational complexity is partially driven by the iterative inve… ▽ More

    Submitted 10 March, 2023; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: Accepted as a spotlight to ICLR 2022

  22. arXiv:2102.06477  [pdf, other

    stat.ML cs.LG q-bio.QM

    HNPE: Leveraging Global Parameters for Neural Posterior Estimation

    Authors: Pedro L. C. Rodrigues, Thomas Moreau, Gilles Louppe, Alexandre Gramfort

    Abstract: Inferring the parameters of a stochastic model based on experimental observations is central to the scientific method. A particularly challenging setting is when the model is strongly indeterminate, i.e. when distinct sets of parameters yield identical observations. This arises in many practical situations, such as when inferring the distance and power of a radio source (is the source close and we… ▽ More

    Submitted 9 November, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

  23. arXiv:2011.14962  [pdf, other

    eess.SP cs.LG

    Extraction of Nystagmus Patterns from Eye-Tracker Data with Convolutional Sparse Coding

    Authors: Clément Lalanne, Maxence Rateaux, Laurent Oudre, Matthieu Robert, Thomas Moreau

    Abstract: The analysis of the Nystagmus waveforms from eye-tracking records is crucial for the clinicial interpretation of this pathological movement. A major issue to automatize this analysis is the presence of natural eye movements and eye blink artefacts that are mixed with the signal of interest. We propose a method based on Convolutional Dictionary Learning that is able to automaticcaly highlight the N… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Journal ref: Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Jul 2020, Montreal, QC, Canada. pp.928-931

  24. arXiv:2007.01627  [pdf, other

    cs.LG cs.AI stat.ML

    NeuMiss networks: differentiable programming for supervised learning with missing values

    Authors: Marine Le Morvan, Julie Josse, Thomas Moreau, Erwan Scornet, Gaël Varoquaux

    Abstract: The presence of missing values makes supervised learning much more challenging. Indeed, previous work has shown that even when the response is a linear function of the complete data, the optimal predictor is a complex function of the observed entries and the missingness indicator. As a result, the computational or sample complexities of consistent approaches depend on the number of missing pattern… ▽ More

    Submitted 4 November, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

    Journal ref: Advances in Neural Information Processing Systems 33, Dec 2020, Vancouver, Canada

  25. arXiv:2002.03722  [pdf, other

    stat.ML cs.LG

    Super-efficiency of automatic differentiation for functions defined as a minimum

    Authors: Pierre Ablin, Gabriel Peyré, Thomas Moreau

    Abstract: In min-min optimization or max-min optimization, one has to compute the gradient of a function defined as a minimum. In most cases, the minimum has no closed-form, and an approximation is obtained via an iterative algorithm. There are two usual ways of estimating the gradient of the function: using either an analytic formula obtained by assuming exactness of the approximation, or automatic differe… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: 31 pages

  26. arXiv:1905.11071  [pdf, other

    stat.ML cs.LG

    Learning step sizes for unfolded sparse coding

    Authors: Pierre Ablin, Thomas Moreau, Mathurin Massias, Alexandre Gramfort

    Abstract: Sparse coding is typically solved by iterative optimization techniques, such as the Iterative Shrinkage-Thresholding Algorithm (ISTA). Unfolding and learning weights of ISTA using neural networks is a practical way to accelerate estimation. In this paper, we study the selection of adapted step sizes for ISTA. We show that a simple step size strategy can improve the convergence rate of ISTA by leve… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

    Comments: 22 pages

  27. arXiv:1904.08368  [pdf, other

    cs.LG cs.PL stat.ML

    Relay: A High-Level Compiler for Deep Learning

    Authors: Jared Roesch, Steven Lyubomirsky, Marisa Kirisame, Logan Weber, Josh Pollock, Luis Vega, Ziheng Jiang, Tianqi Chen, Thierry Moreau, Zachary Tatlock

    Abstract: Frameworks for writing, compiling, and optimizing deep learning (DL) models have recently enabled progress in areas like computer vision and natural language processing. Extending these frameworks to accommodate the rapidly diversifying landscape of DL models and hardware platforms presents challenging tradeoffs between expressivity, composability, and portability. We present Relay, a new compiler… ▽ More

    Submitted 24 August, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

  28. arXiv:1901.09235  [pdf, other

    cs.LG cs.DC stat.ML

    Distributed Convolutional Dictionary Learning (DiCoDiLe): Pattern Discovery in Large Images and Signals

    Authors: Thomas Moreau, Alexandre Gramfort

    Abstract: Convolutional dictionary learning (CDL) estimates shift invariant basis adapted to multidimensional data. CDL has proven useful for image denoising or inpainting, as well as for pattern discovery on multivariate signals. As estimated patterns can be positioned anywhere in signals or images, optimization techniques face the difficulty of working in extremely high dimensions with millions of pixels… ▽ More

    Submitted 26 January, 2019; originally announced January 2019.

  29. arXiv:1810.11066  [pdf, other

    cs.LG stat.ML

    Automating Generation of Low Precision Deep Learning Operators

    Authors: Meghan Cowan, Thierry Moreau, Tianqi Chen, Luis Ceze

    Abstract: State of the art deep learning models have made steady progress in the fields of computer vision and natural language processing, at the expense of growing model sizes and computational complexity. Deploying these models on low power and mobile devices poses a challenge due to their limited compute capabilities and strict energy budgets. One solution that has generated significant research interes… ▽ More

    Submitted 25 October, 2018; originally announced October 2018.

    Comments: 10 pages, 11 figures

  30. arXiv:1809.05859  [pdf, other

    cs.AR cs.ET cs.PL

    Exploiting Errors for Efficiency: A Survey from Circuits to Algorithms

    Authors: Phillip Stanley-Marbell, Armin Alaghi, Michael Carbin, Eva Darulova, Lara Dolecek, Andreas Gerstlauer, Ghayoor Gillani, Djordje Jevdjic, Thierry Moreau, Mattia Cacciotti, Alexandros Daglis, Natalie Enright Jerger, Babak Falsafi, Sasa Misailovic, Adrian Sampson, Damien Zufferey

    Abstract: When a computational task tolerates a relaxation of its specification or when an algorithm tolerates the effects of noise in its execution, hardware, programming languages, and system software can trade deviations from correct behavior for lower resource usage. We present, for the first time, a synthesis of research results on computing systems that only make as many errors as their users can tole… ▽ More

    Submitted 16 September, 2018; originally announced September 2018.

    Comments: 35 pages

  31. arXiv:1807.04188  [pdf, other

    cs.LG cs.DC stat.ML

    A Hardware-Software Blueprint for Flexible Deep Learning Specialization

    Authors: Thierry Moreau, Tianqi Chen, Luis Vega, Jared Roesch, Eddie Yan, Lianmin Zheng, Josh Fromm, Ziheng Jiang, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy

    Abstract: Specialized Deep Learning (DL) acceleration stacks, designed for a specific set of frameworks, model architectures, operators, and data types, offer the allure of high performance while sacrificing flexibility. Changes in algorithms, models, operators, or numerical systems threaten the viability of specialized hardware accelerators. We propose VTA, a programmable deep learning architecture templat… ▽ More

    Submitted 22 April, 2019; v1 submitted 11 July, 2018; originally announced July 2018.

    Comments: 6 pages plus references, 8 figures

  32. arXiv:1805.09654  [pdf, other

    eess.SP cs.LG stat.ML

    Multivariate Convolutional Sparse Coding for Electromagnetic Brain Signals

    Authors: Tom Dupré La Tour, Thomas Moreau, Mainak Jas, Alexandre Gramfort

    Abstract: Frequency-specific patterns of neural activity are traditionally interpreted as sustained rhythmic oscillations, and related to cognitive mechanisms such as attention, high level visual processing or motor control. While alpha waves (8-12 Hz) are known to closely resemble short sinusoids, and thus are revealed by Fourier analysis or wavelet transforms, there is an evolving debate that electromagne… ▽ More

    Submitted 26 May, 2018; v1 submitted 24 May, 2018; originally announced May 2018.

  33. arXiv:1805.08166  [pdf, other

    cs.LG stat.ML

    Learning to Optimize Tensor Programs

    Authors: Tianqi Chen, Lianmin Zheng, Eddie Yan, Ziheng Jiang, Thierry Moreau, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy

    Abstract: We introduce a learning-based framework to optimize tensor programs for deep learning workloads. Efficient implementations of tensor operators, such as matrix multiplication and high dimensional convolution, are key enablers of effective deep learning systems. However, existing systems rely on manually optimized libraries such as cuDNN where only a narrow range of server class GPUs are well-suppor… ▽ More

    Submitted 8 January, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: NeurIPS 2018

  34. arXiv:1802.04799  [pdf, other

    cs.LG cs.AI cs.PL

    TVM: An Automated End-to-End Optimizing Compiler for Deep Learning

    Authors: Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Meghan Cowan, Haichen Shen, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy

    Abstract: There is an increasing need to bring machine learning to a wide diversity of hardware devices. Current frameworks rely on vendor-specific operator libraries and optimize for a narrow range of server-class GPUs. Deploying workloads to new platforms -- such as mobile phones, embedded devices, and accelerators (e.g., FPGAs, ASICs) -- requires significant manual effort. We propose TVM, a compiler that… ▽ More

    Submitted 5 October, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: Significantly improved version, add automated optimization

  35. arXiv:1801.06378  [pdf, other

    stat.ML cs.LG cs.SE

    Introducing ReQuEST: an Open Platform for Reproducible and Quality-Efficient Systems-ML Tournaments

    Authors: Thierry Moreau, Anton Lokhmotov, Grigori Fursin

    Abstract: Co-designing efficient machine learning based systems across the whole hardware/software stack to trade off speed, accuracy, energy and costs is becoming extremely complex and time consuming. Researchers often struggle to evaluate and compare different published works across rapidly evolving software frameworks, heterogeneous hardware platforms, compilers, libraries, algorithms, data sets, models,… ▽ More

    Submitted 19 January, 2018; originally announced January 2018.

    Comments: ReQuEST tournament website: http://cKnowledge.org/request

  36. arXiv:1706.04332  [pdf, other

    cs.NE

    MATIC: Learning Around Errors for Efficient Low-Voltage Neural Network Accelerators

    Authors: Sung Kim, Patrick Howe, Thierry Moreau, Armin Alaghi, Luis Ceze, Visvesh Sathe

    Abstract: As a result of the increasing demand for deep neural network (DNN)-based services, efforts to develop dedicated hardware accelerators for DNNs are growing rapidly. However,while accelerators with high performance and efficiency on convolutional deep neural networks (Conv-DNNs) have been developed, less progress has been made with regards to fully-connected DNNs (FC-DNNs). In this paper, we propose… ▽ More

    Submitted 23 March, 2018; v1 submitted 14 June, 2017; originally announced June 2017.

    Comments: 6 pages, 12 figures, 3 tables. Published at Design, Automation and Test in Europe Conference and Exhibition (DATE) 2018

  37. arXiv:1706.03864  [pdf, other

    cs.AR

    Exploring Computation-Communication Tradeoffs in Camera Systems

    Authors: Amrita Mazumdar, Thierry Moreau, Sung Kim, Meghan Cowan, Armin Alaghi, Luis Ceze, Mark Oskin, Visvesh Sathe

    Abstract: Cameras are the defacto sensor. The growing demand for real-time and low-power computer vision, coupled with trends towards high-efficiency heterogeneous systems, has given rise to a wide range of image processing acceleration techniques at the camera node and in the cloud. In this paper, we characterize two novel camera systems that use acceleration techniques to push the extremes of energy and p… ▽ More

    Submitted 16 October, 2017; v1 submitted 12 June, 2017; originally announced June 2017.

    Journal ref: 2017 IEEE International Symposium on Workload Characterization (IISWC)

  38. arXiv:1705.10087  [pdf, other

    cs.LG stat.ML

    DICOD: Distributed Convolutional Sparse Coding

    Authors: Thomas Moreau, Laurent Oudre, Nicolas Vayatis

    Abstract: In this paper, we introduce DICOD, a convolutional sparse coding algorithm which builds shift invariant representations for long signals. This algorithm is designed to run in a distributed setting, with local message passing, making it communication efficient. It is based on coordinate descent and uses locally greedy updates which accelerate the resolution compared to greedy coordinate selection.… ▽ More

    Submitted 13 May, 2018; v1 submitted 29 May, 2017; originally announced May 2017.

  39. arXiv:1611.04499  [pdf, other

    stat.ML cs.LG

    Post Training in Deep Learning with Last Kernel

    Authors: Thomas Moreau, Julien Audiffren

    Abstract: One of the main challenges of deep learning methods is the choice of an appropriate training strategy. In particular, additional steps, such as unsupervised pre-training, have been shown to greatly improve the performances of deep structures. In this article, we propose an extra training step, called post-training, which only optimizes the last layer of the network. We show that this procedure can… ▽ More

    Submitted 31 October, 2017; v1 submitted 14 November, 2016; originally announced November 2016.

    Comments: submitted to ICLR 2018

  40. arXiv:1305.2426   

    cs.CR

    Towards a Better Approximation of Full Domain Hash - or - The Reef and Shoal Integrity Arrangement

    Authors: Thierry Moreau

    Abstract: For RSA and Rabin-Williams public key digital signatures, proper message hashing and padding procedures are critical to the overall digital signature security. The theoretical work in this field coined the term `full domain hash' for a conceptually simple approach, a message hashing step with an output value as large as the signature public modulus. The practitioners learned from the theory but di… ▽ More

    Submitted 10 May, 2013; originally announced May 2013.

    Report number: C005404