Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–6 of 6 results for author: Blondé, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.09805  [pdf, other

    cs.LG

    Mimicking Better by Matching the Approximate Action Distribution

    Authors: João A. Cândido Ramos, Lionel Blondé, Naoya Takeishi, Alexandros Kalousis

    Abstract: In this paper, we introduce MAAD, a novel, sample-efficient on-policy algorithm for Imitation Learning from Observations. MAAD utilizes a surrogate reward signal, which can be derived from various sources such as adversarial games, trajectory matching objectives, or optimal transport criteria. To compensate for the non-availability of expert actions, we rely on an inverse dynamics model that infer… ▽ More

    Submitted 9 February, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

  2. arXiv:2107.01407  [pdf, other

    cs.LG cs.AI

    Optimality Inductive Biases and Agnostic Guidelines for Offline Reinforcement Learning

    Authors: Lionel Blondé, Alexandros Kalousis, Stéphane Marchand-Maillet

    Abstract: The performance of state-of-the-art offline RL methods varies widely over the spectrum of dataset qualities, ranging from far-from-optimal random data to close-to-optimal expert demonstrations. We re-implement these methods to test their reproducibility, and show that when a given method outperforms the others on one end of the spectrum, it never does on the other end. This prevents us from naming… ▽ More

    Submitted 19 January, 2022; v1 submitted 3 July, 2021; originally announced July 2021.

  3. arXiv:2106.11083  [pdf, other

    cs.LG

    Conditional Neural Relational Inference for Interacting Systems

    Authors: Joao A. Candido Ramos, Lionel Blondé, Stéphane Armand, Alexandros Kalousis

    Abstract: In this work, we want to learn to model the dynamics of similar yet distinct groups of interacting objects. These groups follow some common physical laws that exhibit specificities that are captured through some vectorial description. We develop a model that allows us to do conditional generation from any such group given its vectorial description. Unlike previous work on learning dynamical system… ▽ More

    Submitted 2 July, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

  4. Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning

    Authors: Lionel Blondé, Pablo Strasser, Alexandros Kalousis

    Abstract: Despite the recent success of reinforcement learning in various domains, these approaches remain, for the most part, deterringly sensitive to hyper-parameters and are often riddled with essential engineering feats allowing their success. We consider the case of off-policy generative adversarial imitation learning, and perform an in-depth review, qualitative and quantitative, of the method. We show… ▽ More

    Submitted 25 October, 2023; v1 submitted 28 June, 2020; originally announced June 2020.

    Comments: Accepted for publication in Machine Learning 2022

  5. arXiv:1912.08444  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Relational Mimic for Visual Adversarial Imitation Learning

    Authors: Lionel Blondé, Yichuan Charlie Tang, Jian Zhang, Russ Webb

    Abstract: In this work, we introduce a new method for imitation learning from video demonstrations. Our method, Relational Mimic (RM), improves on previous visual imitation learning methods by combining generative adversarial networks and relational learning. RM is flexible and can be used in conjunction with other recent advances in generative adversarial imitation learning to better address the need for m… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

  6. arXiv:1809.02064  [pdf, other

    cs.LG stat.ML

    Sample-Efficient Imitation Learning via Generative Adversarial Nets

    Authors: Lionel Blondé, Alexandros Kalousis

    Abstract: GAIL is a recent successful imitation learning architecture that exploits the adversarial training procedure introduced in GANs. Albeit successful at generating behaviours similar to those demonstrated to the agent, GAIL suffers from a high sample complexity in the number of interactions it has to carry out in the environment in order to achieve satisfactory performance. We dramatically shrink the… ▽ More

    Submitted 8 March, 2019; v1 submitted 6 September, 2018; originally announced September 2018.

    Comments: Published as a conference paper for AISTATS 2019