Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–4 of 4 results for author: Gosselin, A

.
  1. arXiv:2406.05630  [pdf, other

    cs.CV

    Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion

    Authors: Ge Ya Luo, Zhi Hao Luo, Anthony Gosselin, Alexia Jolicoeur-Martineau, Christopher Pal

    Abstract: With recent advances in video prediction, controllable video generation has been attracting more attention. Generating high fidelity videos according to simple and flexible conditioning is of particular interest. To this end, we propose a controllable video generation model using pixel level renderings of 2D or 3D bounding boxes as conditioning. In addition, we also create a bounding box predictor… ▽ More

    Submitted 21 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

  2. arXiv:2403.19918  [pdf, other

    cs.RO cs.AI cs.LG

    CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning

    Authors: Luke Rowe, Roger Girgis, Anthony Gosselin, Bruno Carrez, Florian Golemo, Felix Heide, Liam Paull, Christopher Pal

    Abstract: Evaluating autonomous vehicle stacks (AVs) in simulation typically involves replaying driving logs from real-world recorded traffic. However, agents replayed from offline data are not reactive and hard to intuitively control. Existing approaches address these challenges by proposing methods that rely on heuristics or generative models of real-world data but these approaches either lack realism or… ▽ More

    Submitted 14 June, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: 21 pages, 9 figures, 8 tables

  3. arXiv:2303.00949  [pdf, other

    eess.AS

    Real-time Audio Video Enhancement \\with a Microphone Array and Headphones

    Authors: Jacob Kealey, Anthony Gosselin, Étienne Deshaies-Samson, Francis Cardinal, Félix Ducharme-Turcotte, Olivier Bergeron, Amélie Rioux-Joyal, Jérémy Bélec, François Grondin

    Abstract: This paper presents a complete hardware and software pipeline for real-time speech enhancement in noisy and reverberant conditions. The device consists of a microphone array and a camera mounted on eyeglasses, connected to an embedded system that enhances speech and plays back the audio in headphones, with a latency of maximum 120 msec. The proposed approach relies on face detection, tracking and… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: Submitted to IROS 2023

  4. arXiv:2207.07825  [pdf, other

    cs.AI cs.LG

    ChronosPerseus: Randomized Point-based Value Iteration with Importance Sampling for POSMDPs

    Authors: Richard Kohar, François Rivest, Alain Gosselin

    Abstract: In reinforcement learning, agents have successfully used environments modeled with Markov decision processes (MDPs). However, in many problem domains, an agent may suffer from noisy observations or random times until its subsequent decision. While partially observable Markov decision processes (POMDPs) have dealt with noisy observations, they have yet to deal with the unknown time aspect. Of cours… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: 33 pages, 9 figures