Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–20 of 20 results for author: Golemo, F

.
  1. arXiv:2407.15723  [pdf, other

    cs.CL cs.AI

    DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design

    Authors: Zhi Hao Luo, Luis Lara, Ge Ya Luo, Florian Golemo, Christopher Beckham, Christopher Pal

    Abstract: Text conditioned generative models for images have yielded impressive results. Text conditioned floorplan generation as a special type of raster image generation task also received particular attention. However there are many use cases in floorpla generation where numerical properties of the generated result are more important than the aesthetics. For instance, one might want to specify sizes for… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  2. arXiv:2403.19918  [pdf, other

    cs.RO cs.AI cs.LG

    CtRL-Sim: Reactive and Controllable Driving Agents with Offline Reinforcement Learning

    Authors: Luke Rowe, Roger Girgis, Anthony Gosselin, Bruno Carrez, Florian Golemo, Felix Heide, Liam Paull, Christopher Pal

    Abstract: Evaluating autonomous vehicle stacks (AVs) in simulation typically involves replaying driving logs from real-world recorded traffic. However, agents replayed from offline data are not reactive and hard to intuitively control. Existing approaches address these challenges by proposing methods that rely on heuristics or generative models of real-world data but these approaches either lack realism or… ▽ More

    Submitted 14 June, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: 21 pages, 9 figures, 8 tables

  3. arXiv:2212.01639  [pdf, other

    stat.ML cs.CV cs.LG

    Visual Question Answering From Another Perspective: CLEVR Mental Rotation Tests

    Authors: Christopher Beckham, Martin Weiss, Florian Golemo, Sina Honari, Derek Nowrouzezahrai, Christopher Pal

    Abstract: Different types of mental rotation tests have been used extensively in psychology to understand human visual reasoning and perception. Understanding what an object or visual scene would look like from another viewpoint is a challenging problem that is made even harder if it must be performed from a single image. We explore a controlled setting whereby questions are posed about the properties of a… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

    Comments: Accepted for publication to Pattern Recognition journal

  4. arXiv:2203.10351  [pdf, other

    cs.LG

    The Sandbox Environment for Generalizable Agent Research (SEGAR)

    Authors: R Devon Hjelm, Bogdan Mazoure, Florian Golemo, Felipe Frujeri, Mihai Jalobeanu, Andrey Kolobov

    Abstract: A broad challenge of research on generalization for sequential decision-making tasks in interactive environments is designing benchmarks that clearly landmark progress. While there has been notable headway, current benchmarks either do not provide suitable exposure nor intuitive control of the underlying factors, are not easy-to-implement, customizable, or extensible, or are computationally expens… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

  5. arXiv:2203.03570  [pdf, other

    cs.CV cs.GR cs.LG

    Kubric: A scalable dataset generator

    Authors: Klaus Greff, Francois Belletti, Lucas Beyer, Carl Doersch, Yilun Du, Daniel Duckworth, David J. Fleet, Dan Gnanapragasam, Florian Golemo, Charles Herrmann, Thomas Kipf, Abhijit Kundu, Dmitry Lagun, Issam Laradji, Hsueh-Ti, Liu, Henning Meyer, Yishu Miao, Derek Nowrouzezahrai, Cengiz Oztireli, Etienne Pot, Noha Radwan, Daniel Rebain, Sara Sabour, Mehdi S. M. Sajjadi , et al. (10 additional authors not shown)

    Abstract: Data is the driving force of machine learning, with the amount and quality of training data often being more important for the performance of a system than architecture and training details. But collecting, processing and annotating real data at scale is difficult, expensive, and frequently raises additional privacy, fairness and legal concerns. Synthetic data is a powerful tool with the potential… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: 21 pages, CVPR2022

  6. arXiv:2110.08307  [pdf, other

    cs.LG cs.AI

    GrowSpace: Learning How to Shape Plants

    Authors: Yasmeen Hitti, Ionelia Buzatu, Manuel Del Verme, Mark Lefsrud, Florian Golemo, Audrey Durand

    Abstract: Plants are dynamic systems that are integral to our existence and survival. Plants face environment changes and adapt over time to their surrounding conditions. We argue that plant responses to an environmental stimulus are a good example of a real-world problem that can be approached within a reinforcement learning (RL)framework. With the objective of controlling a plant by moving the light sourc… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

  7. arXiv:2108.01005  [pdf, other

    cs.LG

    Sequoia: A Software Framework to Unify Continual Learning Research

    Authors: Fabrice Normandin, Florian Golemo, Oleksiy Ostapenko, Pau Rodriguez, Matthew D Riemer, Julio Hurtado, Khimya Khetarpal, Ryan Lindeborg, Lucas Cecchi, Timothée Lesort, Laurent Charlin, Irina Rish, Massimo Caccia

    Abstract: The field of Continual Learning (CL) seeks to develop algorithms that accumulate knowledge and skills over time through interaction with non-stationary environments. In practice, a plethora of evaluation procedures (settings) and algorithmic solutions (methods) exist, each with their own potentially disjoint set of assumptions. This variety makes measuring progress in CL difficult. We propose a ta… ▽ More

    Submitted 5 June, 2023; v1 submitted 2 August, 2021; originally announced August 2021.

  8. arXiv:2104.02646  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    gradSim: Differentiable simulation for system identification and visuomotor control

    Authors: Krishna Murthy Jatavallabhula, Miles Macklin, Florian Golemo, Vikram Voleti, Linda Petrini, Martin Weiss, Breandan Considine, Jerome Parent-Levesque, Kevin Xie, Kenny Erleben, Liam Paull, Florian Shkurti, Derek Nowrouzezahrai, Sanja Fidler

    Abstract: We consider the problem of estimating an object's physical properties such as mass, friction, and elasticity directly from video sequences. Such a system identification problem is fundamentally ill-posed due to the loss of information during image formation. Current solutions require precise 3D labels which are labor-intensive to gather, and infeasible to create for many systems such as deformable… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: ICLR 2021. Project page (and a dynamic web version of the article): https://gradsim.github.io

  9. arXiv:2104.00563  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.MA

    Latent Variable Sequential Set Transformers For Joint Multi-Agent Motion Prediction

    Authors: Roger Girgis, Florian Golemo, Felipe Codevilla, Martin Weiss, Jim Aldon D'Souza, Samira Ebrahimi Kahou, Felix Heide, Christopher Pal

    Abstract: Robust multi-agent trajectory prediction is essential for the safe control of robotic systems. A major challenge is to efficiently learn a representation that approximates the true joint distribution of contextual, social, and temporal information to enable planning. We propose Latent Variable Sequential Set Transformers which are encoder-decoder architectures that generate scene-consistent multi-… ▽ More

    Submitted 10 February, 2022; v1 submitted 19 February, 2021; originally announced April 2021.

    Comments: 26 pages, 17 figures, 8 tables

  10. arXiv:2104.00442  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Touch-based Curiosity for Sparse-Reward Tasks

    Authors: Sai Rajeswar, Cyril Ibrahim, Nitin Surya, Florian Golemo, David Vazquez, Aaron Courville, Pedro O. Pinheiro

    Abstract: Robots in many real-world settings have access to force/torque sensors in their gripper and tactile sensing is often necessary in tasks that involve contact-rich motion. In this work, we leverage surprise from mismatches in touch feedback to guide exploration in hard sparse-reward reinforcement learning tasks. Our approach, Touch-based Curiosity (ToC), learns what visible objects interactions are… ▽ More

    Submitted 26 June, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

  11. arXiv:2012.03806  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Perspectives on Sim2Real Transfer for Robotics: A Summary of the R:SS 2020 Workshop

    Authors: Sebastian Höfer, Kostas Bekris, Ankur Handa, Juan Camilo Gamboa, Florian Golemo, Melissa Mozifian, Chris Atkeson, Dieter Fox, Ken Goldberg, John Leonard, C. Karen Liu, Jan Peters, Shuran Song, Peter Welinder, Martha White

    Abstract: This report presents the debates, posters, and discussions of the Sim2Real workshop held in conjunction with the 2020 edition of the "Robotics: Science and System" conference. Twelve leaders of the field took competing debate positions on the definition, viability, and importance of transferring skills from simulation to the real world in the context of robotics problems. The debaters also joined… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: Summary of the "2nd Workshop on Closing the Reality Gap in Sim2Real Transfer for Robotics" held in conjunction with "Robotics: Science and System 2020". Website: https://sim2real.github.io/

  12. arXiv:2011.05499  [pdf, other

    cs.CV

    Unsupervised Learning of Dense Visual Representations

    Authors: Pedro O. Pinheiro, Amjad Almahairi, Ryan Y. Benmalek, Florian Golemo, Aaron Courville

    Abstract: Contrastive self-supervised learning has emerged as a promising approach to unsupervised visual representation learning. In general, these methods learn global (image-level) representations that are invariant to different views (i.e., compositions of data augmentation) of the same image. However, many visual understanding tasks require dense (pixel-level) representations. In this paper, we propose… ▽ More

    Submitted 7 December, 2020; v1 submitted 10 November, 2020; originally announced November 2020.

  13. arXiv:2003.14166  [pdf, other

    cs.CV cs.LG stat.ML

    Pix2Shape: Towards Unsupervised Learning of 3D Scenes from Images using a View-based Representation

    Authors: Sai Rajeswar, Fahim Mannan, Florian Golemo, Jérôme Parent-Lévesque, David Vazquez, Derek Nowrouzezahrai, Aaron Courville

    Abstract: We infer and generate three-dimensional (3D) scene information from a single input image and without supervision. This problem is under-explored, with most prior work relying on supervision from, e.g., 3D ground-truth, multiple images of a scene, image silhouettes or key-points. We propose Pix2Shape, an approach to solve this problem with four components: (i) an encoder that infers the latent 3D r… ▽ More

    Submitted 17 April, 2020; v1 submitted 22 March, 2020; originally announced March 2020.

    Comments: This is a pre-print of an article published in International Journal of Computer Vision. The final authenticated version is available online at: https://doi.org/10.1007/s11263-020-01322-1

    Journal ref: International Journal of Computer Vision, (2020), 1-16

  14. arXiv:2002.07911  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Generating Automatic Curricula via Self-Supervised Active Domain Randomization

    Authors: Sharath Chandra Raparthy, Bhairav Mehta, Florian Golemo, Liam Paull

    Abstract: Goal-directed Reinforcement Learning (RL) traditionally considers an agent interacting with an environment, prescribing a real-valued reward to an agent proportional to the completion of some goal. Goal-directed RL has seen large gains in sample efficiency, due to the ease of reusing or generating new experience by proposing goals. One approach,self-play, allows an agent to "play" against itself b… ▽ More

    Submitted 26 October, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

  15. arXiv:1911.03594  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Robo-PlaNet: Learning to Poke in a Day

    Authors: Maxime Chevalier-Boisvert, Guillaume Alain, Florian Golemo, Derek Nowrouzezahrai

    Abstract: Recently, the Deep Planning Network (PlaNet) approach was introduced as a model-based reinforcement learning method that learns environment dynamics directly from pixel observations. This architecture is useful for learning tasks in which either the agent does not have access to meaningful states (like position/velocity of robotic joints) or where the observed states significantly deviate from the… ▽ More

    Submitted 19 November, 2019; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: 4 pages, 3 figures. Version 2: added reference and acknowledgement

  16. arXiv:1910.13249  [pdf, other

    cs.CV cs.HC cs.LG

    Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments

    Authors: Martin Weiss, Simon Chamorro, Roger Girgis, Margaux Luck, Samira E. Kahou, Joseph P. Cohen, Derek Nowrouzezahrai, Doina Precup, Florian Golemo, Chris Pal

    Abstract: Millions of blind and visually-impaired (BVI) people navigate urban environments every day, using smartphones for high-level path-planning and white canes or guide dogs for local information. However, many BVI people still struggle to travel to new places. In our endeavor to create a navigation assistant for the BVI, we found that existing Reinforcement Learning (RL) environments were unsuitable f… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted at CoRL2019. Code & video available at https://mweiss17.github.io/SEVN/

  17. arXiv:1904.04762  [pdf, other

    cs.LG cs.AI cs.RO

    Active Domain Randomization

    Authors: Bhairav Mehta, Manfred Diaz, Florian Golemo, Christopher J. Pal, Liam Paull

    Abstract: Domain randomization is a popular technique for improving domain transfer, often used in a zero-shot setting when the target domain is unknown or cannot easily be used for training. In this work, we empirically examine the effects of domain randomization on agent generalization. Our experiments show that domain randomization may lead to suboptimal, high-variance policies, which we attribute to the… ▽ More

    Submitted 10 July, 2019; v1 submitted 9 April, 2019; originally announced April 2019.

    Comments: Code available at https://github.com/montrealrobotics/active-domainrand

  18. arXiv:1903.02503  [pdf, other

    cs.RO

    The AI Driving Olympics at NeurIPS 2018

    Authors: Julian Zilly, Jacopo Tani, Breandan Considine, Bhairav Mehta, Andrea F. Daniele, Manfred Diaz, Gianmarco Bernasconi, Claudio Ruch, Jan Hakenberg, Florian Golemo, A. Kirsten Bowser, Matthew R. Walter, Ruslan Hristov, Sunil Mallya, Emilio Frazzoli, Andrea Censi, Liam Paull

    Abstract: Despite recent breakthroughs, the ability of deep learning and reinforcement learning to outperform traditional approaches to control physically embodied robotic agents remains largely unproven. To help bridge this gap, we created the 'AI Driving Olympics' (AI-DO), a competition with the objective of evaluating the state of the art in machine learning and artificial intelligence for mobile robotic… ▽ More

    Submitted 6 March, 2019; originally announced March 2019.

    Comments: Competition, robotics, safety-critical AI, self-driving cars, autonomous mobility on demand, Duckietown

  19. arXiv:1901.07186  [pdf, other

    cs.LG cs.RO stat.ML

    Towards Learning to Imitate from a Single Video Demonstration

    Authors: Glen Berseth, Florian Golemo, Christopher Pal

    Abstract: Agents that can learn to imitate given video observation -- \emph{without direct access to state or action information} are more applicable to learning in the natural world. However, formulating a reinforcement learning (RL) agent that facilitates this goal remains a significant challenge. We approach this challenge using contrastive training to learn a reward function comparing an agent's behavio… ▽ More

    Submitted 12 July, 2023; v1 submitted 22 January, 2019; originally announced January 2019.

    Comments: Published in JMLR. https://jmlr.org/papers/v24/21-1174.html

  20. arXiv:1711.11017  [pdf, other

    cs.AI cs.CL cs.CV cs.RO cs.SD eess.AS

    HoME: a Household Multimodal Environment

    Authors: Simon Brodeur, Ethan Perez, Ankesh Anand, Florian Golemo, Luca Celotti, Florian Strub, Jean Rouat, Hugo Larochelle, Aaron Courville

    Abstract: We introduce HoME: a Household Multimodal Environment for artificial agents to learn from vision, audio, semantics, physics, and interaction with objects and other agents, all within a realistic context. HoME integrates over 45,000 diverse 3D house layouts based on the SUNCG dataset, a scale which may facilitate learning, generalization, and transfer. HoME is an open-source, OpenAI Gym-compatible… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

    Comments: Presented at NIPS 2017's Visually-Grounded Interaction and Language Workshop