Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 71 results for author: Schaal, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.06536  [pdf, other

    cs.RO cs.LG

    A Comparison of Imitation Learning Algorithms for Bimanual Manipulation

    Authors: Michael Drolet, Simon Stepputtis, Siva Kailas, Ajinkya Jain, Jan Peters, Stefan Schaal, Heni Ben Amor

    Abstract: Amidst the wide popularity of imitation learning algorithms in robotics, their properties regarding hyperparameter sensitivity, ease of training, data efficiency, and performance have not been well-studied in high-precision industry-inspired environments. In this work, we demonstrate the limitations and benefits of prominent imitation learning approaches and analyze their capabilities regarding th… ▽ More

    Submitted 24 August, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

  2. arXiv:2404.06645  [pdf, other

    cs.RO cs.AI

    GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks

    Authors: Kaylee Burns, Ajinkya Jain, Keegan Go, Fei Xia, Michael Stark, Stefan Schaal, Karol Hausman

    Abstract: Large Language Models (LLMs) have been successful at generating robot policy code, but so far these results have been limited to high-level tasks that do not require precise movement. It is an open question how well such approaches work for tasks that require reasoning over contact forces and working within tight success tolerances. We find that, with the right action space, LLMs are capable of su… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 14 pages, 12 figures

    ACM Class: I.2.9

  3. arXiv:2403.02709  [pdf, other

    cs.RO

    RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches

    Authors: Priya Sundaresan, Quan Vuong, Jiayuan Gu, Peng Xu, Ted Xiao, Sean Kirmani, Tianhe Yu, Michael Stark, Ajinkya Jain, Karol Hausman, Dorsa Sadigh, Jeannette Bohg, Stefan Schaal

    Abstract: Natural language and images are commonly used as goal representations in goal-conditioned imitation learning (IL). However, natural language can be ambiguous and images can be over-specified. In this work, we propose hand-drawn sketches as a modality for goal specification in visual imitation learning. Sketches are easy for users to provide on the fly like language, but similar to images they can… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  4. arXiv:2401.16013  [pdf, other

    cs.RO cs.AI

    SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

    Authors: Jianlan Luo, Zheyuan Hu, Charles Xu, You Liang Tan, Jacob Berg, Archit Sharma, Stefan Schaal, Chelsea Finn, Abhishek Gupta, Sergey Levine

    Abstract: In recent years, significant progress has been made in the field of robotic reinforcement learning (RL), enabling methods that handle complex image observations, train in the real world, and incorporate auxiliary data, such as demonstrations and prior experience. However, despite these advances, robotic RL remains hard to use. It is acknowledged among practitioners that the particular implementati… ▽ More

    Submitted 12 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: ICRA 2024

  5. arXiv:2312.09190  [pdf, other

    cs.RO

    Efficient Online Learning of Contact Force Models for Connector Insertion

    Authors: Kevin Tracy, Zachary Manchester, Ajinkya Jain, Keegan Go, Stefan Schaal, Tom Erez, Yuval Tassa

    Abstract: Contact-rich manipulation tasks with stiff frictional elements like connector insertion are difficult to model with rigid-body simulators. In this work, we propose a new approach for modeling these environments by learning a quasi-static contact force model instead of a full simulator. Using a feature vector that contains information about the configuration and control, we find a linear mapping ad… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  6. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  7. arXiv:2307.08927  [pdf, other

    cs.RO cs.AI

    Multi-Stage Cable Routing through Hierarchical Imitation Learning

    Authors: Jianlan Luo, Charles Xu, Xinyang Geng, Gilbert Feng, Kuan Fang, Liam Tan, Stefan Schaal, Sergey Levine

    Abstract: We study the problem of learning to perform multi-stage robotic manipulation tasks, with applications to cable routing, where the robot must route a cable through a series of clips. This setting presents challenges representative of complex multi-stage robotic manipulation scenarios: handling deformable objects, closing the loop on visual perception, and handling extended behaviors consisting of m… ▽ More

    Submitted 13 January, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: T-RO 2024

  8. arXiv:2212.00955  [pdf, other

    cs.RO

    Prim-LAfD: A Framework to Learn and Adapt Primitive-Based Skills from Demonstrations for Insertion Tasks

    Authors: Zheng Wu, Wenzhao Lian, Changhao Wang, Mengxi Li, Stefan Schaal, Masayoshi Tomizuka

    Abstract: Learning generalizable insertion skills in a data-efficient manner has long been a challenge in the robot learning community. While the current state-of-the-art methods with reinforcement learning (RL) show promising performance in acquiring manipulation skills, the algorithms are data-hungry and hard to generalize. To overcome the issues, in this paper we present Prim-LAfD, a simple yet effective… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 6 pages, 4 figures

  9. arXiv:2210.00350  [pdf, other

    cs.RO cs.LG

    Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning

    Authors: Zheng Wu, Yichen Xie, Wenzhao Lian, Changhao Wang, Yanjiang Guo, Jianyu Chen, Stefan Schaal, Masayoshi Tomizuka

    Abstract: Humans are capable of abstracting various tasks as different combinations of multiple attributes. This perspective of compositionality is vital for human rapid learning and adaption since previous experiences from related tasks can be combined to generalize across novel compositional settings. In this work, we aim to achieve zero-shot policy generalization of Reinforcement Learning (RL) agents by… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

    Comments: 7 pages, 9 figures

  10. arXiv:2208.00596  [pdf, other

    cs.RO cs.HC

    A System for Imitation Learning of Contact-Rich Bimanual Manipulation Policies

    Authors: Simon Stepputtis, Maryam Bandari, Stefan Schaal, Heni Ben Amor

    Abstract: In this paper, we discuss a framework for teaching bimanual manipulation tasks by imitation. To this end, we present a system and algorithms for learning compliant and contact-rich robot behavior from human demonstrations. The presented system combines insights from admittance control and machine learning to extract control policies that can (a) recover from and adapt to a variety of disturbances… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

    Comments: Accepted to the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022), Kyoto, Japan

  11. arXiv:2205.08041  [pdf, other

    cs.RO cs.CV eess.IV

    Detection and Physical Interaction with Deformable Linear Objects

    Authors: Azarakhsh Keipour, Mohammadreza Mousaei, Maryam Bandari, Stefan Schaal, Sebastian Scherer

    Abstract: Deformable linear objects (e.g., cables, ropes, and threads) commonly appear in our everyday lives. However, perception of these objects and the study of physical interaction with them is still a growing area. There have already been successful methods to model and track deformable linear objects. However, the number of methods that can automatically extract the initial conditions in non-trivial s… ▽ More

    Submitted 8 April, 2023; v1 submitted 16 May, 2022; originally announced May 2022.

    Comments: Presented at ICRA 2022 2nd Workshop on Representing and Manipulating Deformable Objects (https://deformable-workshop.github.io/icra2022/)

  12. arXiv:2203.02468  [pdf, other

    cs.RO

    Symbolic State Estimation with Predicates for Contact-Rich Manipulation Tasks

    Authors: Toki Migimatsu, Wenzhao Lian, Jeannette Bohg, Stefan Schaal

    Abstract: Manipulation tasks often require a robot to adjust its sensorimotor skills based on the state it finds itself in. Taking peg-in-hole as an example: once the peg is aligned with the hole, the robot should push the peg downwards. While high level execution frameworks such as state machines and behavior trees are commonly used to formalize such decision-making problems, these frameworks require a mec… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

  13. Efficient Spatial Representation and Routing of Deformable One-Dimensional Objects for Manipulation

    Authors: Azarakhsh Keipour, Maryam Bandari, Stefan Schaal

    Abstract: With the field of rigid-body robotics having matured in the last fifty years, routing, planning, and manipulation of deformable objects have recently emerged as a more untouched research area in many fields ranging from surgical robotics to industrial assembly and construction. Routing approaches for deformable objects which rely on learned implicit spatial representations (e.g., Learning-from-Dem… ▽ More

    Submitted 2 January, 2023; v1 submitted 12 February, 2022; originally announced February 2022.

    Comments: 6 pages

    Journal ref: Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 211-216

  14. arXiv:2201.12716  [pdf, other

    cs.RO cs.AI cs.CV eess.SY

    You Only Demonstrate Once: Category-Level Manipulation from Single Visual Demonstration

    Authors: Bowen Wen, Wenzhao Lian, Kostas Bekris, Stefan Schaal

    Abstract: Promising results have been achieved recently in category-level manipulation that generalizes across object instances. Nevertheless, it often requires expensive real-world data collection and manual specification of semantic keypoints for each object category and task. Additionally, coarse keypoint predictions and ignoring intermediate action sequences hinder adoption in complex manipulation tasks… ▽ More

    Submitted 6 May, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

    Journal ref: Robotics: Science and Systems (RSS) 2022

  15. Deformable One-Dimensional Object Detection for Routing and Manipulation

    Authors: Azarakhsh Keipour, Maryam Bandari, Stefan Schaal

    Abstract: Many methods exist to model and track deformable one-dimensional objects (e.g., cables, ropes, and threads) across a stream of video frames. However, these methods depend on the existence of some initial conditions. To the best of our knowledge, the topic of detection methods that can extract those initial conditions in non-trivial situations has hardly been addressed. The lack of detection method… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: Accepted to IEEE Robotics and Automation Letters, January 2022. 8 pages

  16. arXiv:2112.00597  [pdf, other

    cs.RO stat.ML

    Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation

    Authors: Todor Davchev, Oleg Sushkov, Jean-Baptiste Regli, Stefan Schaal, Yusuf Aytar, Markus Wulfmeier, Jon Scholz

    Abstract: Complex sequential tasks in continuous-control settings often require agents to successfully traverse a set of "narrow passages" in their state space. Solving such tasks with a sparse reward in a sample-efficient manner poses a challenge to modern reinforcement learning (RL) due to the associated long-horizon nature of the problem and the lack of sufficient positive signal during learning. Various… ▽ More

    Submitted 22 March, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Journal ref: International Conference on Learning Representations (ICLR 2022)

  17. arXiv:2110.15245  [pdf, ps, other

    cs.RO cs.LG

    From Machine Learning to Robotics: Challenges and Opportunities for Embodied Intelligence

    Authors: Nicholas Roy, Ingmar Posner, Tim Barfoot, Philippe Beaudoin, Yoshua Bengio, Jeannette Bohg, Oliver Brock, Isabelle Depatie, Dieter Fox, Dan Koditschek, Tomas Lozano-Perez, Vikash Mansinghka, Christopher Pal, Blake Richards, Dorsa Sadigh, Stefan Schaal, Gaurav Sukhatme, Denis Therien, Marc Toussaint, Michiel Van de Panne

    Abstract: Machine learning has long since become a keystone technology, accelerating science and applications in a broad range of domains. Consequently, the notion of applying learning methods to a particular problem set has become an established and valuable modus operandi to advance a particular field. In this article we argue that such an approach does not straightforwardly extended to robotics -- or to… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

  18. arXiv:2110.04276  [pdf, other

    cs.RO

    Offline Meta-Reinforcement Learning for Industrial Insertion

    Authors: Tony Z. Zhao, Jianlan Luo, Oleg Sushkov, Rugile Pevceviciute, Nicolas Heess, Jon Scholz, Stefan Schaal, Sergey Levine

    Abstract: Reinforcement learning (RL) can in principle let robots automatically adapt to new tasks, but current RL methods require a large number of trials to accomplish this. In this paper, we tackle rapid adaptation to new tasks through the framework of meta-learning, which utilizes past tasks to learn to adapt with a specific focus on industrial insertion tasks. Fast adaptation is crucial because prohibi… ▽ More

    Submitted 1 September, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: ICRA 2022

  19. arXiv:2109.09163  [pdf, other

    cs.RO cs.AI cs.CV eess.SY

    CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation

    Authors: Bowen Wen, Wenzhao Lian, Kostas Bekris, Stefan Schaal

    Abstract: Task-relevant grasping is critical for industrial assembly, where downstream manipulation tasks constrain the set of valid grasps. Learning how to perform this task, however, is challenging, since task-relevant grasp labels are hard to define and annotate. There is also yet no consensus on proper representations for modeling or off-the-shelf tools for performing task-relevant grasps. This work pro… ▽ More

    Submitted 25 February, 2022; v1 submitted 19 September, 2021; originally announced September 2021.

    Comments: IEEE International Conference on Robotics and Automation (ICRA) 2022

  20. arXiv:2109.07578  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Multi-Task Learning with Sequence-Conditioned Transporter Networks

    Authors: Michael H. Lim, Andy Zeng, Brian Ichter, Maryam Bandari, Erwin Coumans, Claire Tomlin, Stefan Schaal, Aleksandra Faust

    Abstract: Enabling robots to solve multiple manipulation tasks has a wide range of industrial applications. While learning-based approaches enjoy flexibility and generalizability, scaling these approaches to solve such compositional tasks remains a challenge. In this work, we aim to solve multi-task learning through the lens of sequence-conditioning and weighted sampling. First, we propose a new suite of be… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  21. arXiv:2104.12042  [pdf, other

    cs.RO

    A Robustness Analysis of Inverse Optimal Control of Bipedal Walking

    Authors: John R. Rebula, Stefan Schaal, James Finley, Ludovic Righetti

    Abstract: Cost functions have the potential to provide compact and understandable generalizations of motion. The goal of Inverse Optimal Control (IOC) is to analyze an observed behavior which is assumed to be optimal with respect to an unknown cost function, and infer this cost function. Here we develop a method for characterizing cost functions of legged locomotion, with the goal of representing complex hu… ▽ More

    Submitted 24 April, 2021; originally announced April 2021.

    Comments: 9 pages, 5 figures

  22. arXiv:2103.11512  [pdf, other

    cs.AI cs.RO

    Robust Multi-Modal Policies for Industrial Assembly via Reinforcement Learning and Demonstrations: A Large-Scale Study

    Authors: Jianlan Luo, Oleg Sushkov, Rugile Pevceviciute, Wenzhao Lian, Chang Su, Mel Vecerik, Ning Ye, Stefan Schaal, Jon Scholz

    Abstract: Over the past several years there has been a considerable research investment into learning-based approaches to industrial assembly, but despite significant progress these techniques have yet to be adopted by industry. We argue that it is the prohibitively large design space for Deep Reinforcement Learning (DRL), rather than algorithmic limitations per se, that are truly responsible for this lack… ▽ More

    Submitted 31 July, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

    Comments: RSS 2021

  23. arXiv:2103.05140  [pdf, other

    cs.RO cs.AI

    Benchmarking Off-The-Shelf Solutions to Robotic Assembly Tasks

    Authors: Wenzhao Lian, Tim Kelch, Dirk Holz, Adam Norton, Stefan Schaal

    Abstract: In recent years, many learning based approaches have been studied to realize robotic manipulation and assembly tasks, often including vision and force/tactile feedback. However, it remains frequently unclear what is the baseline state-of-the-art performance and what are the bottleneck problems. In this work, we evaluate some off-the-shelf (OTS) industrial solutions on a recently introduced benchma… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: 7 pages, 6 figures

  24. arXiv:2011.08458  [pdf, other

    cs.RO

    Learning Dense Rewards for Contact-Rich Manipulation Tasks

    Authors: Zheng Wu, Wenzhao Lian, Vaibhav Unhelkar, Masayoshi Tomizuka, Stefan Schaal

    Abstract: Rewards play a crucial role in reinforcement learning. To arrive at the desired policy, the design of a suitable reward function often requires significant domain expertise as well as trial-and-error. Here, we aim to minimize the effort involved in designing reward functions for contact-rich manipulation tasks. In particular, we provide an approach capable of extracting dense reward functions algo… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: 8 pages, 5 figures

  25. Residual Learning from Demonstration: Adapting DMPs for Contact-rich Manipulation

    Authors: Todor Davchev, Kevin Sebastian Luck, Michael Burke, Franziska Meier, Stefan Schaal, Subramanian Ramamoorthy

    Abstract: Manipulation skills involving contact and friction are inherent to many robotics tasks. Using the class of motor primitives for peg-in-hole like insertions, we study how robots can learn such skills. Dynamic Movement Primitives (DMP) are a popular way of extracting such policies through behaviour cloning (BC) but can struggle in the context of insertion. Policy adaptation strategies such as residu… ▽ More

    Submitted 22 March, 2022; v1 submitted 17 August, 2020; originally announced August 2020.

    Journal ref: IEEE Robotics and Automation Letters 7 (2), 4488-4495, 2022

  26. arXiv:2007.04842  [pdf, other

    cs.RO cs.CG

    An Interior Point Method Solving Motion Planning Problems with Narrow Passages

    Authors: Jim Mainprice, Nathan Ratliff, Marc Toussaint, Stefan Schaal

    Abstract: Algorithmic solutions for the motion planning problem have been investigated for five decades. Since the development of A* in 1969 many approaches have been investigated, traditionally classified as either grid decomposition, potential fields or sampling-based. In this work, we focus on using numerical optimization, which is understudied for solving motion planning problems. This lack of interest… ▽ More

    Submitted 24 July, 2020; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: IEEE RO-MAN 2020, 6 pages

  27. Supervised Learning and Reinforcement Learning of Feedback Models for Reactive Behaviors: Tactile Feedback Testbed

    Authors: Giovanni Sutanto, Katharina Rombach, Yevgen Chebotar, Zhe Su, Stefan Schaal, Gaurav S. Sukhatme, Franziska Meier

    Abstract: Robots need to be able to adapt to unexpected changes in the environment such that they can autonomously succeed in their tasks. However, hand-designing feedback models for adaptation is tedious, if at all possible, making data-driven methods a promising alternative. In this paper we introduce a full framework for learning feedback models for reactive motion planning. Our pipeline starts by segmen… ▽ More

    Submitted 2 December, 2022; v1 submitted 29 June, 2020; originally announced July 2020.

    Comments: Accepted for publication in the International Journal of Robotics Research (IJRR). Paper length is 22 pages (including references) with 12 figures. A video overview of the reinforcement learning experiment on the real robot can be seen at https://www.youtube.com/watch?v=yu5v-ZXo4-E. arXiv admin note: text overlap with arXiv:1710.08555

  28. arXiv:2004.12926  [pdf

    cs.CY cs.AI q-bio.NC

    A New Age of Computing and the Brain

    Authors: Polina Golland, Jack Gallant, Greg Hager, Hanspeter Pfister, Christos Papadimitriou, Stefan Schaal, Joshua T. Vogelstein

    Abstract: The history of computer science and brain sciences are intertwined. In his unfinished manuscript "The Computer and the Brain," von Neumann debates whether or not the brain can be thought of as a computing machine and identifies some of the similarities and differences between natural and artificial computation. Turing, in his 1950 article in Mind, argues that computing devices could ultimately emu… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: A Computing Community Consortium (CCC) workshop report, 24 pages

    Report number: ccc2014report_5

  29. arXiv:1810.02422  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    Simulator Predictive Control: Using Learned Task Representations and MPC for Zero-Shot Generalization and Sequencing

    Authors: Zhanpeng He, Ryan Julian, Eric Heiden, Hejia Zhang, Stefan Schaal, Joseph J. Lim, Gaurav Sukhatme, Karol Hausman

    Abstract: Simulation-to-real transfer is an important strategy for making reinforcement learning practical with real robots. Successful sim-to-real transfer systems have difficulty producing policies which generalize across tasks, despite training for thousands of hours equivalent real robot time. To address this shortcoming, we present a novel approach to efficiently learning new robotic skills directly on… ▽ More

    Submitted 27 January, 2021; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: Presented at NeurIPS 2018 Workshop: Deep Reinforcement Learning. See https://youtu.be/te4JWe7LPKw for supplemental video

  30. arXiv:1809.10253  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Scaling simulation-to-real transfer by learning composable robot skills

    Authors: Ryan Julian, Eric Heiden, Zhanpeng He, Hejia Zhang, Stefan Schaal, Joseph J. Lim, Gaurav Sukhatme, Karol Hausman

    Abstract: We present a novel solution to the problem of simulation-to-real transfer, which builds on recent advances in robot skill decomposition. Rather than focusing on minimizing the simulation-reality gap, we learn a set of diverse policies that are parameterized in a way that makes them easily reusable. This diversity and parameterization of low-level skills allows us to find a transferable policy that… ▽ More

    Submitted 13 November, 2018; v1 submitted 26 September, 2018; originally announced September 2018.

    Comments: Presented at ISER 2018. See https://www.youtube.com/watch?v=Syr2RQTHqTs for supplemental video

  31. Learning Task-Specific Dynamics to Improve Whole-Body Control

    Authors: Andrej Gams, Sean A. Mason, Aleš Ude, Stefan Schaal, Ludovic Righetti

    Abstract: In task-based inverse dynamics control, reference accelerations used to follow a desired plan can be broken down into feedforward and feedback trajectories. The feedback term accounts for tracking errors that are caused from inaccurate dynamic models or external disturbances. On underactuated, free-floating robots, such as humanoids, good tracking accuracy often necessitates high feedback gains, w… ▽ More

    Submitted 29 June, 2021; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: this version was uploaded to fulfill the open access requirements of the EU commission

  32. An MPC Walking Framework With External Contact Forces

    Authors: Sean Mason, Nicholas Rotella, Stefan Schaal, Ludovic Righetti

    Abstract: In this work, we present an extension to a linear Model Predictive Control (MPC) scheme that plans external contact forces for the robot when given multiple contact locations and their corresponding friction cone. To this end, we set up a two-step optimization problem. In the first optimization, we compute the Center of Mass (CoM) trajectory, foot step locations, and introduce slack variables to a… ▽ More

    Submitted 27 February, 2018; v1 submitted 26 December, 2017; originally announced December 2017.

  33. Learning Sensor Feedback Models from Demonstrations via Phase-Modulated Neural Networks

    Authors: Giovanni Sutanto, Zhe Su, Stefan Schaal, Franziska Meier

    Abstract: In order to robustly execute a task under environmental uncertainty, a robot needs to be able to reactively adapt to changes arising in its environment. The environment changes are usually reflected in deviation from expected sensory traces. These deviations in sensory traces can be used to drive the motion adaptation, and for this purpose, a feedback model is required. The feedback model maps the… ▽ More

    Submitted 15 March, 2018; v1 submitted 23 October, 2017; originally announced October 2017.

    Comments: 8 pages, accepted to be published at the International Conference on Robotics and Automation (ICRA) 2018

  34. Combining Learned and Analytical Models for Predicting Action Effects from Sensory Data

    Authors: Alina Kloss, Stefan Schaal, Jeannette Bohg

    Abstract: One of the most basic skills a robot should possess is predicting the effect of physical interactions with objects in the environment. This enables optimal action selection to reach a certain goal state. Traditionally, dynamics are approximated by physics-based analytical models. These models rely on specific state representations that may be hard to obtain from raw sensory data, especially if no… ▽ More

    Submitted 12 October, 2020; v1 submitted 11 October, 2017; originally announced October 2017.

    Comments: The International Journal of Robotics Research (2020)

  35. arXiv:1710.02513  [pdf, other

    cs.RO

    A New Data Source for Inverse Dynamics Learning

    Authors: Daniel Kappler, Franziska Meier, Nathan Ratliff, Stefan Schaal

    Abstract: Modern robotics is gravitating toward increasingly collaborative human robot interaction. Tools such as acceleration policies can naturally support the realization of reactive, adaptive, and compliant robots. These tools require us to model the system dynamics accurately -- a difficult task. The fundamental problem remains that simulation and reality diverge--we do not know how to accurately chang… ▽ More

    Submitted 6 October, 2017; originally announced October 2017.

    Comments: IROS 2017

  36. arXiv:1709.09265  [pdf, other

    cs.RO

    On Time Optimization of Centroidal Momentum Dynamics

    Authors: Brahayam Ponton, Alexander Herzog, Andrea Del Prete, Stefan Schaal, Ludovic Righetti

    Abstract: Recently, the centroidal momentum dynamics has received substantial attention to plan dynamically consistent motions for robots with arms and legs in multi-contact scenarios. However, it is also non convex which renders any optimization approach difficult and timing is usually kept fixed in most trajectory optimization techniques to not introduce additional non convexities to the problem. But this… ▽ More

    Submitted 25 February, 2018; v1 submitted 26 September, 2017; originally announced September 2017.

    Comments: 7 pages, 4 figures, ICRA 2018

  37. arXiv:1709.07472  [pdf, ps, other

    cs.RO

    Unsupervised Contact Learning for Humanoid Estimation and Control

    Authors: Nicholas Rotella, Stefan Schaal, Ludovic Righetti

    Abstract: This work presents a method for contact state estimation using fuzzy clustering to learn contact probability for full, six-dimensional humanoid contacts. The data required for training is solely from proprioceptive sensors - endeffector contact wrench sensors and inertial measurement units (IMUs) - and the method is completely unsupervised. The resulting cluster means are used to efficiently compu… ▽ More

    Submitted 21 September, 2017; originally announced September 2017.

    Comments: Submitted to the IEEE International Conference on Robotics and Automation (ICRA) 2018

  38. arXiv:1709.07089  [pdf, other

    eess.SY cs.LG stat.ML

    On the Design of LQR Kernels for Efficient Controller Learning

    Authors: Alonso Marco, Philipp Hennig, Stefan Schaal, Sebastian Trimpe

    Abstract: Finding optimal feedback controllers for nonlinear dynamic systems from data is hard. Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct controller tuning from experimental trials. For selecting the next query point and finding the global optimum, BO relies on a probabilistic description of the latent objective function, typically a Gaussian process (GP). As… ▽ More

    Submitted 20 September, 2017; originally announced September 2017.

    Comments: 8 pages, 5 figures, to appear in 56th IEEE Conference on Decision and Control (CDC 2017)

  39. arXiv:1709.06709  [pdf, other

    cs.LG

    Online Learning of a Memory for Learning Rates

    Authors: Franziska Meier, Daniel Kappler, Stefan Schaal

    Abstract: The promise of learning to learn for robotics rests on the hope that by extracting some information about the learning process itself we can speed up subsequent similar learning tasks. Here, we introduce a computationally efficient online meta-learning algorithm that builds and optimizes a memory model of the optimal learning rate landscape from previously observed gradient behaviors. While perfor… ▽ More

    Submitted 23 March, 2018; v1 submitted 19 September, 2017; originally announced September 2017.

    Comments: accepted to ICRA 2018, code available: https://github.com/fmeier/online-meta-learning ; video pitch available: https://youtu.be/9PzQ25FPPOM

  40. arXiv:1705.10479  [pdf, other

    cs.RO cs.LG

    Multi-Modal Imitation Learning from Unstructured Demonstrations using Generative Adversarial Nets

    Authors: Karol Hausman, Yevgen Chebotar, Stefan Schaal, Gaurav Sukhatme, Joseph Lim

    Abstract: Imitation learning has traditionally been applied to learn a single task from demonstrations thereof. The requirement of structured and isolated demonstrations limits the scalability of imitation learning approaches as they are difficult to apply to real-world scenarios, where robots have to be able to execute a multitude of tasks. In this paper, we propose a multi-modal imitation learning framewo… ▽ More

    Submitted 23 November, 2017; v1 submitted 30 May, 2017; originally announced May 2017.

    Comments: Paper accepted to NIPS 2017

  41. arXiv:1704.06888  [pdf, other

    cs.CV cs.RO

    Time-Contrastive Networks: Self-Supervised Learning from Video

    Authors: Pierre Sermanet, Corey Lynch, Yevgen Chebotar, Jasmine Hsu, Eric Jang, Stefan Schaal, Sergey Levine

    Abstract: We propose a self-supervised approach for learning representations and robotic behaviors entirely from unlabeled videos recorded from multiple viewpoints, and study how this representation can be used in two robotic imitation settings: imitating object interactions from videos of humans, and imitating human poses. Imitation of human behavior requires a viewpoint-invariant representation that captu… ▽ More

    Submitted 19 March, 2018; v1 submitted 23 April, 2017; originally announced April 2017.

  42. arXiv:1703.03512  [pdf, other

    cs.RO

    Real-time Perception meets Reactive Motion Generation

    Authors: Daniel Kappler, Franziska Meier, Jan Issac, Jim Mainprice, Cristina Garcia Cifuentes, Manuel Wüthrich, Vincent Berenz, Stefan Schaal, Nathan Ratliff, Jeannette Bohg

    Abstract: We address the challenging problem of robotic grasping and manipulation in the presence of uncertainty. This uncertainty is due to noisy sensing, inaccurate models and hard-to-predict environment dynamics. We quantify the importance of continuous, real-time perception and its tight integration with reactive motion generation methods in dynamic manipulation scenarios. We compare three different sys… ▽ More

    Submitted 6 October, 2017; v1 submitted 9 March, 2017; originally announced March 2017.

  43. arXiv:1703.03078  [pdf, other

    cs.RO

    Combining Model-Based and Model-Free Updates for Trajectory-Centric Reinforcement Learning

    Authors: Yevgen Chebotar, Karol Hausman, Marvin Zhang, Gaurav Sukhatme, Stefan Schaal, Sergey Levine

    Abstract: Reinforcement learning (RL) algorithms for real-world robotic applications need a data-efficient learning process and the ability to handle complex, unknown dynamical systems. These requirements are handled well by model-based and model-free RL approaches, respectively. In this work, we aim to combine the advantages of these two types of methods in a principled manner. By focusing on time-varying… ▽ More

    Submitted 18 June, 2017; v1 submitted 8 March, 2017; originally announced March 2017.

    Comments: Paper accepted to the International Conference on Machine Learning (ICML) 2017

  44. arXiv:1703.02899  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Model-Based Policy Search for Automatic Tuning of Multivariate PID Controllers

    Authors: Andreas Doerr, Duy Nguyen-Tuong, Alonso Marco, Stefan Schaal, Sebastian Trimpe

    Abstract: PID control architectures are widely used in industrial applications. Despite their low number of open parameters, tuning multiple, coupled PID controllers can become tedious in practice. In this paper, we extend PILCO, a model-based policy search framework, to automatically tune multivariate PID controllers purely based on data observed on an otherwise unknown system. The system's state is extend… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

    Comments: Accepted final version to appear in 2017 IEEE International Conference on Robotics and Automation (ICRA)

  45. arXiv:1703.01250  [pdf, other

    cs.RO cs.LG eess.SY

    Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

    Authors: Alonso Marco, Felix Berkenkamp, Philipp Hennig, Angela P. Schoellig, Andreas Krause, Stefan Schaal, Sebastian Trimpe

    Abstract: In practice, the parameters of control policies are often tuned manually. This is time-consuming and frustrating. Reinforcement learning is a promising alternative that aims to automate this process, yet often requires too many experiments to be practical. In this paper, we propose a solution to this problem by exploiting prior knowledge from simulations, which are readily available for most robot… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

    Comments: 7 pages, 6 figures, to appear in IEEE 2017 International Conference on Robotics and Automation (ICRA)

  46. Balancing and Walking Using Full Dynamics LQR Control With Contact Constraints

    Authors: Sean Mason, Nicholas Rotella, Stefan Schaal, Ludovic Righetti

    Abstract: Torque control algorithms which consider robot dynamics and contact constraints are important for creating dynamic behaviors for humanoids. As computational power increases, algorithms tend to also increase in complexity. However, it is not clear how much complexity is really required to create controllers which exhibit good performance. In this paper, we study the capabilities of a simple approac… ▽ More

    Submitted 27 January, 2017; originally announced January 2017.

  47. arXiv:1612.05932  [pdf, other

    cs.RO

    A Probabilistic Representation for Dynamic Movement Primitives

    Authors: Franziska Meier, Stefan Schaal

    Abstract: Dynamic Movement Primitives have successfully been used to realize imitation learning, trial-and-error learning, reinforce- ment learning, movement recognition and segmentation and control. Because of this they have become a popular represen- tation for motor primitives. In this work, we showcase how DMPs can be reformulated as a probabilistic linear dynamical system with control inputs. Through t… ▽ More

    Submitted 18 December, 2016; originally announced December 2016.

  48. arXiv:1610.04871  [pdf, other

    cs.RO

    Probabilistic Articulated Real-Time Tracking for Robot Manipulation

    Authors: Cristina Garcia Cifuentes, Jan Issac, Manuel Wüthrich, Stefan Schaal, Jeannette Bohg

    Abstract: We propose a probabilistic filtering method which fuses joint measurements with depth images to yield a precise, real-time estimate of the end-effector pose in the camera frame. This avoids the need for frame transformations when using it in combination with visual object tracking methods. Precision is achieved by modeling and correcting biases in the joint measurements as well as inaccuracies i… ▽ More

    Submitted 25 November, 2016; v1 submitted 16 October, 2016; originally announced October 2016.

    Comments: 8 pages, 7 figures. Revision submitted to IEEE Robotics and Automation Letters (RA-L). Fixed wrong order of bars in boxplots; further argumentation

  49. Learning Feedback Terms for Reactive Planning and Control

    Authors: Akshara Rai, Giovanni Sutanto, Stefan Schaal, Franziska Meier

    Abstract: With the advancement of robotics, machine learning, and machine perception, increasingly more robots will enter human environments to assist with daily tasks. However, dynamically-changing human environments requires reactive motion plans. Reactivity can be accomplished through replanning, e.g. model-predictive control, or through a reactive feedback policy that modifies on-going behavior in respo… ▽ More

    Submitted 3 March, 2017; v1 submitted 11 October, 2016; originally announced October 2016.

    Comments: 8 pages, accepted to be published at ICRA 2017 conference

  50. arXiv:1610.00529  [pdf, other

    cs.RO cs.LG

    Path Integral Guided Policy Search

    Authors: Yevgen Chebotar, Mrinal Kalakrishnan, Ali Yahya, Adrian Li, Stefan Schaal, Sergey Levine

    Abstract: We present a policy search method for learning complex feedback control policies that map from high-dimensional sensory inputs to motor torques, for manipulation tasks with discontinuous contact dynamics. We build on a prior technique called guided policy search (GPS), which iteratively optimizes a set of local policies for specific instances of a task, and uses these to train a complex, high-dime… ▽ More

    Submitted 11 October, 2018; v1 submitted 3 October, 2016; originally announced October 2016.

    Comments: Published at the International Conference on Robotics and Automation (ICRA), 2017