Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 59 results for author: Tomlin, C J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.05279  [pdf, other

    cs.LG

    Safety Filters for Black-Box Dynamical Systems by Learning Discriminating Hyperplanes

    Authors: Will Lavanakul, Jason J. Choi, Koushil Sreenath, Claire J. Tomlin

    Abstract: Learning-based approaches are emerging as an effective approach for safety filters for black-box dynamical systems. Existing methods have relied on certificate functions like Control Barrier Functions (CBFs) and Hamilton-Jacobi (HJ) reachability value functions. The primary motivation for our work is the recognition that ultimately, enforcing the safety constraint as a control input constraint at… ▽ More

    Submitted 21 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: * Indicate co-first authors. This is an extended version of the paper presented at L4DC 2024

  2. arXiv:2311.13824  [pdf, other

    cs.RO eess.SY

    Constraint-Guided Online Data Selection for Scalable Data-Driven Safety Filters in Uncertain Robotic Systems

    Authors: Jason J. Choi, Fernando Castañeda, Wonsuhk Jung, Bike Zhang, Claire J. Tomlin, Koushil Sreenath

    Abstract: As the use of autonomous robotic systems expands in tasks that are complex and challenging to model, the demand for robust data-driven control methods that can certify safety and stability in uncertain conditions is increasing. However, the practical implementation of these methods often faces scalability issues due to the growing amount of data points with system complexity, and a significant rel… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: The first three authors contributed equally to the work. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2309.06655  [pdf, other

    cs.RO cs.LG

    Out of Distribution Detection via Domain-Informed Gaussian Process State Space Models

    Authors: Alonso Marco, Elias Morley, Claire J. Tomlin

    Abstract: In order for robots to safely navigate in unseen scenarios using learning-based methods, it is important to accurately detect out-of-training-distribution (OoD) situations online. Recently, Gaussian process state-space models (GPSSMs) have proven useful to discriminate unexpected observations by comparing them against probabilistic predictions. However, the capability for the model to correctly di… ▽ More

    Submitted 15 September, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: 7 pages, 4 figures

  4. arXiv:2307.01917  [pdf, other

    eess.SY cs.AI cs.RO

    Stranding Risk for Underactuated Vessels in Complex Ocean Currents: Analysis and Controllers

    Authors: Andreas Doering, Marius Wiggert, Hanna Krasowski, Manan Doshi, Pierre F. J. Lermusiaux, Claire J. Tomlin

    Abstract: Low-propulsion vessels can take advantage of powerful ocean currents to navigate towards a destination. Recent results demonstrated that vessels can reach their destination with high probability despite forecast errors. However, these results do not consider the critical aspect of safety of such vessels: because of their low propulsion which is much smaller than the magnitude of currents, they mig… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: 6 pages, 3 figures, submitted to 2023 IEEE 62th Annual Conference on Decision and Control (CDC) Andreas Doering and Marius Wiggert contributed equally to this work

  5. arXiv:2307.01916  [pdf, other

    eess.SY cs.AI cs.RO

    Maximizing Seaweed Growth on Autonomous Farms: A Dynamic Programming Approach for Underactuated Systems Navigating on Uncertain Ocean Currents

    Authors: Matthias Killer, Marius Wiggert, Hanna Krasowski, Manan Doshi, Pierre F. J. Lermusiaux, Claire J. Tomlin

    Abstract: Seaweed biomass offers significant potential for climate mitigation, but large-scale, autonomous open-ocean farms are required to fully exploit it. Such farms typically have low propulsion and are heavily influenced by ocean currents. We want to design a controller that maximizes seaweed growth over months by taking advantage of the non-linear time-varying ocean currents for reaching high-growth r… ▽ More

    Submitted 29 August, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 8 pages, submitted to 2023 IEEE 62th Annual Conference on Decision and Control (CDC) Matthias Killer and Marius Wiggert contributed equally to this work

  6. arXiv:2302.01999  [pdf, other

    cs.RO

    Online and Offline Learning of Player Objectives from Partial Observations in Dynamic Games

    Authors: Lasse Peters, Vicenç Rubies-Royo, Claire J. Tomlin, Laura Ferranti, Javier Alonso-Mora, Cyrill Stachniss, David Fridovich-Keil

    Abstract: Robots deployed to the real world must be able to interact with other agents in their environment. Dynamic game theory provides a powerful mathematical framework for modeling scenarios in which agents have individual objectives and interactions evolve over time. However, a key limitation of such techniques is that they require a-priori knowledge of all players' objectives. In this work, we address… ▽ More

    Submitted 14 May, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2106.03611

  7. arXiv:2210.05015  [pdf, other

    cs.AI cs.RO eess.SY stat.ML

    Optimality Guarantees for Particle Belief Approximation of POMDPs

    Authors: Michael H. Lim, Tyler J. Becker, Mykel J. Kochenderfer, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: Partially observable Markov decision processes (POMDPs) provide a flexible representation for real-world decision and control problems. However, POMDPs are notoriously difficult to solve, especially when the state and observation spaces are continuous or hybrid, which is often the case for physical systems. While recent online sampling-based POMDP algorithms that plan with observation likelihood w… ▽ More

    Submitted 19 October, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Journal ref: Journal of Artificial Intelligence Research, 77, 1591-1636 (2023)

  8. arXiv:2208.10733  [pdf, other

    eess.SY cs.LG math.OC

    Recursively Feasible Probabilistic Safe Online Learning with Control Barrier Functions

    Authors: Fernando Castañeda, Jason J. Choi, Wonsuhk Jung, Bike Zhang, Claire J. Tomlin, Koushil Sreenath

    Abstract: Learning-based control has recently shown great efficacy in performing complex tasks for various applications. However, to deploy it in real systems, it is of vital importance to guarantee the system will stay safe. Control Barrier Functions (CBFs) offer mathematical tools for designing safety-preserving controllers for systems with known dynamics. In this article, we first introduce a model-uncer… ▽ More

    Submitted 3 September, 2024; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Journal article. Includes the results of the 2021 CDC paper titled "Pointwise feasibility of gaussian process-based safety-critical control under model uncertainty" and proposes a recursively feasible safe online learning algorithm as new contribution

  9. arXiv:2203.10142  [pdf, other

    eess.SY cs.AI cs.LG math.OC

    Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement Learning

    Authors: Jingqi Li, Donggun Lee, Somayeh Sojoudi, Claire J. Tomlin

    Abstract: In this paper, we consider the infinite-horizon reach-avoid zero-sum game problem, where the goal is to find a set in the state space, referred to as the reach-avoid set, such that the system starting at a state therein could be controlled to reach a given target set without violating constraints under the worst-case disturbance. We address this problem by designing a new value function with a con… ▽ More

    Submitted 24 August, 2024; v1 submitted 18 March, 2022; originally announced March 2022.

  10. arXiv:2201.08538  [pdf, other

    cs.RO eess.SY

    Computation of Regions of Attraction for Hybrid Limit Cycles Using Reachability: An Application to Walking Robots

    Authors: Jason J. Choi, Ayush Agrawal, Koushil Sreenath, Claire J. Tomlin, Somil Bansal

    Abstract: Contact-rich robotic systems, such as legged robots and manipulators, are often represented as hybrid systems. However, the stability analysis and region-of-attraction computation for these systems are often challenging because of the discontinuous state changes upon contact (also referred to as state resets). In this work, we cast the computation of region-ofattraction as a Hamilton-Jacobi (HJ) r… ▽ More

    Submitted 8 February, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: Accepted to IEEE RA-L & ICRA, 2022

  11. arXiv:2112.12288  [pdf, other

    cs.LG cs.RO eess.SY

    Safety and Liveness Guarantees through Reach-Avoid Reinforcement Learning

    Authors: Kai-Chieh Hsu, Vicenç Rubies-Royo, Claire J. Tomlin, Jaime F. Fisac

    Abstract: Reach-avoid optimal control problems, in which the system must reach certain goal conditions while staying clear of unacceptable failure modes, are central to safety and liveness assurance for autonomous robotic systems, but their exact solutions are intractable for complex dynamics and environments. Recent successes in reinforcement learning methods to approximately solve optimal control problems… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: Accepted in Robotics: Science and Systems (RSS), 2021

  12. arXiv:2112.09456  [pdf, other

    cs.AI cs.LG cs.RO eess.SY

    Compositional Learning-based Planning for Vision POMDPs

    Authors: Sampada Deglurkar, Michael H. Lim, Johnathan Tucker, Zachary N. Sunberg, Aleksandra Faust, Claire J. Tomlin

    Abstract: The Partially Observable Markov Decision Process (POMDP) is a powerful framework for capturing decision-making problems that involve state and transition uncertainty. However, most current POMDP planners cannot effectively handle high-dimensional image observations prevalent in real world applications, and often require lengthy online training that requires interaction with the environment. In thi… ▽ More

    Submitted 2 December, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  13. arXiv:2109.10450  [pdf, other

    eess.SY cs.RO math.DS

    Towards cyber-physical systems robust to communication delays: A differential game approach

    Authors: Shankar A. Deka, Donggun Lee, Claire J. Tomlin

    Abstract: Collaboration between interconnected cyber-physical systems is becoming increasingly pervasive. Time-delays in communication channels between such systems are known to induce catastrophic failure modes, like high frequency oscillations in robotic manipulators in bilateral teleoperation or string instability in platoons of autonomous vehicles. This paper considers nonlinear time-delay systems repre… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: 7 pages, 5 figures, Submitted to IEEE Control Systems Letters

    MSC Class: 34K35; 49L12; 93D21

  14. arXiv:2109.04874  [pdf, other

    cs.RO eess.SY

    Discretizing Dynamics for Maximum Likelihood Constraint Inference

    Authors: Kaylene C. Stocking, David L. McPherson, Robert P. Matthew, Claire J. Tomlin

    Abstract: Maximum likelihood constraint inference is a powerful technique for identifying unmodeled constraints that affect the behavior of a demonstrator acting under a known objective function. However, it was originally formulated only for discrete state-action spaces. Continuous dynamics are more useful for modeling many real-world systems of interest, including the movements of humans and robots. We pr… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: 10 pages, 7 figures

  15. arXiv:2106.07108  [pdf, other

    eess.SY cs.LG math.OC

    Pointwise Feasibility of Gaussian Process-based Safety-Critical Control under Model Uncertainty

    Authors: Fernando Castañeda, Jason J. Choi, Bike Zhang, Claire J. Tomlin, Koushil Sreenath

    Abstract: Control Barrier Functions (CBFs) and Control Lyapunov Functions (CLFs) are popular tools for enforcing safety and stability of a controlled system, respectively. They are commonly utilized to build constraints that can be incorporated in a min-norm quadratic program (CBF-CLF-QP) which solves for a safety-critical control input. However, since these constraints rely on a model of the system, when t… ▽ More

    Submitted 1 October, 2021; v1 submitted 13 June, 2021; originally announced June 2021.

    Comments: The first two authors contributed equally. Accepted for publication in IEEE 60th Conference on Decision and Control (CDC 2021)

  16. arXiv:2106.03611  [pdf, other

    cs.RO cs.MA

    Inferring Objectives in Continuous Dynamic Games from Noise-Corrupted Partial State Observations

    Authors: Lasse Peters, David Fridovich-Keil, Vicenç Rubies-Royo, Claire J. Tomlin, Cyrill Stachniss

    Abstract: Robots and autonomous systems must interact with one another and their environment to provide high-quality services to their users. Dynamic game theory provides an expressive theoretical framework for modeling scenarios involving multiple agents with differing objectives interacting over time. A core challenge when formulating a dynamic game is designing objectives for each agent that capture desi… ▽ More

    Submitted 7 August, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Submitted to RSS2021

  17. arXiv:2103.05746  [pdf, other

    cs.RO cs.AI cs.HC eess.SY

    Analyzing Human Models that Adapt Online

    Authors: Andrea Bajcsy, Anand Siththaranjan, Claire J. Tomlin, Anca D. Dragan

    Abstract: Predictive human models often need to adapt their parameters online from human data. This raises previously ignored safety-related questions for robots relying on these models such as what the model could learn online and how quickly could it learn it. For instance, when will the robot have a confident estimate in a nearby human's goal? Or, what parameter initializations guarantee that the robot c… ▽ More

    Submitted 30 September, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: ICRA 2021

  18. FaSTrack: a Modular Framework for Real-Time Motion Planning and Guaranteed Safe Tracking

    Authors: Mo Chen, Sylvia L. Herbert, Haimin Hu, Ye Pu, Jaime F. Fisac, Somil Bansal, SooJean Han, Claire J. Tomlin

    Abstract: Real-time, guaranteed safe trajectory planning is vital for navigation in unknown environments. However, real-time navigation algorithms typically sacrifice robustness for computation speed. Alternatively, provably safe trajectory planning tends to be too computationally intensive for real-time replanning. We propose FaSTrack, Fast and Safe Tracking, a framework that achieves both real-time replan… ▽ More

    Submitted 13 March, 2021; v1 submitted 13 February, 2021; originally announced February 2021.

    Comments: Published in the IEEE Transactions on Automatic Control

  19. arXiv:2101.05916  [pdf, other

    cs.RO cs.LG eess.SY

    Scalable Learning of Safety Guarantees for Autonomous Systems using Hamilton-Jacobi Reachability

    Authors: Sylvia Herbert, Jason J. Choi, Suvansh Sanjeev, Marsalis Gibson, Koushil Sreenath, Claire J. Tomlin

    Abstract: Autonomous systems like aircraft and assistive robots often operate in scenarios where guaranteeing safety is critical. Methods like Hamilton-Jacobi reachability can provide guaranteed safe sets and controllers for such systems. However, often these same scenarios have unknown or uncertain environments, system dynamics, or predictions of other agents. As the system is operating, it may learn new k… ▽ More

    Submitted 2 April, 2021; v1 submitted 14 January, 2021; originally announced January 2021.

    Comments: The first two authors are co-first authors. ICRA 2021

  20. arXiv:2012.10140  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Voronoi Progressive Widening: Efficient Online Solvers for Continuous State, Action, and Observation POMDPs

    Authors: Michael H. Lim, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: This paper introduces Voronoi Progressive Widening (VPW), a generalization of Voronoi optimistic optimization (VOO) and action progressive widening to partially observable Markov decision processes (POMDPs). Tree search algorithms can use VPW to effectively handle continuous or hybrid action spaces by efficiently balancing local and global action searching. This paper proposes two VPW-based algori… ▽ More

    Submitted 1 April, 2021; v1 submitted 18 December, 2020; originally announced December 2020.

  21. arXiv:2011.07183  [pdf, other

    eess.SY cs.LG math.OC

    Gaussian Process-based Min-norm Stabilizing Controller for Control-Affine Systems with Uncertain Input Effects and Dynamics

    Authors: Fernando Castañeda, Jason J. Choi, Bike Zhang, Claire J. Tomlin, Koushil Sreenath

    Abstract: This paper presents a method to design a min-norm Control Lyapunov Function (CLF)-based stabilizing controller for a control-affine system with uncertain dynamics using Gaussian Process (GP) regression. In order to estimate both state and input-dependent model uncertainty, we propose a novel compound kernel that captures the control-affine nature of the problem. Furthermore, by the use of GP Upper… ▽ More

    Submitted 23 March, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: The first two authors contributed equally. To appear at the 2021 American Control Conference (ACC)

  22. arXiv:2011.04815  [pdf, other

    cs.RO eess.SY

    Encoding Defensive Driving as a Dynamic Nash Game

    Authors: Chih-Yuan Chiu, David Fridovich-Keil, Claire J. Tomlin

    Abstract: Robots deployed in real-world environments should operate safely in a robust manner. In scenarios where an "ego" agent navigates in an environment with multiple other "non-ego" agents, two modes of safety are commonly proposed -- adversarial robustness and probabilistic constraint satisfaction. However, while the former is generally computationally intractable and leads to overconservative solutio… ▽ More

    Submitted 30 March, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: Accepted to ICRA 2021

  23. arXiv:2011.00601  [pdf, other

    eess.SY cs.GT cs.RO

    Approximate Solutions to a Class of Reachability Games

    Authors: David Fridovich-Keil, Claire J. Tomlin

    Abstract: In this paper, we present a method for finding approximate Nash equilibria in a broad class of reachability games. These games are often used to formulate both collision avoidance and goal satisfaction. Our method is computationally efficient, running in real-time for scenarios involving multiple players and more than ten state dimensions. The proposed approach forms a family of increasingly exact… ▽ More

    Submitted 20 March, 2021; v1 submitted 1 November, 2020; originally announced November 2020.

    Comments: Conference paper at ICRA 2021

  24. arXiv:2009.02874  [pdf

    cs.LG eess.SY math.DS stat.ML

    Dynamically Computing Adversarial Perturbations for Recurrent Neural Networks

    Authors: Shankar A. Deka, Dušan M. Stipanović, Claire J. Tomlin

    Abstract: Convolutional and recurrent neural networks have been widely employed to achieve state-of-the-art performance on classification tasks. However, it has also been noted that these networks can be manipulated adversarially with relative ease, by carefully crafted additive perturbations to the input. Though several experimentally established prior works exist on crafting and defending against attacks,… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

    Comments: Submitted to IEEE Transactions on Neural Networks and Learning Systems

    MSC Class: 68T07; 93B52; 93C10; 49N90 ACM Class: I.2.8

  25. arXiv:2004.07584  [pdf, other

    eess.SY cs.LG cs.RO

    Reinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions

    Authors: Jason Choi, Fernando Castañeda, Claire J. Tomlin, Koushil Sreenath

    Abstract: In this paper, the issue of model uncertainty in safety-critical control is addressed with a data-driven approach. For this purpose, we utilize the structure of an input-ouput linearization controller based on a nominal model along with a Control Barrier Function and Control Lyapunov Function based Quadratic Program (CBF-CLF-QP). Specifically, we propose a novel reinforcement learning framework wh… ▽ More

    Submitted 4 June, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: The first two authors contributed equally to this work

  26. arXiv:2004.07276  [pdf, other

    eess.SY cs.LG cs.RO

    Improving Input-Output Linearizing Controllers for Bipedal Robots via Reinforcement Learning

    Authors: Fernando Castañeda, Mathias Wulfman, Ayush Agrawal, Tyler Westenbroek, Claire J. Tomlin, S. Shankar Sastry, Koushil Sreenath

    Abstract: The main drawbacks of input-output linearizing controllers are the need for precise dynamics models and not being able to account for input constraints. Model uncertainty is common in almost every robotic application and input saturation is present in every real world system. In this paper, we address both challenges for the specific case of bipedal robot control by the use of reinforcement learni… ▽ More

    Submitted 2 May, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: Final version appearing in Learning for Dynamics and Control (L4DC) 2020 Conference

  27. arXiv:2004.02766  [pdf, other

    cs.LG math.DS math.OC stat.ML

    Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning

    Authors: Tyler Westenbroek, Eric Mazumdar, David Fridovich-Keil, Valmik Prabhu, Claire J. Tomlin, S. Shankar Sastry

    Abstract: This paper proposes a framework for adaptively learning a feedback linearization-based tracking controller for an unknown system using discrete-time model-free policy-gradient parameter update rules. The primary advantage of the scheme over standard model-reference adaptive control techniques is that it does not require the learned inverse model to be invertible at all instances of time. This enab… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  28. arXiv:2002.04354  [pdf, other

    cs.RO eess.SY

    Inference-Based Strategy Alignment for General-Sum Differential Games

    Authors: Lasse Peters, David Fridovich-Keil, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: In many settings where multiple agents interact, the optimal choices for each agent depend heavily on the choices of the others. These coupled interactions are well-described by a general-sum differential game, in which players have differing objectives, the state evolves in continuous time, and optimal play may be characterized by one of many equilibrium concepts, e.g., a Nash equilibrium. Often,… ▽ More

    Submitted 6 May, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

  29. arXiv:1910.13369  [pdf, other

    cs.RO cs.LG eess.SY

    A Hamilton-Jacobi Reachability-Based Framework for Predicting and Analyzing Human Motion for Safe Planning

    Authors: Somil Bansal, Andrea Bajcsy, Ellis Ratner, Anca D. Dragan, Claire J. Tomlin

    Abstract: Real-world autonomous systems often employ probabilistic predictive models of human behavior during planning to reason about their future motion. Since accurately modeling human behavior a priori is challenging, such models are often parameterized, enabling the robot to adapt predictions based on observations by maintaining a distribution over the model parameters. Although this enables data and p… ▽ More

    Submitted 5 April, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

  30. arXiv:1910.13272  [pdf, other

    math.OC cs.AI cs.LG eess.SY

    Feedback Linearization for Unknown Systems via Reinforcement Learning

    Authors: Tyler Westenbroek, David Fridovich-Keil, Eric Mazumdar, Shreyas Arora, Valmik Prabhu, S. Shankar Sastry, Claire J. Tomlin

    Abstract: We present a novel approach to control design for nonlinear systems which leverages model-free policy optimization techniques to learn a linearizing controller for a physical plant with unknown dynamics. Feedback linearization is a technique from nonlinear control which renders the input-output dynamics of a nonlinear plant \emph{linear} under application of an appropriate feedback controller. Onc… ▽ More

    Submitted 21 April, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

  31. arXiv:1910.04332  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Sparse tree search optimality guarantees in POMDPs with continuous observation spaces

    Authors: Michael H. Lim, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: Partially observable Markov decision processes (POMDPs) with continuous state and observation spaces have powerful flexibility for representing real-world decision and control problems but are notoriously difficult to solve. Recent online sampling-based algorithms that use observation likelihood weighting have shown unprecedented effectiveness in domains with continuous observation spaces. However… ▽ More

    Submitted 5 June, 2023; v1 submitted 9 October, 2019; originally announced October 2019.

  32. arXiv:1910.00681  [pdf, other

    eess.SY cs.GT cs.MA cs.RO

    An Iterative Quadratic Method for General-Sum Differential Games with Feedback Linearizable Dynamics

    Authors: David Fridovich-Keil, Vicenc Rubies-Royo, Claire J. Tomlin

    Abstract: Iterative linear-quadratic (ILQ) methods are widely used in the nonlinear optimal control community. Recent work has applied similar methodology in the setting of multiplayer general-sum differential games. Here, ILQ methods are capable of finding local equilibria in interactive motion planning problems in real-time. As in most iterative procedures, however, this approach can be sensitive to initi… ▽ More

    Submitted 19 March, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: 7 pages, 5 figures, accepted to IEEE International Conference on Robotics and Automation (2020)

  33. arXiv:1909.05699  [pdf, other

    eess.SY cs.LG

    Closed-loop Model Selection for Kernel-based Models using Bayesian Optimization

    Authors: Thomas Beckers, Somil Bansal, Claire J. Tomlin, Sandra Hirche

    Abstract: Kernel-based nonparametric models have become very attractive for model-based control approaches for nonlinear systems. However, the selection of the kernel and its hyperparameters strongly influences the quality of the learned model. Classically, these hyperparameters are optimized to minimize the prediction error of the model but this process totally neglects its later usage in the control loop.… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

    Journal ref: IEEE Conference on Decision and Control 2019

  34. arXiv:1909.04694  [pdf, other

    eess.SY cs.RO

    Efficient Iterative Linear-Quadratic Approximations for Nonlinear Multi-Player General-Sum Differential Games

    Authors: David Fridovich-Keil, Ellis Ratner, Lasse Peters, Anca D. Dragan, Claire J. Tomlin

    Abstract: Many problems in robotics involve multiple decision making agents. To operate efficiently in such settings, a robot must reason about the impact of its decisions on the behavior of other agents. Differential games offer an expressive theoretical framework for formulating these types of multi-agent problems. Unfortunately, most numerical solution techniques scale poorly with state dimension and are… ▽ More

    Submitted 18 March, 2020; v1 submitted 10 September, 2019; originally announced September 2019.

    Comments: 8 pages, 4 figures, accepted to the IEEE International Conference on Robotics and Automation

  35. arXiv:1905.00532  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    An Efficient Reachability-Based Framework for Provably Safe Autonomous Navigation in Unknown Environments

    Authors: Andrea Bajcsy, Somil Bansal, Eli Bronstein, Varun Tolani, Claire J. Tomlin

    Abstract: Real-world autonomous vehicles often operate in a priori unknown environments. Since most of these systems are safety-critical, it is important to ensure they operate safely in the face of environment uncertainty, such as unseen obstacles. Current safety analysis tools enable autonomous systems to reason about safety given full information about the state of the environment a priori. However, thes… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

  36. A New Simulation Metric to Determine Safe Environments and Controllers for Systems with Unknown Dynamics

    Authors: Shromona Ghosh, Somil Bansal, Alberto Sangiovanni-Vincentelli, Sanjit A. Seshia, Claire J. Tomlin

    Abstract: We consider the problem of extracting safe environments and controllers for reach-avoid objectives for systems with known state and control spaces, but unknown dynamics. In a given environment, a common approach is to synthesize a controller from an abstraction or a model of the system (potentially learned from data). However, in many situations, the relationship between the dynamics of the model… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: 22nd ACM International Conference on Hybrid Systems: Computation and Control (2019)

  37. arXiv:1811.07834  [pdf, other

    cs.RO eess.SY

    Safely Probabilistically Complete Real-Time Planning and Exploration in Unknown Environments

    Authors: David Fridovich-Keil, Jaime F. Fisac, Claire J. Tomlin

    Abstract: We present a new framework for motion planning that wraps around existing kinodynamic planners and guarantees recursive feasibility when operating in a priori unknown, static environments. Our approach makes strong guarantees about overall safety and collision avoidance by utilizing a robust controller derived from reachability analysis. We ensure that motion plans never exit the safe backward rea… ▽ More

    Submitted 6 March, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

    Comments: 7 pages, accepted to ICRA 2019

  38. arXiv:1811.05929  [pdf, other

    cs.RO

    A Scalable Framework For Real-Time Multi-Robot, Multi-Human Collision Avoidance

    Authors: Andrea Bajcsy, Sylvia L. Herbert, David Fridovich-Keil, Jaime F. Fisac, Sampada Deglurkar, Anca D. Dragan, Claire J. Tomlin

    Abstract: Robust motion planning is a well-studied problem in the robotics literature, yet current algorithms struggle to operate scalably and safely in the presence of other moving agents, such as humans. This paper introduces a novel framework for robot navigation that accounts for high-order system dynamics and maintains safety in the presence of external disturbances, other robots, and non-deterministic… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

  39. arXiv:1809.00706  [pdf, other

    math.OC cs.RO

    A Minimum Discounted Reward Hamilton-Jacobi Formulation for Computing Reachable Sets

    Authors: Anayo K. Akametalu, Shromona Ghosh, Jaime F. Fisac, Claire J. Tomlin

    Abstract: We propose a novel formulation for approximating reachable sets through a minimum discounted reward optimal control problem. The formulation yields a continuous solution that can be obtained by solving a Hamilton-Jacobi equation. Furthermore, the numerical approximation to this solution can be obtained as the unique fixed-point to a contraction mapping. This allows for more efficient solution meth… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

  40. arXiv:1808.00649  [pdf, other

    eess.SY cs.RO math.OC

    Robust Tracking with Model Mismatch for Fast and Safe Planning: an SOS Optimization Approach

    Authors: Sumeet Singh, Mo Chen, Sylvia L. Herbert, Claire J. Tomlin, Marco Pavone

    Abstract: In the pursuit of real-time motion planning, a commonly adopted practice is to compute a trajectory by running a planning algorithm on a simplified, low-dimensional dynamical model, and then employ a feedback tracking controller that tracks such a trajectory by accounting for the full, high-dimensional system dynamics. While this strategy of planning with model mismatch generally yields fast compu… ▽ More

    Submitted 28 July, 2019; v1 submitted 1 August, 2018; originally announced August 2018.

    Comments: Presented at WAFR 2018; final version v2 -- fixed typos

  41. arXiv:1806.00109  [pdf, other

    cs.RO cs.LG

    Probabilistically Safe Robot Planning with Confidence-Based Human Predictions

    Authors: Jaime F. Fisac, Andrea Bajcsy, Sylvia L. Herbert, David Fridovich-Keil, Steven Wang, Claire J. Tomlin, Anca D. Dragan

    Abstract: In order to safely operate around humans, robots can employ predictive models of human motion. Unfortunately, these models cannot capture the full complexity of human behavior and necessarily introduce simplifying assumptions. As a result, predictions may degrade whenever the observed human behavior departs from the assumed structure, which can have negative implications for safety. In this paper,… ▽ More

    Submitted 31 May, 2018; originally announced June 2018.

    Comments: Robotics Science and Systems (RSS) 2018

  42. A Classification-based Approach for Approximate Reachability

    Authors: Vicenc Rubies-Royo, David Fridovich-Keil, Sylvia Herbert, Claire J. Tomlin

    Abstract: Hamilton-Jacobi (HJ) reachability analysis has been developed over the past decades into a widely-applicable tool for determining goal satisfaction and safety verification in nonlinear systems. While HJ reachability can be formulated very generally, computational complexity can be a serious impediment for many systems of practical interest. Much prior work has been devoted to computing approximate… ▽ More

    Submitted 20 May, 2019; v1 submitted 8 March, 2018; originally announced March 2018.

  43. arXiv:1802.04929  [pdf, other

    eess.SY cs.LG cs.RO

    Context-Specific Validation of Data-Driven Models

    Authors: Somil Bansal, Shromona Ghosh, Alberto Sangiovanni-Vincentelli, Sanjit A. Seshia, Claire J. Tomlin

    Abstract: With an increasing use of data-driven models to control robotic systems, it has become important to develop a methodology for validating such models before they can be deployed to design a controller for the actual system. Specifically, it must be ensured that the controller designed for a learned model would perform as expected on the actual physical system. We propose a context-specific validati… ▽ More

    Submitted 25 March, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

  44. arXiv:1711.05928  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Budget-Constrained Multi-Armed Bandits with Multiple Plays

    Authors: Datong P. Zhou, Claire J. Tomlin

    Abstract: We study the multi-armed bandit problem with multiple plays and a budget constraint for both the stochastic and the adversarial setting. At each round, exactly $K$ out of $N$ possible arms have to be played (with $1\leq K \leq N$). In addition to observing the individual rewards for each arm played, the player also learns a vector of costs which has to be covered with an a-priori defined budget… ▽ More

    Submitted 16 November, 2017; originally announced November 2017.

    Comments: 20 pages

  45. arXiv:1710.04731  [pdf, other

    eess.SY cs.GT

    Planning, Fast and Slow: A Framework for Adaptive Real-Time Safe Trajectory Planning

    Authors: David Fridovich-Keil, Sylvia L. Herbert, Jaime F. Fisac, Sampada Deglurkar, Claire J. Tomlin

    Abstract: Motion planning is an extremely well-studied problem in the robotics community, yet existing work largely falls into one of two categories: computationally efficient but with few if any safety guarantees, or able to give stronger guarantees but at high computational cost. This work builds on a recent development called FaSTrack in which a slow offline computation provides a modular safety guarante… ▽ More

    Submitted 6 March, 2018; v1 submitted 12 October, 2017; originally announced October 2017.

    Comments: ICRA, International Conference on Robotics and Automation, ICRA 2018, 8 pages, 9 figures

  46. arXiv:1705.01292  [pdf, other

    cs.RO eess.SY

    A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems

    Authors: Jaime F. Fisac, Anayo K. Akametalu, Melanie N. Zeilinger, Shahab Kaynama, Jeremy Gillula, Claire J. Tomlin

    Abstract: The proven efficacy of learning-based control schemes strongly motivates their application to robotic systems operating in the physical world. However, guaranteeing correct operation during the learning process is currently an unresolved issue, which is of vital importance in safety-critical systems. We propose a general safety framework based on Hamilton-Jacobi reachability methods that can work… ▽ More

    Submitted 14 February, 2018; v1 submitted 3 May, 2017; originally announced May 2017.

    Comments: Accepted for publication in IEEE Transactions on Automatic Control. Video with experiments: https://youtu.be/WAAxyeSk2bw

    ACM Class: I.2.9; I.2.8; I.2.6

  47. arXiv:1703.09260  [pdf, other

    eess.SY cs.LG

    Goal-Driven Dynamics Learning via Bayesian Optimization

    Authors: Somil Bansal, Roberto Calandra, Ted Xiao, Sergey Levine, Claire J. Tomlin

    Abstract: Real-world robots are becoming increasingly complex and commonly act in poorly understood environments where it is extremely challenging to model or learn their true dynamics. Therefore, it might be desirable to take a task-specific approach, wherein the focus is on explicitly learning the dynamics model which achieves the best control performance for the task at hand, rather than learning the tru… ▽ More

    Submitted 21 September, 2017; v1 submitted 27 March, 2017; originally announced March 2017.

    Comments: This is the extended version of the CDC'17 paper titled "Goal-Driven Dynamics Learning via Bayesian Optimization."

  48. FaSTrack: a Modular Framework for Fast and Guaranteed Safe Motion Planning

    Authors: Sylvia L. Herbert, Mo Chen, SooJean Han, Somil Bansal, Jaime F. Fisac, Claire J. Tomlin

    Abstract: Fast and safe navigation of dynamical systems through a priori unknown cluttered environments is vital to many applications of autonomous systems. However, trajectory planning for autonomous systems is computationally intensive, often requiring simplified dynamics that sacrifice safety and dynamic feasibility in order to plan efficiently. Conversely, safe trajectories can be computed using more so… ▽ More

    Submitted 13 February, 2021; v1 submitted 21 March, 2017; originally announced March 2017.

    Comments: Published in the Proceedings of the IEEE Conference on Decision and Control, 2017

  49. arXiv:1703.00980  [pdf, other

    cs.SI

    How Peer Effects Influence Energy Consumption

    Authors: Datong P. Zhou, Mardavij Roozbehani, Munther A. Dahleh, Claire J. Tomlin

    Abstract: This paper analyzes the impact of peer effects on electricity consumption of a network of rational, utility-maximizing users. Users derive utility from consuming electricity as well as consuming less energy than their neighbors. However, a disutility is incurred for consuming more than their neighbors. To maximize the profit of the load-serving entity that provides electricity to such users, we de… ▽ More

    Submitted 17 March, 2017; v1 submitted 2 March, 2017; originally announced March 2017.

    Comments: 9 pages, 4 figures

  50. arXiv:1703.00972  [pdf, other

    cs.GT

    Eliciting Private User Information for Residential Demand Response

    Authors: Datong P. Zhou, Maximilian Balandat, Munther A. Dahleh, Claire J. Tomlin

    Abstract: Residential Demand Response has emerged as a viable tool to alleviate supply and demand imbalances of electricity, particularly during times when the electric grid is strained due a shortage of supply. Demand Response providers bid reduction capacity into the wholesale electricity market by asking their customers under contract to temporarily reduce their consumption in exchange for a monetary inc… ▽ More

    Submitted 3 September, 2017; v1 submitted 2 March, 2017; originally announced March 2017.

    Comments: 8 pages, 7 figures