Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–42 of 42 results for author: Ivanovic, B

.
  1. arXiv:2407.00959  [pdf, other

    cs.AI cs.RO

    Tokenize the World into Object-level Knowledge to Address Long-tail Events in Autonomous Driving

    Authors: Ran Tian, Boyi Li, Xinshuo Weng, Yuxiao Chen, Edward Schmerling, Yue Wang, Boris Ivanovic, Marco Pavone

    Abstract: The autonomous driving industry is increasingly adopting end-to-end learning from sensory inputs to minimize human biases in system design. Traditional end-to-end driving models, however, suffer from long-tail events due to rare or unseen inputs within their training distributions. To address this, we propose TOKEN, a novel Multi-Modal Large Language Model (MM-LLM) that tokenizes the world into ob… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.15349  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking

    Authors: Daniel Dauner, Marcel Hallgarten, Tianyu Li, Xinshuo Weng, Zhiyu Huang, Zetong Yang, Hongyang Li, Igor Gilitschenski, Boris Ivanovic, Marco Pavone, Andreas Geiger, Kashyap Chitta

    Abstract: Benchmarking vision-based driving policies is challenging. On one hand, open-loop evaluation with real data is easy, but these results do not reflect closed-loop performance. On the other, closed-loop evaluation is possible in simulation, but is hard to scale due to its significant computational demands. Further, the simulators available today exhibit a large domain gap to real data. This has resu… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.12095  [pdf, other

    cs.CV cs.AI cs.RO

    DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features

    Authors: Letian Wang, Seung Wook Kim, Jiawei Yang, Cunjun Yu, Boris Ivanovic, Steven L. Waslander, Yue Wang, Sanja Fidler, Marco Pavone, Peter Karkus

    Abstract: We propose DistillNeRF, a self-supervised learning framework addressing the challenge of understanding 3D environments from limited 2D observations in autonomous driving. Our method is a generalizable feedforward model that predicts a rich neural scene representation from sparse, single-frame multi-view camera inputs, and is trained self-supervised with differentiable rendering to reconstruct RGB,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  4. arXiv:2406.10789  [pdf, other

    cs.CV

    Learning Traffic Crashes as Language: Datasets, Benchmarks, and What-if Causal Analyses

    Authors: Zhiwen Fan, Pu Wang, Yang Zhao, Yibo Zhao, Boris Ivanovic, Zhangyang Wang, Marco Pavone, Hao Frank Yang

    Abstract: The increasing rate of road accidents worldwide results not only in significant loss of life but also imposes billions financial burdens on societies. Current research in traffic crash frequency modeling and analysis has predominantly approached the problem as classification tasks, focusing mainly on learning-based classification or ensemble learning methods. These approaches often overlook the in… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  5. arXiv:2406.04557  [pdf, other

    cs.CY

    Countrywide natural experiment reveals impact of built environment on physical activity

    Authors: Tim Althoff, Boris Ivanovic, Jennifer L. Hicks, Scott L. Delp, Abby C. King, Jure Leskovec

    Abstract: While physical activity is critical to human health, most people do not meet recommended guidelines. More walkable built environments have the potential to increase activity across the population. However, previous studies on the built environment and physical activity have led to mixed findings, possibly due to methodological limitations such as small cohorts, few or single locations, over-relian… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  6. arXiv:2405.03685  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Language-Image Models with 3D Understanding

    Authors: Jang Hyun Cho, Boris Ivanovic, Yulong Cao, Edward Schmerling, Yue Wang, Xinshuo Weng, Boyi Li, Yurong You, Philipp Krähenbühl, Yan Wang, Marco Pavone

    Abstract: Multi-modal large language models (MLLMs) have shown incredible capabilities in a variety of 2D vision and language tasks. We extend MLLMs' perceptual capabilities to ground and reason about images in 3-dimensional space. To that end, we first develop a large-scale pre-training dataset for 2D and 3D called LV3D by combining multiple existing 2D and 3D recognition datasets under a common task formu… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Project page: https://janghyuncho.github.io/Cube-LLM

  7. arXiv:2403.20309  [pdf, other

    cs.CV

    InstantSplat: Unbounded Sparse-view Pose-free Gaussian Splatting in 40 Seconds

    Authors: Zhiwen Fan, Wenyan Cong, Kairun Wen, Kevin Wang, Jian Zhang, Xinghao Ding, Danfei Xu, Boris Ivanovic, Marco Pavone, Georgios Pavlakos, Zhangyang Wang, Yue Wang

    Abstract: While novel view synthesis (NVS) from a sparse set of images has advanced significantly in 3D computer vision, it relies on precise initial estimation of camera parameters using Structure-from-Motion (SfM). For instance, the recently developed Gaussian Splatting depends heavily on the accuracy of SfM-derived points and poses. However, SfM processes are time-consuming and often prove unreliable in… ▽ More

    Submitted 30 June, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: Project Page: https://instantsplat.github.io/

  8. arXiv:2403.16439  [pdf, other

    cs.RO cs.CV cs.LG

    Producing and Leveraging Online Map Uncertainty in Trajectory Prediction

    Authors: Xunjiang Gu, Guanyu Song, Igor Gilitschenski, Marco Pavone, Boris Ivanovic

    Abstract: High-definition (HD) maps have played an integral role in the development of modern autonomous vehicle (AV) stacks, albeit with high associated labeling and maintenance costs. As a result, many recent works have proposed methods for estimating HD maps online from sensor data, enabling AVs to operate outside of previously-mapped regions. However, current online map estimation approaches are develop… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 14 pages, 14 figures, 6 tables. CVPR 2024

  9. arXiv:2402.17077  [pdf, other

    cs.LG cs.CV

    Parallelized Spatiotemporal Binding

    Authors: Gautam Singh, Yue Wang, Jiawei Yang, Boris Ivanovic, Sungjin Ahn, Marco Pavone, Tong Che

    Abstract: While modern best practices advocate for scalable architectures that support long-range interactions, object-centric models are yet to fully embrace these architectures. In particular, existing object-centric models for handling sequential inputs, due to their reliance on RNN-based implementation, show poor stability and capacity and are slow to train on long sequences. We introduce Parallelizable… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: See project page at https://parallel-st-binder.github.io

  10. arXiv:2402.05932  [pdf, other

    cs.RO cs.AI cs.CL

    Driving Everywhere with Large Language Model Policy Adaptation

    Authors: Boyi Li, Yue Wang, Jiageng Mao, Boris Ivanovic, Sushant Veer, Karen Leung, Marco Pavone

    Abstract: Adapting driving behavior to new environments, customs, and laws is a long-standing problem in autonomous driving, precluding the widespread deployment of autonomous vehicles (AVs). In this paper, we present LLaDA, a simple yet powerful tool that enables human drivers and autonomous vehicles alike to drive everywhere by adapting their tasks and motion plans to traffic rules in new locations. LLaDA… ▽ More

    Submitted 10 April, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: CVPR 2024, featured in GTC 2024: https://www.youtube.com/watch?v=t-UPlPlrYgQ&t=51s

  11. arXiv:2311.02077  [pdf, other

    cs.CV

    EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

    Authors: Jiawei Yang, Boris Ivanovic, Or Litany, Xinshuo Weng, Seung Wook Kim, Boyi Li, Tong Che, Danfei Xu, Sanja Fidler, Marco Pavone, Yue Wang

    Abstract: We present EmerNeRF, a simple yet powerful approach for learning spatial-temporal representations of dynamic driving scenes. Grounded in neural fields, EmerNeRF simultaneously captures scene geometry, appearance, motion, and semantics via self-bootstrapping. EmerNeRF hinges upon two core components: First, it stratifies scenes into static and dynamic fields. This decomposition emerges purely from… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: See the project page for code, data, and request pre-trained models: https://emernerf.github.io

  12. arXiv:2310.05885  [pdf, other

    cs.RO

    DTPP: Differentiable Joint Conditional Prediction and Cost Evaluation for Tree Policy Planning in Autonomous Driving

    Authors: Zhiyu Huang, Peter Karkus, Boris Ivanovic, Yuxiao Chen, Marco Pavone, Chen Lv

    Abstract: Motion prediction and cost evaluation are vital components in the decision-making system of autonomous vehicles. However, existing methods often ignore the importance of cost learning and treat them as separate modules. In this study, we employ a tree-structured policy planner and propose a differentiable joint training framework for both ego-conditioned prediction and cost models, resulting in a… ▽ More

    Submitted 23 February, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 2024 IEEE International Conference on Robotics and Automation

  13. arXiv:2309.00709  [pdf, other

    cs.AI cs.LG cs.RO

    Reinforcement Learning with Human Feedback for Realistic Traffic Simulation

    Authors: Yulong Cao, Boris Ivanovic, Chaowei Xiao, Marco Pavone

    Abstract: In light of the challenges and costs of real-world testing, autonomous vehicle developers often rely on testing in simulation for the creation of reliable systems. A key element of effective simulation is the incorporation of realistic traffic models that align with human knowledge, an aspect that has proven challenging due to the need to balance realism and diversity. This works aims to address t… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 9 pages, 4 figures

  14. arXiv:2307.13924  [pdf, other

    cs.CV cs.LG cs.RO

    trajdata: A Unified Interface to Multiple Human Trajectory Datasets

    Authors: Boris Ivanovic, Guanyu Song, Igor Gilitschenski, Marco Pavone

    Abstract: The field of trajectory forecasting has grown significantly in recent years, partially owing to the release of numerous large-scale, real-world human trajectory datasets for autonomous vehicles (AVs) and pedestrian motion tracking. While such datasets have been a boon for the community, they each use custom and unique data formats and APIs, making it cumbersome for researchers to train and evaluat… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: 15 pages, 15 figures, 3 tables

  15. arXiv:2307.07947  [pdf, other

    cs.CV

    Language Conditioned Traffic Generation

    Authors: Shuhan Tan, Boris Ivanovic, Xinshuo Weng, Marco Pavone, Philipp Kraehenbuehl

    Abstract: Simulation forms the backbone of modern self-driving development. Simulators help develop, test, and improve driving systems without putting humans, vehicles, or their environment at risk. However, simulators face a major challenge: They rely on realistic, scalable, yet interesting content. While recent advances in rendering and scene reconstruction make great strides in creating static scene asse… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: Technical Report. Website available at https://ariostgx.github.io/lctgen

  16. arXiv:2306.06344  [pdf, other

    cs.RO cs.AI cs.LG

    Language-Guided Traffic Simulation via Scene-Level Diffusion

    Authors: Ziyuan Zhong, Davis Rempe, Yuxiao Chen, Boris Ivanovic, Yulong Cao, Danfei Xu, Marco Pavone, Baishakhi Ray

    Abstract: Realistic and controllable traffic simulation is a core capability that is necessary to accelerate autonomous vehicle (AV) development. However, current approaches for controlling learning-based traffic models require significant domain expertise and are difficult for practitioners to use. To remedy this, we present CTG++, a scene-level conditional diffusion model that can be guided by language in… ▽ More

    Submitted 18 October, 2023; v1 submitted 10 June, 2023; originally announced June 2023.

  17. arXiv:2304.00673  [pdf, other

    cs.CV

    Partial-View Object View Synthesis via Filtered Inversion

    Authors: Fan-Yun Sun, Jonathan Tremblay, Valts Blukis, Kevin Lin, Danfei Xu, Boris Ivanovic, Peter Karkus, Stan Birchfield, Dieter Fox, Ruohan Zhang, Yunzhu Li, Jiajun Wu, Marco Pavone, Nick Haber

    Abstract: We propose Filtering Inversion (FINV), a learning framework and optimization process that predicts a renderable 3D object representation from one or few partial views. FINV addresses the challenge of synthesizing novel views of objects from partial observations, spanning cases where the object is not entirely in view, is partially occluded, or is only observed from similar views. To achieve this,… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: project website: http://cs.stanford.edu/~sunfanyun/finv

  18. arXiv:2301.11902  [pdf, other

    cs.RO eess.SY

    Tree-structured Policy Planning with Learned Behavior Models

    Authors: Yuxiao Chen, Peter Karkus, Boris Ivanovic, Xinshuo Weng, Marco Pavone

    Abstract: Autonomous vehicles (AVs) need to reason about the multimodal behavior of neighboring agents while planning their own motion. Many existing trajectory planners seek a single trajectory that performs well under \emph{all} plausible futures simultaneously, ignoring bi-directional interactions and thus leading to overly conservative plans. Policy planning, whereby the ego agent plans a policy that re… ▽ More

    Submitted 26 February, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  19. arXiv:2212.06437  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles

    Authors: Peter Karkus, Boris Ivanovic, Shie Mannor, Marco Pavone

    Abstract: Autonomous vehicle (AV) stacks are typically built in a modular fashion, with explicit components performing detection, tracking, prediction, planning, control, etc. While modularity improves reusability, interpretability, and generalizability, it also suffers from compounding errors, information bottlenecks, and integration challenges. To overcome these challenges, a prominent approach is to conv… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: CoRL 2022 camera ready

  20. arXiv:2210.14584  [pdf, other

    cs.LG cs.RO

    Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models

    Authors: Filippos Christianos, Peter Karkus, Boris Ivanovic, Stefano V. Albrecht, Marco Pavone

    Abstract: Reasoning with occluded traffic agents is a significant open challenge for planning for autonomous vehicles. Recent deep learning models have shown impressive results for predicting occluded agents based on the behaviour of nearby visible agents; however, as we show in experiments, these models are difficult to integrate into downstream planning. To this end, we propose Bi-level Variational Occlus… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 7 pages, 6 figures

  21. arXiv:2210.05519  [pdf, other

    cs.LG

    Robust and Controllable Object-Centric Learning through Energy-based Models

    Authors: Ruixiang Zhang, Tong Che, Boris Ivanovic, Renhao Wang, Marco Pavone, Yoshua Bengio, Liam Paull

    Abstract: Humans are remarkably good at understanding and reasoning about complex visual scenes. The capability to decompose low-level observations into discrete objects allows us to build a grounded abstract representation and identify the compositional structure of the world. Accordingly, it is a crucial step for machine learning models to be capable of inferring objects and their properties from visual s… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  22. arXiv:2209.11820  [pdf, other

    cs.LG cs.CV cs.RO

    Expanding the Deployment Envelope of Behavior Prediction via Adaptive Meta-Learning

    Authors: Boris Ivanovic, James Harrison, Marco Pavone

    Abstract: Learning-based behavior prediction methods are increasingly being deployed in real-world autonomous systems, e.g., in fleets of self-driving vehicles, which are beginning to commercially operate in major cities across the world. Despite their advancements, however, the vast majority of prediction systems are specialized to a set of well-explored geographic regions or operational design domains, co… ▽ More

    Submitted 23 May, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: 12 pages, 13 figures, 2 tables. ICRA 2023

  23. arXiv:2208.12403  [pdf, other

    cs.RO cs.LG

    BITS: Bi-level Imitation for Traffic Simulation

    Authors: Danfei Xu, Yuxiao Chen, Boris Ivanovic, Marco Pavone

    Abstract: Simulation is the key to scaling up validation and verification for robotic systems such as autonomous vehicles. Despite advances in high-fidelity physics and sensor simulation, a critical gap remains in simulating realistic behaviors of road users. This is because, unlike simulating physics and graphics, devising first principle models for human-like behaviors is generally infeasible. In this wor… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

  24. arXiv:2207.12380  [pdf, other

    cs.RO

    Task-Relevant Failure Detection for Trajectory Predictors in Autonomous Vehicles

    Authors: Alec Farid, Sushant Veer, Boris Ivanovic, Karen Leung, Marco Pavone

    Abstract: In modern autonomy stacks, prediction modules are paramount to planning motions in the presence of other mobile agents. However, failures in prediction modules can mislead the downstream planner into making unsafe decisions. Indeed, the high uncertainty inherent to the task of trajectory forecasting ensures that such mispredictions occur frequently. Motivated by the need to improve safety of auton… ▽ More

    Submitted 14 April, 2023; v1 submitted 25 July, 2022; originally announced July 2022.

  25. arXiv:2206.13387  [pdf, other

    cs.AI cs.CV cs.LG cs.RO

    ScePT: Scene-consistent, Policy-based Trajectory Predictions for Planning

    Authors: Yuxiao Chen, Boris Ivanovic, Marco Pavone

    Abstract: Trajectory prediction is a critical functionality of autonomous systems that share environments with uncontrolled agents, one prominent example being self-driving vehicles. Currently, most prediction methods do not enforce scene consistency, i.e., there are a substantial amount of self-collisions between predicted trajectories of different agents in the scene. Moreover, many approaches generate in… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  26. arXiv:2110.09481  [pdf, other

    cs.CV cs.MA cs.RO

    MTP: Multi-Hypothesis Tracking and Prediction for Reduced Error Propagation

    Authors: Xinshuo Weng, Boris Ivanovic, Marco Pavone

    Abstract: Recently, there has been tremendous progress in developing each individual module of the standard perception-planning robot autonomy pipeline, including detection, tracking, prediction of other agents' trajectories, and ego-agent trajectory planning. Nevertheless, there has been less attention given to the principled integration of these components, particularly in terms of the characterization an… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: Project page: https://www.xinshuoweng.com/projects/MTP

  27. arXiv:2110.03270  [pdf, other

    cs.RO cs.CV cs.LG eess.SY

    Injecting Planning-Awareness into Prediction and Detection Evaluation

    Authors: Boris Ivanovic, Marco Pavone

    Abstract: Detecting other agents and forecasting their behavior is an integral part of the modern robotic autonomy stack, especially in safety-critical scenarios entailing human-robot interaction such as autonomous driving. Due to the importance of these components, there has been a significant amount of interest and research in perception and trajectory forecasting, resulting in a wide variety of approache… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: 8 pages, 9 figures. arXiv admin note: substantial text overlap with arXiv:2107.10297

  28. arXiv:2110.03267  [pdf, other

    cs.RO cs.CV cs.LG

    Propagating State Uncertainty Through Trajectory Forecasting

    Authors: Boris Ivanovic, Yifeng Lin, Shubham Shrivastava, Punarjay Chakravarty, Marco Pavone

    Abstract: Uncertainty pervades through the modern robotic autonomy stack, with nearly every component (e.g., sensors, detection, classification, tracking, behavior prediction) producing continuous or discrete probabilistic distributions. Trajectory forecasting, in particular, is surrounded by uncertainty as its inputs are produced by (noisy) upstream perception and its outputs are predictions that are often… ▽ More

    Submitted 12 July, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: IEEE International Conference on Robotics and Automation (ICRA) 2022 -- 8 pages, 6 figures, 4 tables

  29. Sample-Efficient Safety Assurances using Conformal Prediction

    Authors: Rachel Luo, Shengjia Zhao, Jonathan Kuck, Boris Ivanovic, Silvio Savarese, Edward Schmerling, Marco Pavone

    Abstract: When deploying machine learning models in high-stakes robotics applications, the ability to detect unsafe situations is crucial. Early warning systems can provide alerts when an unsafe situation is imminent (in the absence of corrective action). To reliably improve safety, these warning systems should have a provable false negative rate; i.e. of the situations that are unsafe, fewer than $ε$ will… ▽ More

    Submitted 2 January, 2024; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: International Journal of Robotics Research, 2023

  30. arXiv:2107.10297  [pdf, other

    cs.RO cs.CV cs.LG eess.SY

    Rethinking Trajectory Forecasting Evaluation

    Authors: Boris Ivanovic, Marco Pavone

    Abstract: Forecasting the behavior of other agents is an integral part of the modern robotic autonomy stack, especially in safety-critical scenarios with human-robot interaction, such as autonomous driving. In turn, there has been a significant amount of interest and research in trajectory forecasting, resulting in a wide variety of approaches. Common to all works, however, is the use of the same few accura… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    Comments: 4 pages, 2 figures

  31. arXiv:2104.12446  [pdf, other

    cs.CV cs.LG cs.RO

    Heterogeneous-Agent Trajectory Forecasting Incorporating Class Uncertainty

    Authors: Boris Ivanovic, Kuan-Hui Lee, Pavel Tokmakov, Blake Wulfe, Rowan McAllister, Adrien Gaidon, Marco Pavone

    Abstract: Reasoning about the future behavior of other agents is critical to safe robot navigation. The multiplicity of plausible futures is further amplified by the uncertainty inherent to agent state estimation from data, including positions, velocities, and semantic class. Forecasting methods, however, typically neglect class uncertainty, conditioning instead only on the agent's most likely class, even t… ▽ More

    Submitted 2 March, 2022; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: 15 pages, 15 figures, 6 tables

  32. arXiv:2012.01027  [pdf, other

    cs.RO cs.AI

    Leveraging Neural Network Gradients within Trajectory Optimization for Proactive Human-Robot Interactions

    Authors: Simon Schaefer, Karen Leung, Boris Ivanovic, Marco Pavone

    Abstract: To achieve seamless human-robot interactions, robots need to intimately reason about complex interaction dynamics and future human behaviors within their motion planning process. However, there is a disconnect between state-of-the-art neural network-based human behavior models and robot motion planners -- either the behavior models are limited in their consideration of downstream planning or a sim… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

  33. arXiv:2010.09164  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

    Authors: Masha Itkina, Boris Ivanovic, Ransalu Senanayake, Mykel J. Kochenderfer, Marco Pavone

    Abstract: Discrete latent spaces in variational autoencoders have been shown to effectively capture the data distribution for many real-world problems such as natural language understanding, human intent prediction, and visual scene representation. However, discrete latent spaces need to be sufficiently large to capture the complexities of real-world data, rendering downstream tasks computationally challeng… ▽ More

    Submitted 18 January, 2021; v1 submitted 18 October, 2020; originally announced October 2020.

    Comments: 21 pages, 15 figures, 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

    ACM Class: I.2.10; I.2.9; I.2.6

  34. arXiv:2009.07517  [pdf, other

    cs.RO cs.HC cs.LG eess.SY

    MATS: An Interpretable Trajectory Forecasting Representation for Planning and Control

    Authors: Boris Ivanovic, Amine Elhafsi, Guy Rosman, Adrien Gaidon, Marco Pavone

    Abstract: Reasoning about human motion is a core component of modern human-robot interactive systems. In particular, one of the main uses of behavior prediction in autonomous systems is to inform robot motion planning and control. However, a majority of planning and control algorithms reason about system dynamics rather than the predicted agent tracklets (i.e., ordered sets of waypoints) that are commonly o… ▽ More

    Submitted 14 January, 2021; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: 14 pages, 6 figures, 1 table. All code, models, and data can be found at https://github.com/StanfordASL/MATS . Conference on Robot Learning (CoRL) 2020

  35. arXiv:2009.05702  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Risk-Sensitive Sequential Action Control with Multi-Modal Human Trajectory Forecasting for Safe Crowd-Robot Interaction

    Authors: Haruki Nishimura, Boris Ivanovic, Adrien Gaidon, Marco Pavone, Mac Schwager

    Abstract: This paper presents a novel online framework for safe crowd-robot interaction based on risk-sensitive stochastic optimal control, wherein the risk is modeled by the entropic risk measure. The sampling-based model predictive control relies on mode insertion gradient optimization for this risk measure as well as Trajectron++, a state-of-the-art generative model that produces multimodal probabilistic… ▽ More

    Submitted 11 September, 2020; originally announced September 2020.

    Comments: To appear in 2020 IEEE/RSJ IROS

  36. arXiv:2008.03880  [pdf, other

    cs.RO cs.HC cs.LG

    Multimodal Deep Generative Models for Trajectory Prediction: A Conditional Variational Autoencoder Approach

    Authors: Boris Ivanovic, Karen Leung, Edward Schmerling, Marco Pavone

    Abstract: Human behavior prediction models enable robots to anticipate how humans may react to their actions, and hence are instrumental to devising safe and proactive robot planning algorithms. However, modeling complex interaction dynamics and capturing the possibility of many possible outcomes in such interactive settings is very challenging, which has recently prompted the study of several different app… ▽ More

    Submitted 20 November, 2020; v1 submitted 9 August, 2020; originally announced August 2020.

    Comments: 8 pages, 3 figures, 2 tables. IEEE Robotics and Automation Letters (RA-L), 2020

  37. arXiv:2001.03093  [pdf, other

    cs.RO cs.HC cs.LG

    Trajectron++: Dynamically-Feasible Trajectory Forecasting With Heterogeneous Data

    Authors: Tim Salzmann, Boris Ivanovic, Punarjay Chakravarty, Marco Pavone

    Abstract: Reasoning about human motion is an important prerequisite to safe and socially-aware robotic navigation. As a result, multi-agent behavior prediction has become a core component of modern human-robot interactive systems, such as self-driving cars. While there exist many methods for trajectory forecasting, most do not enforce dynamic constraints and do not account for environmental information (e.g… ▽ More

    Submitted 13 January, 2021; v1 submitted 9 January, 2020; originally announced January 2020.

    Comments: 23 pages, 6 figures, 5 tables. All code, models, and data can be found at https://github.com/StanfordASL/Trajectron-plus-plus . European Conference on Computer Vision (ECCV) 2020. Fixed a few typos

  38. arXiv:1910.08184  [pdf, other

    cs.RO eess.SY

    Map-Predictive Motion Planning in Unknown Environments

    Authors: Amine Elhafsi, Boris Ivanovic, Lucas Janson, Marco Pavone

    Abstract: Algorithms for motion planning in unknown environments are generally limited in their ability to reason about the structure of the unobserved environment. As such, current methods generally navigate unknown environments by relying on heuristic methods to choose intermediate objectives along frontiers. We present a unified method that combines map prediction and motion planning for safe, time-effic… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

  39. arXiv:1810.05993  [pdf, other

    cs.RO cs.HC cs.LG

    The Trajectron: Probabilistic Multi-Agent Trajectory Modeling With Dynamic Spatiotemporal Graphs

    Authors: Boris Ivanovic, Marco Pavone

    Abstract: Developing safe human-robot interaction systems is a necessary step towards the widespread integration of autonomous agents in society. A key component of such systems is the ability to reason about the many potential futures (e.g. trajectories) of other agents in the scene. Towards this end, we present the Trajectron, a graph-structured model that predicts many potential future trajectories of mu… ▽ More

    Submitted 23 August, 2019; v1 submitted 14 October, 2018; originally announced October 2018.

    Comments: IEEE/CVF International Conference on Computer Vision (ICCV) 2019 -- 10 pages, 10 figures, 2 tables

  40. arXiv:1806.06161  [pdf, other

    cs.RO cs.LG eess.SY

    BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning

    Authors: Boris Ivanovic, James Harrison, Apoorva Sharma, Mo Chen, Marco Pavone

    Abstract: Model-free Reinforcement Learning (RL) offers an attractive approach to learn control policies for high-dimensional systems, but its relatively poor sample complexity often forces training in simulated environments. Even in simulation, goal-directed tasks whose natural reward function is sparse remain intractable for state-of-the-art model-free algorithms for continuous control. The bottleneck in… ▽ More

    Submitted 16 September, 2018; v1 submitted 15 June, 2018; originally announced June 2018.

  41. arXiv:1803.02015  [pdf, other

    cs.RO cs.HC

    Generative Modeling of Multimodal Multi-Human Behavior

    Authors: Boris Ivanovic, Edward Schmerling, Karen Leung, Marco Pavone

    Abstract: This work presents a methodology for modeling and predicting human behavior in settings with N humans interacting in highly multimodal scenarios (i.e. where there are many possible highly-distinct futures). A motivating example includes robots interacting with humans in crowded environments, such as self-driving cars operating alongside human-driven vehicles or human-robot collaborative bin packin… ▽ More

    Submitted 26 July, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018 -- 8 pages, 5 figures

  42. arXiv:1707.04674  [pdf, other

    cs.RO

    ADAPT: Zero-Shot Adaptive Policy Transfer for Stochastic Dynamical Systems

    Authors: James Harrison, Animesh Garg, Boris Ivanovic, Yuke Zhu, Silvio Savarese, Li Fei-Fei, Marco Pavone

    Abstract: Model-free policy learning has enabled robust performance of complex tasks with relatively simple algorithms. However, this simplicity comes at the cost of requiring an Oracle and arguably very poor sample complexity. This renders such methods unsuitable for physical systems. Variants of model-based methods address this problem through the use of simulators, however, this gives rise to the problem… ▽ More

    Submitted 8 November, 2017; v1 submitted 14 July, 2017; originally announced July 2017.

    Comments: International Symposium on Robotics Research (ISRR), 2017