Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–16 of 16 results for author: Itkina, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.21126  [pdf, other

    cs.CV cs.RO

    Self-supervised Multi-future Occupancy Forecasting for Autonomous Driving

    Authors: Bernard Lange, Masha Itkina, Jiachen Li, Mykel J. Kochenderfer

    Abstract: Environment prediction frameworks are critical for the safe navigation of autonomous vehicles (AVs) in dynamic settings. LiDAR-generated occupancy grid maps (L-OGMs) offer a robust bird's-eye view for the scene representation, enabling self-supervised joint scene predictions while exhibiting resilience to partial observability and perception detection failures. Prior approaches have focused on det… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  2. arXiv:2405.05439  [pdf, other

    cs.RO cs.AI cs.LG stat.AP

    How Generalizable Is My Behavior Cloning Policy? A Statistical Approach to Trustworthy Performance Evaluation

    Authors: Joseph A. Vincent, Haruki Nishimura, Masha Itkina, Paarth Shah, Mac Schwager, Thomas Kollar

    Abstract: With the rise of stochastic generative models in robot policy learning, end-to-end visuomotor policies are increasingly successful at solving complex tasks by learning from human demonstrations. Nevertheless, since real-world evaluation costs afford users only a small number of policy rollouts, it remains a challenge to accurately gauge the performance of such policies. This is exacerbated by dist… ▽ More

    Submitted 18 July, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2403.15941  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Explore until Confident: Efficient Exploration for Embodied Question Answering

    Authors: Allen Z. Ren, Jaden Clark, Anushri Dixit, Masha Itkina, Anirudha Majumdar, Dorsa Sadigh

    Abstract: We consider the problem of Embodied Question Answering (EQA), which refers to settings where an embodied agent such as a robot needs to actively explore an environment to gather information until it is confident about the answer to a question. In this work, we leverage the strong semantic reasoning capabilities of large vision-language models (VLMs) to efficiently explore and answer such questions… ▽ More

    Submitted 7 July, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Robotics: Science and Systems (RSS) 2024

  4. arXiv:2403.12945  [pdf, other

    cs.RO

    DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset

    Authors: Alexander Khazatsky, Karl Pertsch, Suraj Nair, Ashwin Balakrishna, Sudeep Dasari, Siddharth Karamcheti, Soroush Nasiriany, Mohan Kumar Srirama, Lawrence Yunliang Chen, Kirsty Ellis, Peter David Fagan, Joey Hejna, Masha Itkina, Marion Lepert, Yecheng Jason Ma, Patrick Tree Miller, Jimmy Wu, Suneel Belkhale, Shivin Dass, Huy Ha, Arhan Jain, Abraham Lee, Youngwoon Lee, Marius Memmel, Sungjae Park , et al. (74 additional authors not shown)

    Abstract: The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a resu… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Project website: https://droid-dataset.github.io/

  5. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  6. arXiv:2211.08701  [pdf, other

    cs.RO cs.CV cs.LG

    Interpretable Self-Aware Neural Networks for Robust Trajectory Prediction

    Authors: Masha Itkina, Mykel J. Kochenderfer

    Abstract: Although neural networks have seen tremendous success as predictive models in a variety of domains, they can be overly confident in their predictions on out-of-distribution (OOD) data. To be viable for safety-critical applications, like autonomous vehicles, neural networks must accurately estimate their epistemic or model uncertainty, achieving a level of system self-awareness. Techniques for epis… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Conference on Robot Learning (CoRL) 2022, 15 pages, 4 figures

    ACM Class: I.2.9; I.2.6; I.2.10

  7. arXiv:2210.01249  [pdf, other

    cs.RO cs.CV

    LOPR: Latent Occupancy PRediction using Generative Models

    Authors: Bernard Lange, Masha Itkina, Mykel J. Kochenderfer

    Abstract: Environment prediction frameworks are integral for autonomous vehicles, enabling safe navigation in dynamic environments. LiDAR generated occupancy grid maps (L-OGMs) offer a robust bird's eye-view scene representation that facilitates joint scene predictions without relying on manual labeling unlike commonly used trajectory prediction frameworks. Prior approaches have optimized deterministic L-OG… ▽ More

    Submitted 24 August, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

  8. arXiv:2210.00552  [pdf, other

    cs.RO cs.HC cs.LG

    Occlusion-Aware Crowd Navigation Using People as Sensors

    Authors: Ye-Ji Mun, Masha Itkina, Shuijing Liu, Katherine Driggs-Campbell

    Abstract: Autonomous navigation in crowded spaces poses a challenge for mobile robots due to the highly dynamic, partially observable environment. Occlusions are highly prevalent in such settings due to a limited sensor field of view and obstructing human agents. Previous work has shown that observed interactive behaviors of human agents can be used to estimate potential obstacles despite occlusions. We pro… ▽ More

    Submitted 28 April, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: 7 pages, 01041552993 figures, Accepted to 2023 IEEE International Conference on Robotics and Automation (ICRA)

  9. arXiv:2203.14155  [pdf, other

    cs.RO cs.AI cs.LG

    How Do We Fail? Stress Testing Perception in Autonomous Vehicles

    Authors: Harrison Delecki, Masha Itkina, Bernard Lange, Ransalu Senanayake, Mykel J. Kochenderfer

    Abstract: Autonomous vehicles (AVs) rely on environment perception and behavior prediction to reason about agents in their surroundings. These perception systems must be robust to adverse weather such as rain, fog, and snow. However, validation of these systems is challenging due to their complexity and dependence on observation histories. This paper presents a method for characterizing failures of LiDAR-ba… ▽ More

    Submitted 26 March, 2022; originally announced March 2022.

    Comments: Submitted to IEEE IROS 2022

  10. arXiv:2110.14182  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models

    Authors: Phil Chen, Masha Itkina, Ransalu Senanayake, Mykel J. Kochenderfer

    Abstract: Many applications of generative models rely on the marginalization of their high-dimensional output probability distributions. Normalization functions that yield sparse probability distributions can make exact marginalization more computationally tractable. However, sparse normalization functions usually require alternative loss functions for training since the log-likelihood is undefined for spar… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Accepted to NeurIPS 2021. Code is available at https://github.com/sisl/EvSoftmax

  11. arXiv:2109.02173  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.MA

    Multi-Agent Variational Occlusion Inference Using People as Sensors

    Authors: Masha Itkina, Ye-Ji Mun, Katherine Driggs-Campbell, Mykel J. Kochenderfer

    Abstract: Autonomous vehicles must reason about spatial occlusions in urban environments to ensure safety without being overly cautious. Prior work explored occlusion inference from observed social behaviors of road agents, hence treating people as sensors. Inferring occupancy from agent behaviors is an inherently multimodal problem; a driver may behave similarly for different occupancy patterns ahead of th… ▽ More

    Submitted 2 March, 2022; v1 submitted 5 September, 2021; originally announced September 2021.

    Comments: 12 pages, 9 figures, International Conference on Robotics and Automation (ICRA) 2022

    ACM Class: I.2.9; I.2.10

  12. arXiv:2011.09045  [pdf, other

    cs.RO

    Double-Prong ConvLSTM for Spatiotemporal Occupancy Prediction in Dynamic Environments

    Authors: Maneekwan Toyungyernsub, Masha Itkina, Ransalu Senanayake, Mykel J. Kochenderfer

    Abstract: Predicting the future occupancy state of an environment is important to enable informed decisions for autonomous vehicles. Common challenges in occupancy prediction include vanishing dynamic objects and blurred predictions, especially for long prediction horizons. In this work, we propose a double-prong neural network architecture to predict the spatiotemporal evolution of the occupancy state. One… ▽ More

    Submitted 27 September, 2022; v1 submitted 17 November, 2020; originally announced November 2020.

    Comments: Accepted at 2021 International Conference on Robotics and Automation (ICRA 2021)

    ACM Class: I.2.9; I.2.10

  13. arXiv:2011.01413  [pdf, other

    cs.CV cs.RO

    Out-of-Distribution Detection for Automotive Perception

    Authors: Julia Nitsch, Masha Itkina, Ransalu Senanayake, Juan Nieto, Max Schmidt, Roland Siegwart, Mykel J. Kochenderfer, Cesar Cadena

    Abstract: Neural networks (NNs) are widely used for object classification in autonomous driving. However, NNs can fail on input data not well represented by the training dataset, known as out-of-distribution (OOD) data. A mechanism to detect OOD samples is important for safety-critical applications, such as automotive perception, to trigger a safe fallback mode. NNs often rely on softmax normalization for c… ▽ More

    Submitted 5 September, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    Comments: 6 pages, 4 figures, paper accepted at Intelligent Transportation Systems Conference (ITSC) 2021

    ACM Class: I.2.10; I.2.9

  14. arXiv:2010.09662  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Attention Augmented ConvLSTM for Environment Prediction

    Authors: Bernard Lange, Masha Itkina, Mykel J. Kochenderfer

    Abstract: Safe and proactive planning in robotic systems generally requires accurate predictions of the environment. Prior work on environment prediction applied video frame prediction techniques to bird's-eye view environment representations, such as occupancy grids. ConvLSTM-based frameworks used previously often result in significant blurring and vanishing of moving objects, thus hindering their applicab… ▽ More

    Submitted 10 September, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: Accepted to be published on 2021 International Conference on Intelligent Robots and Systems (IROS)

    ACM Class: I.2.9; I.2.10

  15. arXiv:2010.09164  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

    Authors: Masha Itkina, Boris Ivanovic, Ransalu Senanayake, Mykel J. Kochenderfer, Marco Pavone

    Abstract: Discrete latent spaces in variational autoencoders have been shown to effectively capture the data distribution for many real-world problems such as natural language understanding, human intent prediction, and visual scene representation. However, discrete latent spaces need to be sufficiently large to capture the complexities of real-world data, rendering downstream tasks computationally challeng… ▽ More

    Submitted 18 January, 2021; v1 submitted 18 October, 2020; originally announced October 2020.

    Comments: 21 pages, 15 figures, 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

    ACM Class: I.2.10; I.2.9; I.2.6

  16. arXiv:1904.12374  [pdf, other

    cs.CV cs.LG cs.RO

    Dynamic Environment Prediction in Urban Scenes using Recurrent Representation Learning

    Authors: Masha Itkina, Katherine Driggs-Campbell, Mykel J. Kochenderfer

    Abstract: A key challenge for autonomous driving is safe trajectory planning in cluttered, urban environments with dynamic obstacles, such as pedestrians, bicyclists, and other vehicles. A reliable prediction of the future environment, including the behavior of dynamic agents, would allow planning algorithms to proactively generate a trajectory in response to a rapidly changing environment. We present a nov… ▽ More

    Submitted 18 August, 2019; v1 submitted 28 April, 2019; originally announced April 2019.

    Comments: 8 pages, updated final draft, accepted into Intelligent Transportation Systems Conference (ITSC) 2019