Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 136 results for author: Burgard, W

.
  1. arXiv:2405.19035  [pdf, other

    cs.RO cs.CV

    A Good Foundation is Worth Many Labels: Label-Efficient Panoptic Segmentation

    Authors: Niclas Vödisch, Kürsat Petek, Markus Käppeler, Abhinav Valada, Wolfram Burgard

    Abstract: A key challenge for the widespread application of learning-based models for robotic perception is to significantly reduce the required amount of annotated training data while achieving accurate predictions. This is essential not only to decrease operating costs but also to speed up deployment time. In this work, we address this challenge for PAnoptic SegmenTation with fEw Labels (PASTEL) by exploi… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  2. arXiv:2405.18852  [pdf, other

    cs.CV cs.AI cs.RO

    LetsMap: Unsupervised Representation Learning for Semantic BEV Mapping

    Authors: Nikhil Gosala, Kürsat Petek, B Ravi Kiran, Senthil Yogamani, Paulo Drews-Jr, Wolfram Burgard, Abhinav Valada

    Abstract: Semantic Bird's Eye View (BEV) maps offer a rich representation with strong occlusion reasoning for various decision making tasks in autonomous driving. However, most BEV mapping approaches employ a fully supervised learning paradigm that relies on large amounts of human-annotated BEV ground truth data. In this work, we address this limitation by proposing the first unsupervised representation lea… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 23 pages, 5 figures

  3. arXiv:2404.17298  [pdf, other

    cs.RO

    Automatic Target-Less Camera-LiDAR Calibration From Motion and Deep Point Correspondences

    Authors: Kürsat Petek, Niclas Vödisch, Johannes Meyer, Daniele Cattaneo, Abhinav Valada, Wolfram Burgard

    Abstract: Sensor setups of robotic platforms commonly include both camera and LiDAR as they provide complementary information. However, fusing these two modalities typically requires a highly accurate calibration between them. In this paper, we propose MDPCalib which is a novel method for camera-LiDAR calibration that requires neither human supervision nor any specific target objects. Instead, we utilize se… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  4. arXiv:2403.17846  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation

    Authors: Abdelrhman Werby, Chenguang Huang, Martin Büchner, Abhinav Valada, Wolfram Burgard

    Abstract: Recent open-vocabulary robot mapping methods enrich dense geometric maps with pre-trained visual-language features. While these maps allow for the prediction of point-wise saliency maps when queried for a certain language concept, large-scale environments and abstract queries beyond the object level still pose a considerable hurdle, ultimately limiting language-grounded robotic navigation. In this… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Code and video are available at http://hovsg.github.io/

  5. arXiv:2403.14305  [pdf, other

    cs.RO

    Bayesian Optimization for Sample-Efficient Policy Improvement in Robotic Manipulation

    Authors: Adrian Röfer, Iman Nematollahi, Tim Welschehold, Wolfram Burgard, Abhinav Valada

    Abstract: Sample efficient learning of manipulation skills poses a major challenge in robotics. While recent approaches demonstrate impressive advances in the type of task that can be addressed and the sensing modalities that can be incorporated, they still require large amounts of training data. Especially with regard to learning actions on robots in the real world, this poses a major problem due to the hi… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures, 2 tables, submitted to IROS2024

  6. arXiv:2403.11914  [pdf, other

    cs.LG cs.RO

    Single-Agent Actor Critic for Decentralized Cooperative Driving

    Authors: Shengchao Yan, Lukas König, Wolfram Burgard

    Abstract: Active traffic management incorporating autonomous vehicles (AVs) promises a future with diminished congestion and enhanced traffic flow. However, developing algorithms for real-world application requires addressing the challenges posed by continuous traffic flow and partial observability. To bridge this gap and advance the field of active traffic management towards greater decentralization, we in… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  7. arXiv:2403.11761  [pdf, other

    cs.RO cs.CV

    BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation

    Authors: Jonas Schramm, Niclas Vödisch, Kürsat Petek, B Ravi Kiran, Senthil Yogamani, Wolfram Burgard, Abhinav Valada

    Abstract: Semantic scene segmentation from a bird's-eye-view (BEV) perspective plays a crucial role in facilitating planning and decision-making for mobile robots. Although recent vision-only methods have demonstrated notable advancements in performance, they often struggle under adverse illumination conditions such as rain or nighttime. While active sensors offer a solution to this challenge, the prohibiti… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  8. arXiv:2402.07691  [pdf, other

    cs.RO

    Evaluation of a Smart Mobile Robotic System for Industrial Plant Inspection and Supervision

    Authors: Georg K. J. Fischer, Max Bergau, D. Adriana Gómez-Rosal, Andreas Wachaja, Johannes Gräter, Matthias Odenweller, Uwe Piechottka, Fabian Hoeflinger, Nikhil Gosala, Niklas Wetzel, Daniel Büscher, Abhinav Valada, Wolfram Burgard

    Abstract: Automated and autonomous industrial inspection is a longstanding research field, driven by the necessity to enhance safety and efficiency within industrial settings. In addressing this need, we introduce an autonomously navigating robotic system designed for comprehensive plant inspection. This innovative system comprises a robotic platform equipped with a diverse array of sensors integrated to fa… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Submitted for publication in IEEE Sensors Journal

  9. arXiv:2402.05840  [pdf, other

    cs.RO

    uPLAM: Robust Panoptic Localization and Mapping Leveraging Perception Uncertainties

    Authors: Kshitij Sirohi, Daniel Büscher, Wolfram Burgard

    Abstract: The availability of a robust map-based localization system is essential for the operation of many autonomously navigating vehicles. Since uncertainty is an inevitable part of perception, it is beneficial for the robustness of the robot to consider it in typical downstream tasks of navigation stacks. In particular localization and mapping methods, which in modern systems often employ convolutional… ▽ More

    Submitted 20 March, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  10. arXiv:2312.08240  [pdf, other

    cs.RO cs.CV

    CenterGrasp: Object-Aware Implicit Representation Learning for Simultaneous Shape Reconstruction and 6-DoF Grasp Estimation

    Authors: Eugenio Chisari, Nick Heppert, Tim Welschehold, Wolfram Burgard, Abhinav Valada

    Abstract: Reliable object grasping is a crucial capability for autonomous robots. However, many existing grasping approaches focus on general clutter removal without explicitly modeling objects and thus only relying on the visible local geometry. We introduce CenterGrasp, a novel framework that combines object awareness and holistic grasping. CenterGrasp learns a general object prior by encoding shapes and… ▽ More

    Submitted 5 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted at RA-L. Video, code and models available at http://centergrasp.cs.uni-freiburg.de

  11. arXiv:2310.15059  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Robot Skill Generalization via Keypoint Integrated Soft Actor-Critic Gaussian Mixture Models

    Authors: Iman Nematollahi, Kirill Yankov, Wolfram Burgard, Tim Welschehold

    Abstract: A long-standing challenge for a robotic manipulation system operating in real-world scenarios is adapting and generalizing its acquired motor skills to unseen environments. We tackle this challenge employing hybrid skill models that integrate imitation and reinforcement paradigms, to explore how the learning and adaptation of a skill, along with its core grounding in the scene through a learned ke… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted at the International Symposium on Experimental Robotics (ISER) 2023. Videos at http://kis-gmm.cs.uni-freiburg.de/

  12. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  13. arXiv:2310.05600  [pdf, other

    cs.RO cs.CV

    Care3D: An Active 3D Object Detection Dataset of Real Robotic-Care Environments

    Authors: Michael G. Adam, Sebastian Eger, Martin Piccolrovazzi, Maged Iskandar, Joern Vogel, Alexander Dietrich, Seongjien Bien, Jon Skerlj, Abdeldjallil Naceri, Eckehard Steinbach, Alin Albu-Schaeffer, Sami Haddadin, Wolfram Burgard

    Abstract: As labor shortage increases in the health sector, the demand for assistive robotics grows. However, the needed test data to develop those robots is scarce, especially for the application of active 3D object detection, where no real data exists at all. This short paper counters this by introducing such an annotated dataset of real environments. The captured environments represent areas which are al… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  14. arXiv:2310.05239  [pdf, other

    cs.RO

    LAN-grasp: Using Large Language Models for Semantic Object Grasping

    Authors: Reihaneh Mirjalili, Michael Krawez, Simone Silenzi, Yannik Blei, Wolfram Burgard

    Abstract: In this paper, we propose LAN-grasp, a novel approach towards more appropriate semantic grasping. We use foundation models to provide the robot with a deeper understanding of the objects, the right place to grasp an object, or even the parts to avoid. This allows our robot to grasp and utilize objects in a more meaningful and safe manner. We leverage the combination of a Large Language Model, a Vi… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  15. arXiv:2309.10726  [pdf, other

    cs.CV cs.RO

    Few-Shot Panoptic Segmentation With Foundation Models

    Authors: Markus Käppeler, Kürsat Petek, Niclas Vödisch, Wolfram Burgard, Abhinav Valada

    Abstract: Current state-of-the-art methods for panoptic segmentation require an immense amount of annotated training data that is both arduous and expensive to obtain posing a significant challenge for their widespread adoption. Concurrently, recent breakthroughs in visual representation learning have sparked a paradigm shift leading to the advent of large foundation models that can be trained with complete… ▽ More

    Submitted 1 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted for "IEEE International Conference on Robotics and Automation (ICRA) 2024"

  16. arXiv:2309.06635  [pdf, other

    cs.RO

    Collaborative Dynamic 3D Scene Graphs for Automated Driving

    Authors: Elias Greve, Martin Büchner, Niclas Vödisch, Wolfram Burgard, Abhinav Valada

    Abstract: Maps have played an indispensable role in enabling safe and automated driving. Although there have been many advances on different fronts ranging from SLAM to semantics, building an actionable hierarchical semantic representation of urban dynamic scenes and processing information from multiple agents are still challenging problems. In this work, we present Collaborative URBan Scene Graphs (CURB-SG… ▽ More

    Submitted 4 March, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted for "IEEE International Conference on Robotics and Automation (ICRA) 2024"

  17. arXiv:2308.05612  [pdf, other

    cs.RO cs.AI

    A Smart Robotic System for Industrial Plant Supervision

    Authors: D. Adriana Gómez-Rosal, Max Bergau, Georg K. J. Fischer, Andreas Wachaja, Johannes Gräter, Matthias Odenweller, Uwe Piechottka, Fabian Hoeflinger, Nikhil Gosala, Niklas Wetzel, Daniel Büscher, Abhinav Valada, Wolfram Burgard

    Abstract: In today's chemical plants, human field operators perform frequent integrity checks to guarantee high safety standards, and thus are possibly the first to encounter dangerous operating conditions. To alleviate their task, we present a system consisting of an autonomously navigating robot integrated with various sensors and intelligent data processing. It is able to detect methane leaks and estimat… ▽ More

    Submitted 1 September, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: Final submission for IEEE Sensors 2023

  18. arXiv:2307.00488  [pdf, other

    cs.RO

    POV-SLAM: Probabilistic Object-Aware Variational SLAM in Semi-Static Environments

    Authors: Jingxing Qian, Veronica Chatrath, James Servos, Aaron Mavrinac, Wolfram Burgard, Steven L. Waslander, Angela P. Schoellig

    Abstract: Simultaneous localization and mapping (SLAM) in slowly varying scenes is important for long-term robot task completion. Failing to detect scene changes may lead to inaccurate maps and, ultimately, lost robots. Classical SLAM algorithms assume static scenes, and recent works take dynamics into account, but require scene changes to be observed in consecutive frames. Semi-static scenes, wherein objec… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: Published in Robotics: Science and Systems (RSS) 2023

  19. arXiv:2306.16316  [pdf, other

    cs.RO

    Learning Continuous Control with Geometric Regularity from Robot Intrinsic Symmetry

    Authors: Shengchao Yan, Baohe Zhang, Yuan Zhang, Joschka Boedecker, Wolfram Burgard

    Abstract: Geometric regularity, which leverages data symmetry, has been successfully incorporated into deep learning architectures such as CNNs, RNNs, GNNs, and Transformers. While this concept has been widely applied in robotics to address the curse of dimensionality when learning from high-dimensional data, the inherent reflectional and rotational symmetry of robot structures has not been adequately explo… ▽ More

    Submitted 18 March, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: accepted by ICRA 2024

  20. arXiv:2306.15410  [pdf, other

    cs.CV

    AutoGraph: Predicting Lane Graphs from Traffic Observations

    Authors: Jannik Zürn, Ingmar Posner, Wolfram Burgard

    Abstract: Lane graph estimation is a long-standing problem in the context of autonomous driving. Previous works aimed at solving this problem by relying on large-scale, hand-annotated lane graphs, introducing a data bottleneck for training models to solve this task. To overcome this limitation, we propose to use the motion patterns of traffic participants as lane graph annotations. In our AutoGraph approach… ▽ More

    Submitted 10 November, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: 8 pages, 6 figures

  21. arXiv:2306.11346  [pdf, other

    cs.RO cs.CV

    End-to-end 2D-3D Registration between Image and LiDAR Point Cloud for Vehicle Localization

    Authors: Guangming Wang, Yu Zheng, Yanfeng Guo, Zhe Liu, Yixiang Zhu, Wolfram Burgard, Hesheng Wang

    Abstract: Robot localization using a previously built map is essential for a variety of tasks including highly accurate navigation and mobile manipulation. A popular approach to robot localization is based on image-to-point cloud registration, which combines illumination-invariant LiDAR-based mapping with economical image-based localization. However, the recent works for image-to-point cloud registration ei… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 18 pages, 14 figures, under review

  22. Fast yet predictable braking manoeuvers for real-time robot control

    Authors: Mazin Hamad, Jesus Gutierrez-Moreno, Hugo T. M. Kussaba, Nico Mansfeld, Saeed Abdolshah, Abdalla Swikir, Wolfram Burgard, Sami Haddadin

    Abstract: This paper proposes a framework for generating fast, smooth and predictable braking manoeuvers for a controlled robot. The proposed framework integrates two approaches to obtain feasible modal limits for designing braking trajectories. The first approach is real-time capable but conservative considering the usage of the available feasible actuator control region, resulting in longer braking times.… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: This work has been accepted to the 22nd IFAC World Congress

  23. arXiv:2305.04718  [pdf, other

    cs.RO cs.AI cs.CV

    The Treachery of Images: Bayesian Scene Keypoints for Deep Policy Learning in Robotic Manipulation

    Authors: Jan Ole von Hartz, Eugenio Chisari, Tim Welschehold, Wolfram Burgard, Joschka Boedecker, Abhinav Valada

    Abstract: In policy learning for robotic manipulation, sample efficiency is of paramount importance. Thus, learning and extracting more compact representations from camera observations is a promising avenue. However, current methods often assume full observability of the scene and struggle with scale invariance. In many tasks and settings, this assumption does not hold as objects in the scene are often occl… ▽ More

    Submitted 20 September, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Journal ref: IEEE Robotics and Automation Letters, vol. 8, no. 11, pp. 6931-6938, Nov. 2023

  24. arXiv:2304.07058  [pdf, other

    cs.RO

    FM-Loc: Using Foundation Models for Improved Vision-based Localization

    Authors: Reihaneh Mirjalili, Michael Krawez, Wolfram Burgard

    Abstract: Visual place recognition is essential for vision-based robot localization and SLAM. Despite the tremendous progress made in recent years, place recognition in changing environments remains challenging. A promising approach to cope with appearance variations is to leverage high-level semantic features like objects or place categories. In this paper, we propose FM-Loc which is a novel image-based lo… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  25. arXiv:2303.11756  [pdf, other

    cs.RO cs.LG

    Improving Deep Dynamics Models for Autonomous Vehicles with Multimodal Latent Mapping of Surfaces

    Authors: Johan Vertens, Nicolai Dorka, Tim Welschehold, Michael Thompson, Wolfram Burgard

    Abstract: The safe deployment of autonomous vehicles relies on their ability to effectively react to environmental changes. This can require maneuvering on varying surfaces which is still a difficult problem, especially for slippery terrains. To address this issue we propose a new approach that learns a surface-aware dynamics model by conditioning it on a latent variable vector storing surface information a… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  26. CoVIO: Online Continual Learning for Visual-Inertial Odometry

    Authors: Niclas Vödisch, Daniele Cattaneo, Wolfram Burgard, Abhinav Valada

    Abstract: Visual odometry is a fundamental task for many applications on mobile devices and robotic platforms. Since such applications are oftentimes not limited to predefined target domains and learning-based vision systems are known to generalize poorly to unseen environments, methods for continual adaptation during inference time are of significant interest. In this work, we introduce CoVIO for online co… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Journal ref: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

  27. arXiv:2303.10147  [pdf, other

    cs.RO cs.CV

    CoDEPS: Online Continual Learning for Depth Estimation and Panoptic Segmentation

    Authors: Niclas Vödisch, Kürsat Petek, Wolfram Burgard, Abhinav Valada

    Abstract: Operating a robot in the open world requires a high level of robustness with respect to previously unseen environments. Optimally, the robot is able to adapt by itself to new conditions without human supervision, e.g., automatically adjusting its perception system to changing lighting conditions. In this work, we address the task of continual learning for deep learning-based monocular depth estima… ▽ More

    Submitted 31 May, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: Accepted for "Robotics: Science and Systems (RSS) 2023"

  28. arXiv:2303.10144  [pdf, other

    cs.LG stat.ML

    Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting

    Authors: Nicolai Dorka, Tim Welschehold, Wolfram Burgard

    Abstract: Early stopping based on the validation set performance is a popular approach to find the right balance between under- and overfitting in the context of supervised learning. However, in reinforcement learning, even for supervised sub-problems such as world model learning, early stopping is not applicable as the dataset is continually evolving. As a solution, we propose a new general method that dyn… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: ICLR 2023

  29. arXiv:2303.07522  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Audio Visual Language Maps for Robot Navigation

    Authors: Chenguang Huang, Oier Mees, Andy Zeng, Wolfram Burgard

    Abstract: While interacting in the world is a multi-sensory experience, many robots continue to predominantly rely on visual perception to map and navigate in their environments. In this work, we propose Audio-Visual-Language Maps (AVLMaps), a unified 3D spatial map representation for storing cross-modal information from audio, visual, and language cues. AVLMaps integrate the open-vocabulary capabilities of… ▽ More

    Submitted 27 March, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: Project page: https://avlmaps.github.io/

  30. arXiv:2303.03037  [pdf, other

    cs.CV cs.AI

    EvCenterNet: Uncertainty Estimation for Object Detection using Evidential Learning

    Authors: Monish R. Nallapareddy, Kshitij Sirohi, Paulo L. J. Drews-Jr, Wolfram Burgard, Chih-Hong Cheng, Abhinav Valada

    Abstract: Uncertainty estimation is crucial in safety-critical settings such as automated driving as it provides valuable information for several downstream tasks including high-level decision making and path planning. In this work, we propose EvCenterNet, a novel uncertainty-aware 2D object detection framework using evidential learning to directly estimate both classification and regression uncertainties.… ▽ More

    Submitted 28 September, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

  31. arXiv:2302.06175  [pdf, other

    cs.CV cs.RO

    Learning and Aggregating Lane Graphs for Urban Automated Driving

    Authors: Martin Büchner, Jannik Zürn, Ion-George Todoran, Abhinav Valada, Wolfram Burgard

    Abstract: Lane graph estimation is an essential and highly challenging task in automated driving and HD map learning. Existing methods using either onboard or aerial imagery struggle with complex lane topologies, out-of-distribution scenarios, or significant occlusions in the image space. Moreover, merging overlapping lane graphs to obtain consistent large-scale graphs remains difficult. To overcome these c… ▽ More

    Submitted 17 March, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: 22 pages, 17 figures

  32. arXiv:2302.04233  [pdf, other

    cs.CV cs.AI cs.RO

    SkyEye: Self-Supervised Bird's-Eye-View Semantic Mapping Using Monocular Frontal View Images

    Authors: Nikhil Gosala, Kürsat Petek, Paulo L. J. Drews-Jr, Wolfram Burgard, Abhinav Valada

    Abstract: Bird's-Eye-View (BEV) semantic maps have become an essential component of automated driving pipelines due to the rich representation they provide for decision-making tasks. However, existing approaches for generating these maps still follow a fully supervised training paradigm and hence rely on large amounts of annotated BEV data. In this work, we address this limitation by proposing the first sel… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: 14 pages, 7 figures

  33. arXiv:2210.05714  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Visual Language Maps for Robot Navigation

    Authors: Chenguang Huang, Oier Mees, Andy Zeng, Wolfram Burgard

    Abstract: Grounding language to the visual observations of a navigating agent can be performed using off-the-shelf visual-language models pretrained on Internet-scale data (e.g., image captions). While this is useful for matching images to natural language descriptions of object goals, it remains disjoint from the process of mapping the environment, so that it lacks the spatial precision of classic geometri… ▽ More

    Submitted 8 March, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted at the 2023 IEEE International Conference on Robotics and Automation (ICRA). Project page: https://vlmaps.github.io

  34. arXiv:2210.04472  [pdf, other

    cs.CV cs.RO

    Uncertainty-aware LiDAR Panoptic Segmentation

    Authors: Kshitij Sirohi, Sajad Marvi, Daniel Büscher, Wolfram Burgard

    Abstract: Modern autonomous systems often rely on LiDAR scanners, in particular for autonomous driving scenarios. In this context, reliable scene understanding is indispensable. Current learning-based methods typically try to achieve maximum performance for this task, while neglecting a proper estimation of the associated uncertainties. In this work, we introduce a novel approach for solving the task of unc… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  35. arXiv:2210.01911  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    Grounding Language with Visual Affordances over Unstructured Data

    Authors: Oier Mees, Jessica Borja-Diaz, Wolfram Burgard

    Abstract: Recent works have shown that Large Language Models (LLMs) can be applied to ground natural language to a wide variety of robot skills. However, in practice, learning multi-task, language-conditioned robotic skills typically requires large-scale data collection and frequent human intervention to reset the environment or help correcting the current policies. In this work, we propose a novel approach… ▽ More

    Submitted 8 March, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Accepted at the 2023 IEEE International Conference on Robotics and Automation (ICRA). Project website: http://hulc2.cs.uni-freiburg.de

  36. arXiv:2209.11693  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    T3VIP: Transformation-based 3D Video Prediction

    Authors: Iman Nematollahi, Erick Rosete-Beas, Seyed Mahdi B. Azad, Raghu Rajan, Frank Hutter, Wolfram Burgard

    Abstract: For autonomous skill acquisition, robots have to learn about the physical rules governing the 3D world dynamics from their own past experience to predict and reason about plausible future outcomes. To this end, we propose a transformation-based 3D video prediction (T3VIP) approach that explicitly models the 3D motion by decomposing a scene into its object parts and predicting their corresponding r… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: Accepted at the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  37. PADLoC: LiDAR-Based Deep Loop Closure Detection and Registration Using Panoptic Attention

    Authors: José Arce, Niclas Vödisch, Daniele Cattaneo, Wolfram Burgard, Abhinav Valada

    Abstract: A key component of graph-based SLAM systems is the ability to detect loop closures in a trajectory to reduce the drift accumulated over time from the odometry. Most LiDAR-based methods achieve this goal by using only the geometric information, disregarding the semantics of the scene. In this work, we introduce PADLoC for joint loop closure detection and registration in LiDAR-based SLAM frameworks.… ▽ More

    Submitted 28 March, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

    Journal ref: IEEE Robotics and Automation Letters, vol. 8, no. 3, pp. 1319-1326, March 2023

  38. arXiv:2209.08959  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Latent Plans for Task-Agnostic Offline Reinforcement Learning

    Authors: Erick Rosete-Beas, Oier Mees, Gabriel Kalweit, Joschka Boedecker, Wolfram Burgard

    Abstract: Everyday tasks of long-horizon and comprising a sequence of multiple implicit subtasks still impose a major challenge in offline robot control. While a number of prior methods aimed to address this setting with variants of imitation and offline reinforcement learning, the learned behavior is typically narrow and often struggles to reach configurable long-horizon goals. As both paradigms have compl… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: CoRL 2022. Project website: http://tacorl.cs.uni-freiburg.de/

  39. arXiv:2209.05247  [pdf, other

    cs.RO cs.CV

    TrackletMapper: Ground Surface Segmentation and Mapping from Traffic Participant Trajectories

    Authors: Jannik Zürn, Sebastian Weber, Wolfram Burgard

    Abstract: Robustly classifying ground infrastructure such as roads and street crossings is an essential task for mobile robots operating alongside pedestrians. While many semantic segmentation datasets are available for autonomous vehicles, models trained on such datasets exhibit a large domain gap when deployed on robots operating in pedestrian spaces. Manually annotating images recorded from pedestrian vi… ▽ More

    Submitted 8 January, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: 19 pages, 14 figures, CoRL 2022 v4 (updated acknowledgements)

  40. arXiv:2207.07469  [pdf, other

    cs.CV cs.RO

    USegScene: Unsupervised Learning of Depth, Optical Flow and Ego-Motion with Semantic Guidance and Coupled Networks

    Authors: Johan Vertens, Wolfram Burgard

    Abstract: In this paper we propose USegScene, a framework for semantically guided unsupervised learning of depth, optical flow and ego-motion estimation for stereo camera images using convolutional neural networks. Our framework leverages semantic information for improved regularization of depth and optical flow maps, multimodal fusion and occlusion filling considering dynamic rigid object motions as indepe… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  41. arXiv:2206.14554  [pdf, other

    cs.CV cs.RO

    Uncertainty-aware Panoptic Segmentation

    Authors: Kshitij Sirohi, Sajad Marvi, Daniel Büscher, Wolfram Burgard

    Abstract: Reliable scene understanding is indispensable for modern autonomous systems. Current learning-based methods typically try to maximize their performance based on segmentation metrics that only consider the quality of the segmentation. However, for the safe operation of a system in the real world it is crucial to consider the uncertainty in the prediction as well. In this work, we introduce the nove… ▽ More

    Submitted 24 December, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

  42. arXiv:2204.06252  [pdf, other

    cs.RO cs.AI cs.CL cs.CV

    What Matters in Language Conditioned Robotic Imitation Learning over Unstructured Data

    Authors: Oier Mees, Lukas Hermann, Wolfram Burgard

    Abstract: A long-standing goal in robotics is to build robots that can perform a wide range of daily tasks from perceptions obtained with their onboard sensors and specified only via natural language. While recently substantial advances have been achieved in language-driven robotics by leveraging end-to-end learning from pixels, there is no clear and well-understood process for making various design choices… ▽ More

    Submitted 30 August, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: Accepted for publication at IEEE Robotics and Automation Letters (RAL). Codebase and trained models available at http://hulc.cs.uni-freiburg.de

  43. Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping Through Continual Learning

    Authors: Niclas Vödisch, Daniele Cattaneo, Wolfram Burgard, Abhinav Valada

    Abstract: Robots operating in the open world encounter various different environments that can substantially differ from each other. This domain gap also poses a challenge for Simultaneous Localization and Mapping (SLAM) being one of the fundamental tasks for navigation. In particular, learning-based SLAM methods are known to generalize poorly to unseen environments hindering their general adoption. In this… ▽ More

    Submitted 13 March, 2023; v1 submitted 3 March, 2022; originally announced March 2022.

    Journal ref: Robotics Research. ISRR 2022. Springer Proceedings in Advanced Robotics, vol 27, pp 19-35

  44. arXiv:2203.00403  [pdf, other

    cs.RO cs.AI

    OpenDR: An Open Toolkit for Enabling High Performance, Low Footprint Deep Learning for Robotics

    Authors: N. Passalis, S. Pedrazzi, R. Babuska, W. Burgard, D. Dias, F. Ferro, M. Gabbouj, O. Green, A. Iosifidis, E. Kayacan, J. Kober, O. Michel, N. Nikolaidis, P. Nousi, R. Pieters, M. Tzelepi, A. Valada, A. Tefas

    Abstract: Existing Deep Learning (DL) frameworks typically do not provide ready-to-use solutions for robotics, where very specific learning, reasoning, and embodiment problems exist. Their relatively steep learning curve and the different methodologies employed by DL compared to traditional approaches, along with the high complexity of DL models, which often leads to the need of employing specialized hardwa… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

  45. arXiv:2203.00352  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Affordance Learning from Play for Sample-Efficient Policy Learning

    Authors: Jessica Borja-Diaz, Oier Mees, Gabriel Kalweit, Lukas Hermann, Joschka Boedecker, Wolfram Burgard

    Abstract: Robots operating in human-centered environments should have the ability to understand how objects function: what can be done with each object, where this interaction may occur, and how the object is used to achieve a goal. To this end, we propose a novel approach that extracts a self-supervised visual affordance model from human teleoperated play data and leverages it to enable efficient policy le… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: Accepted at the 2022 IEEE International Conference on Robotics and Automation (ICRA). Videos at http://vapo.cs.uni-freiburg.de/

  46. arXiv:2201.12771  [pdf, other

    cs.CV

    Self-Supervised Moving Vehicle Detection from Audio-Visual Cues

    Authors: Jannik Zürn, Wolfram Burgard

    Abstract: Robust detection of moving vehicles is a critical task for any autonomously operating outdoor robot or self-driving vehicle. Most modern approaches for solving this task rely on training image-based detectors using large-scale vehicle detection datasets such as nuScenes or the Waymo Open Dataset. Providing manual annotations is an expensive and laborious exercise that does not scale well in practi… ▽ More

    Submitted 13 June, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: 8 pages, 6 figures

  47. arXiv:2112.03227  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    CALVIN: A Benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

    Authors: Oier Mees, Lukas Hermann, Erick Rosete-Beas, Wolfram Burgard

    Abstract: General-purpose robots coexisting with humans in their environment must learn to relate human language to their perceptions and actions to be useful in a range of daily tasks. Moreover, they need to acquire a diverse repertoire of general-purpose skills that allow composing long-horizon tasks by following unconstrained language instructions. In this paper, we present CALVIN (Composing Actions from… ▽ More

    Submitted 13 July, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: Accepted for publication at IEEE Robotics and Automation Letters (RAL). Code, models and dataset available at http://calvin.cs.uni-freiburg.de

  48. arXiv:2111.13129  [pdf, other

    cs.RO cs.CV cs.LG

    Robot Skill Adaptation via Soft Actor-Critic Gaussian Mixture Models

    Authors: Iman Nematollahi, Erick Rosete-Beas, Adrian Röfer, Tim Welschehold, Abhinav Valada, Wolfram Burgard

    Abstract: A core challenge for an autonomous agent acting in the real world is to adapt its repertoire of skills to cope with its noisy perception and dynamics. To scale learning of skills to long-horizon tasks, robots should be able to learn and later refine their skills in a structured manner through trajectories rather than making instantaneous decisions individually at each time step. To this end, we pr… ▽ More

    Submitted 19 September, 2022; v1 submitted 25 November, 2021; originally announced November 2021.

    Comments: Accepted at the 2022 IEEE International Conference on Robotics and Automation (ICRA)

  49. arXiv:2111.12673  [pdf, other

    cs.LG cs.AI cs.RO

    Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

    Authors: Nicolai Dorka, Tim Welschehold, Joschka Boedecker, Wolfram Burgard

    Abstract: Accurate value estimates are important for off-policy reinforcement learning. Algorithms based on temporal difference learning typically are prone to an over- or underestimation bias building up over time. In this paper, we propose a general method called Adaptively Calibrated Critics (ACC) that uses the most recent high variance but unbiased on-policy rollouts to alleviate the bias of the low var… ▽ More

    Submitted 21 October, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: Submitted to RA-L

  50. arXiv:2110.10563  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Robust Monocular Localization in Sparse HD Maps Leveraging Multi-Task Uncertainty Estimation

    Authors: Kürsat Petek, Kshitij Sirohi, Daniel Büscher, Wolfram Burgard

    Abstract: Robust localization in dense urban scenarios using a low-cost sensor setup and sparse HD maps is highly relevant for the current advances in autonomous driving, but remains a challenging topic in research. We present a novel monocular localization approach based on a sliding-window pose graph that leverages predicted uncertainties for increased precision and robustness against challenging scenario… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.