Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleNovember 2024
Semantic Environment Atlas for Object-Goal Navigation
AbstractIn this paper, we introduce the Semantic Environment Atlas (SEA), a novel mapping approach designed to enhance visual navigation capabilities of embodied agents. The SEA utilizes semantic graph maps that intricately delineate the relationships ...
Highlights- Enhanced Navigation Capabilities: The SEA enhances navigation using semantic graph maps, improving localization and tasks.
- Robust Against Sensor Noise: SEA achieves robust navigation by leveraging semantic knowledge, even with sensor ...
- research-articleDecember 2023
Sequential preference ranking for efficient reinforcement learning from human feedback
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 2133, Pages 49088–49099Reinforcement learning from human feedback (RLHF) alleviates the problem of designing a task-specific reward function in reinforcement learning by learning it from human preference. However, existing RLHF models are considered inefficient as they produce ...
- research-articleDecember 2023
Trust region-based safe distributional reinforcement learning for multiple constraints
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 874, Pages 19908–19939In safety-critical robotic tasks, potential failures must be reduced, and multiple constraints must be met, such as avoiding collisions, limiting energy consumption, and maintaining balance. Thus, applying safe reinforcement learning (RL) in such robotic ...
- research-articleDecember 2023
Diffused task-agnostic milestone planner
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 19, Pages 387–405Addressing decision-making problems using sequence modeling to predict future trajectories shows promising results in recent years. In this paper, we take a step further to leverage the sequence predictive method in wider areas such as long-term planning,...
- research-articleMarch 2023
Local Selective Vision Transformer for Depth Estimation Using a Compound Eye Camera
Pattern Recognition Letters (PTRL), Volume 167, Issue CPages 82–89https://doi.org/10.1016/j.patrec.2023.02.010Highlights- We propose a coarse depth estimation method for compound eye images based on deep learning.
- We propose a network suitable for compound eye structure based on Vision Transformer.
- Performance of coarse depth estimation improves ...
A compound eye camera is a hemispherical camera made by mimicking the structure of an insect’s eye. In general, a compound eye camera is composed of a set of single eye cameras. The compound eye camera has various advantages due to its unique ...
-
- research-articleMay 2022
Visually Grounding Language Instruction for History-Dependent Manipulation
2022 International Conference on Robotics and Automation (ICRA)Pages 675–682https://doi.org/10.1109/ICRA46639.2022.9812279This paper emphasizes the importance of a robot's ability to refer to its task history, especially when it exe-cutes a series of pick-and-place manipulations by following language instructions given one by one. The advantage of referring to the ...
- research-articleApril 2022
Deep latent-space sequential skill chaining from incomplete demonstrations
Intelligent Service Robotics (SPISR), Volume 15, Issue 2Pages 203–213https://doi.org/10.1007/s11370-021-00409-zAbstractImitation learning is a methodology, which trains an agent using demonstrations from skilled experts without external rewards. However, for a complex task with a long horizon, it is challenging to obtain data that exactly match the desired task. ...
- research-articleOctober 2021
Image-Goal Navigation Algorithm using Viewpoint Estimation
2021 21st International Conference on Control, Automation and Systems (ICCAS)Pages 689–692https://doi.org/10.23919/ICCAS52745.2021.9650044This paper tackles the image-goal navigation problem, in which a robot needs to find a goal pose based on the target image. The proposed algorithm estimates the geometric information between the target pose and the current pose of the robot. Using the ...
- research-articleOctober 2021
Learning Latent Dynamics from Multi-View Observations for Image-Based Control
2021 21st International Conference on Control, Automation and Systems (ICCAS)Pages 699–702https://doi.org/10.23919/ICCAS52745.2021.9650040In this paper, we tackle the problem of efficient data-driven control of a system, of which the observations are given by raw images. By learning latent representations of image observations and their dynamics model, the image-based control problems can ...
- research-articleOctober 2021
Image-Goal Navigation via Metric Mapping and Keypoint based Reinforcement Learning
2021 21st International Conference on Control, Automation and Systems (ICCAS)Pages 319–324https://doi.org/10.23919/ICCAS52745.2021.9650026In this paper, we tackle the problem of Image-Goal Navigation, where an agent is asked to find the viewpoint of a given goal image in an unseen environment. Past methods rely on depth cameras, require panoramic observations and a pre-built graph network, ...
- research-articleOctober 2021
Decomposed Q-Learning for Non-Prehensile Rearrangement Problem
2021 21st International Conference on Control, Automation and Systems (ICCAS)Pages 1756–1759https://doi.org/10.23919/ICCAS52745.2021.9649975In this paper, we address a planar non-prehensile rearrangement task. As a problem of pushing objects to desired target points, we model the problem as Multi-objective Markov decision processes (MOMDPs) to efficiently solve it and propose a method of ...
- research-articleOctober 2021
Vision-Based 3D Reconstruction Using a Compound Eye Camera
2021 21st International Conference on Control, Automation and Systems (ICCAS)Pages 418–423https://doi.org/10.23919/ICCAS52745.2021.9649968The vision-based 3D reconstruction methods have various advantages and can be used in various applications such as navigation. Although various vision-based methods are being studied, it is difficult to reconstruct many parts at once with a general camera ...
- research-articleOctober 2021
Semantic Descriptors into Representation for Robust Indoor Visual Place Recognition
2021 21st International Conference on Control, Automation and Systems (ICCAS)Pages 715–718https://doi.org/10.23919/ICCAS52745.2021.9649944Visual place localization (VPL) is a problem finding the closest database image from a query image. Since the outdoor images can be recognized from GPS sensors, VPL in an indoor scene is a difficult problem. Also, Image changes indoors are more severe ...
- research-articleSeptember 2021
Road Graphical Neural Networks for Autonomous Roundabout Driving
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)Pages 162–167https://doi.org/10.1109/IROS51168.2021.9636411We propose a novel autonomous driving frame-work that leverages graph-based features of roads, such as road positions and connections. The proposed method is divided into two parts: a low-level controller which follows the trajectory calculated by a graph-...
- research-articleFebruary 2021
Reconstruct as Far as You Can: Consensus of Non-Rigid Reconstruction from Feasible Regions
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 43, Issue 2Pages 623–637https://doi.org/10.1109/TPAMI.2019.2931317Much progress has been made for non-rigid structure from motion (NRSfM) during the last two decades, which made it possible to provide reasonable solutions for synthetically-created benchmark data. In order to utilize these NRSfM techniques in more ...
- research-articleDecember 2020
Optimal algorithms for stochastic multi-armed bandits with heavy tailed rewards
NIPS '20: Proceedings of the 34th International Conference on Neural Information Processing SystemsArticle No.: 708, Pages 8452–8462In this paper, we consider stochastic multi-armed bandits (MABs) with heavy-tailed rewards, whose p-th moment is bounded by a constant νp for 1 < p ≤ 2. First, we propose a novel robust estimator which does not require νp as prior information, while ...
- research-articleOctober 2020
No-Regret Shannon Entropy Regularized Neural Contextual Bandit Online Learning for Robotic Grasping
2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)Pages 9620–9625https://doi.org/10.1109/IROS45743.2020.9341123In this paper, we propose a novel contextual bandit algorithm that employs a neural network as a reward estimator and utilizes Shannon entropy regularization to encourage exploration, which is called Shannon entropy regularized neural contextual bandits (...
- research-articleOctober 2020
MixGAIL: Autonomous Driving Using Demonstrations with Mixed Qualities
2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)Pages 5425–5430https://doi.org/10.1109/IROS45743.2020.9341104In this paper, we consider autonomous driving of a vehicle using imitation learning. Generative adversarial imitation learning (GAIL) is a widely used algorithm for imitation learning. This algorithm leverages positive demonstrations to imitate the ...
- research-articleOctober 2020
Pedestrian Intention Prediction for Autonomous Driving Using a Multiple Stakeholder Perspective Model
2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)Pages 7957–7962https://doi.org/10.1109/IROS45743.2020.9341083This paper proposes a multiple stakeholder perspective model (MSPM) which predicts the future pedestrian trajectory observed from vehicle’s point of view. For the vehicle-pedestrian interaction, the estimation of the pedestrian’s intention ...
- research-articleOctober 2020
GOPE: Geometry-Aware Optimal Viewpoint Path Estimation Using a Monocular Camera
2020 20th International Conference on Control, Automation and Systems (ICCAS)Pages 1062–1067https://doi.org/10.23919/ICCAS50221.2020.9268299The goal of the optimal viewpoint path estimation is to generate a path to the optimal viewpoint location where the robot can best see the Point of Interest (POI). There are several learning-based methods to find an optimal viewpoint, but these methods ...