Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–18 of 18 results for author: Shin, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.07715  [pdf, other

    cs.CV cs.AI cs.RO

    FIReStereo: Forest InfraRed Stereo Dataset for UAS Depth Perception in Visually Degraded Environments

    Authors: Devansh Dhrafani, Yifei Liu, Andrew Jong, Ukcheol Shin, Yao He, Tyler Harp, Yaoyu Hu, Jean Oh, Sebastian Scherer

    Abstract: Robust depth perception in visually-degraded environments is crucial for autonomous aerial systems. Thermal imaging cameras, which capture infrared radiation, are robust to visual degradation. However, due to lack of a large-scale dataset, the use of thermal cameras for unmanned aerial system (UAS) depth perception has remained largely unexplored. This paper presents a stereo thermal depth percept… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: Under review in RA-L. The first 2 authors contributed equally

  2. arXiv:2408.03551  [pdf, other

    cs.CV cs.RO

    VPOcc: Exploiting Vanishing Point for Monocular 3D Semantic Occupancy Prediction

    Authors: Junsu Kim, Junhee Lee, Ukcheol Shin, Jean Oh, Kyungdon Joo

    Abstract: Monocular 3D semantic occupancy prediction is becoming important in robot vision due to the compactness of using a single RGB camera. However, existing methods often do not adequately account for camera perspective geometry, resulting in information imbalance along the depth range of the image. To address this issue, we propose a vanishing point (VP) guided monocular 3D semantic occupancy predicti… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  3. arXiv:2407.07995  [pdf, other

    cs.CV

    Flow4D: Leveraging 4D Voxel Network for LiDAR Scene Flow Estimation

    Authors: Jaeyeul Kim, Jungwan Woo, Ukcheol Shin, Jean Oh, Sunghoon Im

    Abstract: Understanding the motion states of the surrounding environment is critical for safe autonomous driving. These motion states can be accurately derived from scene flow, which captures the three-dimensional motion field of points. Existing LiDAR scene flow methods extract spatial features from each point cloud and then fuse them channel-wise, resulting in the implicit extraction of spatio-temporal fe… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 8 pages, 4 figures

  4. arXiv:2405.10780  [pdf

    eess.SP cs.AR cs.HC cs.LG q-bio.NC

    Intelligent and Miniaturized Neural Interfaces: An Emerging Era in Neurotechnology

    Authors: Mahsa Shoaran, Uisub Shin, MohammadAli Shaeri

    Abstract: Integrating smart algorithms on neural devices presents significant opportunities for various brain disorders. In this paper, we review the latest advancements in the development of three categories of intelligent neural prostheses featuring embedded signal processing on the implantable or wearable device. These include: 1) Neural interfaces for closed-loop symptom tracking and responsive stimulat… ▽ More

    Submitted 31 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Journal ref: 2024 IEEE Custom Integrated Circuits Conference (CICC), Denver, CO, USA, 2024, pp. 1-7

  5. arXiv:2404.01636  [pdf, other

    cs.CV cs.AI cs.LG cs.RO eess.SY

    Learning to Control Camera Exposure via Reinforcement Learning

    Authors: Kyunghyun Lee, Ukcheol Shin, Byeong-Uk Lee

    Abstract: Adjusting camera exposure in arbitrary lighting conditions is the first step to ensure the functionality of computer vision applications. Poorly adjusted camera exposure often leads to critical failure and performance degradation. Traditional camera exposure control methods require multiple convergence steps and time-consuming processes, making them unsuitable for dynamic lighting conditions. In t… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2024, *First two authors contributed equally to this work. Project page link: https://sites.google.com/view/drl-ae

  6. arXiv:2403.19985  [pdf, other

    cs.CV

    Stable Surface Regularization for Fast Few-Shot NeRF

    Authors: Byeongin Joung, Byeong-Uk Lee, Jaesung Choe, Ukcheol Shin, Minjun Kang, Taeyeop Lee, In So Kweon, Kuk-Jin Yoon

    Abstract: This paper proposes an algorithm for synthesizing novel views under few-shot setup. The main concept is to develop a stable surface regularization technique called Annealing Signed Distance Function (ASDF), which anneals the surface in a coarse-to-fine manner to accelerate convergence speed. We observe that the Eikonal loss - which is a widely known geometric regularization - requires dense traini… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 3DV 2024

  7. arXiv:2403.00398  [pdf, other

    cs.RO

    Learning Quadrupedal Locomotion with Impaired Joints Using Random Joint Masking

    Authors: Mincheol Kim, Ukcheol Shin, Jung-Yup Kim

    Abstract: Quadrupedal robots have played a crucial role in various environments, from structured environments to complex harsh terrains, thanks to their agile locomotion ability. However, these robots can easily lose their locomotion functionality if damaged by external accidents or internal malfunctions. In this paper, we propose a novel deep reinforcement learning framework to enable a quadrupedal robot t… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: Appear to ICRA 2024, Project page: https://sites.google.com/view/learning-impaired-joints-loco

  8. arXiv:2312.08603  [pdf, other

    eess.AS cs.SD

    NeXt-TDNN: Modernizing Multi-Scale Temporal Convolution Backbone for Speaker Verification

    Authors: Hyun-Jun Heo, Ui-Hyeop Shin, Ran Lee, YoungJu Cheon, Hyung-Min Park

    Abstract: In speaker verification, ECAPA-TDNN has shown remarkable improvement by utilizing one-dimensional(1D) Res2Net block and squeeze-and-excitation(SE) module, along with multi-layer feature aggregation (MFA). Meanwhile, in vision tasks, ConvNet structures have been modernized by referring to Transformer, resulting in improved performance. In this paper, we present an improved block design for TDNN in… ▽ More

    Submitted 14 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024

  9. arXiv:2306.07562  [pdf, other

    eess.AS cs.SD

    Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition

    Authors: Ui-Hyeop Shin, Hyung-Min Park

    Abstract: In this paper, we present a statistical beamforming algorithm as a pre-processing step for robust automatic speech recognition (ASR). By modeling the target speech as a non-stationary Laplacian distribution, a mask-based statistical beamforming algorithm is proposed to exploit both its output and masked input variance for robust estimation of the beamformer. In addition, we also present a method f… ▽ More

    Submitted 5 January, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Accepted by TASLP

  10. arXiv:2303.17386  [pdf, other

    cs.CV cs.AI cs.RO

    Complementary Random Masking for RGB-Thermal Semantic Segmentation

    Authors: Ukcheol Shin, Kyunghyun Lee, In So Kweon, Jean Oh

    Abstract: RGB-thermal semantic segmentation is one potential solution to achieve reliable semantic scene understanding in adverse weather and lighting conditions. However, the previous studies mostly focus on designing a multi-modal fusion module without consideration of the nature of multi-modality inputs. Therefore, the networks easily become over-reliant on a single modality, making it difficult to learn… ▽ More

    Submitted 4 March, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: ICRA 2024, Our source code is available at https://github.com/UkcheolShin/CRM_RGBTSeg

  11. arXiv:2207.03081  [pdf, other

    cs.CV cs.AI cs.LG cs.RO eess.IV

    DRL-ISP: Multi-Objective Camera ISP with Deep Reinforcement Learning

    Authors: Ukcheol Shin, Kyunghyun Lee, In So Kweon

    Abstract: In this paper, we propose a multi-objective camera ISP framework that utilizes Deep Reinforcement Learning (DRL) and camera ISP toolbox that consist of network-based and conventional ISP tools. The proposed DRL-based camera ISP framework iteratively selects a proper tool from the toolbox and applies it to the image to maximize a given vision task-specific reward function. For this purpose, we impl… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: Accepted by IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2022 (*First two authors are equal contributed)

  12. arXiv:2201.04387  [pdf, other

    cs.RO cs.CV

    Maximizing Self-supervision from Thermal Image for Effective Self-supervised Learning of Depth and Ego-motion

    Authors: Ukcheol Shin, Kyunghyun Lee, Byeong-Uk Lee, In So Kweon

    Abstract: Recently, self-supervised learning of depth and ego-motion from thermal images shows strong robustness and reliability under challenging scenarios. However, the inherent thermal image properties such as weak contrast, blurry edges, and noise hinder to generate effective self-supervision from thermal images. Therefore, most research relies on additional self-supervision sources such as well-lit RGB… ▽ More

    Submitted 15 June, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

    Comments: 8 pages, Accepted by IEEE Robotics and Automation Letters (RA-L) with IROS 2022 option

  13. arXiv:2111.12580  [pdf, other

    cs.CV

    UDA-COPE: Unsupervised Domain Adaptation for Category-level Object Pose Estimation

    Authors: Taeyeop Lee, Byeong-Uk Lee, Inkyu Shin, Jaesung Choe, Ukcheol Shin, In So Kweon, Kuk-Jin Yoon

    Abstract: Learning to estimate object pose often requires ground-truth (GT) labels, such as CAD model and absolute-scale object pose, which is expensive and laborious to obtain in the real world. To tackle this problem, we propose an unsupervised domain adaptation (UDA) for category-level object pose estimation, called UDA-COPE. Inspired by recent multi-modal UDA techniques, the proposed method exploits a t… ▽ More

    Submitted 5 April, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: Accepted to CVPR 2022

  14. arXiv:2109.05848  [pdf, other

    eess.SP cs.AR

    Closed-Loop Neural Prostheses with On-Chip Intelligence: A Review and A Low-Latency Machine Learning Model for Brain State Detection

    Authors: Bingzhao Zhu, Uisub Shin, Mahsa Shoaran

    Abstract: The application of closed-loop approaches in systems neuroscience and therapeutic stimulation holds great promise for revolutionizing our understanding of the brain and for developing novel neuromodulation therapies to restore lost functions. Neural prostheses capable of multi-channel neural recording, on-site signal processing, rapid symptom detection, and closed-loop stimulation are critical to… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

  15. Self-Supervised Depth and Ego-Motion Estimation for Monocular Thermal Video Using Multi-Spectral Consistency Loss

    Authors: Ukcheol Shin, Kyunghyun Lee, Seokju Lee, In So Kweon

    Abstract: A thermal camera can robustly capture thermal radiation images under harsh light conditions such as night scenes, tunnels, and disaster scenarios. However, despite this advantage, neither depth nor ego-motion estimation research for the thermal camera have not been actively explored so far. In this paper, we propose a self-supervised learning method for depth and ego-motion estimation from thermal… ▽ More

    Submitted 7 July, 2022; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: 8 pages, Accepted by IEEE Robotics and Automation Letters (RA-L) with ICRA 2022 option

    Journal ref: IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 1103-1110, April 2022

  16. arXiv:2012.05417  [pdf, other

    cs.LG cs.AI

    An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search

    Authors: Kyunghyun Lee, Byeong-Uk Lee, Ukcheol Shin, In So Kweon

    Abstract: Deep reinforcement learning (DRL) algorithms and evolution strategies (ES) have been applied to various tasks, showing excellent performances. These have the opposite properties, with DRL having good sample efficiency and poor stability, while ES being vice versa. Recently, there have been attempts to combine these algorithms, but these methods fully rely on synchronous update scheme, making it no… ▽ More

    Submitted 6 January, 2021; v1 submitted 9 December, 2020; originally announced December 2020.

    Journal ref: NeurIPS 2020 (oral)

  17. arXiv:2010.09457  [pdf, other

    cs.AR cs.LG

    Closed-Loop Neural Interfaces with Embedded Machine Learning

    Authors: Bingzhao Zhu, Uisub Shin, Mahsa Shoaran

    Abstract: Neural interfaces capable of multi-site electrical recording, on-site signal classification, and closed-loop therapy are critical for the diagnosis and treatment of neurological disorders. However, deploying machine learning algorithms on low-power neural devices is challenging, given the tight constraints on computational and memory resources for such devices. In this paper, we review the recent… ▽ More

    Submitted 21 October, 2020; v1 submitted 15 October, 2020; originally announced October 2020.

  18. arXiv:1907.12646  [pdf, other

    cs.CV cs.RO

    Camera Exposure Control for Robust Robot Vision with Noise-Aware Image Quality Assessment

    Authors: Ukcheol Shin, Jinsun Park, Gyumin Shim, Francois Rameau, In So Kweon

    Abstract: In this paper, we propose a noise-aware exposure control algorithm for robust robot vision. Our method aims to capture the best-exposed image which can boost the performance of various computer vision and robotics tasks. For this purpose, we carefully design an image quality metric which captures complementary quality attributes and ensures light-weight computation. Specifically, our metric consis… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: 8 pages,6 figures, accepted in IROS2019