Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–42 of 42 results for author: Yoon, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.01591  [pdf, other

    cs.CL cs.AI eess.IV

    Simplifying Multimodality: Unimodal Approach to Multimodal Challenges in Radiology with General-Domain Large Language Model

    Authors: Seonhee Cho, Choonghan Kim, Jiho Lee, Chetan Chilkunda, Sujin Choi, Joo Heung Yoon

    Abstract: Recent advancements in Large Multimodal Models (LMMs) have attracted interest in their generalization capability with only a few samples in the prompt. This progress is particularly relevant to the medical domain, where the quality and sensitivity of data pose unique challenges for model training and application. However, the dependency on high-quality data for effective in-context learning raises… ▽ More

    Submitted 29 April, 2024; originally announced May 2024.

    Comments: Under review

  2. arXiv:2404.16080  [pdf, other

    eess.IV cs.AI cs.CV

    Enhancing Diagnosis through AI-driven Analysis of Reflectance Confocal Microscopy

    Authors: Hong-Jun Yoon, Chris Keum, Alexander Witkowski, Joanna Ludzik, Tracy Petrie, Heidi A. Hanson, Sancy A. Leachman

    Abstract: Reflectance Confocal Microscopy (RCM) is a non-invasive imaging technique used in biomedical research and clinical dermatology. It provides virtual high-resolution images of the skin and superficial tissues, reducing the need for physical biopsies. RCM employs a laser light source to illuminate the tissue, capturing the reflected light to generate detailed images of microscopic structures at vario… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  3. arXiv:2404.15305  [pdf, other

    eess.SP cs.LG

    ADAPT^2: Adapting Pre-Trained Sensing Models to End-Users via Self-Supervision Replay

    Authors: Hyungjun Yoon, Jaehyun Kwak, Biniyam Aschalew Tolera, Gaole Dai, Mo Li, Taesik Gong, Kimin Lee, Sung-Ju Lee

    Abstract: Self-supervised learning has emerged as a method for utilizing massive unlabeled data for pre-training models, providing an effective feature extractor for various mobile sensing applications. However, when deployed to end-users, these models encounter significant domain shifts attributed to user diversity. We investigate the performance degradation that occurs when self-supervised models are fine… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

  4. arXiv:2404.01464  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Data-Efficient Unsupervised Interpolation Without Any Intermediate Frame for 4D Medical Images

    Authors: JungEun Kim, Hangyul Yoon, Geondo Park, Kyungsu Kim, Eunho Yang

    Abstract: 4D medical images, which represent 3D images with temporal information, are crucial in clinical practice for capturing dynamic changes and monitoring long-term disease progression. However, acquiring 4D medical images poses challenges due to factors such as radiation exposure and imaging duration, necessitating a balance between achieving high temporal resolution and minimizing adverse effects. Gi… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  5. arXiv:2403.11578  [pdf, other

    eess.AS

    AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition

    Authors: SooHwan Eom, Eunseop Yoon, Hee Suk Yoon, Chanwoo Kim, Mark Hasegawa-Johnson, Chang D. Yoo

    Abstract: In Automatic Speech Recognition (ASR) systems, a recurring obstacle is the generation of narrowly focused output distributions. This phenomenon emerges as a side effect of Connectionist Temporal Classification (CTC), a robust sequence learning tool that utilizes dynamic programming for sequence mapping. While earlier efforts have tried to combine the CTC loss with an entropy maximization regulariz… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  6. arXiv:2312.09736  [pdf, other

    cs.CL cs.SD eess.AS

    HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue

    Authors: Sunjae Yoon, Dahyun Kim, Eunseop Yoon, Hee Suk Yoon, Junyeong Kim, Chnag D. Yoo

    Abstract: Video-grounded Dialogue (VGD) aims to answer questions regarding a given multi-modal input comprising video, audio, and dialogue history. Although there have been numerous efforts in developing VGD systems to improve the quality of their responses, existing systems are competent only to incorporate the information in the video and text and tend to struggle in extracting the necessary information f… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: EMNLP 2023, 14 pages, 13 figures

  7. arXiv:2312.05790  [pdf, other

    cs.LG cs.AI eess.SP

    SimPSI: A Simple Strategy to Preserve Spectral Information in Time Series Data Augmentation

    Authors: Hyun Ryu, Sunjae Yoon, Hee Suk Yoon, Eunseop Yoon, Chang D. Yoo

    Abstract: Data augmentation is a crucial component in training neural networks to overcome the limitation imposed by data size, and several techniques have been studied for time series. Although these techniques are effective in certain tasks, they have yet to be generalized to time series benchmarks. We find that current data augmentation techniques ruin the core information contained within the frequency… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  8. Synergistic Perception and Control Simplex for Verifiable Safe Vertical Landing

    Authors: Ayoosh Bansal, Yang Zhao, James Zhu, Sheng Cheng, Yuliang Gu, Hyung-Jin Yoon, Hunmin Kim, Naira Hovakimyan, Lui Sha

    Abstract: Perception, Planning, and Control form the essential components of autonomy in advanced air mobility. This work advances the holistic integration of these components to enhance the performance and robustness of the complete cyber-physical system. We adapt Perception Simplex, a system for verifiable collision avoidance amidst obstacle detection faults, to the vertical landing maneuver for autonomou… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: To appear in AIAA SciTech 2024

    ACM Class: C.3; C.4; J.7

    Journal ref: AIAA SCITECH 2024 Forum, p. 1167

  9. arXiv:2311.16652  [pdf, other

    cs.CV eess.IV physics.app-ph physics.comp-ph

    Augmenting x-ray single particle imaging reconstruction with self-supervised machine learning

    Authors: Zhantao Chen, Cong Wang, Mingye Gao, Chun Hong Yoon, Jana B. Thayer, Joshua J. Turner

    Abstract: The development of X-ray Free Electron Lasers (XFELs) has opened numerous opportunities to probe atomic structure and ultrafast dynamics of various materials. Single Particle Imaging (SPI) with XFELs enables the investigation of biological particles in their natural physiological states with unparalleled temporal resolution, while circumventing the need for cryogenic conditions or crystallization.… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  10. Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech

    Authors: Hyungchan Yoon, Changhwan Kim, Eunwoo Song, Hyun-Wook Yoon, Hong-Goo Kang

    Abstract: For personalized speech generation, a neural text-to-speech (TTS) model must be successfully implemented with limited data from a target speaker. To this end, the baseline TTS model needs to be amply generalized to out-of-domain data (i.e., target speaker's speech). However, approaches to address this out-of-domain generalization problem in TTS have yet to be thoroughly studied. In this work, we p… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: INTERSPEECH 2023

    Journal ref: Proc. INTERSPEECH 2023, 4299-4303

  11. arXiv:2308.08442  [pdf, other

    cs.CL cs.SD eess.AS

    Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction

    Authors: Eunseop Yoon, Hee Suk Yoon, Dhananjaya Gowda, SooHwan Eom, Daehyeok Kim, John Harvill, Heting Gao, Mark Hasegawa-Johnson, Chanwoo Kim, Chang D. Yoo

    Abstract: Text-to-Text Transfer Transformer (T5) has recently been considered for the Grapheme-to-Phoneme (G2P) transduction. As a follow-up, a tokenizer-free byte-level model based on T5 referred to as ByT5, recently gave promising results on word-level G2P conversion by representing each input character with its corresponding UTF-8 encoding. Although it is generally understood that sentence-level or parag… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: INTERSPEECH 2023

  12. arXiv:2306.06102  [pdf, other

    eess.SY

    Backup Plan Constrained Model Predictive Control with Guaranteed Stability

    Authors: Ran Tao, Hunmin Kim, Hyung-Jin Yoon, Wenbin Wan, Naira Hovakimyan, Lui Sha, Petros Voulgaris

    Abstract: This article proposes and evaluates a new safety concept called backup plan safety for path planning of autonomous vehicles under mission uncertainty using model predictive control (MPC). Backup plan safety is defined as the ability to complete an alternative mission when the primary mission is aborted. To include this new safety concept in control problems, we formulate a feasibility maximization… ▽ More

    Submitted 6 October, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

  13. arXiv:2306.02579  [pdf, other

    cs.CL cs.SD eess.AS

    Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model

    Authors: Hoyeon Lee, Hyun-Wook Yoon, Jong-Hwan Kim, Jae-Min Kim

    Abstract: Phrase break prediction is a crucial task for improving the prosody naturalness of a text-to-speech (TTS) system. However, most proposed phrase break prediction models are monolingual, trained exclusively on a large amount of labeled data. In this paper, we address this issue for low-resource languages with limited labeled data using cross-lingual transfer. We investigate the effectiveness of zero… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by INTERSPEECH 2023

  14. arXiv:2305.16371  [pdf, other

    cs.CL cs.SD eess.AS

    INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition

    Authors: Eunseop Yoon, Hee Suk Yoon, John Harvill, Mark Hasegawa-Johnson, Chang D. Yoo

    Abstract: Automatic Speech Recognition (ASR) systems have attained unprecedented performance with large speech models pre-trained based on self-supervised speech representation learning. However, these pre-trained speech models suffer from representational bias as they tend to better represent those prominent accents (i.e., native (L1) English accent) in the pre-training speech corpus than less represented… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: ACL2023

  15. arXiv:2305.06806  [pdf, other

    cs.SD eess.AS

    HappyQuokka System for ICASSP 2023 Auditory EEG Challenge

    Authors: Zhenyu Piao, Miseul Kim, Hyungchan Yoon, Hong-Goo Kang

    Abstract: This report describes our submission to Task 2 of the Auditory EEG Decoding Challenge at ICASSP 2023 Signal Processing Grand Challenge (SPGC). Task 2 is a regression problem that focuses on reconstructing a speech envelope from an EEG signal. For the task, we propose a pre-layer normalized feed-forward transformer (FFT) architecture. For within-subjects generation, we additionally utilize an auxil… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: First Place in Task 2 of Auditory EEG decoding Challenge, which is part of ICASSP Signal Processing Grand Challenge (SPGC) 2023

  16. arXiv:2212.11439  [pdf

    eess.IV cs.CV cs.LG

    Novel Deep Learning Framework For Bovine Iris Segmentation

    Authors: Heemoon Yoon, Mira Park, Sang-Hee Lee

    Abstract: Iris segmentation is the initial step to identify biometric of animals to establish a traceability system of livestock. In this study, we propose a novel deep learning framework for pixel-wise segmentation with minimum use of annotation labels using BovineAAEyes80 public dataset. In the experiment, U-Net with VGG16 backbone was selected as the best combination of encoder and decoder model, demonst… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: 5 pages, 4 figures, 3 tables

  17. A Framework for Generalizing Critical Heat Flux Detection Models Using Unsupervised Image-to-Image Translation

    Authors: Firas Al-Hindawi, Tejaswi Soori, Han Hu, Md Mahfuzur Rahman Siddiquee, Hyunsoo Yoon, Teresa Wu, Ying Sun

    Abstract: The detection of critical heat flux (CHF) is crucial in heat boiling applications as failure to do so can cause rapid temperature ramp leading to device failures. Many machine learning models exist to detect CHF, but their performance reduces significantly when tested on data from different domains. To deal with datasets from new domains a model needs to be trained from scratch. Moreover, the data… ▽ More

    Submitted 17 March, 2023; v1 submitted 18 December, 2022; originally announced December 2022.

    Comments: This work has been submitted to the Expert Systems With Applications Journal on Sep 25, 2022

  18. arXiv:2211.11381  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    LISA: Localized Image Stylization with Audio via Implicit Neural Representation

    Authors: Seung Hyun Lee, Chanyoung Kim, Wonmin Byeon, Sang Ho Yoon, Jinkyu Kim, Sangpil Kim

    Abstract: We present a novel framework, Localized Image Stylization with Audio (LISA) which performs audio-driven localized image stylization. Sound often provides information about the specific context of the scene and is closely related to a certain part of the scene or object. However, existing image stylization works have focused on stylizing the entire image using an image or text input. Stylizing a pa… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  19. arXiv:2209.10102  [pdf, other

    cs.LG eess.SY

    Multi-time Predictions of Wildfire Grid Map using Remote Sensing Local Data

    Authors: Hyung-Jin Yoon, Petros Voulgaris

    Abstract: Due to recent climate changes, we have seen more frequent and severe wildfires in the United States. Predicting wildfires is critical for natural disaster prevention and mitigation. Advances in technologies in data processing and communication enabled us to access remote sensing data. With the remote sensing data, valuable spatiotemporal statistical models can be created and used for resource mana… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: IEEE ICKG 2022

  20. arXiv:2208.06089  [pdf, other

    cs.AI cs.LG eess.SY

    Accurate Action Recommendation for Smart Home via Two-Level Encoders and Commonsense Knowledge

    Authors: Hyunsik Jeon, Jongjin Kim, Hoyoung Yoon, Jaeri Lee, U Kang

    Abstract: How can we accurately recommend actions for users to control their devices at home? Action recommendation for smart home has attracted increasing attention due to its potential impact on the markets of virtual assistants and Internet of Things (IoT). However, designing an effective action recommender system for smart home is challenging because it requires handling context correlations, considerin… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

    Comments: 10 pages, 8 figures, Accepted to CIKM 2022

  21. arXiv:2206.15067  [pdf, other

    cs.SD eess.AS

    Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems

    Authors: Hyun-Wook Yoon, Ohsung Kwon, Hoyeon Lee, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim, Min-Jae Hwang

    Abstract: This paper proposes an effective emotional text-to-speech (TTS) system with a pre-trained language model (LM)-based emotion prediction method. Unlike conventional systems that require auxiliary inputs such as manually defined emotion classes, our system directly estimates emotion-related attributes from the input text. Specifically, we utilize generative pre-trained transformer (GPT)-3 to jointly… ▽ More

    Submitted 30 June, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

    Comments: Accepted by INTERSPEECH2022

  22. arXiv:2206.14984  [pdf, other

    eess.AS cs.SD

    TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder

    Authors: Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon, Chan-Ho Song, Min-Jae Hwang, Suhyeon Oh, Hyun-Wook Yoon, Jin-Seob Kim, Jae-Min Kim

    Abstract: Recent advances in synthetic speech quality have enabled us to train text-to-speech (TTS) systems by using synthetic corpora. However, merely increasing the amount of synthetic data is not always advantageous for improving training efficiency. Our aim in this study is to selectively choose synthetic data that are beneficial to the training process. In the proposed method, we first adopt a variatio… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: Accepted to the conference of INTERSPEECH 2022

  23. arXiv:2206.11985  [pdf, other

    eess.SY

    Path Integral Methods with Stochastic Control Barrier Functions

    Authors: Chuyuan Tao, Hyung-Jin Yoon, Hunmin Kim, Naira Hovakimyan, Petros Voulgaris

    Abstract: Safe control designs for robotic systems remain challenging because of the difficulties of explicitly solving optimal control with nonlinear dynamics perturbed by stochastic noise. However, recent technological advances in computing devices enable online optimization or sampling-based methods to solve control problems. For example, Control Barrier Functions (CBFs), a Lyapunov-like control algorith… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  24. arXiv:2206.09074  [pdf, other

    cs.LG eess.SP

    Weakly Supervised Classification of Vital Sign Alerts as Real or Artifact

    Authors: Arnab Dey, Mononito Goswami, Joo Heung Yoon, Gilles Clermont, Michael Pinsky, Marilyn Hravnak, Artur Dubrawski

    Abstract: A significant proportion of clinical physiologic monitoring alarms are false. This often leads to alarm fatigue in clinical personnel, inevitably compromising patient safety. To combat this issue, researchers have attempted to build Machine Learning (ML) models capable of accurately adjudicating Vital Sign (VS) alerts raised at the bedside of hemodynamically monitored patients as real or artifact.… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted at American Medical Informatics Association (AMIA) Annual Symposium 2022. 10 pages, 4 figures and 2 tables

  25. arXiv:2205.12633  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results

    Authors: Eduardo Pérez-Pellitero, Sibi Catley-Chandar, Richard Shaw, Aleš Leonardis, Radu Timofte, Zexin Zhang, Cen Liu, Yunbo Peng, Yue Lin, Gaocheng Yu, Jin Zhang, Zhe Ma, Hongbin Wang, Xiangyu Chen, Xintao Wang, Haiwei Wu, Lin Liu, Chao Dong, Jiantao Zhou, Qingsen Yan, Song Zhang, Weiye Chen, Yuhang Liu, Zhen Zhang, Yanning Zhang , et al. (68 additional authors not shown)

    Abstract: This paper reviews the challenge on constrained high dynamic range (HDR) imaging that was part of the New Trends in Image Restoration and Enhancement (NTIRE) workshop, held in conjunction with CVPR 2022. This manuscript focuses on the competition set-up, datasets, the proposed methods and their results. The challenge aims at estimating an HDR image from multiple respective low dynamic range (LDR)… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: CVPR Workshops 2022. 15 pages, 21 figures, 2 tables

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2022

  26. arXiv:2204.10020  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation

    Authors: Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana

    Abstract: Data augmentation via voice conversion (VC) has been successfully applied to low-resource expressive text-to-speech (TTS) when only neutral data for the target speaker are available. Although the quality of VC is crucial for this approach, it is challenging to learn a stable VC model because the amount of data is limited in low-resource scenarios, and highly expressive speech has large acoustic va… ▽ More

    Submitted 5 July, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

    Comments: Accepted to INTERSPEECH 2022

  27. Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech

    Authors: Hyungchan Yoon, Seyun Um, Changwhan Kim, Hong-Goo Kang

    Abstract: To simplify the generation process, several text-to-speech (TTS) systems implicitly learn intermediate latent representations instead of relying on predefined features (e.g., mel-spectrogram). However, their generation quality is unsatisfactory as these representations lack speech variances. In this paper, we improve TTS performance by adding \emph{prosody embeddings} to the latent representations… ▽ More

    Submitted 28 August, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: INTERSPEECH 2023

    MSC Class: 68T07 (Primary) 68T50; 68T99 (Secondary) ACM Class: I.2.7; I.2.6

  28. arXiv:2203.10067  [pdf, other

    cs.RO eess.SY

    Sampling Complexity of Path Integral Methods for Trajectory Optimization

    Authors: Hyung-Jin Yoon, Chuyuan Tao, Hunmin Kim, Naira Hovakimyan, Petros Voulgaris

    Abstract: The use of random sampling in decision-making and control has become popular with the ease of access to graphic processing units that can generate and calculate multiple random trajectories for real-time robotic applications. In contrast to sequential optimization, the sampling-based method can take advantage of parallel computing to maintain constant control loop frequencies. Inspired by its wide… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: Accepted in American Control Conference 2022

  29. arXiv:2112.00007  [pdf, other

    cs.GR cs.CV cs.LG cs.SD eess.AS

    Sound-Guided Semantic Image Manipulation

    Authors: Seung Hyun Lee, Wonseok Roh, Wonmin Byeon, Sang Ho Yoon, Chan Young Kim, Jinkyu Kim, Sangpil Kim

    Abstract: The recent success of the generative model shows that leveraging the multi-modal embedding space can manipulate an image using text information. However, manipulating an image with other sources rather than text, such as sound, is not easy due to the dynamic characteristics of the sources. Especially, sound can convey vivid emotions and dynamic expressions of the real world. Here, we propose a fra… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

  30. arXiv:2111.06974  [pdf, other

    eess.SY

    Control Barrier Function Augmentation in Sampling-based Control Algorithm for Sample Efficiency

    Authors: Chuyuan Tao, Hunmin Kim, Hyungjin Yoon, Naira Hovakimyan, Petros Voulgaris

    Abstract: For a nonlinear stochastic path planning problem, sampling-based algorithms generate thousands of random sample trajectories to find the optimal path while guaranteeing safety by Lagrangian penalty methods. However, the sampling-based algorithm can perform poorly in obstacle-rich environments because most samples might violate safety constraints, invalidating the corresponding samples. To improve… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

  31. arXiv:2109.14779  [pdf, other

    eess.SY

    Time Coordination of Multiple UAVs over Switching Communication Networks with Digraph Topologies

    Authors: Hyungsoo Kang, Hyung-Jin Yoon, Venanzio Cichella, Naira Hovakimyan, Petros Voulgaris

    Abstract: This paper presents a time-coordination algorithm for multiple UAVs executing cooperative missions. Unlike previous algorithms, it does not rely on the assumption that the communication between UAVs is bidirectional. Thus, the topology of the inter-UAV information flow can be characterized by digraphs. To achieve coordination with weak connectivity, we design a switching law that orchestrates swit… ▽ More

    Submitted 12 April, 2022; v1 submitted 29 September, 2021; originally announced September 2021.

  32. arXiv:2105.00240  [pdf, other

    eess.IV cs.CV cs.LG

    Simultaneous super-resolution and motion artifact removal in diffusion-weighted MRI using unsupervised deep learning

    Authors: Hyungjin Chung, Jaehyun Kim, Jeong Hee Yoon, Jeong Min Lee, Jong Chul Ye

    Abstract: Diffusion-weighted MRI is nowadays performed routinely due to its prognostic ability, yet the quality of the scans are often unsatisfactory which can subsequently hamper the clinical utility. To overcome the limitations, here we propose a fully unsupervised quality enhancement scheme, which boosts the resolution and removes the motion artifact simultaneously. This process is done by first training… ▽ More

    Submitted 1 May, 2021; originally announced May 2021.

  33. arXiv:2103.14819  [pdf, other

    eess.SY

    Backup Plan Constrained Model Predictive Control

    Authors: Hunmin Kim, Hyungjin Yoon, Wenbin Wan, Naira Hovakimyan, Lui Sha, Petros Voulgaris

    Abstract: This article proposes a new safety concept: backup plan safety. The backup plan safety is defined as the ability to complete one of the alternative missions in the case of primary mission abortion. To incorporate this new safety concept in control problems, we formulate a feasibility maximization problem that adopts additional (virtual) input horizons toward the alternative missions on top of the… ▽ More

    Submitted 27 March, 2021; originally announced March 2021.

  34. arXiv:2012.07267  [pdf, other

    eess.AS

    Multi-SpectroGAN: High-Diversity and High-Fidelity Spectrogram Generation with Adversarial Style Combination for Speech Synthesis

    Authors: Sang-Hoon Lee, Hyun-Wook Yoon, Hyeong-Rae Noh, Ji-Hoon Kim, Seong-Whan Lee

    Abstract: While generative adversarial networks (GANs) based neural text-to-speech (TTS) systems have shown significant improvement in neural speech synthesis, there is no TTS system to learn to synthesize speech from text sequences with only adversarial feedback. Because adversarial feedback alone is not sufficient to train the generator, current models still require the reconstruction loss compared with t… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

    Comments: 9 pages, 3 figures, Accepted paper in AAAI Conference on Artificial Intelligence (AAAI), 2021

  35. arXiv:2011.06776  [pdf, other

    cs.CV cs.LG eess.IV physics.flu-dyn

    Fast and Scalable Earth Texture Synthesis using Spatially Assembled Generative Adversarial Neural Networks

    Authors: Sung Eun Kim, Hongkyu Yoon, Jonghyun Lee

    Abstract: The earth texture with complex morphological geometry and compositions such as shale and carbonate rocks, is typically characterized with sparse field samples because of an expensive and time-consuming characterization process. Accordingly, generating arbitrary large size of the geological texture with similar topological structures at a low computation cost has become one of the key tasks for rea… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

    Comments: 17 pages, 11 figures, 2 tables, and a table in Appendix

  36. arXiv:2008.06867  [pdf, other

    eess.AS cs.CL cs.SD

    Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder

    Authors: Hyun-Wook Yoon, Sang-Hoon Lee, Hyeong-Rae Noh, Seong-Whan Lee

    Abstract: In recent works, a flow-based neural vocoder has shown significant improvement in real-time speech generation task. The sequence of invertible flow operations allows the model to convert samples from simple distribution to audio samples. However, training a continuous density model on discrete audio data can degrade model performance due to the topological difference between latent and actual dist… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    Comments: Accepted in INTERSPEECH2020

  37. arXiv:2006.13304  [pdf, ps, other

    physics.geo-ph cs.LG eess.SP stat.ML

    Connectivity-informed Drainage Network Generation using Deep Convolution Generative Adversarial Networks

    Authors: Sung Eun Kim, Yongwon Seo, Junshik Hwang, Hongkyu Yoon, Jonghyun Lee

    Abstract: Stochastic network modeling is often limited by high computational costs to generate a large number of networks enough for meaningful statistical evaluation. In this study, Deep Convolutional Generative Adversarial Networks (DCGANs) were applied to quickly reproduce drainage networks from the already generated network samples without repetitive long modeling of the stochastic network model, Gibb's… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: 16 pages; 9 figures; Python and Matlab scripts used in this paper can be found in https://github.com/saint-kim/RiverDCGANs

  38. arXiv:2006.06385  [pdf

    eess.IV cs.CV cs.LG

    TensorFlow with user friendly Graphical Framework for object detection API

    Authors: Heemoon Yoon, Sang-Hee Lee, Mira Park

    Abstract: TensorFlow is an open-source framework for deep learning dataflow and contains application programming interfaces (APIs) of voice analysis, natural language process, and computer vision. Especially, TensorFlow object detection API in computer vision field has been widely applied to technologies of agriculture, engineering, and medicine but barriers to entry of the framework usage is still high thr… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: "The code of TF-GraF for TensorFlow object detection API is opened at https://github.com/boguss1225/ObjectDetectionGUI"

  39. High Accuracy Tumor Diagnoses and Benchmarking of Hematoxylin and Eosin Stained Prostate Core Biopsy Images Generated by Explainable Deep Neural Networks

    Authors: Aman Rana, Alarice Lowe, Marie Lithgow, Katharine Horback, Tyler Janovitz, Annacarolina Da Silva, Harrison Tsai, Vignesh Shanmugam, Hyung-Jin Yoon, Pratik Shah

    Abstract: Histopathological diagnoses of tumors in tissue biopsy after Hematoxylin and Eosin (H&E) staining is the gold standard for oncology care. H&E staining is slow and uses dyes, reagents and precious tissue samples that cannot be reused. Thousands of native nonstained RGB Whole Slide Image (RWSI) patches of prostate core tissue biopsies were registered with their H&E stained versions. Conditional Gene… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

    Journal ref: JAMA Network. 2020;3(5):e205111

  40. arXiv:1906.05348  [pdf, other

    eess.SY cs.RO

    Towards Resilient UAV: Escape Time in GPS Denied Environment with Sensor Drift

    Authors: Hyung-Jin Yoon, Wenbin Wan, Hunmin Kim, Naira Hovakimyan, Lui Sha, Petros G. Voulgaris

    Abstract: This paper considers a resilient state estimation framework for unmanned aerial vehicles (UAVs) that integrates a Kalman filter-like state estimator and an attack detector. When an attack is detected, the state estimator uses only IMU signals as the GPS signals do not contain legitimate information. This limited sensor availability induces a sensor drift problem questioning the reliability of the… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

  41. arXiv:1905.05008  [pdf, other

    eess.IV physics.optics

    Low-signal limit of X-ray single particle imaging

    Authors: Kartik Ayyer, Andrew J. Morgan, Andrew A. Aquila, Hasan DeMirci, Brenda G. Hogue, Richard A. Kirian, P. Lourdu Xavier, Chun Hong Yoon, Henry N. Chapman, Anton Barty

    Abstract: An outstanding question in X-ray single particle imaging experiments has been the feasibility of imaging sub 10-nm-sized biomolecules under realistic experimental conditions where very few photons are expected to be measured in a single snapshot and instrument background may be significant relative to particle scattering. While analyses of simulated data have shown that the determination of an ave… ▽ More

    Submitted 27 April, 2020; v1 submitted 10 May, 2019; originally announced May 2019.

    Comments: 19 pages, 9 figures

    Journal ref: Optics Express 27.26 (2019): 37816-37833

  42. arXiv:1809.06401  [pdf, ps, other

    cs.LG eess.SY stat.ML

    Hidden Markov Model Estimation-Based Q-learning for Partially Observable Markov Decision Process

    Authors: Hyung-Jin Yoon, Donghwan Lee, Naira Hovakimyan

    Abstract: The objective is to study an on-line Hidden Markov model (HMM) estimation-based Q-learning algorithm for partially observable Markov decision process (POMDP) on finite state and action sets. When the full state observation is available, Q-learning finds the optimal action-value function given the current action (Q function). However, Q-learning can perform poorly when the full state observation is… ▽ More

    Submitted 24 September, 2018; v1 submitted 17 September, 2018; originally announced September 2018.