Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 127 results for author: Hwang, I

.
  1. arXiv:2407.09342  [pdf, other

    cs.RO

    MIXED-SENSE: A Mixed Reality Sensor Emulation Framework for Test and Evaluation of UAVs Against False Data Injection Attacks

    Authors: Kartik A. Pant, Li-Yu Lin, Jaehyeok Kim, Worawis Sribunma, James M. Goppert, Inseok Hwang

    Abstract: We present a high-fidelity Mixed Reality sensor emulation framework for testing and evaluating the resilience of Unmanned Aerial Vehicles (UAVs) against false data injection (FDI) attacks. The proposed approach can be utilized to assess the impact of FDI attacks, benchmark attack detector performance, and validate the effectiveness of mitigation/reconfiguration strategies in single-UAV and UAV swa… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 6 pages, 5 figures, IROS 2024

  2. arXiv:2407.07142  [pdf, other

    hep-ph astro-ph.CO hep-ex hep-th

    LANSCE-mQ: Dedicated search for milli/fractionally charged particles at LANL

    Authors: Yu-Dai Tsai, Insung Hwang, Ryan Schmitz, Matthew Citron, Kranti Gunthoti, Jacob Steenis, Hoyong Jeong, Hyunki Moon, Jae Hyeok Yoo, Ming Xiong Liu

    Abstract: In this paper, we propose an experiment, LANSCE-mQ, aiming to detect fractionally charged and millicharged particles (mCP) using an 800 MeV proton beam fixed target at the Los Alamos Neutron Science Center (LANSCE) facility. This search can shed new light on numerous fundamental questions, including charge quantization, the predictions of string theories and grand unification theories, the gauge s… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 8 pages, 8 figures

    Report number: FERMILAB-PUB-24-0357-T-V; LA-UR-24-27441

  3. arXiv:2406.09117  [pdf, other

    cs.CV cs.AI

    PC-LoRA: Low-Rank Adaptation for Progressive Model Compression with Knowledge Distillation

    Authors: Injoon Hwang, Haewon Park, Youngwan Lee, Jooyoung Yang, SunJae Maeng

    Abstract: Low-rank adaption (LoRA) is a prominent method that adds a small number of learnable parameters to the frozen pre-trained weights for parameter-efficient fine-tuning. Prompted by the question, ``Can we make its representation enough with LoRA weights solely at the final phase of finetuning without the pre-trained weights?'' In this work, we introduce Progressive Compression LoRA~(PC-LoRA), which u… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted at T4V@CVPR

  4. arXiv:2406.03234  [pdf, other

    cs.LG cs.AI

    Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning

    Authors: Inwoo Hwang, Yunhyeok Kwak, Suhyung Choi, Byoung-Tak Zhang, Sanghack Lee

    Abstract: Causal dynamics learning has recently emerged as a promising approach to enhancing robustness in reinforcement learning (RL). Typically, the goal is to build a dynamics model that makes predictions based on the causal relationships among the entities. Despite the fact that causal connections often manifest only under certain contexts, existing approaches overlook such fine-grained relationships an… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  5. arXiv:2406.00614  [pdf, other

    cs.LG cs.AI

    Efficient Monte Carlo Tree Search via On-the-Fly State-Conditioned Action Abstraction

    Authors: Yunhyeok Kwak, Inwoo Hwang, Dooyoung Kim, Sanghack Lee, Byoung-Tak Zhang

    Abstract: Monte Carlo Tree Search (MCTS) has showcased its efficacy across a broad spectrum of decision-making problems. However, its performance often degrades under vast combinatorial action space, especially where an action is composed of multiple sub-actions. In this work, we propose an action abstraction based on the compositional structure between a state and sub-actions for improving the efficiency o… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: UAI 2024 (Oral). The first two authors contributed equally

  6. arXiv:2405.07220  [pdf, other

    cs.LG cs.AI stat.ML

    On Discovery of Local Independence over Continuous Variables via Neural Contextual Decomposition

    Authors: Inwoo Hwang, Yunhyeok Kwak, Yeon-Ji Song, Byoung-Tak Zhang, Sanghack Lee

    Abstract: Conditional independence provides a way to understand causal relationships among the variables of interest. An underlying system may exhibit more fine-grained causal relationships especially between a variable and its parents, which will be called the local independence relationships. One of the most widely studied local relationships is Context-Specific Independence (CSI), which holds in a specif… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: Conference on Causal Learning and Reasoning (CLeaR), 2023

  7. arXiv:2404.14647  [pdf, other

    cs.RO eess.SY

    Human Behavior Modeling via Identification of Task Objective and Variability

    Authors: Sooyung Byeon, Dawei Sun, Inseok Hwang

    Abstract: Human behavior modeling is important for the design and implementation of human-automation interactive control systems. In this context, human behavior refers to a human's control input to systems. We propose a novel method for human behavior modeling that uses human demonstrations for a given task to infer the unknown task objective and the variability. The task objective represents the human's i… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 10 pages

  8. arXiv:2404.01805  [pdf, other

    cs.LG

    Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification

    Authors: Michael Mitsios, Georgios Vamvoukakis, Georgia Maniati, Nikolaos Ellinas, Georgios Dimitriou, Konstantinos Markopoulos, Panos Kakoulidis, Alexandra Vioni, Myrsini Christidou, Junkwang Oh, Gunu Jho, Inchul Hwang, Georgios Vardaxoglou, Aimilios Chalamandaris, Pirros Tsiakoulis, Spyros Raptis

    Abstract: Emotion detection in textual data has received growing interest in recent years, as it is pivotal for developing empathetic human-computer interaction systems. This paper introduces a method for categorizing emotions from text, which acknowledges and differentiates between the diversified similarities and distinctions of various emotions. Initially, we establish a baseline by training a transforme… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  9. arXiv:2404.00856  [pdf, other

    cs.SD cs.AI eess.AS

    Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling

    Authors: Injune Hwang, Kyogu Lee

    Abstract: Recently, there have been efforts to encode the linguistic information of speech using a self-supervised framework for speech synthesis. However, predicting representations from surrounding representations can inadvertently entangle speaker information in the speech representation. This paper aims to remove speaker information by exploiting the structured nature of speech, composed of discrete uni… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  10. arXiv:2402.01520  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations

    Authors: Panos Kakoulidis, Nikolaos Ellinas, Georgios Vamvoukakis, Myrsini Christidou, Alexandra Vioni, Georgia Maniati, Junkwang Oh, Gunu Jho, Inchul Hwang, Pirros Tsiakoulis, Aimilios Chalamandaris

    Abstract: In this paper, we propose a singing voice synthesis model, Karaoker-SSL, that is trained only on text and speech data as a typical multi-speaker acoustic model. It is a low-resource pipeline that does not utilize any singing data end-to-end, since its vocoder is also trained on speech data. Karaoker-SSL is conditioned by self-supervised speech representations in an unsupervised manner. We preproce… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to IEEE ICASSP SASB 2024

  11. arXiv:2402.01298  [pdf, other

    eess.AS cs.AI cs.SD

    Learning Semantic Information from Raw Audio Signal Using Both Contextual and Phonetic Representations

    Authors: Jaeyeon Kim, Injune Hwang, Kyogu Lee

    Abstract: We propose a framework to learn semantics from raw audio signals using two types of representations, encoding contextual and phonetic information respectively. Specifically, we introduce a speech-to-unit processing pipeline that captures two types of representations with different time resolutions. For the language model, we adopt a dual-channel architecture to incorporate both types of representa… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to ICASSP 2024

  12. arXiv:2401.16737  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall physics.chem-ph

    Formation of highly stable interfacial nitrogen gas hydrate overlayers under ambient conditions

    Authors: Chung-Kai Fang, Cheng-Hao Chuang, Chih-Wen Yang, Zheng-Rong Guo, Wei-Hao Hsu, Chia-Hsin Wang, Ing-Shouh Hwang

    Abstract: Surfaces (interfaces) dictate many physical and chemical properties of solid materials and adsorbates considerably affect these properties. Nitrogen molecules, which are the most abundant constituent in ambient air, are considered to be inert. Our study combining atomic force microscopy (AFM), X-ray photoemission spectroscopy (XPS), and thermal desorption spectroscopy (TDS) revealed that nitrogen… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  13. arXiv:2401.14421  [pdf, other

    cs.LG cs.MA eess.SY stat.ML

    Multi-Agent Based Transfer Learning for Data-Driven Air Traffic Applications

    Authors: Chuhao Deng, Hong-Cheol Choi, Hyunsang Park, Inseok Hwang

    Abstract: Research in developing data-driven models for Air Traffic Management (ATM) has gained a tremendous interest in recent years. However, data-driven models are known to have long training time and require large datasets to achieve good performance. To address the two issues, this paper proposes a Multi-Agent Bidirectional Encoder Representations from Transformers (MA-BERT) model that fully considers… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 12 pages, 8 figures, submitted for IEEE Transactions on Intelligent Transportation System

  14. arXiv:2310.19348  [pdf

    cond-mat.str-el cond-mat.mtrl-sci physics.optics quant-ph

    Rapid suppression of quantum many-body magnetic exciton in doped van der Waals antiferromagnet (Ni,Cd)PS3

    Authors: Junghyun Kim, Woongki Na, Jonghyeon Kim, Pyeongjae Park, Kaixuan Zhang, Inho Hwang, Young-Woo Son, Jae Hoon Kim, Hyeonsik Cheong, Je-Geun Park

    Abstract: The unique discovery of magnetic exciton in van der Waals antiferromagnet NiPS3 arises between two quantum many-body states of a Zhang-Rice singlet excited state and a Zhang-Rice triplet ground state. Simultaneously, the spectral width of photoluminescence originating from this exciton is exceedingly narrow as 0.4 meV. These extraordinary properties, including the extreme coherence of the magnetic… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 40 pages, 4 main figures, 13 supporting figures, accepted by Nano Letters

  15. arXiv:2310.16191  [pdf, other

    cs.CR

    Can Virtual Reality Protect Users from Keystroke Inference Attacks?

    Authors: Zhuolin Yang, Zain Sarwar, Iris Hwang, Ronik Bhaskar, Ben Y. Zhao, Haitao Zheng

    Abstract: Virtual Reality (VR) has gained popularity by providing immersive and interactive experiences without geographical limitations. It also provides a sense of personal privacy through physical separation. In this paper, we show that despite assumptions of enhanced privacy, VR is unable to shield its users from side-channel attacks that steal private information. Ironically, this vulnerability arises… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted by USENIX 2024

  16. arXiv:2310.07049  [pdf, other

    physics.comp-ph

    Robust Machine Learning Inference from X-ray Absorption Near Edge Spectra through Featurization

    Authors: Yiming Chen, Chi Chen, Inhui Hwang, Michael J. Davis, Wanli Yang, Chengjun Sun, Shyue Ping Ong, Maria K. Y. Chan

    Abstract: X-ray absorption spectroscopy (XAS) is a commonly-employed technique for characterizing functional materials. In particular, x-ray absorption near edge spectra (XANES) encodes local coordination and electronic information and machine learning approaches to extract this information is of significant interest. To date, most ML approaches for XANES have primarily focused on using the raw spectral int… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  17. arXiv:2310.05299  [pdf

    eess.IV cs.CV cs.LG

    Image Compression and Decompression Framework Based on Latent Diffusion Model for Breast Mammography

    Authors: InChan Hwang, MinJae Woo

    Abstract: This research presents a novel framework for the compression and decompression of medical images utilizing the Latent Diffusion Model (LDM). The LDM represents advancement over the denoising diffusion probabilistic model (DDPM) with a potential to yield superior image quality while requiring fewer computational resources in the image decompression process. A possible application of LDM and Torchvi… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: 6 pages IEEE conference

  18. arXiv:2309.11784  [pdf, other

    eess.SY eess.SP

    Collaborative Fault-Identification & Reconstruction in Multi-Agent Systems

    Authors: Shiraz Khan, Inseok Hwang

    Abstract: The conventional solutions for fault-detection, identification, and reconstruction (FDIR) require centralized decision-making mechanisms which are typically combinatorial in their nature, necessitating the design of an efficient distributed FDIR mechanism that is suitable for multi-agent applications. To this end, we develop a general framework for efficiently reconstructing a sparse vector being… ▽ More

    Submitted 22 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

  19. arXiv:2308.16880  [pdf, other

    cs.CV

    Text2Scene: Text-driven Indoor Scene Stylization with Part-aware Details

    Authors: Inwoo Hwang, Hyeonwoo Kim, Young Min Kim

    Abstract: We propose Text2Scene, a method to automatically create realistic textures for virtual scenes composed of multiple objects. Guided by a reference image and text descriptions, our pipeline adds detailed texture on labeled 3D geometries in the room such that the generated colors respect the hierarchical structure or semantic parts that are often composed of similar materials. Instead of applying fla… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted to CVPR 2023

  20. arXiv:2308.00274  [pdf, other

    eess.SY

    Exploiting Sparsity for Localization of Large-Scale Wireless Sensor Networks

    Authors: Shiraz Khan, Inseok Hwang, James Goppert

    Abstract: Wireless Sensor Network (WSN) localization refers to the problem of determining the position of each of the agents in a WSN using noisy measurement information. In many cases, such as in distance and bearing-based localization, the measurement model is a nonlinear function of the agents' positions, leading to pairwise interconnections between the agents. As the optimal solution for the WSN localiz… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  21. arXiv:2308.00268  [pdf, other

    eess.SP

    Distributed Gaussian Mixture PHD Filtering under Communication Constraints

    Authors: Shiraz Khan, Yi-Chieh Sun, Inseok Hwang

    Abstract: The Gaussian Mixture Probability Hypothesis Density (GM-PHD) filter is an almost exact closed-form approximation to the Bayes-optimal multi-target tracking algorithm. Due to its optimality guarantees and ease of implementation, it has been studied extensively in the literature. However, the challenges involved in implementing the GM-PHD filter efficiently in a distributed (multi-sensor) setting ha… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  22. arXiv:2307.12078  [pdf, other

    eess.SY eess.SP

    Recovery of Localization Errors in Sensor Networks using Inter-Agent Measurements

    Authors: Shiraz Khan, Inseok Hwang

    Abstract: A practical challenge which arises in the operation of sensor networks is the presence of sensor faults, biases, or adversarial attacks, which can lead to significant errors incurring in the localization of the agents, thereby undermining the security and performance of the network. We consider the problem of identifying and correcting the localization errors using inter-agent measurements, such a… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  23. arXiv:2305.04422  [pdf

    eess.IV cs.CV cs.CY cs.LG

    Multivariate Analysis on Performance Gaps of Artificial Intelligence Models in Screening Mammography

    Authors: Linglin Zhang, Beatrice Brown-Mulry, Vineela Nalla, InChan Hwang, Judy Wawira Gichoya, Aimilia Gastounioti, Imon Banerjee, Laleh Seyyed-Kalantari, MinJae Woo, Hari Trivedi

    Abstract: Although deep learning models for abnormality classification can perform well in screening mammography, the demographic, imaging, and clinical characteristics associated with increased risk of model failure remain unclear. This retrospective study uses the Emory BrEast Imaging Dataset(EMBED) containing mammograms from 115931 patients imaged at Emory Healthcare between 2013-2020, with BI-RADS asses… ▽ More

    Submitted 19 October, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: 29 pages, 6 tables, 7 figures, 2 supplemental tables

  24. arXiv:2304.08204  [pdf, other

    cs.CV

    Learning Geometry-aware Representations by Sketching

    Authors: Hyundo Lee, Inwoo Hwang, Hyunsung Go, Won-Seok Choi, Kibeom Kim, Byoung-Tak Zhang

    Abstract: Understanding geometric concepts, such as distance and shape, is essential for understanding the real world and also for many vision tasks. To incorporate such information into a visual representation of a scene, we propose learning to represent the scene by sketching, inspired by human behavior. Our method, coined Learning by Sketching (LBS), learns to convert an image into a set of colored strok… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: CVPR 2023

  25. arXiv:2302.00671  [pdf, other

    cs.LG cs.AI cs.RO

    Efficient Multi-Task Reinforcement Learning via Selective Behavior Sharing

    Authors: Grace Zhang, Ayush Jain, Injune Hwang, Shao-Hua Sun, Joseph J. Lim

    Abstract: The ability to leverage shared behaviors between tasks is critical for sample-efficient multi-task reinforcement learning (MTRL). While prior methods have primarily explored parameter and data sharing, direct behavior-sharing has been limited to task families requiring similar behaviors. Our goal is to extend the efficacy of behavior-sharing to more general task families that could require a mix o… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  26. arXiv:2212.04396  [pdf, other

    eess.SY math.DS

    On Attack Detection and Identification for the Cyber-Physical System using Lifted System Model

    Authors: Dawei Sun, Minhyun Cho, Inseok Hwang

    Abstract: Motivated by the safety and security issues related to cyber-physical systems with potentially multi-rate, delayed, and nonuniformly sampled measurements, we investigate the attack detection and identification using the lifted system model in this paper. Attack detectability and identifiability based on the lifted system model are formally defined and rigorously characterized in a novel approach.… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: It is the preprint of a paper submitted to Automatica

  27. arXiv:2212.04018  [pdf, other

    cs.RO

    An Open-Source Gazebo Plugin for GNSS Multipath Signal Emulation in Virtual Urban Canyons

    Authors: Kartik Anand Pant, Zhanpeng Yang, James M Goppert, Inseok Hwang

    Abstract: One of the major errors affecting GNSS signals in urban canyons is GNSS multipath error. In this work, we develop a Gazebo plugin which utilizes a ray tracing technique to account for multipath effects in a virtual urban canyon environment using virtual satellites. This software plugin balances accuracy and computational complexity to run the simulation in real-time for both software-in-the-loop (… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: 13 pages, 8 figures

  28. arXiv:2211.07734  [pdf, other

    physics.app-ph

    Superconducting Niobium Tip Electron Beam Source

    Authors: Cameron W. Johnson, Andreas K. Schmid, Marian Mankos, Robin Röpke, Nicole Kerker, Ing-Shouh Hwang, Ed K. Wong, D. Frank Ogletree, Andrew M. Minor, Alexander Stibor

    Abstract: Modern electron microscopy and spectroscopy is a key technology for studying the structure and composition of quantum and biological materials in fundamental and applied sciences. High-resolution spectroscopic techniques and aberration-corrected microscopes are often limited by the relatively large energy distribution of currently available beam sources. This can be improved by a monochromator, wi… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

  29. arXiv:2211.05203  [pdf, other

    eess.SY

    Data-driven Cyberattack Synthesis against Network Control Systems

    Authors: Omanshu Thapliyal, Inseok Hwang

    Abstract: Network Control Systems (NCSs) pose unique vulnerabilities to cyberattacks due to a heavy reliance on communication channels. These channels can be susceptible to eavesdropping, false data injection (FDI), and denial of service (DoS). As a result, smarter cyberattacks can employ a combination of techniques to cause degradation of the considered NCS performance. We consider a white-box cyberattack… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: 10 pages, 5 figures

  30. arXiv:2211.03732  [pdf, other

    eess.SY

    Approximating Reachable Sets for Neural Network based Models in Real-Time via Optimal Control

    Authors: Omanshu Thapliyal, Inseok Hwang

    Abstract: In this paper, we present a data-driven framework for real-time estimation of reachable sets for control systems where the plant is modeled using neural networks (NNs). We utilize a running example of a quadrotor model that is learned using trajectory data via NNs. The NN learned offline, can be excited online to obtain linear approximations for reachability analysis. We use a dynamic mode decompo… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 14 pages, 11 figures, journal paper that has been conditionally accepted

  31. arXiv:2211.03310  [pdf, other

    eess.SY

    Log-linear Dynamic Inversion Control with Provable Safety Guarantees in Lie Groups

    Authors: Li-Yu Lin, James Goppert, Inseok Hwang

    Abstract: In this paper, we use the derivative of the exponential map to derive the exact evolution of the logarithm of the tracking error for mixed-invariant systems, a class of systems capable of describing rigid body tracking problems in Lie groups. Additionally, we design a log-linear dynamic inversion-based control law to remove the nonlinearities due to spatial curvature and enhance the robustness of… ▽ More

    Submitted 13 August, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: 7 pages, 5 figures. Revision is submitted to IEEE TAC

  32. arXiv:2211.02291  [pdf, other

    cs.CV cs.AI cs.LG

    SelecMix: Debiased Learning by Contradicting-pair Sampling

    Authors: Inwoo Hwang, Sangjun Lee, Yunhyeok Kwak, Seong Joon Oh, Damien Teney, Jin-Hwa Kim, Byoung-Tak Zhang

    Abstract: Neural networks trained with ERM (empirical risk minimization) sometimes learn unintended decision rules, in particular when their training data is biased, i.e., when training labels are strongly correlated with undesirable features. To prevent a network from learning such features, recent methods augment training data such that examples displaying spurious correlations (i.e., bias-aligned example… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022

  33. arXiv:2211.01327  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis

    Authors: Konstantinos Klapsas, Karolos Nikitaras, Nikolaos Ellinas, June Sig Sung, Inchul Hwang, Spyros Raptis, Aimilios Chalamandaris, Pirros Tsiakoulis

    Abstract: A large part of the expressive speech synthesis literature focuses on learning prosodic representations of the speech signal which are then modeled by a prior distribution during inference. In this paper, we compare different prior architectures at the task of predicting phoneme level prosodic representations extracted with an unsupervised FVAE model. We use both subjective and objective metrics t… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: Submitted to ICASSP 2023

  34. arXiv:2211.00523  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis

    Authors: Karolos Nikitaras, Konstantinos Klapsas, Nikolaos Ellinas, Georgia Maniati, June Sig Sung, Inchul Hwang, Spyros Raptis, Aimilios Chalamandaris, Pirros Tsiakoulis

    Abstract: This paper proposes an Expressive Speech Synthesis model that utilizes token-level latent prosodic variables in order to capture and control utterance-level attributes, such as character acting voice and speaking style. Current works aim to explicitly factorize such fine-grained and utterance-level speech attributes into different representations extracted by modules that operate in the correspond… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: Submitted to ICASSP 2023

  35. arXiv:2211.00375  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Generating Multilingual Gender-Ambiguous Text-to-Speech Voices

    Authors: Konstantinos Markopoulos, Georgia Maniati, Georgios Vamvoukakis, Nikolaos Ellinas, Georgios Vardaxoglou, Panos Kakoulidis, Junkwang Oh, Gunu Jho, Inchul Hwang, Aimilios Chalamandaris, Pirros Tsiakoulis, Spyros Raptis

    Abstract: The gender of any voice user interface is a key element of its perceived identity. Recently, there has been increasing interest in interfaces where the gender is ambiguous rather than clearly identifying as female or male. This work addresses the task of generating novel gender-ambiguous TTS voices in a multi-speaker, multilingual setting. This is accomplished by efficiently sampling from a latent… ▽ More

    Submitted 11 June, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: Accepted to INTERSPEECH 2023

  36. arXiv:2211.00342  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Investigating Content-Aware Neural Text-To-Speech MOS Prediction Using Prosodic and Linguistic Features

    Authors: Alexandra Vioni, Georgia Maniati, Nikolaos Ellinas, June Sig Sung, Inchul Hwang, Aimilios Chalamandaris, Pirros Tsiakoulis

    Abstract: Current state-of-the-art methods for automatic synthetic speech evaluation are based on MOS prediction neural models. Such MOS prediction models include MOSNet and LDNet that use spectral features as input, and SSL-MOS that relies on a pretrained self-supervised learning model that directly uses the speech signal as input. In modern high-quality neural TTS systems, prosodic appropriateness with re… ▽ More

    Submitted 7 May, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: Proceedings of ICASSP 2023

  37. arXiv:2210.17264   

    cs.SD cs.CL cs.LG eess.AS

    Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation

    Authors: Nikolaos Ellinas, Georgios Vamvoukakis, Konstantinos Markopoulos, Georgia Maniati, Panos Kakoulidis, June Sig Sung, Inchul Hwang, Spyros Raptis, Aimilios Chalamandaris, Pirros Tsiakoulis

    Abstract: This paper presents a method for end-to-end cross-lingual text-to-speech (TTS) which aims to preserve the target language's pronunciation regardless of the original speaker's language. The model used is based on a non-attentive Tacotron architecture, where the decoder has been replaced with a normalizing flow network conditioned on the speaker identity, allowing both TTS and voice conversion (VC)… ▽ More

    Submitted 27 February, 2024; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: Fundamental changes to the model described and experimental procedure

  38. arXiv:2210.10927  [pdf, other

    eess.SY

    A Novel Approach to Set-Membership Observer for Systems with Unknown Exogenous Inputs

    Authors: Marvin Jesse, Dawei Sun, Inseok Hwang

    Abstract: Motivated by the increasing need to monitor safety-critical systems subject to uncertainties, a novel set-membership approach is proposed to estimate the state of a dynamical system with unknown-but-bounded exogenous inputs. The proposed method decomposes the system into the strongly observable and weakly unobservable subsystem in which an unknown input observer and an ellipsoidal set-membership o… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  39. arXiv:2208.13843  [pdf, ps, other

    eess.SY math.OC

    Provably Stabilizing Model-Free Q-Learning for Unknown Bilinear Systems

    Authors: Shanelle G. Clarke, Omanshu Thapliyal, Inseok Hwang

    Abstract: In this paper, we present a provably convergent Model-Free ${Q}$-Learning algorithm that learns a stabilizing control policy for an unknown Bilinear System from a single online run. Given an unknown bilinear system, we study the interplay between its equivalent control-affine linear time-varying and linear time-invariant representations to derive i) from Pontryagin's Minimum Principle, a pair of p… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

    Comments: 7 pages, 1 figure, Submitted to IEEE Control Systems Letters (L-CSS)

  40. arXiv:2208.04832  [pdf, other

    cs.AI cs.LG cs.NE

    On the Importance of Critical Period in Multi-stage Reinforcement Learning

    Authors: Junseok Park, Inwoo Hwang, Min Whoo Lee, Hyunseok Oh, Minsu Lee, Youngki Lee, Byoung-Tak Zhang

    Abstract: The initial years of an infant's life are known as the critical period, during which the overall development of learning performance is significantly impacted due to neural plasticity. In recent studies, an AI agent, with a deep neural network mimicking mechanisms of actual neurons, exhibited a learning period similar to human's critical period. Especially during this initial period, the appropria… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: Accepted by the ICML Complex Feedback in Online Learning Workshop (Open Problems) 2022

  41. Sparse Ellipsometry: Portable Acquisition of Polarimetric SVBRDF and Shape with Unstructured Flash Photography

    Authors: Inseung Hwang, Daniel S. Jeon, Adolfo Muñoz, Diego Gutierrez, Xin Tong, Min H. Kim

    Abstract: Ellipsometry techniques allow to measure polarization information of materials, requiring precise rotations of optical components with different configurations of lights and sensors. This results in cumbersome capture devices, carefully calibrated in lab conditions, and in very long acquisition times, usually in the order of a few days per object. Recent techniques allow to capture polarimetric sp… ▽ More

    Submitted 8 February, 2023; v1 submitted 9 July, 2022; originally announced July 2022.

    Journal ref: ACM Transactions on Graphics 41, 4, Article 133 (July 2022)

  42. arXiv:2207.03318  [pdf, other

    eess.SY

    State Prediction of Human-in-the-Loop Multi-rotor System with Stochastic Human Behavior Model

    Authors: Joonwon Choi, Sooyung Byeon, Inseok Hwang

    Abstract: Reachability analysis is a widely used method to analyze the safety of a Human-in-the-Loop Cyber Physical System (HiLCPS). This strategy allows the HiLCPS to respond against an imminent threat in advance by predicting reachable states of the system. However, it could lead to an unnecessarily conservative reachable set if the prediction only relies on the system dynamics without explicitly consider… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: This work has been submitted to IFAC for possible publication

  43. arXiv:2206.12455  [pdf, other

    cs.CV

    Ev-NeRF: Event Based Neural Radiance Field

    Authors: Inwoo Hwang, Junho Kim, Young Min Kim

    Abstract: We present Ev-NeRF, a Neural Radiance Field derived from event data. While event cameras can measure subtle brightness changes in high frame rates, the measurements in low lighting or extreme motion suffer from significant domain discrepancy with complex noise. As a result, the performance of event-based vision tasks does not transfer to challenging environments, where the event cameras are expect… ▽ More

    Submitted 5 March, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: Accepted to WACV 2023

  44. arXiv:2204.03811  [pdf

    cond-mat.mes-hall

    Observation of Mesoscopic Clathrate Structures in Ethanol-Water Mixtures

    Authors: Wei-Hao Hsu, Tzu-Chieh Yen, Chien-Chun Chen, Chih-Wen Yang, Chung-Kai Fang, Ing-Shouh Hwang

    Abstract: Water-alcohol mixtures exhibit many abnormal physicochemical properties, the origins of which remain controversial. Here we use transmission electron microscopy (TEM), nanoparticle tracking analysis (NTA), and atomic force microscopy (AFM) to study ethanol-water mixtures. TEM reveals mesoscopic clathrate structures with water molecules forming a crystalline matrix hosting a high density of tiny ce… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

  45. arXiv:2203.12247  [pdf, other

    cs.CV

    Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition

    Authors: Junho Kim, Inwoo Hwang, Young Min Kim

    Abstract: We introduce Ev-TTA, a simple, effective test-time adaptation algorithm for event-based object recognition. While event cameras are proposed to provide measurements of scenes with fast motions or drastic illumination changes, many existing event-based recognition algorithms suffer from performance deterioration under extreme conditions due to significant domain shifts. Ev-TTA mitigates the severe… ▽ More

    Submitted 28 March, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  46. arXiv:2110.11863  [pdf, ps, other

    math.FA

    Operator-valued rational functions

    Authors: Raul E. Curto, In Sung Hwang, Woo Young Lee

    Abstract: In this paper we show that every inner divisor of the operator-valued coordinate function, $zI_E$, is a Blaschke-Potapov factor. We also introduce a notion of operator-valued "rational" function and then show that $Δ$ is two-sided inner and rational if and only if it can be represented as a finite Blaschke-Potapov product; this extends to operator-valued functions the well-known result proved by V… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    MSC Class: 47

  47. arXiv:2110.02509  [pdf, other

    eess.SY eess.SP

    Design and Implementation of 5.8GHz RF Wireless PowerTransfer System

    Authors: Je Hyeon Park, Nguyen Minh Tran, Sa Il Hwang, Dong In Kim, Kae Won Choi

    Abstract: In this paper, we present a 5.8 GHz radio-frequency (RF) wireless power transfer (WPT) system that consists of 64 transmit antennas and 16 receive antennas. Unlike the inductive or resonant coupling-based near-field WPT, RF WPT has a great advantage in powering low-power internet of things (IoT) devices with its capability of long-range wireless power transfer. We also propose a beam scanning algo… ▽ More

    Submitted 6 October, 2021; originally announced October 2021.

  48. arXiv:2108.13022  [pdf

    cond-mat.mtrl-sci cond-mat.other physics.app-ph quant-ph

    Highly efficient nonvolatile magnetization switching and multi-level states by current in single van der Waals topological ferromagnet Fe3GeTe2

    Authors: Kaixuan Zhang, Youjin Lee, Matthew J. Coak, Junghyun Kim, Suhan Son, Inho Hwang, Dong-Su Ko, Youngtek Oh, Insu Jeon, Dohun Kim, Changgan Zeng, Hyun-Woo Lee, Je-Geun Park

    Abstract: Robust multi-level spin memory with the ability to write information electrically is a long-sought capability in spintronics, with great promise for applications. Here we achieve nonvolatile and highly energy-efficient magnetization switching in a single-material device formed of van-der-Waals topological ferromagnet Fe3GeTe2, whose magnetic information can be readily controlled by a tiny current.… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: Accepted by Advanced Functional Materials; 28 pages, 5 main figures, 4 supporting figures

    Journal ref: Advanced Functional Materials 31, 2105992 (2021)

  49. arXiv:2108.12111  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall physics.app-ph

    Gigantic current control of coercive field and magnetic memory based on nm-thin ferromagnetic van der Waals Fe3GeTe2

    Authors: Kaixuan Zhang, Seungyun Han, Youjin Lee, Matthew J. Coak, Junghyun Kim, Inho Hwang, Suhan Son, Jeacheol Shin, Mijin Lim, Daegeun Jo, Kyoo Kim, Dohun Kim, Hyun-Woo Lee, Je-Geun Park

    Abstract: Controlling magnetic states by a small current is essential for the next-generation of energy-efficient spintronic devices. However, it invariably requires considerable energy to change a magnetic ground state of intrinsically quantum nature governed by fundamental Hamiltonian, once stabilized below a phase transition temperature. We report that surprisingly an in-plane current can tune the magnet… ▽ More

    Submitted 1 September, 2021; v1 submitted 27 August, 2021; originally announced August 2021.

    Comments: 61 pages, 4 main figures, 14 supporting figures

    Journal ref: Advanced Materials 33, 2004110 (2021)

  50. Unconventional hysteretic transition in a charge density wave

    Authors: B. Q. Lv, Alfred Zong, D. Wu, A. V. Rozhkov, Boris V. Fine, Su-Di Chen, Makoto Hashimoto, Dong-Hui Lu, M. Li, Y. -B. Huang, Jacob P. C. Ruff, Donald A. Walko, Z. H. Chen, Inhui Hwang, Yifan Su, Xiaozhe Shen, Xirui Wang, Fei Han, Hoi Chun Po, Yao Wang, Pablo Jarillo-Herrero, Xijie Wang, Hua Zhou, Cheng-Jun Sun, Haidan Wen , et al. (3 additional authors not shown)

    Abstract: Hysteresis underlies a large number of phase transitions in solids, giving rise to exotic metastable states that are otherwise inaccessible. Here, we report an unconventional hysteretic transition in a quasi-2D material, EuTe4. By combining transport, photoemission, diffraction, and x-ray absorption measurements, we observed that the hysteresis loop has a temperature width of more than 400 K, sett… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Journal ref: Phys. Rev. Lett. 128, 036401 (2022)