Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 177 results for author: Huang, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2409.03715  [pdf, other

    cs.SD cs.AI eess.AS

    Applications and Advances of Artificial Intelligence in Music Generation:A Review

    Authors: Yanxu Chen, Linshu Huang, Tian Gou

    Abstract: In recent years, artificial intelligence (AI) has made significant progress in the field of music generation, driving innovation in music creation and applications. This paper provides a systematic review of the latest research advancements in AI music generation, covering key technologies, models, datasets, evaluation methods, and their practical applications across various fields. The main contr… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  2. arXiv:2408.16215  [pdf, ps, other

    math.OC cs.LG cs.PF eess.SY

    Adversarial Network Optimization under Bandit Feedback: Maximizing Utility in Non-Stationary Multi-Hop Networks

    Authors: Yan Dai, Longbo Huang

    Abstract: Stochastic Network Optimization (SNO) concerns scheduling in stochastic queueing systems. It has been widely studied in network theory. Classical SNO algorithms require network conditions to be stationary with time, which fails to capture the non-stationary components in many real-world scenarios. Many existing algorithms also assume knowledge of network conditions before decision, which rules out… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  3. arXiv:2408.10680  [pdf, other

    cs.CL cs.SD eess.AS

    Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper

    Authors: Tianyi Xu, Kaixun Huang, Pengcheng Guo, Yu Zhou, Longtao Huang, Hui Xue, Lei Xie

    Abstract: Pre-trained multilingual speech foundation models, like Whisper, have shown impressive performance across different languages. However, adapting these models to new or specific languages is computationally extensive and faces catastrophic forgetting problems. Addressing these issues, our study investigates strategies to enhance the model on new languages in the absence of original training data, w… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  4. arXiv:2408.08849  [pdf, other

    eess.SP

    ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis

    Authors: Yubao Zhao, Tian Zhang, Xu Wang, Puyu Han, Tong Chen, Linlin Huang, Youzhu Jin, Jiaju Kang

    Abstract: The success of Multimodal Large Language Models (MLLMs) in the medical auxiliary field shows great potential, allowing patients to engage in conversations using physiological signal data. However, general MLLMs perform poorly in cardiac disease diagnosis, particularly in the integration of ECG data analysis and long-text medical report generation, mainly due to the complexity of ECG data analysis… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  5. arXiv:2407.15903  [pdf, other

    eess.IV

    Semantics Guided Disentangled GAN for Chest X-ray Image Rib Segmentation

    Authors: Lili Huang, Dexin Ma, Xiaowei Zhao, Chenglong Li, Haifeng Zhao, Jin Tang, Chuanfu Li

    Abstract: The label annotations for chest X-ray image rib segmentation are time consuming and laborious, and the labeling quality heavily relies on medical knowledge of annotators. To reduce the dependency on annotated data, existing works often utilize generative adversarial network (GAN) to generate training data. However, GAN-based methods overlook the nuanced information specific to individual organs, w… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  6. arXiv:2407.05259  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Multi-scale Conditional Generative Modeling for Microscopic Image Restoration

    Authors: Luzhe Huang, Xiongye Xiao, Shixuan Li, Jiawen Sun, Yi Huang, Aydogan Ozcan, Paul Bogdan

    Abstract: The advance of diffusion-based generative models in recent years has revolutionized state-of-the-art (SOTA) techniques in a wide variety of image analysis and synthesis tasks, whereas their adaptation on image restoration, particularly within computational microscopy remains theoretically and empirically underexplored. In this research, we introduce a multi-scale generative model that enhances con… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  7. arXiv:2407.04675  [pdf, other

    eess.AS cs.SD

    Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

    Authors: Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li , et al. (30 additional authors not shown)

    Abstract: Modern automatic speech recognition (ASR) model is required to accurately transcribe diverse speech signals (from different domains, languages, accents, etc) given the specific contextual information in various application scenarios. Classic end-to-end models fused with extra language models perform well, but mainly in data matching scenarios and are gradually approaching a bottleneck. In this wor… ▽ More

    Submitted 10 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  8. arXiv:2407.04353  [pdf, other

    eess.IV cs.CV

    Segmenting Medical Images: From UNet to Res-UNet and nnUNet

    Authors: Lina Huang, Alina Miron, Kate Hone, Yongmin Li

    Abstract: This study provides a comparative analysis of deep learning models including UNet, Res-UNet, Attention Res-UNet, and nnUNet, and evaluates their performance in brain tumour, polyp, and multi-class heart segmentation tasks. The analysis focuses on precision, accuracy, recall, Dice Similarity Coefficient (DSC), and Intersection over Union (IoU) to assess their clinical applicability. In brain tumour… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 7 pages, 3 figures

  9. arXiv:2407.02160  [pdf, ps, other

    eess.SP

    Intelligent Reflecting Surface-Assisted NLOS Sensing With OFDM Signals

    Authors: Jilin Wang, Jun Fang, Hongbin Li, Lei Huang

    Abstract: This work addresses the problem of intelligent reflecting surface (IRS) assisted target sensing in a non-line-of-sight (NLOS) scenario, where an IRS is employed to facilitate the radar/access point (AP) to sense the targets when the line-of-sight (LOS) path between the AP and the target is blocked by obstacles. To sense the targets, the AP transmits a train of uniformly-spaced orthogonal frequency… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  10. arXiv:2406.12456  [pdf, other

    eess.IV cs.CV

    Deep-learning-based groupwise registration for motion correction of cardiac $T_1$ mapping

    Authors: Yi Zhang, Yidong Zhao, Lu Huang, Liming Xia, Qian Tao

    Abstract: Quantitative $T_1$ mapping by MRI is an increasingly important tool for clinical assessment of cardiovascular diseases. The cardiac $T_1$ map is derived by fitting a known signal model to a series of baseline images, while the quality of this map can be deteriorated by involuntary respiratory and cardiac motion. To correct motion, a template image is often needed to register all baseline images, b… ▽ More

    Submitted 21 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: MICCAI 2024. Contents may slightly differ from the camera-ready version

  11. arXiv:2406.08081  [pdf

    eess.SP

    CLDTA: Contrastive Learning based on Diagonal Transformer Autoencoder for Cross-Dataset EEG Emotion Recognition

    Authors: Yuan Liao, Yuhong Zhang, Shenghuan Wang, Xiruo Zhang, Yiling Zhang, Wei Chen, Yuzhe Gu, Liya Huang

    Abstract: Recent advances in non-invasive EEG technology have broadened its application in emotion recognition, yielding a multitude of related datasets. Yet, deep learning models struggle to generalize across these datasets due to variations in acquisition equipment and emotional stimulus materials. To address the pressing need for a universal model that fluidly accommodates diverse EEG dataset formats and… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  12. arXiv:2406.07409  [pdf, other

    stat.ML cs.IT cs.LG eess.SP math.OC

    Accelerating Ill-conditioned Hankel Matrix Recovery via Structured Newton-like Descent

    Authors: HanQin Cai, Longxiu Huang, Xiliang Lu, Juntao You

    Abstract: This paper studies the robust Hankel recovery problem, which simultaneously removes the sparse outliers and fulfills missing entries from the partial observation. We propose a novel non-convex algorithm, coined Hankel Structured Newton-Like Descent (HSNLD), to tackle the robust Hankel recovery problem. HSNLD is highly efficient with linear convergence, and its convergence rate is independent of th… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    MSC Class: 15A29; 15A83; 47B35; 90C17; 90C26; 90C53

  13. arXiv:2406.06534  [pdf, other

    cs.CV eess.IV physics.optics

    Compressed Meta-Optical Encoder for Image Classification

    Authors: Anna Wirth-Singh, Jinlin Xiang, Minho Choi, Johannes E. Fröch, Luocheng Huang, Shane Colburn, Eli Shlizerman, Arka Majumdar

    Abstract: Optical and hybrid convolutional neural networks (CNNs) recently have become of increasing interest to achieve low-latency, low-power image classification and computer vision tasks. However, implementing optical nonlinearity is challenging, and omitting the nonlinear layers in a standard CNN comes at a significant reduction in accuracy. In this work, we use knowledge distillation to compress modif… ▽ More

    Submitted 14 June, 2024; v1 submitted 22 April, 2024; originally announced June 2024.

  14. arXiv:2405.17716  [pdf, ps, other

    eess.SP

    Soft Multipath Information-Based UWB Tracking in Cluttered Scenarios: Preliminaries and Validations

    Authors: Chenglong Li, Zukun Lu, Long Huang, Shaojie Ni, Guangfu Sun, Emmeric Tanghe, Wout Joseph

    Abstract: In this paper, we investigate ultra-wideband (UWB) localization and tracking in cluttered environments. Instead of mitigating the multipath, we exploit the specular reflections to enhance the localizability and improve the positioning accuracy. With the assistance of the multipath, it is also possible to achieve localization purposes using fewer anchors or when the line-of-sight propagations are b… ▽ More

    Submitted 28 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  15. arXiv:2404.18458  [pdf

    eess.IV cs.CV cs.LG physics.med-ph

    Autonomous Quality and Hallucination Assessment for Virtual Tissue Staining and Digital Pathology

    Authors: Luzhe Huang, Yuzhu Li, Nir Pillar, Tal Keidar Haran, William Dean Wallace, Aydogan Ozcan

    Abstract: Histopathological staining of human tissue is essential in the diagnosis of various diseases. The recent advances in virtual tissue staining technologies using AI alleviate some of the costly and tedious steps involved in the traditional histochemical staining process, permitting multiplexed rapid staining of label-free tissue without using staining reagents, while also preserving tissue. However,… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 37 Pages, 7 Figures

  16. arXiv:2404.13376  [pdf, other

    eess.SY

    Cross-Forming Control and Fault Current Limiting for Grid-Forming Inverters

    Authors: Xiuqiang He, Maitraya Avadhut Desai, Linbin Huang, Florian Dörfler

    Abstract: This article proposes a "cross-forming" control concept for grid-forming inverters operating against grid faults. Cross-forming refers to voltage angle forming and current magnitude forming. It differs from classical grid-forming and grid-following paradigms that feature voltage magnitude-and-angle forming and voltage magnitude-and-angle following (or current magnitude-and-angle forming), respecti… ▽ More

    Submitted 19 July, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

  17. Saturation-Informed Current-Limiting Control for Grid-Forming Converters

    Authors: Maitraya Avadhut Desai, Xiuqiang He, Linbin Huang, Florian Dörfler

    Abstract: In this paper, we investigate the transient stability of a state-of-the-art grid-forming complex-droop control (i.e., dispatchable virtual oscillator control, dVOC) under current saturation. We quantify the saturation level of a converter by introducing the concept of degree of saturation (DoS), and we propose a provably stable current-limiting control with saturation-informed feedback, which feed… ▽ More

    Submitted 1 July, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Journal ref: Electric Power Systems Research, 2024

  18. arXiv:2403.18275  [pdf, other

    eess.SY

    Differentially Private Dual Gradient Tracking for Distributed Resource Allocation

    Authors: Wei Huo, Xiaomeng Chen, Lingying Huang, Karl Henrik Johansson, Ling Shi

    Abstract: This paper investigates privacy issues in distributed resource allocation over directed networks, where each agent holds a private cost function and optimizes its decision subject to a global coupling constraint through local interaction with other agents. Conventional methods for resource allocation over directed networks require all agents to transmit their original data to neighbors, which pose… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  19. arXiv:2403.17324  [pdf, ps, other

    eess.SP

    Unsupervised Learning for Joint Beamforming Design in RIS-aided ISAC Systems

    Authors: Junjie Ye, Lei Huang, Zhen Chen, Peichang Zhang, Mohamed Rihan

    Abstract: It is critical to design efficient beamforming in reconfigurable intelligent surface (RIS)-aided integrated sensing and communication (ISAC) systems for enhancing spectrum utilization. However, conventional methods often have limitations, either incurring high computational complexity due to iterative algorithms or sacrificing performance when using heuristic methods. To achieve both low complexit… ▽ More

    Submitted 15 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Accpeted by IEEE Wireless Communications Letters

  20. arXiv:2403.16488  [pdf

    eess.SY

    Ensuring Disturbance Rejection Performance by Synthesizing Grid-Following and Grid-Forming Inverters in Power Systems

    Authors: Fuyilong Ma, Huanhai Xin, Zhiyi Li, Linbin Huang

    Abstract: To satisfy dynamic requirements of power systems, it is imperative for grid-tied inverters to ensure good disturbance rejection performance (DRP) under variable grid conditions. This letter discovers and theoretically proves that for general networks, synthesizing grid-following (GFL) inverters and grid-forming (GFM) inverters can always more effectively ensure the DRP of multiple inverters, as co… ▽ More

    Submitted 26 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: 6 pages

  21. arXiv:2403.11626  [pdf, other

    cs.GR cs.AI cs.CV cs.MM cs.SD eess.AS

    QEAN: Quaternion-Enhanced Attention Network for Visual Dance Generation

    Authors: Zhizhen Zhou, Yejing Huo, Guoheng Huang, An Zeng, Xuhang Chen, Lian Huang, Zinuo Li

    Abstract: The study of music-generated dance is a novel and challenging Image generation task. It aims to input a piece of music and seed motions, then generate natural dance movements for the subsequent music. Transformer-based methods face challenges in time series prediction tasks related to human movements and music due to their struggle in capturing the nonlinear relationship and temporal aspects. This… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by The Visual Computer Journal

  22. arXiv:2403.06920  [pdf, other

    eess.SP eess.SY

    Distributed Average Consensus via Noisy and Non-Coherent Over-the-Air Aggregation

    Authors: Huiwen Yang, Xiaomeng Chen, Lingying Huang, Subhrakanti Dey, Ling Shi

    Abstract: Over-the-air aggregation has attracted widespread attention for its potential advantages in task-oriented applications, such as distributed sensing, learning, and consensus. In this paper, we develop a communication-efficient distributed average consensus protocol by utilizing over-the-air aggregation, which exploits the superposition property of wireless channels rather than combat it. Noisy chan… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  23. arXiv:2403.06756  [pdf, other

    eess.SP

    One-Bit Target Detection in Collocated MIMO Radar with Colored Background Noise

    Authors: Yu-Hang Xiao, David Ramírez, Lei Huang, Xiao Peng Li, Hing Cheung So

    Abstract: One-bit sampling has emerged as a promising technique in multiple-input multiple-output (MIMO) radar systems due to its ability to significantly reduce data volume and processing requirements. Nevertheless, current detection methods have not adequately addressed the impact of colored noise, which is frequently encountered in real scenarios. In this paper, we present a novel detection method that a… ▽ More

    Submitted 26 April, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  24. arXiv:2401.17793  [pdf, other

    eess.SY

    Optimal Dynamic Ancillary Services Provision Based on Local Power Grid Perception

    Authors: Verena Häberle, Xiuqiang He, Linbin Huang, Eduardo Prieto-Araujo, Florian Dörfler

    Abstract: In this paper, we propose a systematic closed-loop approach to provide optimal dynamic ancillary services with converter-interfaced generation systems based on local power grid perception. In particular, we structurally encode dynamic ancillary services such as fast frequency and voltage regulation in the form of a parametric transfer function matrix, which includes several parameters to define a… ▽ More

    Submitted 28 August, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: 15 pages, 20 Figures

  25. arXiv:2401.05614  [pdf

    cs.SD cs.MM eess.AS

    Self-Attention and Hybrid Features for Replay and Deep-Fake Audio Detection

    Authors: Lian Huang, Chi-Man Pun

    Abstract: Due to the successful application of deep learning, audio spoofing detection has made significant progress. Spoofed audio with speech synthesis or voice conversion can be well detected by many countermeasures. However, an automatic speaker verification system is still vulnerable to spoofing attacks such as replay or Deep-Fake audio. Deep-Fake audio means that the spoofed utterances are generated u… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  26. arXiv:2401.02662  [pdf, other

    cs.NI eess.SP

    GainNet: Coordinates the Odd Couple of Generative AI and 6G Networks

    Authors: Ning Chen, Jie Yang, Zhipeng Cheng, Xuwei Fan, Zhang Liu, Bangzhen Huang, Yifeng Zhao, Lianfen Huang, Xiaojiang Du, Mohsen Guizani

    Abstract: The rapid expansion of AI-generated content (AIGC) reflects the iteration from assistive AI towards generative AI (GAI) with creativity. Meanwhile, the 6G networks will also evolve from the Internet-of-everything to the Internet-of-intelligence with hybrid heterogeneous network architectures. In the future, the interplay between GAI and the 6G will lead to new opportunities, where GAI can learn th… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 10 pages, 5 figures, 1 table

  27. arXiv:2401.00153  [pdf, other

    eess.IV

    USFM: A Universal Ultrasound Foundation Model Generalized to Tasks and Organs towards Label Efficient Image Analysis

    Authors: Jing Jiao, Jin Zhou, Xiaokang Li, Menghua Xia, Yi Huang, Lihong Huang, Na Wang, Xiaofan Zhang, Shichong Zhou, Yuanyuan Wang, Yi Guo

    Abstract: Inadequate generality across different organs and tasks constrains the application of ultrasound (US) image analysis methods in smart healthcare. Building a universal US foundation model holds the potential to address these issues. Nevertheless, the development of such foundational models encounters intrinsic challenges in US analysis, i.e., insufficient databases, low quality, and ineffective fea… ▽ More

    Submitted 2 January, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: Submit to MedIA, 17 pages, 11 figures

  28. arXiv:2312.17528  [pdf

    eess.SY

    Characterizing the Role of Complex Power in Small-Signal Synchronization Stability of Multi-Converter Power Systems

    Authors: Fuyilong Ma, Huanhai Xin, Zhiyi Li, Linbin Huang

    Abstract: Small-signal synchronization instability (SSI) may be triggered when a grid-connected converter is operated in weak grids. This problem is highly related to the active and reactive power (referred to as complex power) generation or consumption of the converter. Such an instability phenomenon manifests as power oscillations within the bandwidth frequency of phase-locked loop (PLL). However, in a mu… ▽ More

    Submitted 13 February, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: 11 pages

  29. arXiv:2312.10687  [pdf, other

    eess.AS cs.SD

    MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis

    Authors: Wenhao Guan, Yishuang Li, Tao Li, Hukai Huang, Feng Wang, Jiayan Lin, Lingyan Huang, Lin Li, Qingyang Hong

    Abstract: The style transfer task in Text-to-Speech refers to the process of transferring style information into text content to generate corresponding speech with a specific style. However, most existing style transfer approaches are either based on fixed emotional labels or reference speech clips, which cannot achieve flexible style transfer. Recently, some methods have adopted text descriptions to guide… ▽ More

    Submitted 31 January, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI2024

  30. arXiv:2312.10343  [pdf, other

    eess.SP cs.AR cs.LG cs.NE

    In-Sensor Radio Frequency Computing for Energy-Efficient Intelligent Radar

    Authors: Yang Sui, Minning Zhu, Lingyi Huang, Chung-Tse Michael Wu, Bo Yuan

    Abstract: Radio Frequency Neural Networks (RFNNs) have demonstrated advantages in realizing intelligent applications across various domains. However, as the model size of deep neural networks rapidly increases, implementing large-scale RFNN in practice requires an extensive number of RF interferometers and consumes a substantial amount of energy. To address this challenge, we propose to utilize low-rank dec… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  31. arXiv:2311.16433  [pdf, ps, other

    eess.SP

    Energy Efficiency Optimization in Active Reconfigurable Intelligent Surface-Aided Integrated Sensing and Communication Systems

    Authors: Junjie Ye, Mohamed Rihan, Peichang Zhang, Lei Huang, Stefano Buzzi, Zhen Chen

    Abstract: Energy efficiency (EE) is a challenging task in integrated sensing and communication (ISAC) systems, where high spectral efficiency and low energy consumption appear as conflicting requirements. Although passive reconfigurable intelligent surface (RIS) has emerged as a promising technology for enhancing the EE of the ISAC system, the multiplicative fading feature hinders its effectiveness. This pa… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  32. arXiv:2311.08966  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer

    Authors: Jin Qiu, Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma

    Abstract: Deep biasing for the Transducer can improve the recognition performance of rare words or contextual entities, which is essential in practical applications, especially for streaming Automatic Speech Recognition (ASR). However, deep biasing with large-scale rare words remains challenging, as the performance drops significantly when more distractors exist and there are words with similar grapheme seq… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Submitted to ASRU 2023

  33. arXiv:2311.03815  [pdf, other

    cs.NI eess.SP

    Integrated Sensing, Communication, and Computing for Cost-effective Multimodal Federated Perception

    Authors: Ning Chen, Zhipeng Cheng, Xuwei Fan, Bangzhen Huang, Yifeng Zhao, Lianfen Huang, Xiaojiang Du, Mohsen Guizani

    Abstract: Federated learning (FL) is a classic paradigm of 6G edge intelligence (EI), which alleviates privacy leaks and high communication pressure caused by traditional centralized data processing in the artificial intelligence of things (AIoT). The implementation of multimodal federated perception (MFP) services involves three sub-processes, including sensing-based multimodal data generation, communicati… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  34. arXiv:2310.16592  [pdf, other

    cs.LG cs.DC eess.SP

    Over-the-air Federated Policy Gradient

    Authors: Huiwen Yang, Lingying Huang, Subhrakanti Dey, Ling Shi

    Abstract: In recent years, over-the-air aggregation has been widely considered in large-scale distributed learning, optimization, and sensing. In this paper, we propose the over-the-air federated policy gradient algorithm, where all agents simultaneously broadcast an analog signal carrying local information to a common wireless channel, and a central controller uses the received aggregated waveform to updat… ▽ More

    Submitted 25 February, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: To appear at IEEE ICC 2024

  35. Quantitative Stability Conditions for Grid-Forming Converters With Complex Droop Control

    Authors: Xiuqiang He, Linbin Huang, Irina Subotić, Verena Häberle, Florian Dörfler

    Abstract: In this paper, we analytically study the transient stability of grid-connected converters with grid-forming complex droop control, also known as dispatchable virtual oscillator control. We prove theoretically that complex droop control, as a state-of-the-art grid-forming control, always possesses steady-state equilibria whereas classical droop control does not. We provide quantitative conditions f… ▽ More

    Submitted 23 May, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

    Journal ref: IEEE Transactions on Power Electronics, 2024

  36. arXiv:2310.06873  [pdf, other

    eess.IV cs.CV

    A review of uncertainty quantification in medical image analysis: probabilistic and non-probabilistic methods

    Authors: Ling Huang, Su Ruan, Yucheng Xing, Mengling Feng

    Abstract: The comprehensive integration of machine learning healthcare models within clinical practice remains suboptimal, notwithstanding the proliferation of high-performing solutions reported in the literature. A predominant factor hindering widespread adoption pertains to an insufficiency of evidence affirming the reliability of the aforementioned models. Recently, uncertainty quantification methods hav… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2210.03736 by other authors

  37. arXiv:2310.03985  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder

    Authors: Zih-Jyun Lin, Yi-Ju Chen, Po-Chih Kuo, Likai Huang, Chaur-Jong Hu, Cheng-Yu Chen

    Abstract: Dementia diagnosis requires a series of different testing methods, which is complex and time-consuming. Early detection of dementia is crucial as it can prevent further deterioration of the condition. This paper utilizes a speech recognition model to construct a dementia assessment system tailored for Mandarin speakers during the picture description task. By training an attention-based speech reco… ▽ More

    Submitted 15 December, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Accepted to IEEE ICASSP 2024

  38. arXiv:2310.01552  [pdf, other

    eess.SY

    Dynamic Ancillary Services: From Grid Codes to Transfer Function-Based Converter Control

    Authors: Verena Häberle, Linbin Huang, Xiuqiang He, Eduardo Prieto-Araujo, Florian Dörfler

    Abstract: Conventional grid-code specifications for dynamic ancillary services provision such as fast frequency and voltage regulation are typically defined by means of piece-wise linear step-response capability curves in the time domain. However, although the specification of such time-domain curves is straightforward, their practical implementation in a converter-based generation system is not immediate,… ▽ More

    Submitted 28 August, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 8 pages, 11 figures

  39. arXiv:2309.08037  [pdf, other

    eess.SY

    Gain and Phase: Decentralized Stability Conditions for Power Electronics-Dominated Power Systems

    Authors: Linbin Huang, Dan Wang, Xiongfei Wang, Huanhai Xin, Ping Ju, Karl H. Johansson, Florian Dörfler

    Abstract: This paper proposes decentralized stability conditions for multi-converter systems based on the combination of the small gain theorem and the small phase theorem. Instead of directly computing the closed-loop dynamics, e.g., eigenvalues of the state-space matrix, or using the generalized Nyquist stability criterion, the proposed stability conditions are more scalable and computationally lighter, w… ▽ More

    Submitted 10 January, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

  40. arXiv:2309.05919  [pdf, other

    eess.IV cs.CV

    Deep evidential fusion with uncertainty quantification and contextual discounting for multimodal medical image segmentation

    Authors: Ling Huang, Su Ruan, Pierre Decazes, Thierry Denoeux

    Abstract: Single-modality medical images generally do not contain enough information to reach an accurate and reliable diagnosis. For this reason, physicians generally diagnose diseases based on multimodal medical images such as, e.g., PET/CT. The effective fusion of multimodal information is essential to reach a reliable decision and explain how the decision is made as well. In this paper, we propose a fus… ▽ More

    Submitted 18 August, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

  41. arXiv:2309.01321  [pdf, other

    eess.SY math.OC

    Joint Oscillation Damping and Inertia Provision Service for Converter-Interfaced Generation

    Authors: Cheng Feng, Linbin Huang, Xiuqiang He, Yi Wang, Florian Dörfler, Qixin Chen

    Abstract: As renewable generation becomes more prevalent, traditional power systems dominated by synchronous generators are transitioning to systems dominated by converter-interfaced generation. These devices, with their weaker damping capabilities and lower inertia, compromise the system's ability to withstand disturbances, pose a threat to system stability, and lead to oscillations and poor frequency resp… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: Submitted for IEEE PES journal for possible publications

  42. arXiv:2308.13933  [pdf, other

    physics.optics eess.IV

    Illumination strategies for space-bandwidth-time product improvement in Fourier ptychography

    Authors: Haibo Xu, Cheng Li, Mingzhe Wei, Ziwen Zhou, Longqian Huang

    Abstract: Fourier ptychography (FP) is a promising technique for high-throughput imaging. Reconstruction algorithms and illumination paradigm are two key aspects of FP. In this review, we mainly focus on illumination strategies in FP. We derive the space-bandwidth-time product (SBP-T) for the characterization of FP performance. Based on the analysis of SBP-T, we categorize the illumination strategy in FP ef… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  43. arXiv:2308.11627  [pdf, other

    eess.SP cs.AI cs.CV eess.IV eess.SY

    Non-Intrusive Electric Load Monitoring Approach Based on Current Feature Visualization for Smart Energy Management

    Authors: Yiwen Xu, Dengfeng Liu, Liangtao Huang, Zhiquan Lin, Tiesong Zhao, Sam Kwong

    Abstract: The state-of-the-art smart city has been calling for an economic but efficient energy management over large-scale network, especially for the electric power system. It is a critical issue to monitor, analyze and control electric loads of all users in system. In this paper, we employ the popular computer vision techniques of AI to design a non-invasive load monitoring method for smart electric ener… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  44. arXiv:2308.00920  [pdf

    physics.med-ph cs.CV cs.LG eess.IV

    Virtual histological staining of unlabeled autopsy tissue

    Authors: Yuzhu Li, Nir Pillar, Jingxi Li, Tairan Liu, Di Wu, Songyu Sun, Guangdong Ma, Kevin de Haan, Luzhe Huang, Sepehr Hamidi, Anatoly Urisman, Tal Keidar Haran, William Dean Wallace, Jonathan E. Zuckerman, Aydogan Ozcan

    Abstract: Histological examination is a crucial step in an autopsy; however, the traditional histochemical staining of post-mortem samples faces multiple challenges, including the inferior staining quality due to autolysis caused by delayed fixation of cadaver tissue, as well as the resource-intensive nature of chemical staining procedures covering large tissue areas, which demand substantial labor, cost, a… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 24 Pages, 7 Figures

    Journal ref: Nature Communications (2024)

  45. arXiv:2306.14503  [pdf, other

    eess.SY

    Sensor Selection for Remote State Estimation with QoS Requirement Constraints

    Authors: Huiwen Yang, Lingying Huang, Chao Yang, Yilin Mo, Ling Shi

    Abstract: In this paper, we study the sensor selection problem for remote state estimation under the Quality-of-Service (QoS) requirement constraints. Multiple sensors are employed to observe a linear time-invariant system, and their measurements should be transmitted to a remote estimator for state estimation. However, due to the limited communication resources and the QoS requirement constraints, only som… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  46. arXiv:2306.13558  [pdf, other

    eess.SP

    One-Bit Spectrum Sensing for Cognitive Radio

    Authors: Pei-Wen Wu, Lei Huang, David Ramírez, Yu-Hang Xiao, Hing Cheung So

    Abstract: Spectrum sensing in cognitive radio necessitates effective monitoring of wide bandwidths, which requires high-rate sampling. Traditional spectrum sensing methods employing high-precision analog-to-digital converters (ADCs) result in increased power consumption and expensive hardware costs. In this paper, we explore blind spectrum sensing utilizing one-bit ADCs. We derive a closed-form detector bas… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  47. arXiv:2306.04076  [pdf, other

    cs.CL cs.SD eess.AS

    Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer

    Authors: Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma

    Abstract: Domain adaptation using text-only corpus is challenging in end-to-end(E2E) speech recognition. Adaptation by synthesizing audio from text through TTS is resource-consuming. We present a method to learn Unified Speech-Text Representation in Conformer Transducer(USTR-CT) to enable fast domain adaptation using the text-only corpus. Different from the previous textogram method, an extra text encoder i… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: Submitted to Interspeech 2023

  48. arXiv:2305.16789  [pdf, other

    cs.LG cs.CV eess.SP

    Modulate Your Spectrum in Self-Supervised Learning

    Authors: Xi Weng, Yunhao Ni, Tengwei Song, Jie Luo, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan, Lei Huang

    Abstract: Whitening loss offers a theoretical guarantee against feature collapse in self-supervised learning (SSL) with joint embedding architectures. Typically, it involves a hard whitening approach, transforming the embedding and applying loss to the whitened output. In this work, we introduce Spectral Transformation (ST), a framework to modulate the spectrum of embedding and to seek for functions beyond… ▽ More

    Submitted 21 January, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at ICLR 2024. The code is available at https://github.com/winci-ai/intl

  49. arXiv:2305.12852  [pdf

    cs.CV cs.LG eess.IV physics.optics

    Cycle Consistency-based Uncertainty Quantification of Neural Networks in Inverse Imaging Problems

    Authors: Luzhe Huang, Jianing Li, Xiaofu Ding, Yijie Zhang, Hanlong Chen, Aydogan Ozcan

    Abstract: Uncertainty estimation is critical for numerous applications of deep neural networks and draws growing attention from researchers. Here, we demonstrate an uncertainty quantification approach for deep neural networks used in inverse problems based on cycle consistency. We build forward-backward cycles using the physical forward model available and a trained deep neural network solving the inverse p… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 28 Pages, 4 Figures, 1 Table

    Journal ref: Intelligent Computing, AAAS (2023)

  50. arXiv:2305.00192  [pdf, other

    eess.SY

    MIMO Grid Impedance Identification of Three-Phase Power Systems: Parametric vs. Nonparametric Approaches

    Authors: Verena Häberle, Linbin Huang, Xiuqiang He, Eduardo Prieto-Araujo, Roy S. Smith, Florian Dörfler

    Abstract: A fast and accurate grid impedance measurement of three-phase power systems is crucial for online assessment of power system stability and adaptive control of grid-connected converters. Existing grid impedance measurement approaches typically rely on pointwise sinusoidal injections or sequential wideband perturbations to identify a nonparametric grid impedance curve via fast Fourier computations i… ▽ More

    Submitted 29 November, 2023; v1 submitted 29 April, 2023; originally announced May 2023.

    Comments: 7 pages, 7 figures