Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 266 results for author: Liu, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.11481  [pdf, other

    cs.LG cs.AI eess.SP

    Multi-Channel Masked Autoencoder and Comprehensive Evaluations for Reconstructing 12-Lead ECG from Arbitrary Single-Lead ECG

    Authors: Jiarong Chen, Wanqing Wu, Tong Liu, Shenda Hong

    Abstract: In the context of cardiovascular diseases (CVD) that exhibit an elevated prevalence and mortality, the electrocardiogram (ECG) is a popular and standard diagnostic tool for doctors, commonly utilizing a 12-lead configuration in clinical practice. However, the 10 electrodes placed on the surface would cause a lot of inconvenience and discomfort, while the rapidly advancing wearable devices adopt th… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted by KDD-AIDSH 2024

  2. arXiv:2407.05758  [pdf, other

    eess.IV cs.AI cs.CV

    Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports

    Authors: Yutong Zhang, Yi Pan, Tianyang Zhong, Peixin Dong, Kangni Xie, Yuxiao Liu, Hanqi Jiang, Zhengliang Liu, Shijie Zhao, Tuo Zhang, Xi Jiang, Dinggang Shen, Tianming Liu, Xin Zhang

    Abstract: Medical images and radiology reports are crucial for diagnosing medical conditions, highlighting the importance of quantitative analysis for clinical decision-making. However, the diversity and cross-source heterogeneity of these data challenge the generalizability of current data-mining methods. Multimodal large language models (MLLMs) have recently transformed many domains, significantly affecti… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  3. arXiv:2406.16041  [pdf, ps, other

    eess.SP

    Gridless Parameter Estimation in Partly Calibrated Rectangular Arrays

    Authors: Tianyi Liu, Sai Pavan Deram, Khaled Ardah, Martin Haardt, Marc E. Pfetsch, Marius Pesavento

    Abstract: Spatial frequency estimation from a mixture of noisy sinusoids finds applications in various fields. While subspace-based methods offer cost-effective super-resolution parameter estimation, they demand precise array calibration, posing challenges for large antennas. In contrast, sparsity-based approaches outperform subspace methods, especially in scenarios with limited snapshots or correlated sour… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 16 pages, 5 figures. This work has been submitted to the IEEE Transactions on Signal Processing for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  4. arXiv:2406.14799  [pdf, other

    cs.RO eess.SY

    Capture Point Control in Thruster-Assisted Bipedal Locomotion

    Authors: Shreyansh Pitroda, Aditya Bondada, Kaushik Venkatesh Krishnamurthy, Adarsh Salagame, Chenghao Wang, Taoran Liu, Bibek Gupta, Eric Sihite, Reza Nemovi, Alireza Ramezani, Morteza Gharib

    Abstract: Despite major advancements in control design that are robust to unplanned disturbances, bipedal robots are still susceptible to falling over and struggle to negotiate rough terrains. By utilizing thrusters in our bipedal robot, we can perform additional posture manipulation and expand the modes of locomotion to enhance the robot's stability and ability to negotiate rough and difficult-to-navigate… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Submitted and to be presented at IEEE AIM 2024. arXiv admin note: substantial text overlap with arXiv:2103.15952

  5. arXiv:2406.14186  [pdf, other

    eess.IV cs.CV

    CriDiff: Criss-cross Injection Diffusion Framework via Generative Pre-train for Prostate Segmentation

    Authors: Tingwei Liu, Miao Zhang, Leiye Liu, Jialong Zhong, Shuyao Wang, Yongri Piao, Huchuan Lu

    Abstract: Recently, the Diffusion Probabilistic Model (DPM)-based methods have achieved substantial success in the field of medical image segmentation. However, most of these methods fail to enable the diffusion model to learn edge features and non-edge features effectively and to inject them efficiently into the diffusion backbone. Additionally, the domain gap between the images features and the diffusion… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted in MICCAI 2024

  6. arXiv:2406.10283  [pdf, other

    cs.CL cs.SD eess.AS

    Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection

    Authors: Zihan Pan, Tianchi Liu, Hardik B. Sailor, Qiongqiong Wang

    Abstract: Self-supervised learning (SSL) speech representation models, trained on large speech corpora, have demonstrated effectiveness in extracting hierarchical speech embeddings through multiple transformer layers. However, the behavior of these embeddings in specific tasks remains uncertain. This paper investigates the multi-layer behavior of the WavLM model in anti-spoofing and proposes an attentive me… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  7. arXiv:2406.07061  [pdf, other

    eess.IV cs.CV

    Triage of 3D pathology data via 2.5D multiple-instance learning to guide pathologist assessments

    Authors: Gan Gao, Andrew H. Song, Fiona Wang, David Brenes, Rui Wang, Sarah S. L. Chow, Kevin W. Bishop, Lawrence D. True, Faisal Mahmood, Jonathan T. C. Liu

    Abstract: Accurate patient diagnoses based on human tissue biopsies are hindered by current clinical practice, where pathologists assess only a limited number of thin 2D tissue slices sectioned from 3D volumetric tissue. Recent advances in non-destructive 3D pathology, such as open-top light-sheet microscopy, enable comprehensive imaging of spatially heterogeneous tissue morphologies, offering the feasibili… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: CVPR CVMI 2024

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 6955-6965

  8. arXiv:2406.04997  [pdf, ps, other

    eess.AS cs.LG

    On the social bias of speech self-supervised models

    Authors: Yi-Cheng Lin, Tzu-Quan Lin, Hsi-Che Lin, Andy T. Liu, Hung-yi Lee

    Abstract: Self-supervised learning (SSL) speech models have achieved remarkable performance in various tasks, yet the biased outcomes, especially affecting marginalized groups, raise significant concerns. Social bias refers to the phenomenon where algorithms potentially amplify disparate properties between social groups present in the data used for training. Bias in SSL models can perpetuate injustice by au… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH 2024

  9. arXiv:2406.04679  [pdf, other

    eess.IV cs.CV

    XctDiff: Reconstruction of CT Images with Consistent Anatomical Structures from a Single Radiographic Projection Image

    Authors: Qingze Bai, Tiange Liu, Zhi Liu, Yubing Tong, Drew Torigian, Jayaram Udupa

    Abstract: In this paper, we present XctDiff, an algorithm framework for reconstructing CT from a single radiograph, which decomposes the reconstruction process into two easily controllable tasks: feature extraction and CT reconstruction. Specifically, we first design a progressive feature extraction strategy that is able to extract robust 3D priors from radiographs. Then, we use the extracted prior informat… ▽ More

    Submitted 13 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  10. arXiv:2406.02483  [pdf, other

    eess.AS cs.AI cs.SD

    How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio?

    Authors: Tianchi Liu, Lin Zhang, Rohan Kumar Das, Yi Ma, Ruijie Tao, Haizhou Li

    Abstract: Partially manipulating a sentence can greatly change its meaning. Recent work shows that countermeasures (CMs) trained on partially spoofed audio can effectively detect such spoofing. However, the current understanding of the decision-making process of CMs is limited. We utilize Grad-CAM and introduce a quantitative analysis metric to interpret CMs' decisions. We find that CMs prioritize the artif… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  11. arXiv:2406.01795  [pdf, other

    eess.IV

    Video Coding with Cross-Component Sample Offset

    Authors: Han Gao, Xin Zhao, Tianqi Liu, Shan Liu

    Abstract: Beyond the exploration of traditional spatial, temporal and subjective visual signal redundancy in image and video compression, recent research has focused on leveraging cross-color component redundancy to enhance coding efficiency. Cross-component coding approaches are motivated by the statistical correlations among different color components, such as those in the Y'CbCr color space, where luma (… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages

  12. arXiv:2406.01153  [pdf, other

    eess.SY

    Safety-Critical Control of Euler-Lagrange Systems Subject to Multiple Obstacles and Velocity Constraints

    Authors: Zhi Liu, Si Wu, Tengfei Liu, Zhong-Ping Jiang

    Abstract: This paper studies the safety-critical control problem for Euler-Lagrange (EL) systems subject to multiple ball obstacles and velocity constraints in accordance with affordable velocity ranges. A key strategy is to exploit the underlying inner-outer-loop structure for the design of a new cascade controller for the class of EL systems. In particular, the outer-loop controller is developed based on… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  13. arXiv:2406.01058  [pdf, other

    eess.SY

    Constructive Safety Control

    Authors: Si Wu, Tengfei Liu, Zhong-Ping Jiang

    Abstract: This paper proposes a constructive approach to safety control of nonlinear cascade systems subject to multiple state constraints. New design ingredients include a unified characterization of safety and stability for systematic designs of safety controllers, and a novel technique of reshaping the feasible sets of quadratically constrained quadratic programming induced from safety control. The propo… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  14. arXiv:2406.00993  [pdf

    eess.SP cs.HC q-bio.OT

    Detection of Acetone as a Gas Biomarker for Diabetes Based on Gas Sensor Technology

    Authors: Jiaming Wei, Tong Liu, Jipeng Huang, Xiaowei Li, Yurui Qi, Gangyin Luo

    Abstract: With the continuous development and improvement of medical services, there is a growing demand for improving diabetes diagnosis. Exhaled breath analysis, characterized by its speed, convenience, and non-invasive nature, is leading the trend in diagnostic development. Studies have shown that the acetone levels in the breath of diabetes patients are higher than normal, making acetone a basis for dia… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 9 pages, 14 figures

  15. arXiv:2406.00753  [pdf, ps, other

    math.OC eess.SY

    Singular Perturbation: When the Perturbation Parameter Becomes a State-Dependent Function

    Authors: Tengfei Liu, Zhong-Ping Jiang

    Abstract: This paper presents a new systematic framework for nonlinear singularly perturbed systems in which state-dependent perturbation functions are used instead of constant perturbation coefficients. Under this framework, general results are obtained for the global robust stability and input-to-state stability of nonlinear singularly perturbed systems. Interestingly, the proposed methodology provides in… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  16. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  17. arXiv:2405.16084  [pdf, other

    cs.RO cs.HC eess.SY

    A Low-Cost Teleoperable Surgical Robot with a Macro-Micro Structure and a Continuum Tip for Open-Source Research

    Authors: Lachlan Scott, Tangyou Liu, Liao Wu

    Abstract: Surgical robotic systems equipped with microscale, high-dexterity manipulators have shown promising results in minimally invasive surgery (MIS). One barrier to the widespread adoption of such systems is the prohibitive cost of research and development efforts using current state-of-the-art equipment. To address this challenge, this paper proposes a low-cost and modifiable tendon-driven continuum m… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 6 pages, 10 figures, accepted by AIM2024

  18. arXiv:2405.15259  [pdf, other

    eess.SY

    Robust Economic Dispatch with Flexible Demand and Adjustable Uncertainty Set

    Authors: Tian Liu, Xiaoqi Tan, Su Wang, Danny H. K. Tsang

    Abstract: With more renewable energy sources (RES) integrated into the power system, the intermittency of RES places a heavy burden on the system. The uncertainty of RES is traditionally handled by controllable generators to balance the real time wind power deviation. As the demand side management develops, the flexibility of aggregate loads can be leveraged to mitigate the negative impact of the wind power… ▽ More

    Submitted 4 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  19. arXiv:2405.15187  [pdf, other

    eess.SY

    Chance-Constrained Economic Dispatch with Flexible Loads and RES

    Authors: Tian Liu, Bo Sun, Xiaoqi Tan, Danny H. K. Tsang

    Abstract: With the increasing penetration of intermittent renewable energy sources (RESs), it becomes increasingly challenging to maintain the supply-demand balance of power systems by solely relying on the generation side. To combat the volatility led by the uncertain RESs, demand-side management by leveraging the multi-dimensional flexibility (MDF) has been recognized as an economic and efficient approach… ▽ More

    Submitted 4 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  20. arXiv:2405.06339  [pdf, other

    eess.SP

    Performance Analysis of Uplink/Downlink Decoupled Access in Cellular-V2X Networks

    Authors: Luofang Jiao, Kai Yu, Jiacheng Chen, Tingting Liu, Haibo Zhou, Lin Cai

    Abstract: This paper firstly develops an analytical framework to investigate the performance of uplink (UL) / downlink (DL) decoupled access in cellular vehicle-to-everything (C-V2X) networks, in which a vehicle's UL/DL can be connected to different macro/small base stations (MBSs/SBSs) separately. Using the stochastic geometry analytical tool, the UL/DL decoupled access C-V2X is modeled as a Cox process, a… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 15 pages, 10 figures

    Journal ref: Jiao L, Yu K, Chen J, et al. Performance Analysis of Uplink/Downlink Decoupled Access in Cellular-V2X Networks[J]. IEEE Transactions on Mobile Computing, 2023

  21. arXiv:2405.05490  [pdf, other

    cs.RO eess.SY

    Banking Turn of High-DOF Dynamic Morphing Wing Flight by Shifting Structure Response Using Optimization

    Authors: Bibek Gupta, Yogi Shah, Taoran Liu, Eric Sihite, Alireza Ramezani

    Abstract: The 3D flight control of a flapping wing robot is a very challenging problem. The robot stabilizes and controls its pose through the aerodynamic forces acting on the wing membrane which has complex dynamics and it is difficult to develop a control method to interact with such a complex system. Bats, in particular, are capable of performing highly agile aerial maneuvers such as tight banking and bo… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  22. arXiv:2405.02537  [pdf, other

    eess.SY

    A Robust Data-Driven Iterative Control Method for Linear Systems with Bounded Disturbances

    Authors: Kaijian Hu, Tao Liu

    Abstract: This paper proposes a new robust data-driven control method for linear systems with bounded disturbances, where the system model and disturbances are unknown. Due to disturbances, accurately determining the true system becomes challenging using the collected dataset. Therefore, instead of designing controllers directly for the unknown true system, an available approach is to design controllers for… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  23. arXiv:2404.18713  [pdf, other

    cs.RO cs.AI eess.SY

    Adaptive Reinforcement Learning for Robot Control

    Authors: Yu Tang Liu, Nilaksh Singh, Aamir Ahmad

    Abstract: Deep reinforcement learning (DRL) has shown remarkable success in simulation domains, yet its application in designing robot controllers remains limited, due to its single-task orientation and insufficient adaptability to environmental changes. To overcome these limitations, we present a novel adaptive agent that leverages transfer learning techniques to dynamically adapt policy in response to dif… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  24. arXiv:2404.16484  [pdf, other

    cs.CV eess.IV

    Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

    Authors: Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi Jin, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu , et al. (50 additional authors not shown)

    Abstract: This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, AI for Streaming (AIS) Workshop

  25. arXiv:2404.12887  [pdf, other

    cs.CV eess.IV

    3D Multi-frame Fusion for Video Stabilization

    Authors: Zhan Peng, Xinyi Ye, Weiyue Zhao, Tianqi Liu, Huiqiang Sun, Baopu Li, Zhiguo Cao

    Abstract: In this paper, we present RStab, a novel framework for video stabilization that integrates 3D multi-frame fusion through volume rendering. Departing from conventional methods, we introduce a 3D multi-frame perspective to generate stabilized images, addressing the challenge of full-frame generation while preserving structure. The core of our approach lies in Stabilized Rendering (SR), a volume rend… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  26. arXiv:2404.09385  [pdf, other

    eess.AS cs.CL eess.SP

    A Large-Scale Evaluation of Speech Foundation Models

    Authors: Shu-wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee

    Abstract: The foundation model paradigm leverages a shared foundation model to achieve state-of-the-art (SOTA) performance for various tasks, requiring minimal downstream-specific modeling and data annotation. This approach has proven crucial in the field of Natural Language Processing (NLP). However, the speech processing community lacks a similar setup to explore the paradigm systematically. In this work,… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: The extended journal version for SUPERB and SUPERB-SG. Published in IEEE/ACM TASLP. The Arxiv version is preferred

  27. arXiv:2404.00863  [pdf, other

    eess.AS

    Voice Conversion Augmentation for Speaker Recognition on Defective Datasets

    Authors: Ruijie Tao, Zhan Shi, Yidi Jiang, Tianchi Liu, Haizhou Li

    Abstract: Modern speaker recognition system relies on abundant and balanced datasets for classification training. However, diverse defective datasets, such as partially-labelled, small-scale, and imbalanced datasets, are common in real-world applications. Previous works usually studied specific solutions for each scenario from the algorithm perspective. However, the root cause of these problems lies in data… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 5 pages

  28. arXiv:2403.06940  [pdf, other

    eess.IV cs.LG q-bio.QM

    Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction

    Authors: Qing Xiao, Siyeop Yoon, Hui Ren, Matthew Tivnan, Lichao Sun, Quanzheng Li, Tianming Liu, Yu Zhang, Xiang Li

    Abstract: Alzheimer's Disease (AD) is a neurodegenerative condition characterized by diverse progression rates among individuals, with changes in cortical thickness (CTh) closely linked to its progression. Accurately forecasting CTh trajectories can significantly enhance early diagnosis and intervention strategies, providing timely care. However, the longitudinal data essential for these studies often suffe… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  29. arXiv:2403.03527  [pdf

    eess.IV

    LDSF: Lightweight Dual-Stream Framework for SAR Target Recognition by Coupling Local Electromagnetic Scattering Features and Global Visual Features

    Authors: Xuying Xiong, Xinyu Zhang, Weidong Jiang, Tianpeng Liu

    Abstract: Mainstream DNN-based SAR-ATR methods still face issues such as easy overfitting of a few training data, high computational overhead, and poor interpretability of the black-box model. Integrating physical knowledge into DNNs to improve performance and achieve a higher level of physical interpretability becomes the key to solving the above problems. This paper begins by focusing on the electromagnet… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  30. arXiv:2402.13588  [pdf, other

    eess.SY

    PI-CoF: A Bilevel Optimization Framework for Solving Active Learning Problems using Physics-Information

    Authors: Liqiu Dong, Marta Zagorowska, Tong Liu, Alex Durkin, Mehmet Mercangöz

    Abstract: Physics informed neural networks (PINNs) have recently been proposed as surrogate models for solving process optimization problems. However, in an active learning setting collecting enough data for reliably training PINNs poses a challenge. This study proposes a broadly applicable method for incorporating physics information into existing machine learning (ML) models of any type. The proposed meth… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Submitted to The 8th IEEE Conference on Control Technology and Applications (CCTA) 2024, 6 pages

  31. arXiv:2401.14949  [pdf

    eess.SY

    Renewable energy exporting consumption-oriented transfer limit switching control: A unsupervised learning-based method

    Authors: Gao Qiu, Haojin Peng, Youbo Liu, Tingjian Liu, Junyong Liu

    Abstract: A method for generating unsupervised conditional mapping rules for multi-inter-corridor transfer limits and their integration into unit commitment through banding-switching is proposed in this paper. The method starts by using Ant colony clustering(ACC) to identify different operating modes with renewable energy penetration. For each sub-pattern, coupling inter-corridors are determined using corre… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  32. arXiv:2401.13957  [pdf, other

    cs.RO cs.HC eess.SY

    Automatic Tissue Traction with Haptics-Enabled Forceps for Minimally Invasive Surgery

    Authors: Tangyou Liu, Xiaoyi Wang, Jay Katupitiya, Jiaole Wang, Liao Wu

    Abstract: A common limitation of autonomous tissue manipulation in robotic minimally invasive surgery (MIS) is the absence of force sensing and control at the tool level. Recently, our team has developed haptics-enabled forceps that can simultaneously measure the grasping and pulling forces during tissue manipulation. Based on this design, here we further present a method to automate tissue traction with co… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 12 pages, 12 figures, submitted to T-RO

  33. arXiv:2401.11090  [pdf, other

    cs.GT eess.SY math.OC

    Sharing Energy in Wide Area: A Two-Layer Energy Sharing Scheme for Massive Prosumers

    Authors: Yifan Su, Peng Yang, Kai Kang, Zhaojian Wang, Ning Qi, Tonghua Liu, Feng Liu

    Abstract: The popularization of distributed energy resources transforms end-users from consumers into prosumers. Inspired by the sharing economy principle, energy sharing markets for prosumers are proposed to facilitate the utilization of renewable energy. This paper proposes a novel two-layer energy sharing market for massive prosumers, which can promote social efficiency by wider-area sharing. In this mar… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  34. arXiv:2401.07222  [pdf, ps, other

    eess.SY

    Robust Data-Driven Predictive Control for Unknown Linear Time-Invariant Systems

    Authors: Kaijian Hu, Tao Liu

    Abstract: This paper presents a new robust data-driven predictive control scheme for unknown linear time-invariant systems by using input-state-output or input-output data based on whether the state is measurable. To remove the need for the persistently exciting (PE) condition of a sufficiently high order on pre-collected data, a set containing all systems capable of generating such data is constructed. The… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  35. arXiv:2401.02099  [pdf

    cs.CV cs.SD eess.AS

    Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition

    Authors: Zeyu Li, Suncheng Xiang, Tong Yu, Jingsheng Gao, Jiacheng Ruan, Yanping Hu, Ting Liu, Yuzhuo Fu

    Abstract: The recognition of underwater audio plays a significant role in identifying a vessel while it is in motion. Underwater target recognition tasks have a wide range of applications in areas such as marine environmental protection, detection of ship radiated noise, underwater noise control, and coastal vessel dispatch. The traditional UATR task involves training a network to extract features from audi… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted by ICIC 2024

  36. arXiv:2401.02081  [pdf, ps, other

    cs.IT eess.SP

    Performance Trade-off and Joint Waveform Design for MIMO-OFDM DFRC Systems

    Authors: Tianchen Liu, Liang Wu, Bo An, Zaichen Zhang, Jian Dang, Jiangzhou Wang

    Abstract: Dual-functional radar-communication (DFRC) has attracted considerable attention. This paper considers the frequency-selective multipath fading environment and proposes DFRC waveform design strategies based on multiple-input and multiple-output (MIMO) and orthogonal frequency division multiplexing (OFDM) techniques. In the proposed waveform design strategies, the Cramer-Rao bound (CRB) of the radar… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  37. arXiv:2401.00816  [pdf, other

    cs.CV cs.LG eess.IV

    GLIMPSE: Generalized Local Imaging with MLPs

    Authors: AmirEhsan Khorashadizadeh, Valentin Debarnot, Tianlin Liu, Ivan Dokmanić

    Abstract: Deep learning is the current de facto state of the art in tomographic imaging. A common approach is to feed the result of a simple inversion, for example the backprojection, to a convolutional neural network (CNN) which then computes the reconstruction. Despite strong results on 'in-distribution' test data similar to the training data, backprojection from sparse-view data delocalizes singularities… ▽ More

    Submitted 20 June, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Comments: 12 pages, 10 figures

  38. arXiv:2312.05256  [pdf, other

    eess.IV cs.AI

    Holistic Evaluation of GPT-4V for Biomedical Imaging

    Authors: Zhengliang Liu, Hanqi Jiang, Tianyang Zhong, Zihao Wu, Chong Ma, Yiwei Li, Xiaowei Yu, Yutong Zhang, Yi Pan, Peng Shu, Yanjun Lyu, Lu Zhang, Junjie Yao, Peixin Dong, Chao Cao, Zhenxiang Xiao, Jiaqi Wang, Huan Zhao, Shaochen Xu, Yaonai Wei, Jingyuan Chen, Haixing Dai, Peilong Wang, Hao He, Zewei Wang , et al. (25 additional authors not shown)

    Abstract: In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain. We assess GPT-4V's performance across 16 medical imaging categories, including radiology, oncology, ophthalmology, pathology, and mor… ▽ More

    Submitted 10 November, 2023; originally announced December 2023.

  39. arXiv:2312.03620  [pdf, other

    eess.AS cs.SD

    Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification

    Authors: Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li

    Abstract: Previous studies demonstrate the impressive performance of residual neural networks (ResNet) in speaker verification. The ResNet models treat the time and frequency dimensions equally. They follow the default stride configuration designed for image recognition, where the horizontal and vertical axes exhibit similarities. This approach ignores the fact that time and frequency are asymmetric in spee… ▽ More

    Submitted 24 April, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Accepted to IEEE/ACM Transactions on Audio, Speech, and Language Processing. Open Access: https://ieeexplore.ieee.org/abstract/document/10497864

  40. arXiv:2311.18096  [pdf, other

    eess.SY

    Data-Driven Kalman Filter using Maximum Likelihood Optimization

    Authors: Peihu Duan, Tao Liu, Yu Xing, Karl Henrik Johansson

    Abstract: This paper investigates the state estimation problem for unknown linear systems with process and measurement noise. A novel data-driven Kalman filter (DDKF) that combines model identification with state estimation is developed using pre-collected input-output data and uncertain initial state information of the unknown system. Specifically, the state estimation problem is first formulated as a non-… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  41. arXiv:2311.16604  [pdf, other

    eess.AS cs.LG

    LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models

    Authors: Chi-Chang Lee, Hong-Wei Chen, Chu-Song Chen, Hsin-Min Wang, Tsung-Te Liu, Yu Tsao

    Abstract: The performance of speaker verification (SV) models may drop dramatically in noisy environments. A speech enhancement (SE) module can be used as a front-end strategy. However, existing SE methods may fail to bring performance improvements to downstream SV systems due to artifacts in the predicted signals of SE models. To compensate for artifacts, we propose a generic denoising framework named LC4S… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  42. arXiv:2311.15153  [pdf, other

    cs.CV eess.IV

    Predicting Gradient is Better: Exploring Self-Supervised Learning for SAR ATR with a Joint-Embedding Predictive Architecture

    Authors: Weijie Li, Yang Wei, Tianpeng Liu, Yuenan Hou, Yuxuan Li, Zhen Liu, Yongxiang Liu, Li Liu

    Abstract: The growing Synthetic Aperture Radar (SAR) data has the potential to build a foundation model through Self-Supervised Learning (SSL) methods, which can achieve various SAR Automatic Target Recognition (ATR) tasks with pre-training in large-scale unlabeled data and fine-tuning in small labeled samples. SSL aims to construct supervision signals directly from the data, which minimizes the need for ex… ▽ More

    Submitted 28 March, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: Our codes at https://github.com/waterdisappear/SAR-JEPA

  43. arXiv:2311.11281  [pdf, other

    eess.SY cs.LG

    Multi-Timescale Control and Communications with Deep Reinforcement Learning -- Part I: Communication-Aware Vehicle Control

    Authors: Tong Liu, Lei Lei, Kan Zheng, Xuemin, Shen

    Abstract: An intelligent decision-making system enabled by Vehicle-to-Everything (V2X) communications is essential to achieve safe and efficient autonomous driving (AD), where two types of decisions have to be made at different timescales, i.e., vehicle control and radio resource allocation (RRA) decisions. The interplay between RRA and vehicle control necessitates their collaborative design. In this two-pa… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  44. arXiv:2311.11280  [pdf, other

    eess.SY cs.LG

    Multi-Timescale Control and Communications with Deep Reinforcement Learning -- Part II: Control-Aware Radio Resource Allocation

    Authors: Lei Lei, Tong Liu, Kan Zheng, Xuemin, Shen

    Abstract: In Part I of this two-part paper (Multi-Timescale Control and Communications with Deep Reinforcement Learning -- Part I: Communication-Aware Vehicle Control), we decomposed the multi-timescale control and communications (MTCC) problem in Cellular Vehicle-to-Everything (C-V2X) system into a communication-aware Deep Reinforcement Learning (DRL)-based platoon control (PC) sub-problem and a control-aw… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

  45. arXiv:2311.11086  [pdf

    eess.IV cs.CV

    LightBTSeg: A lightweight breast tumor segmentation model using ultrasound images via dual-path joint knowledge distillation

    Authors: Hongjiang Guo, Shengwen Wang, Hao Dang, Kangle Xiao, Yaru Yang, Wenpei Liu, Tongtong Liu, Yiying Wan

    Abstract: The accurate segmentation of breast tumors is an important prerequisite for lesion detection, which has significant clinical value for breast tumor research. The mainstream deep learning-based methods have achieved a breakthrough. However, these high-performance segmentation methods are formidable to implement in clinical scenarios since they always embrace high computation complexity, massive par… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: 7 pages, 7 figures, conference

  46. arXiv:2311.09850  [pdf, other

    cs.IT eess.SP

    Semantic-Relay-Aided Text Transmission: Placement Optimization and Bandwidth Allocation

    Authors: Tianyu Liu, Changsheng You, Zeyang Hu, Chenyu Wu, Yi Gong, Kaibin Huang

    Abstract: Semantic communication has emerged as a promising technology to break the Shannon limit by extracting the meaning of source data and sending relevant semantic information only. However, some mobile devices may have limited computation and storage resources, which renders it difficult to deploy and implement the resource-demanding deep learning based semantic encoder/decoder. To tackle this challen… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 6 pages, 4 figures, accepted for IEEE Global Communication Conference (GLOBECOM) 2023 Workshop

  47. arXiv:2311.08415  [pdf

    eess.IV physics.optics

    Scanning phase imaging without accurate positioning system

    Authors: Tao Liu, Bingyang Wang, JiangTao Zhao, Fu rong Chen, Fucai Zhang

    Abstract: Ptychography, a high-resolution phase imaging technique using precise in-plane translation information, has been widely applied in modern synchrotron radiation sources across the globe. A key requirement for successful ptychographic reconstruction is the precise knowledge of the scanning positions, which are typically obtained by a physical interferometric positioning system. Whereas high-throughp… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: 9 pages,4 figures

  48. arXiv:2311.06854  [pdf, ps, other

    eess.SP

    Multiuser Resource Allocation for Semantic-Relay-Aided Text Transmissions

    Authors: Zeyang Hu, Tianyu Liu, Changsheng You, Zhaohui Yang, Mingzhe Chen

    Abstract: Semantic communication (SemCom) is an emerging technology that extracts useful meaning from data and sends only relevant semantic information. Thus, it has the great potential to improve the spectrum efficiency of conventional wireless systems with bit transmissions, especially in low signal-to-noise ratio (SNR) and small bandwidth regions. However, the existing works have mostly overlooked the co… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: 6 pages, 3 figures, accepted for IEEE Global Communication Conference (GLOBECOM) 2023 Workshop on Semantic Communication for 6G

  49. arXiv:2311.03557  [pdf, other

    cs.LG cs.CV eess.IV

    Spatio-Temporal Similarity Measure based Multi-Task Learning for Predicting Alzheimer's Disease Progression using MRI Data

    Authors: Xulong Wang, Yu Zhang, Menghui Zhou, Tong Liu, Jun Qi, Po Yang

    Abstract: Identifying and utilising various biomarkers for tracking Alzheimer's disease (AD) progression have received many recent attentions and enable helping clinicians make the prompt decisions. Traditional progression models focus on extracting morphological biomarkers in regions of interest (ROIs) from MRI/PET images, such as regional average cortical thickness and regional volume. They are effective… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  50. arXiv:2311.03501  [pdf, ps, other

    eess.SP

    Joint Sparse Estimation with Cardinality Constraint via Mixed-Integer Semidefinite Programming

    Authors: Tianyi Liu, Frederic Matter, Alexander Sorg, Marc E. Pfetsch, Martin Haardt, Marius Pesavento

    Abstract: The multiple measurement vectors (MMV) problem refers to the joint estimation of a row-sparse signal matrix from multiple realizations of mixtures with a known dictionary. As a generalization of the standard sparse representation problem for a single measurement, this problem is fundamental in various applications in signal processing, e.g., spectral analysis and direction-of-arrival (DOA) estimat… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 12 pages, 6 figures. Submitted to the IEEE Transactions on Signal Processing