Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–19 of 19 results for author: Lyu, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2408.04300  [pdf, other

    eess.IV cs.CV

    An Explainable Non-local Network for COVID-19 Diagnosis

    Authors: Jingfu Yang, Peng Huang, Jing Hu, Shu Hu, Siwei Lyu, Xin Wang, Jun Guo, Xi Wu

    Abstract: The CNN has achieved excellent results in the automatic classification of medical images. In this study, we propose a novel deep residual 3D attention non-local network (NL-RAN) to classify CT images included COVID-19, common pneumonia, and normal to perform rapid and explainable COVID-19 diagnosis. We built a deep residual 3D attention non-local network that could achieve end-to-end training. The… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  2. arXiv:2406.16943  [pdf, other

    eess.SP cs.AI cs.HC cs.LG

    EarDA: Towards Accurate and Data-Efficient Earable Activity Sensing

    Authors: Shengzhe Lyu, Yongliang Chen, Di Duan, Renqi Jia, Weitao Xu

    Abstract: In the realm of smart sensing with the Internet of Things, earable devices are empowered with the capability of multi-modality sensing and intelligence of context-aware computing, leading to its wide usage in Human Activity Recognition (HAR). Nonetheless, unlike the movements captured by Inertial Measurement Unit (IMU) sensors placed on the upper or lower body, those motion signals obtained from e… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: accepted by 2024 IEEE Coupling of Sensing & Computing in AIoT Systems (CSCAIoT)

  3. arXiv:2405.00135  [pdf, other

    cs.IT eess.SP

    Improving Channel Resilience for Task-Oriented Semantic Communications: A Unified Information Bottleneck Approach

    Authors: Shuai Lyu, Yao Sun, Linke Guo, Xiaoyong Yuan, Fang Fang, Lan Zhang, Xianbin Wang

    Abstract: Task-oriented semantic communications (TSC) enhance radio resource efficiency by transmitting task-relevant semantic information. However, current research often overlooks the inherent semantic distinctions among encoded features. Due to unavoidable channel variations from time and frequency-selective fading, semantically sensitive feature units could be more susceptible to erroneous inference if… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: This work has been submitted to the IEEE Communications Letters

  4. arXiv:2311.06712  [pdf, other

    eess.IV

    PuzzleTuning: Explicitly Bridge Pathological and Natural Image with Puzzles

    Authors: Tianyi Zhang, Shangqing Lyu, Yanli Lei, Sicheng Chen, Nan Ying, Yufang He, Yu Zhao, Yunlu Feng, Hwee Kuan Lee, Guanglei Zhang

    Abstract: Pathological image analysis is a crucial field in computer vision. Due to the annotation scarcity in the pathological field, pre-training with self-supervised learning (SSL) is widely applied to learn on unlabeled images. However, the current SSL-based pathological pre-training: (1) does not explicitly explore the essential focuses of the pathological field, and (2) does not effectively bridge wit… ▽ More

    Submitted 22 April, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

    Comments: 13 pages, 9 figures, 8 tables

  5. arXiv:2311.05836  [pdf, other

    eess.IV cs.CV cs.LG

    UMedNeRF: Uncertainty-aware Single View Volumetric Rendering for Medical Neural Radiance Fields

    Authors: Jing Hu, Qinrui Fan, Shu Hu, Siwei Lyu, Xi Wu, Xin Wang

    Abstract: In the field of clinical medicine, computed tomography (CT) is an effective medical imaging modality for the diagnosis of various pathologies. Compared with X-ray images, CT images can provide more information, including multi-planar slices and three-dimensional structures for clinical diagnosis. However, CT imaging requires patients to be exposed to large doses of ionizing radiation for a long ti… ▽ More

    Submitted 1 March, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

  6. arXiv:2310.17902  [pdf

    eess.IV

    CPIA Dataset: A Comprehensive Pathological Image Analysis Dataset for Self-supervised Learning Pre-training

    Authors: Nan Ying, Yanli Lei, Tianyi Zhang, Shangqing Lyu, Chunhui Li, Sicheng Chen, Zeyu Liu, Yu Zhao, Guanglei Zhang

    Abstract: Pathological image analysis is a crucial field in computer-aided diagnosis, where deep learning is widely applied. Transfer learning using pre-trained models initialized on natural images has effectively improved the downstream pathological performance. However, the lack of sophisticated domain-specific pathological initialization hinders their potential. Self-supervised learning (SSL) enables pre… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  7. arXiv:2307.14491  [pdf, other

    cs.MM cs.SD eess.AS

    A Unified Framework for Modality-Agnostic Deepfakes Detection

    Authors: Cai Yu, Peng Chen, Jiahe Tian, Jin Liu, Jiao Dai, Xi Wang, Yesheng Chai, Shan Jia, Siwei Lyu, Jizhong Han

    Abstract: As AI-generated content (AIGC) thrives, deepfakes have expanded from single-modality falsification to cross-modal fake content creation, where either audio or visual components can be manipulated. While using two unimodal detectors can detect audio-visual deepfakes, cross-modal forgery clues could be overlooked. Existing multimodal deepfake detection methods typically establish correspondence betw… ▽ More

    Submitted 24 October, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  8. arXiv:2305.05813  [pdf, other

    cs.CV cs.LG eess.IV

    Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review

    Authors: Guangliang Cheng, Yunmeng Huang, Xiangtai Li, Shuchang Lyu, Zhaoyang Xu, Qi Zhao, Shiming Xiang

    Abstract: Change detection is an essential and widely utilized task in remote sensing that aims to detect and analyze changes occurring in the same geographical area over time, which has broad applications in urban development, agricultural surveys, and land cover monitoring. Detecting changes in remote sensing images is a complex challenge due to various factors, including variations in image quality, nois… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 21 pages, 4 figures, 10 tables

  9. arXiv:2304.13085  [pdf, other

    cs.SD cs.MM eess.AS

    AI-Synthesized Voice Detection Using Neural Vocoder Artifacts

    Authors: Chengzhe Sun, Shan Jia, Shuwei Hou, Siwei Lyu

    Abstract: Advancements in AI-synthesized human voices have created a growing threat of impersonation and disinformation, making it crucial to develop methods to detect synthetic human voices. This study proposes a new approach to identifying synthetic human voices by detecting artifacts of vocoders in audio signals. Most DeepFake audio synthesis models use a neural vocoder, a neural network that generates w… ▽ More

    Submitted 27 April, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Paper accepted in CVPRW 2023. Codes and data can be found at https://github.com/csun22/Synthetic-Voice-Detection-Vocoder-Artifacts. arXiv admin note: substantial text overlap with arXiv:2302.09198

  10. arXiv:2302.09198  [pdf, other

    cs.SD cs.MM eess.AS

    Exposing AI-Synthesized Human Voices Using Neural Vocoder Artifacts

    Authors: Chengzhe Sun, Shan Jia, Shuwei Hou, Ehab AlBadawy, Siwei Lyu

    Abstract: The advancements of AI-synthesized human voices have introduced a growing threat of impersonation and disinformation. It is therefore of practical importance to developdetection methods for synthetic human voices. This work proposes a new approach to detect synthetic human voices based on identifying artifacts of neural vocoders in audio signals. A neural vocoder is a specially designed neural net… ▽ More

    Submitted 27 April, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: Dataset and codes will be available at https://github.com/csun22/LibriVoc-Dataset

  11. arXiv:2112.13513  [pdf

    eess.IV cs.CV cs.LG

    MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer

    Authors: Tianyi Zhang, Yunlu Feng, Yu Zhao, Guangda Fan, Aiming Yang, Shangqin Lyu, Peng Zhang, Fan Song, Chenbin Ma, Yangyang Sun, Youdan Feng, Guanglei Zhang

    Abstract: Pancreatic cancer is one of the most malignant cancers in the world, which deteriorates rapidly with very high mortality. The rapid on-site evaluation (ROSE) technique innovates the workflow by immediately analyzing the fast stained cytopathological images with on-site pathologists, which enables faster diagnosis in this time-pressured process. However, the wider expansion of ROSE diagnosis has be… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: 12 pages, 10 figures

  12. arXiv:2112.07415  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Stochastic Planner-Actor-Critic for Unsupervised Deformable Image Registration

    Authors: Ziwei Luo, Jing Hu, Xin Wang, Shu Hu, Bin Kong, Youbing Yin, Qi Song, Xi Wu, Siwei Lyu

    Abstract: Large deformations of organs, caused by diverse shapes and nonlinear shape changes, pose a significant challenge for medical image registration. Traditional registration methods need to iteratively optimize an objective function via a specific deformation model along with meticulous parameter tuning, but which have limited capabilities in registering images with large deformations. While deep lear… ▽ More

    Submitted 30 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI 2022

  13. arXiv:2112.03099  [pdf, other

    cs.SD cs.CL eess.AS

    VocBench: A Neural Vocoder Benchmark for Speech Synthesis

    Authors: Ehab A. AlBadawy, Andrew Gibiansky, Qing He, Jilong Wu, Ming-Ching Chang, Siwei Lyu

    Abstract: Neural vocoders, used for converting the spectral representations of an audio signal to the waveforms, are a commonly used component in speech synthesis pipelines. It focuses on synthesizing waveforms from low-dimensional representation, such as Mel-Spectrograms. In recent years, different approaches have been introduced to develop such vocoders. However, it becomes more challenging to assess thes… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: To appear in icassp 2022

  14. arXiv:2109.06638  [pdf, other

    cs.CV cs.LG eess.IV

    Learnable Discrete Wavelet Pooling (LDW-Pooling) For Convolutional Networks

    Authors: Bor-Shiun Wang, Jun-Wei Hsieh, Ming-Ching Chang, Ping-Yang Chen, Lipeng Ke, Siwei Lyu

    Abstract: Pooling is a simple but essential layer in modern deep CNN architectures for feature aggregation and extraction. Typical CNN design focuses on the conv layers and activation functions, while leaving the pooling layers with fewer options. We introduce the Learning Discrete Wavelet Pooling (LDW-Pooling) that can be applied universally to replace standard pooling operations to better extract features… ▽ More

    Submitted 20 October, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Accepted by BMVC 2021

  15. arXiv:2002.02909  [pdf, other

    cs.CV cs.LG eess.IV

    Domain Embedded Multi-model Generative Adversarial Networks for Image-based Face Inpainting

    Authors: Xian Zhang, Xin Wang, Bin Kong, Canghong Shi, Youbing Yin, Qi Song, Siwei Lyu, Jiancheng Lv, Canghong Shi, Xiaojie Li

    Abstract: Prior knowledge of face shape and structure plays an important role in face inpainting. However, traditional face inpainting methods mainly focus on the generated image resolution of the missing portion without consideration of the special particularities of the human face explicitly and generally produce discordant facial parts. To solve this problem, we present a domain embedded multi-model gene… ▽ More

    Submitted 20 June, 2020; v1 submitted 5 February, 2020; originally announced February 2020.

  16. arXiv:2001.05763  [pdf, ps, other

    cs.IT eess.SP

    GMD-Based Hybrid Beamforming for Large Reconfigurable Intelligent Surface Assisted Millimeter-Wave Massive MIMO

    Authors: Keke Ying, Zhen Gao, Shanxiang Lyu, Yongpeng Wu, Hua Wang, Mohamed-Slim Alouini

    Abstract: Reconfigurable intelligent surface (RIS) is considered to be an energy-efficient approach to reshape the wireless environment for improved throughput. Its passive feature greatly reduces the energy consumption, which makes RIS a promising technique for enabling the future smart city. Existing beamforming designs for RIS mainly focus on optimizing the spectral efficiency for single carrier systems.… ▽ More

    Submitted 16 January, 2020; v1 submitted 16 January, 2020; originally announced January 2020.

    Comments: 8 pages, 6 figures, accepted by IEEE Access.This is an initial attempt to discuss the broadband hybrid beamforming for RIS-assisted mmWave hybrid MIMO systems

  17. On Low-complexity Lattice Reduction Algorithms for Large-scale MIMO Detection: the Blessing of Sequential Reduction

    Authors: Shanxiang Lyu, Jinming Wen, Jian Weng, Cong Ling

    Abstract: Lattice reduction is a popular preprocessing strategy in multiple-input multiple-output (MIMO) detection. In a quest for developing a low-complexity reduction algorithm for large-scale problems, this paper investigates a new framework called sequential reduction (SR), which aims to reduce the lengths of all basis vectors. The performance upper bounds of the strongest reduction in SR are given when… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: Sequential reduction is not in the LLL family, but is generalizing greedy reduction (Nguyen and Stehle) and element-based reduction (Zhou and Ma)

    Journal ref: IEEE Transactions on Signal Processing, Online ISSN: 1941-0476

  18. arXiv:1909.12962  [pdf, other

    cs.CR cs.CV eess.IV

    Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics

    Authors: Yuezun Li, Xin Yang, Pu Sun, Honggang Qi, Siwei Lyu

    Abstract: AI-synthesized face-swapping videos, commonly known as DeepFakes, is an emerging problem threatening the trustworthiness of online information. The need to develop and evaluate DeepFake detection algorithms calls for large-scale datasets. However, current DeepFake datasets suffer from low visual quality and do not resemble DeepFake videos circulated on the Internet. We present a new large-scale ch… ▽ More

    Submitted 16 March, 2020; v1 submitted 27 September, 2019; originally announced September 2019.

  19. Dynamic Modularity Approach to Adaptive Inner/Outer Loop Control of Robotic Systems

    Authors: Hanlei Wang, Wei Ren, Chien Chern Cheah, Yongchun Xie, Shangke Lyu

    Abstract: Modern applications of robotics typically involve a robot control system with an inner PI (proportional-integral) or PID (proportional-integral-derivative) control loop and an outer user-specified control loop. The existing outer loop controllers, however, do not take into consideration the dynamic effects of robots and their effectiveness relies on the ad hoc assumption that the inner PI or PID c… ▽ More

    Submitted 7 January, 2017; v1 submitted 17 March, 2016; originally announced March 2016.

    Comments: This version is mainly for including the experimental results

    Journal ref: IEEE Transactions on Automatic Control, 65(6): 2760-2767, 2020