Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–25 of 25 results for author: Yan, P

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.18533  [pdf, other

    eess.IV cs.CV

    Cardiovascular Disease Detection from Multi-View Chest X-rays with BI-Mamba

    Authors: Zefan Yang, Jiajin Zhang, Ge Wang, Mannudeep K. Kalra, Pingkun Yan

    Abstract: Accurate prediction of Cardiovascular disease (CVD) risk in medical imaging is central to effective patient health management. Previous studies have demonstrated that imaging features in computed tomography (CT) can help predict CVD risk. However, CT entails notable radiation exposure, which may result in adverse health effects for patients. In contrast, chest X-ray emits significantly lower level… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Early accepted paper for MICCAI 2024

  2. arXiv:2403.00274  [pdf, other

    cs.CV cs.SD eess.AS

    CustomListener: Text-guided Responsive Interaction for User-friendly Listening Head Generation

    Authors: Xi Liu, Ying Guo, Cheng Zhen, Tong Li, Yingying Ao, Pengfei Yan

    Abstract: Listening head generation aims to synthesize a non-verbal responsive listener head by modeling the correlation between the speaker and the listener in dynamic conversion.The applications of listener agent generation in virtual interaction have promoted many works achieving the diverse and fine-grained motion generation. However, they can only manipulate motions through simple emotional labels, but… ▽ More

    Submitted 29 March, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  3. arXiv:2312.06462  [pdf, other

    cs.CV cs.AI cs.SD eess.AS

    Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation

    Authors: Qi Yang, Xing Nie, Tong Li, Pengfei Gao, Ying Guo, Cheng Zhen, Pengfei Yan, Shiming Xiang

    Abstract: Recently, an audio-visual segmentation (AVS) task has been introduced, aiming to group pixels with sounding objects within a given video. This task necessitates a first-ever audio-driven pixel-level understanding of the scene, posing significant challenges. In this paper, we propose an innovative audio-visual transformer framework, termed COMBO, an acronym for COoperation of Multi-order Bilateral… ▽ More

    Submitted 7 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 Highlight. 13 pages, 10 figures

  4. arXiv:2311.03679  [pdf, other

    cs.CV eess.IV

    Unsupervised convolutional neural network fusion approach for change detection in remote sensing images

    Authors: Weidong Yan, Pei Yan, Li Cao

    Abstract: With the rapid development of deep learning, a variety of change detection methods based on deep learning have emerged in recent years. However, these methods usually require a large number of training samples to train the network model, so it is very expensive. In this paper, we introduce a completely unsupervised shallow convolutional neural network (USCNN) fusion approach for change detection.… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  5. arXiv:2309.01207  [pdf, other

    eess.IV cs.CV cs.LG

    Spectral Adversarial MixUp for Few-Shot Unsupervised Domain Adaptation

    Authors: Jiajin Zhang, Hanqing Chao, Amit Dhurandhar, Pin-Yu Chen, Ali Tajer, Yangyang Xu, Pingkun Yan

    Abstract: Domain shift is a common problem in clinical applications, where the training images (source domain) and the test images (target domain) are under different distributions. Unsupervised Domain Adaptation (UDA) techniques have been proposed to adapt models trained in the source domain to the target domain. However, those methods require a large number of images from the target domain for model train… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: Accepted by MICCAI 2023

  6. arXiv:2307.14634  [pdf, other

    cs.AI cs.CR cs.CV cs.LG eess.IV

    Fact-Checking of AI-Generated Reports

    Authors: Razi Mahmood, Ge Wang, Mannudeep Kalra, Pingkun Yan

    Abstract: With advances in generative artificial intelligence (AI), it is now possible to produce realistic-looking automated reports for preliminary reads of radiology images. This can expedite clinical workflows, improve accuracy and reduce overall costs. However, it is also well-known that such models often hallucinate, leading to false findings in the generated reports. In this paper, we propose a new m… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: 10 pages, 3 figures, 3 tables

  7. arXiv:2304.12513  [pdf, other

    eess.IV

    A fast and flexible algorithm for microstructure reconstruction combining simulated annealing and deep learning

    Authors: Zhenchuan Ma, Xiaohai He, Pengcheng Yan, Fan Zhang, Qizhi Teng

    Abstract: The microstructure analyses of porous media have considerable research value for the study of macroscopic properties. As the premise of conducting these analyses, the accurate reconstruction of microstructure digital model is also an important component of the research. Computational reconstruction algorithms of microstructure have attracted much attention due to their low cost and excellent perfo… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  8. arXiv:2304.02649  [pdf, other

    eess.IV cs.AI cs.CV

    Specialty-Oriented Generalist Medical AI for Chest CT Screening

    Authors: Chuang Niu, Qing Lyu, Christopher D. Carothers, Parisa Kaviani, Josh Tan, Pingkun Yan, Mannudeep K. Kalra, Christopher T. Whitlow, Ge Wang

    Abstract: Modern medical records include a vast amount of multimodal free text clinical data and imaging data from radiology, cardiology, and digital pathology. Fully mining such big data requires multitasking; otherwise, occult but important aspects may be overlooked, adversely affecting clinical management and population healthcare. Despite remarkable successes of AI in individual tasks with single-modal… ▽ More

    Submitted 24 April, 2024; v1 submitted 3 April, 2023; originally announced April 2023.

  9. Intelligent detect for substation insulator defects based on CenterMask

    Authors: Bo Ye, Feng Li, Mingxuan Li, Peipei Yan, Huiting Yang, Lihua Wang

    Abstract: With the development of intelligent operation and maintenance of substations, the daily inspection of substations needs to process massive video and image data. This puts forward higher requirements on the processing speed and accuracy of defect detection. Based on the end-to-end learning paradigm, this paper proposes an intelligent detection method for substation insulator defects based on Center… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 3 figures,1 table

    Journal ref: Frontiers in Energy Research 10 (2022) 985600

  10. arXiv:2208.06127  [pdf, other

    cs.SD cs.LG eess.AS

    An investigation on selecting audio pre-trained models for audio captioning

    Authors: Peiran Yan, Shengchen Li

    Abstract: Audio captioning is a task that generates description of audio based on content. Pre-trained models are widely used in audio captioning due to high complexity. Unless a comprehensive system is re-trained, it is hard to determine how well pre-trained models contribute to audio captioning system. To prevent the time consuming and energy consuming process of retraining, it is necessary to propose a p… ▽ More

    Submitted 12 August, 2022; originally announced August 2022.

    Comments: 5 pages, 7 figures

  11. arXiv:2207.05231  [pdf, other

    eess.IV cs.CV

    Regression Metric Loss: Learning a Semantic Representation Space for Medical Images

    Authors: Hanqing Chao, Jiajin Zhang, Pingkun Yan

    Abstract: Regression plays an essential role in many medical imaging applications for estimating various clinical risk or measurement scores. While training strategies and loss functions have been studied for the deep neural networks in medical image classification tasks, options for regression tasks are very limited. One of the key challenges is that the high-dimensional feature representation learned by e… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: Accepted by MICCAI2022

  12. Federated Multi-organ Segmentation with Inconsistent Labels

    Authors: Xuanang Xu, Hannah H. Deng, Jaime Gateno, Pingkun Yan

    Abstract: Federated learning is an emerging paradigm allowing large-scale decentralized learning without sharing data across different data owners, which helps address the concern of data privacy in medical image analysis. However, the requirement for label consistency across clients by the existing methods largely narrows its application scope. In practice, each clinical site may only annotate certain orga… ▽ More

    Submitted 25 May, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: v1: 10 pages, 5 figures; v2: 14 pages, 5 figures, accepted by IEEE Transactions on Medical Imaging (TMI), published version available at https://doi.org/10.1109/TMI.2023.3270140, source code available at https://github.com/DIAL-RPI/Fed-MENU

  13. Multiscale reconstruction of porous media based on multiple dictionaries learning

    Authors: Pengcheng Yan, Qizhi Teng, Xiaohai He, Zhenchuan Ma, Ningning Zhang

    Abstract: Digital modeling of the microstructure is important for studying the physical and transport properties of porous media. Multiscale modeling for porous media can accurately characterize macro-pores and micro-pores in a large-FoV (field of view) high-resolution three-dimensional pore structure model. This paper proposes a multiscale reconstruction algorithm based on multiple dictionaries learning, i… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

  14. arXiv:2204.02450  [pdf, ps, other

    eess.IV cs.CV

    Federated Cross Learning for Medical Image Segmentation

    Authors: Xuanang Xu, Hannah H. Deng, Tianyi Chen, Tianshu Kuang, Joshua C. Barber, Daeseung Kim, Jaime Gateno, James J. Xia, Pingkun Yan

    Abstract: Federated learning (FL) can collaboratively train deep learning models using isolated patient data owned by different hospitals for various clinical applications, including medical image segmentation. However, a major problem of FL is its performance degradation when dealing with data that are not independently and identically distributed (non-iid), which is often the case in medical images. In th… ▽ More

    Submitted 22 May, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: v1: 10 pages, 4 figures; v2: 12 pages, 5 figures, accepted by Medical Imaging with Deep Learning (MIDL) 2023 conference, camera-ready version available at https://openreview.net/forum?id=DrZbwobH_zo , source code available at https://github.com/DIAL-RPI/FedCross

  15. arXiv:2203.13118  [pdf, other

    eess.IV cs.CV

    X-ray Dissectography Improves Lung Nodule Detection

    Authors: Chuang Niu, Giridhar Dasegowda, Pingkun Yan, Mannudeep K. Kalra, Ge Wang

    Abstract: Although radiographs are the most frequently used worldwide due to their cost-effectiveness and widespread accessibility, the structural superposition along the x-ray paths often renders suspicious or concerning lung nodules difficult to detect. In this study, we apply "X-ray dissectography" to dissect lungs digitally from a few radiographic projections, suppress the interference of irrelevant str… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  16. arXiv:2107.06449  [pdf, other

    eess.IV cs.CV

    End-to-end Ultrasound Frame to Volume Registration

    Authors: Hengtao Guo, Xuanang Xu, Sheng Xu, Bradford J. Wood, Pingkun Yan

    Abstract: Fusing intra-operative 2D transrectal ultrasound (TRUS) image with pre-operative 3D magnetic resonance (MR) volume to guide prostate biopsy can significantly increase the yield. However, such a multimodal 2D/3D registration problem is a very challenging task. In this paper, we propose an end-to-end frame-to-volume registration network (FVR-Net), which can efficiently bridge the previous research g… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Comments: Early accepted by MICCAI-2021

  17. arXiv:2103.13557  [pdf, other

    eess.IV cs.CV

    Task-Oriented Low-Dose CT Image Denoising

    Authors: Jiajin Zhang, Hanqing Chao, Xuanang Xu, Chuang Niu, Ge Wang, Pingkun Yan

    Abstract: The extensive use of medical CT has raised a public concern over the radiation dose to the patient. Reducing the radiation dose leads to increased CT image noise and artifacts, which can adversely affect not only the radiologists judgement but also the performance of downstream medical image analysis tasks. Various low-dose CT denoising methods, especially the recent deep learning based approaches… ▽ More

    Submitted 10 July, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: Paper accepted by MICCAI-2021

  18. arXiv:2102.09615  [pdf, other

    eess.IV cs.CV

    Noise Entangled GAN For Low-Dose CT Simulation

    Authors: Chuang Niu, Ge Wang, Pingkun Yan, Juergen Hahn, Youfang Lai, Xun Jia, Arjun Krishna, Klaus Mueller, Andreu Badal, KyleJ. Myers, Rongping Zeng

    Abstract: We propose a Noise Entangled GAN (NE-GAN) for simulating low-dose computed tomography (CT) images from a higher dose CT image. First, we present two schemes to generate a clean CT image and a noise image from the high-dose CT image. Then, given these generated images, an NE-GAN is proposed to simulate different levels of low-dose CT images, where the level of generated noise can be continuously co… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

  19. Deep Learning Predicts Cardiovascular Disease Risks from Lung Cancer Screening Low Dose Computed Tomography

    Authors: Hanqing Chao, Hongming Shan, Fatemeh Homayounieh, Ramandeep Singh, Ruhani Doda Khera, Hengtao Guo, Timothy Su, Ge Wang, Mannudeep K. Kalra, Pingkun Yan

    Abstract: Cancer patients have a higher risk of cardiovascular disease (CVD) mortality than the general population. Low dose computed tomography (LDCT) for lung cancer screening offers an opportunity for simultaneous CVD risk estimation in at-risk patients. Our deep learning CVD risk prediction model, trained with 30,286 LDCTs from the National Lung Cancer Screening Trial, achieved an area under the curve (… ▽ More

    Submitted 29 March, 2021; v1 submitted 16 August, 2020; originally announced August 2020.

  20. arXiv:2008.01846  [pdf

    eess.IV cs.CV cs.LG

    Stabilizing Deep Tomographic Reconstruction

    Authors: Weiwen Wu, Dianlin Hu, Wenxiang Cong, Hongming Shan, Shaoyu Wang, Chuang Niu, Pingkun Yan, Hengyong Yu, Varut Vardhanabhuti, Ge Wang

    Abstract: Tomographic image reconstruction with deep learning is an emerging field, but a recent landmark study reveals that several deep reconstruction networks are unstable for computed tomography (CT) and magnetic resonance imaging (MRI). Specifically, three kinds of instabilities were reported: (1) strong image artefacts from tiny perturbations, (2) small features missing in a deeply reconstructed image… ▽ More

    Submitted 13 September, 2021; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: 78 pages, 30 figures, 149 references

  21. arXiv:2007.10416  [pdf, other

    eess.IV cs.CV

    Integrative Analysis for COVID-19 Patient Outcome Prediction

    Authors: Hanqing Chao, Xi Fang, Jiajin Zhang, Fatemeh Homayounieh, Chiara D. Arru, Subba R. Digumarthy, Rosa Babaei, Hadi K. Mobin, Iman Mohseni, Luca Saba, Alessandro Carriero, Zeno Falaschi, Alessio Pasche, Ge Wang, Mannudeep K. Kalra, Pingkun Yan

    Abstract: While image analysis of chest computed tomography (CT) for COVID-19 diagnosis has been intensively studied, little work has been performed for image-based patient outcome prediction. Management of high-risk patients with early intervention is a key to lower the fatality rate of COVID-19 pneumonia, as a majority of patients recover naturally. Therefore, an accurate prediction of disease progression… ▽ More

    Submitted 16 September, 2020; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: This paper has been accepted by Medical Image Analysis. The source code of this work is available at https://github.com/DIAL-RPI/COVID19-ICUPrediction

  22. arXiv:2006.07694  [pdf, other

    cs.CV eess.IV

    Sensorless Freehand 3D Ultrasound Reconstruction via Deep Contextual Learning

    Authors: Hengtao Guo, Sheng Xu, Bradford Wood, Pingkun Yan

    Abstract: Transrectal ultrasound (US) is the most commonly used imaging modality to guide prostate biopsy and its 3D volume provides even richer context information. Current methods for 3D volume reconstruction from freehand US scans require external tracking devices to provide spatial position for every frame. In this paper, we propose a deep contextual learning network (DCL-Net), which can efficiently exp… ▽ More

    Submitted 13 June, 2020; originally announced June 2020.

    Comments: Provisionally accepted by MICCAI 2020

  23. arXiv:1910.11456  [pdf, other

    eess.IV cs.CV

    Unified Multi-scale Feature Abstraction for Medical Image Segmentation

    Authors: Xi Fang, Bo Du, Sheng Xu, Bradford J. Wood, Pingkun Yan

    Abstract: Automatic medical image segmentation, an essential component of medical image analysis, plays an importantrole in computer-aided diagnosis. For example, locating and segmenting the liver can be very helpful in livercancer diagnosis and treatment. The state-of-the-art models in medical image segmentation are variants ofthe encoder-decoder architecture such as fully convolutional network (FCN) and U… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: Abstract of SPIE Medical Imaging (Oral)

  24. arXiv:1908.10245  [pdf

    eess.SP

    Feature Exploration for Knowledge-guided and Data-driven Approach Based Cuffless Blood Pressure Measurement

    Authors: Xiaorong Ding, Bryan P Yan, Yuan-Ting Zhang, Jing Liu, Peng Su, Ni Zhao

    Abstract: This study explores extended feature space that is indicative of blood pressure (BP) changes for better estimation of continuous BP in an unobtrusive way. A total of 222 features were extracted from noninvasively acquired electrocardiogram (ECG) and photoplethysmogram (PPG) signals with the subject undergoing coronary angiography and/or percutaneous coronary intervention, during which intra-arteri… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

    Comments: 4 pages, 4 figures, 2 tables

  25. arXiv:1903.02026  [pdf, other

    q-bio.QM cs.CV eess.IV

    Deep Learning in Medical Image Registration: A Survey

    Authors: Grant Haskins, Uwe Kruger, Pingkun Yan

    Abstract: The establishment of image correspondence through robust image registration is critical to many clinical tasks such as image fusion, organ atlas creation, and tumor growth monitoring, and is a very challenging problem. Since the beginning of the recent deep learning renaissance, the medical imaging research community has developed deep learning based approaches and achieved the state-of-the-art in… ▽ More

    Submitted 21 January, 2020; v1 submitted 5 March, 2019; originally announced March 2019.

    Comments: Accepted for publication by Machine Vision and Applications on January 8, 2020