Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 83 results for author: Niu, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10289  [pdf, other

    cs.CL cs.AI cs.IR

    VeraCT Scan: Retrieval-Augmented Fake News Detection with Justifiable Reasoning

    Authors: Cheng Niu, Yang Guan, Yuanhao Wu, Juno Zhu, Juntong Song, Randy Zhong, Kaihua Zhu, Siliang Xu, Shizhe Diao, Tong Zhang

    Abstract: The proliferation of fake news poses a significant threat not only by disseminating misleading information but also by undermining the very foundations of democracy. The recent advance of generative artificial intelligence has further exacerbated the challenge of distinguishing genuine news from fabricated stories. In response to this challenge, we introduce VeraCT Scan, a novel retrieval-augmente… ▽ More

    Submitted 24 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2405.13037  [pdf, other

    cs.CL cs.AI

    Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation

    Authors: Cheng Niu, Xingguang Wang, Xuxin Cheng, Juntong Song, Tong Zhang

    Abstract: Dialogue State Tracking (DST) is designed to monitor the evolving dialogue state in the conversations and plays a pivotal role in developing task-oriented dialogue systems. However, obtaining the annotated data for the DST task is usually a costly endeavor. In this paper, we focus on employing LLMs to generate dialogue data to reduce dialogue collection and annotation costs. Specifically, GPT-4 is… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  3. arXiv:2405.06586  [pdf, other

    cs.CV

    Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach

    Authors: Elham Ravanbakhsh, Cheng Niu, Yongqing Liang, J. Ramanujam, Xin Li

    Abstract: Semantic segmentation is a core computer vision problem, but the high costs of data annotation have hindered its wide application. Weakly-Supervised Semantic Segmentation (WSSS) offers a cost-efficient workaround to extensive labeling in comparison to fully-supervised methods by using partial or incomplete labels. Existing WSSS methods have difficulties in learning the boundaries of objects leadin… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  4. arXiv:2404.10984  [pdf, other

    cs.LG

    Graph Continual Learning with Debiased Lossless Memory Replay

    Authors: Chaoxi Niu, Guansong Pang, Ling Chen

    Abstract: Real-life graph data often expands continually, rendering the learning of graph neural networks (GNNs) on static graph data impractical. Graph continual learning (GCL) tackles this problem by continually adapting GNNs to the expanded graph of the current task while maintaining the performance over the graph of previous tasks. Memory replay-based methods, which aim to replay data of previous tasks… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 12 pages

  5. arXiv:2404.06041  [pdf, ps, other

    cs.SE

    On Evaluating the Efficiency of Source Code Generated by LLMs

    Authors: Changan Niu, Ting Zhang, Chuanyi Li, Bin Luo, Vincent Ng

    Abstract: Recent years have seen the remarkable capabilities of large language models (LLMs) for code generation. Different from existing work that evaluate the correctness of the code generated by LLMs, we propose to further evaluate its efficiency. More efficient code can lead to higher performance and execution efficiency of programs and software completed by LLM-assisted programming. First, we evaluate… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 1st special event of AI Foundation Models and Software Engineering (FORGE 2024)

  6. arXiv:2403.12331  [pdf, other

    physics.med-ph cs.CV

    Deep Few-view High-resolution Photon-counting Extremity CT at Halved Dose for a Clinical Trial

    Authors: Mengzhou Li, Chuang Niu, Ge Wang, Maya R Amma, Krishna M Chapagain, Stefan Gabrielson, Andrew Li, Kevin Jonker, Niels de Ruiter, Jennifer A Clark, Phil Butler, Anthony Butler, Hengyong Yu

    Abstract: The latest X-ray photon-counting computed tomography (PCCT) for extremity allows multi-energy high-resolution (HR) imaging for tissue characterization and material decomposition. However, both radiation dose and imaging speed need improvement for contrast-enhanced and other studies. Despite the success of deep learning methods for 2D few-view reconstruction, applying them to HR volumetric reconstr… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 9 figures, 5 tables

  7. arXiv:2403.06128  [pdf, other

    eess.IV cs.CV

    Low-dose CT Denoising with Language-engaged Dual-space Alignment

    Authors: Zhihao Chen, Tao Chen, Chenhui Wang, Chuang Niu, Ge Wang, Hongming Shan

    Abstract: While various deep learning methods were proposed for low-dose computed tomography (CT) denoising, they often suffer from over-smoothing, blurring, and lack of explainability. To alleviate these issues, we propose a plug-and-play Language-Engaged Dual-space Alignment loss (LEDA) to optimize low-dose CT denoising models. Our idea is to leverage large language models (LLMs) to align denoised CT and… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: 11 pages, 6 figures

  8. arXiv:2401.00396  [pdf, other

    cs.CL

    RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models

    Authors: Cheng Niu, Yuanhao Wu, Juno Zhu, Siliang Xu, Kashun Shum, Randy Zhong, Juntong Song, Tong Zhang

    Abstract: Retrieval-augmented generation (RAG) has become a main technique for alleviating hallucinations in large language models (LLMs). Despite the integration of RAG, LLMs may still present unsupported or contradictory claims to the retrieved contents. In order to develop effective hallucination prevention strategies under RAG, it is important to create benchmark datasets that can measure the extent of… ▽ More

    Submitted 17 May, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

  9. arXiv:2312.15663  [pdf, other

    cs.CV cs.AI

    IQAGPT: Image Quality Assessment with Vision-language and ChatGPT Models

    Authors: Zhihao Chen, Bin Hu, Chuang Niu, Tao Chen, Yuxin Li, Hongming Shan, Ge Wang

    Abstract: Large language models (LLMs), such as ChatGPT, have demonstrated impressive capabilities in various tasks and attracted an increasing interest as a natural language interface across many domains. Recently, large vision-language models (VLMs) like BLIP-2 and GPT-4 have been intensively investigated, which learn rich vision-language correlation from image-text pairs. However, despite these developme… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: 14 pages, 9 figures

  10. arXiv:2310.06949  [pdf, other

    eess.IV cs.LG physics.med-ph

    Diffusion Prior Regularized Iterative Reconstruction for Low-dose CT

    Authors: Wenjun Xia, Yongyi Shi, Chuang Niu, Wenxiang Cong, Ge Wang

    Abstract: Computed tomography (CT) involves a patient's exposure to ionizing radiation. To reduce the radiation dose, we can either lower the X-ray photon count or down-sample projection views. However, either of the ways often compromises image quality. To address this challenge, here we introduce an iterative reconstruction algorithm regularized by a diffusion prior. Drawing on the exceptional imaging pro… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  11. arXiv:2309.09602  [pdf, other

    cs.CL cs.AI cs.LG

    Proposition from the Perspective of Chinese Language: A Chinese Proposition Classification Evaluation Benchmark

    Authors: Conghui Niu, Mengyang Hu, Lin Bo, Xiaoli He, Dong Yu, Pengyuan Liu

    Abstract: Existing propositions often rely on logical constants for classification. Compared with Western languages that lean towards hypotaxis such as English, Chinese often relies on semantic or logical understanding rather than logical connectives in daily expressions, exhibiting the characteristics of parataxis. However, existing research has rarely paid attention to this issue. And accurately classifyi… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  12. arXiv:2309.04828  [pdf, other

    cs.SE

    FAIR: Flow Type-Aware Pre-Training of Compiler Intermediate Representations

    Authors: Changan Niu, Chuanyi Li, Vincent Ng, David Lo, Bin Luo

    Abstract: While the majority of existing pre-trained models from code learn source code features such as code tokens and abstract syntax trees, there are some other works that focus on learning from compiler intermediate representations (IRs). Existing IR-based models typically utilize IR features such as instructions, control and data flow graphs (CDFGs), call graphs, etc. However, these methods confuse va… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: ICSE 2024 First Cycle

  13. arXiv:2308.16863  [pdf, other

    eess.IV cs.CV

    Self-pruning Graph Neural Network for Predicting Inflammatory Disease Activity in Multiple Sclerosis from Brain MR Images

    Authors: Chinmay Prabhakar, Hongwei Bran Li, Johannes C. Paetzold, Timo Loehr, Chen Niu, Mark Mühlau, Daniel Rueckert, Benedikt Wiestler, Bjoern Menze

    Abstract: Multiple Sclerosis (MS) is a severe neurological disease characterized by inflammatory lesions in the central nervous system. Hence, predicting inflammatory disease activity is crucial for disease assessment and treatment. However, MS lesions can occur throughout the brain and vary in shape, size and total count among patients. The high variance in lesion load and locations makes it challenging fo… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  14. arXiv:2308.12526  [pdf, other

    eess.AS cs.LG cs.SD

    UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023

    Authors: Yu Zheng, Yajun Zhang, Chuanying Niu, Yibin Zhan, Yanhua Long, Dongxing Xu

    Abstract: This report describes the UNISOUND submission for Track1 and Track2 of VoxCeleb Speaker Recognition Challenge 2023 (VoxSRC 2023). We submit the same system on Track 1 and Track 2, which is trained with only VoxCeleb2-dev. Large-scale ResNet and RepVGG architectures are developed for the challenge. We propose a consistency-aware score calibration method, which leverages the stability of audio voice… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  15. arXiv:2307.09765  [pdf, other

    cs.SE

    Are We Ready to Embrace Generative AI for Software Q&A?

    Authors: Bowen Xu, Thanh-Dat Nguyen, Thanh Le-Cong, Thong Hoang, Jiakun Liu, Kisub Kim, Chen Gong, Changan Niu, Chenyu Wang, Bach Le, David Lo

    Abstract: Stack Overflow, the world's largest software Q&A (SQA) website, is facing a significant traffic drop due to the emergence of generative AI techniques. ChatGPT is banned by Stack Overflow after only 6 days from its release. The main reason provided by the official Stack Overflow is that the answers generated by ChatGPT are of low quality. To verify this, we conduct a comparative evaluation of human… ▽ More

    Submitted 12 August, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Accepted by the New Ideas and Emerging Results (NIER) track at The IEEE/ACM Automated Software Engineering (ASE) Conference

  16. arXiv:2307.00755  [pdf, other

    cs.LG cs.CV

    Graph-level Anomaly Detection via Hierarchical Memory Networks

    Authors: Chaoxi Niu, Guansong Pang, Ling Chen

    Abstract: Graph-level anomaly detection aims to identify abnormal graphs that exhibit deviant structures and node attributes compared to the majority in a graph set. One primary challenge is to learn normal patterns manifested in both fine-grained and holistic views of graphs for identifying graphs that are abnormal in part or in whole. To tackle this challenge, we propose a novel approach called Hierarchic… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: Accepted to ECML-PKDD 2023

  17. arXiv:2306.08489  [pdf, ps, other

    stat.ML cs.LG math.SP

    Analysis and Approximate Inference of Large Random Kronecker Graphs

    Authors: Zhenyu Liao, Yuanqian Xia, Chengmei Niu, Yong Xiao

    Abstract: Random graph models are playing an increasingly important role in various fields ranging from social networks, telecommunication systems, to physiologic and biological networks. Within this landscape, the random Kronecker graph model, emerges as a prominent framework for scrutinizing intricate real-world networks. In this paper, we investigate large random Kronecker graphs, i.e., the number of gra… ▽ More

    Submitted 5 February, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: 27 pages, 5 figures, 2 tables

  18. arXiv:2304.02649  [pdf, other

    eess.IV cs.AI cs.CV

    Specialty-Oriented Generalist Medical AI for Chest CT Screening

    Authors: Chuang Niu, Qing Lyu, Christopher D. Carothers, Parisa Kaviani, Josh Tan, Pingkun Yan, Mannudeep K. Kalra, Christopher T. Whitlow, Ge Wang

    Abstract: Modern medical records include a vast amount of multimodal free text clinical data and imaging data from radiology, cardiology, and digital pathology. Fully mining such big data requires multitasking; otherwise, occult but important aspects may be overlooked, adversely affecting clinical management and population healthcare. Despite remarkable successes of AI in individual tasks with single-modal… ▽ More

    Submitted 24 April, 2024; v1 submitted 3 April, 2023; originally announced April 2023.

  19. arXiv:2303.12861  [pdf, other

    eess.IV cs.LG eess.SP physics.bio-ph

    Parallel Diffusion Model-based Sparse-view Cone-beam Breast CT

    Authors: Wenjun Xia, Hsin Wu Tseng, Chuang Niu, Wenxiang Cong, Xiaohua Zhang, Shaohua Liu, Ruola Ning, Srinivasan Vedantham, Ge Wang

    Abstract: Breast cancer is the most prevalent cancer among women worldwide, and early detection is crucial for reducing its mortality rate and improving quality of life. Dedicated breast computed tomography (CT) scanners offer better image quality than mammography and tomosynthesis in general but at higher radiation dose. To enable breast CT for cancer screening, the challenge is to minimize the radiation d… ▽ More

    Submitted 28 January, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

  20. arXiv:2303.10361  [pdf, other

    cs.LG

    DC-CCL: Device-Cloud Collaborative Controlled Learning for Large Vision Models

    Authors: Yucheng Ding, Chaoyue Niu, Fan Wu, Shaojie Tang, Chengfei Lyu, Guihai Chen

    Abstract: Many large vision models have been deployed on the cloud for real-time services. Meanwhile, fresh samples are continuously generated on the served mobile device. How to leverage the device-side samples to improve the cloud-side large model becomes a practical requirement, but falls into the dilemma of no raw sample up-link and no large model down-link. Specifically, the user may opt out of sharing… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

  21. arXiv:2303.09038  [pdf, other

    cs.CL cs.AI physics.med-ph

    Translating Radiology Reports into Plain Language using ChatGPT and GPT-4 with Prompt Learning: Promising Results, Limitations, and Potential

    Authors: Qing Lyu, Josh Tan, Michael E. Zapadka, Janardhana Ponnatapura, Chuang Niu, Kyle J. Myers, Ge Wang, Christopher T. Whitlow

    Abstract: The large language model called ChatGPT has drawn extensively attention because of its human-like expression and reasoning abilities. In this study, we investigate the feasibility of using ChatGPT in experiments on using ChatGPT to translate radiology reports into plain language for patients and healthcare providers so that they are educated for improved healthcare. Radiology reports from 62 low-d… ▽ More

    Submitted 28 March, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  22. arXiv:2302.10630  [pdf, other

    eess.IV cs.CV physics.med-ph

    LIT-Former: Linking In-plane and Through-plane Transformers for Simultaneous CT Image Denoising and Deblurring

    Authors: Zhihao Chen, Chuang Niu, Qi Gao, Ge Wang, Hongming Shan

    Abstract: This paper studies 3D low-dose computed tomography (CT) imaging. Although various deep learning methods were developed in this context, typically they focus on 2D images and perform denoising due to low-dose and deblurring for super-resolution separately. Up to date, little work was done for simultaneous in-plane denoising and through-plane deblurring, which is important to obtain high-quality 3D… ▽ More

    Submitted 7 January, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: 15 pages, 12 figures

    Journal ref: IEEE Transactions on Medical Imaging, 2024

  23. arXiv:2302.04030  [pdf, other

    cs.SE cs.AI

    CrossCodeBench: Benchmarking Cross-Task Generalization of Source Code Models

    Authors: Changan Niu, Chuanyi Li, Vincent Ng, Bin Luo

    Abstract: Despite the recent advances showing that a model pre-trained on large-scale source code data is able to gain appreciable generalization capability, it still requires a sizeable amount of data on the target task for fine-tuning. And the effectiveness of the model generalization is largely affected by the size and quality of the fine-tuning data, which is detrimental for target tasks with limited or… ▽ More

    Submitted 10 February, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: ICSE 2023

  24. arXiv:2302.04026  [pdf, ps, other

    cs.SE

    An Empirical Comparison of Pre-Trained Models of Source Code

    Authors: Changan Niu, Chuanyi Li, Vincent Ng, Dongxiao Chen, Jidong Ge, Bin Luo

    Abstract: While a large number of pre-trained models of source code have been successfully developed and applied to a variety of software engineering (SE) tasks in recent years, our understanding of these pre-trained models is arguably fairly limited. With the goal of advancing our understanding of these models, we perform the first systematic empirical comparison of 19 recently-developed pre-trained models… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: ICSE 2023

  25. arXiv:2302.03916  [pdf, other

    cs.LG

    QS-ADN: Quasi-Supervised Artifact Disentanglement Network for Low-Dose CT Image Denoising by Local Similarity Among Unpaired Data

    Authors: Yuhui Ruan, Qiao Yuan, Chuang Niu, Chen Li, Yudong Yao, Ge Wang, Yueyang Teng

    Abstract: Deep learning has been successfully applied to low-dose CT (LDCT) image denoising for reducing potential radiation risk. However, the widely reported supervised LDCT denoising networks require a training set of paired images, which is expensive to obtain and cannot be perfectly simulated. Unsupervised learning utilizes unpaired data and is highly desirable for LDCT denoising. As an example, an art… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  26. arXiv:2301.13340  [pdf, other

    cs.LG cs.SI

    Affinity Uncertainty-based Hard Negative Mining in Graph Contrastive Learning

    Authors: Chaoxi Niu, Guansong Pang, Ling Chen

    Abstract: Hard negative mining has shown effective in enhancing self-supervised contrastive learning (CL) on diverse data types, including graph CL (GCL). The existing hardness-aware CL methods typically treat negative instances that are most similar to the anchor instance as hard negatives, which helps improve the CL performance, especially on image data. However, this approach often fails to identify the… ▽ More

    Submitted 6 January, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted to TNNLS

  27. arXiv:2211.06276  [pdf, other

    cs.CV

    One-Time Model Adaptation to Heterogeneous Clients: An Intra-Client and Inter-Image Attention Design

    Authors: Yikai Yan, Chaoyue Niu, Fan Wu, Qinya Li, Shaojie Tang, Chengfei Lyu, Guihai Chen

    Abstract: The mainstream workflow of image recognition applications is first training one global model on the cloud for a wide range of classes and then serving numerous clients, each with heterogeneous images from a small subset of classes to be recognized. From the cloud-client discrepancies on the range of image classes, the recognition model is desired to have strong adaptiveness, intuitively by concent… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  28. arXiv:2211.01163  [pdf, other

    cs.IR cs.AI cs.LG

    On-Device Model Fine-Tuning with Label Correction in Recommender Systems

    Authors: Yucheng Ding, Chaoyue Niu, Fan Wu, Shaojie Tang, Chengfei Lyu, Guihai Chen

    Abstract: To meet the practical requirements of low latency, low cost, and good privacy in online intelligent services, more and more deep learning models are offloaded from the cloud to mobile devices. To further deal with cross-device data heterogeneity, the offloaded models normally need to be fine-tuned with each individual user's local samples before being put into real-time inference. In this work, we… ▽ More

    Submitted 21 October, 2022; originally announced November 2022.

  29. Quad-Net: Quad-domain Network for CT Metal Artifact Reduction

    Authors: Zilong Li, Qi Gao, Yaping Wu, Chuang Niu, Junping Zhang, Meiyun Wang, Ge Wang, Hongming Shan

    Abstract: Metal implants and other high-density objects in patients introduce severe streaking artifacts in CT images, compromising image quality and diagnostic performance. Although various methods were developed for CT metal artifact reduction over the past decades, including the latest dual-domain deep networks, remaining metal artifacts are still clinically challenging in many cases. Here we extend the… ▽ More

    Submitted 31 May, 2023; v1 submitted 24 July, 2022; originally announced July 2022.

    Journal ref: IEEE Transactions on Medical Imaging, 2024

  30. arXiv:2207.07743  [pdf, other

    cs.LG cs.AI cs.CV

    HOME: High-Order Mixed-Moment-based Embedding for Representation Learning

    Authors: Chuang Niu, Ge Wang

    Abstract: Minimum redundancy among different elements of an embedding in a latent space is a fundamental requirement or major preference in representation learning to capture intrinsic informational structures. Current self-supervised learning methods minimize a pair-wise covariance matrix to reduce the feature redundancy and produce promising results. However, such representation features of multiple varia… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  31. arXiv:2206.06461  [pdf, other

    cs.CV cs.AI

    Self-Supervised Representation Learning With MUlti-Segmental Informational Coding (MUSIC)

    Authors: Chuang Niu, Ge Wang

    Abstract: Self-supervised representation learning maps high-dimensional data into a meaningful embedding space, where samples of similar semantic contents are close to each other. Most of the recent representation learning methods maximize cosine similarity or minimize the distance between the embedding features of different views from the same sample usually on the $l2$ normalized unit hypersphere. To prev… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  32. arXiv:2205.14833  [pdf, other

    cs.LG cs.DC eess.SY

    Walle: An End-to-End, General-Purpose, and Large-Scale Production System for Device-Cloud Collaborative Machine Learning

    Authors: Chengfei Lv, Chaoyue Niu, Renjie Gu, Xiaotang Jiang, Zhaode Wang, Bin Liu, Ziqi Wu, Qiulin Yao, Congyu Huang, Panos Huang, Tao Huang, Hui Shu, Jinde Song, Bin Zou, Peng Lan, Guohuan Xu, Fei Wu, Shaojie Tang, Fan Wu, Guihai Chen

    Abstract: To break the bottlenecks of mainstream cloud-based machine learning (ML) paradigm, we adopt device-cloud collaborative ML and build the first end-to-end and general-purpose system, called Walle, as the foundation. Walle consists of a deployment platform, distributing ML tasks to billion-scale devices in time; a data pipeline, efficiently preparing task input; and a compute container, providing a c… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

    Comments: Accepted by OSDI 2022

  33. arXiv:2205.11739  [pdf, ps, other

    cs.SE cs.AI

    Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models of Source Code

    Authors: Changan Niu, Chuanyi Li, Bin Luo, Vincent Ng

    Abstract: Recent years have seen the successful application of deep learning to software engineering (SE). In particular, the development and use of pre-trained models of source code has enabled state-of-the-art results to be achieved on a wide variety of SE tasks. This paper provides an overview of this rapidly advancing field of research and reflects on future research directions.

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: IJCAI 2022: Survey Track

  34. Unsupervised Contrastive Learning based Transformer for Lung Nodule Detection

    Authors: Chuang Niu, Ge Wang

    Abstract: Early detection of lung nodules with computed tomography (CT) is critical for the longer survival of lung cancer patients and better quality of life. Computer-aided detection/diagnosis (CAD) is proven valuable as a second or concurrent reader in this context. However, accurate detection of lung nodules remains a challenge for such CAD systems and even radiologists due to not only the variability i… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.

  35. arXiv:2203.13118  [pdf, other

    eess.IV cs.CV

    X-ray Dissectography Improves Lung Nodule Detection

    Authors: Chuang Niu, Giridhar Dasegowda, Pingkun Yan, Mannudeep K. Kalra, Ge Wang

    Abstract: Although radiographs are the most frequently used worldwide due to their cost-effectiveness and widespread accessibility, the structural superposition along the x-ray paths often renders suspicious or concerning lung nodules difficult to detect. In this study, we apply "X-ray dissectography" to dissect lungs digitally from a few radiographic projections, suppress the interference of irrelevant str… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  36. arXiv:2203.11722  [pdf, other

    eess.IV cs.CV cs.LG

    Convolutional Neural Network to Restore Low-Dose Digital Breast Tomosynthesis Projections in a Variance Stabilization Domain

    Authors: Rodrigo de Barros Vimieiro, Chuang Niu, Hongming Shan, Lucas Rodrigues Borges, Ge Wang, Marcelo Andrade da Costa Vieira

    Abstract: Digital breast tomosynthesis (DBT) exams should utilize the lowest possible radiation dose while maintaining sufficiently good image quality for accurate medical diagnosis. In this work, we propose a convolution neural network (CNN) to restore low-dose (LD) DBT projections to achieve an image quality equivalent to a standard full-dose (FD) acquisition. The proposed network architecture benefits fr… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: 12 pages, 9 figures

  37. arXiv:2201.12763  [pdf, other

    cs.CV

    RIM-Net: Recursive Implicit Fields for Unsupervised Learning of Hierarchical Shape Structures

    Authors: Chengjie Niu, Manyi Li, Kai Xu, Hao Zhang

    Abstract: We introduce RIM-Net, a neural network which learns recursive implicit fields for unsupervised inference of hierarchical shape structures. Our network recursively decomposes an input 3D shape into two parts, resulting in a binary tree hierarchy. Each level of the tree corresponds to an assembly of shape parts, represented as implicit functions, to reconstruct the input shape. At each node of the t… ▽ More

    Submitted 28 March, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

  38. arXiv:2201.10382  [pdf, other

    cs.LG cs.AI

    On-Device Learning with Cloud-Coordinated Data Augmentation for Extreme Model Personalization in Recommender Systems

    Authors: Renjie Gu, Chaoyue Niu, Yikai Yan, Fan Wu, Shaojie Tang, Rongfeng Jia, Chengfei Lyu, Guihai Chen

    Abstract: Data heterogeneity is an intrinsic property of recommender systems, making models trained over the global data on the cloud, which is the mainstream in industry, non-optimal to each individual user's local data distribution. To deal with data heterogeneity, model personalization with on-device learning is a potential solution. However, on-device training using a user's small size of local samples… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

  39. arXiv:2201.01549  [pdf, other

    cs.SE

    SPT-Code: Sequence-to-Sequence Pre-Training for Learning Source Code Representations

    Authors: Changan Niu, Chuanyi Li, Vincent Ng, Jidong Ge, Liguo Huang, Bin Luo

    Abstract: Recent years have seen the successful application of large pre-trained models to code representation learning, resulting in substantial improvements on many code-related downstream tasks. But there are issues surrounding their application to SE tasks. First, the majority of the pre-trained models focus on pre-training only the encoder of the Transformer. For generation tasks that are addressed usi… ▽ More

    Submitted 25 May, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

    Comments: ICSE 2022: Technical Track

  40. arXiv:2111.15040  [pdf, other

    physics.med-ph cs.AI

    X-ray Dissectography Enables Stereotography to Improve Diagnostic Performance

    Authors: Chuang Niu, Ge Wang

    Abstract: X-ray imaging is the most popular medical imaging technology. While x-ray radiography is rather cost-effective, tissue structures are superimposed along the x-ray paths. On the other hand, computed tomography (CT) reconstructs internal structures but CT increases radiation dose, is complicated and expensive. Here we propose "x-ray dissectography" to extract a target organ/tissue digitally from few… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

  41. arXiv:2111.09649  [pdf

    cs.CE

    HRnV-Calc: A software package for heart rate n-variability and heart rate variability analysis

    Authors: Chenglin Niu, Dagang Guo, Marcus Eng Hock Ong, Zhi Xiong Koh, Andrew Fu Wah Ho, Zhiping Lin, Chengyu Liu, Gari D. Clifford, Nan Liu

    Abstract: Objective: Heart rate variability (HRV) has been proven to be an important indicator of physiological status for numerous applications. Despite the progress and active developments made in HRV metric research over the last few decades, the representation of the heartbeat sequence upon which HRV is based has received relatively little attention. The recently introduced heart rate n-variability (HRn… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

  42. arXiv:2111.08227  [pdf, other

    cs.LG physics.med-ph

    Phase function estimation from a diffuse optical image via deep learning

    Authors: Yuxuan Liang, Chuang Niu, Chen Wei, Shenghan Ren, Wenxiang Cong, Ge Wang

    Abstract: The phase function is a key element of a light propagation model for Monte Carlo (MC) simulation, which is usually fitted with an analytic function with associated parameters. In recent years, machine learning methods were reported to estimate the parameters of the phase function of a particular form such as the Henyey-Greenstein phase function but, to our knowledge, no studies have been performed… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: 16 pages, 8 figures

  43. arXiv:2109.07704  [pdf, other

    cs.LG cs.AI

    Federated Submodel Optimization for Hot and Cold Data Features

    Authors: Yucheng Ding, Chaoyue Niu, Fan Wu, Shaojie Tang, Chengfei Lv, Yanghe Feng, Guihai Chen

    Abstract: We study practical data characteristics underlying federated learning, where non-i.i.d. data from clients have sparse features, and a certain client's local data normally involves only a small part of the full model, called a submodel. Due to data sparsity, the classical federated averaging (FedAvg) algorithm or its variants will be severely slowed down, because when updating the global model, eac… ▽ More

    Submitted 5 April, 2023; v1 submitted 15 September, 2021; originally announced September 2021.

  44. arXiv:2106.09834  [pdf

    eess.IV cs.CV cs.LG

    AI-Enabled Ultra-Low-Dose CT Reconstruction

    Authors: Weiwen Wu, Chuang Niu, Shadi Ebrahimian, Hengyong Yu, Mannu Kalra, Ge Wang

    Abstract: By the ALARA (As Low As Reasonably Achievable) principle, ultra-low-dose CT reconstruction is a holy grail to minimize cancer risks and genetic damages, especially for children. With the development of medical CT technologies, the iterative algorithms are widely used to reconstruct decent CT images from a low-dose scan. Recently, artificial intelligence (AI) techniques have shown a great promise i… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 19 pages, 10 figures, 1 table, 44 references

    MSC Class: 68T07

  45. arXiv:2103.13557  [pdf, other

    eess.IV cs.CV

    Task-Oriented Low-Dose CT Image Denoising

    Authors: Jiajin Zhang, Hanqing Chao, Xuanang Xu, Chuang Niu, Ge Wang, Pingkun Yan

    Abstract: The extensive use of medical CT has raised a public concern over the radiation dose to the patient. Reducing the radiation dose leads to increased CT image noise and artifacts, which can adversely affect not only the radiologists judgement but also the performance of downstream medical image analysis tasks. Various low-dose CT denoising methods, especially the recent deep learning based approaches… ▽ More

    Submitted 10 July, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: Paper accepted by MICCAI-2021

  46. arXiv:2103.10493  [pdf, ps, other

    cs.CV

    Image Synthesis for Data Augmentation in Medical CT using Deep Reinforcement Learning

    Authors: Arjun Krishna, Kedar Bartake, Chuang Niu, Ge Wang, Youfang Lai, Xun Jia, Klaus Mueller

    Abstract: Deep learning has shown great promise for CT image reconstruction, in particular to enable low dose imaging and integrated diagnostics. These merits, however, stand at great odds with the low availability of diverse image data which are needed to train these neural networks. We propose to overcome this bottleneck via a deep reinforcement learning (DRL) approach that is integrated with a style-tran… ▽ More

    Submitted 21 March, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: Fully3D 2021

  47. SPICE: Semantic Pseudo-labeling for Image Clustering

    Authors: Chuang Niu, Hongming Shan, Ge Wang

    Abstract: The similarity among samples and the discrepancy between clusters are two crucial aspects of image clustering. However, current deep clustering methods suffer from the inaccurate estimation of either feature similarity or semantic discrepancy. In this paper, we present a Semantic Pseudo-labeling-based Image ClustEring (SPICE) framework, which divides the clustering network into a feature model for… ▽ More

    Submitted 14 January, 2022; v1 submitted 16 March, 2021; originally announced March 2021.

    Journal ref: IEEE Transactions on Image Processing, 2022

  48. arXiv:2102.09615  [pdf, other

    eess.IV cs.CV

    Noise Entangled GAN For Low-Dose CT Simulation

    Authors: Chuang Niu, Ge Wang, Pingkun Yan, Juergen Hahn, Youfang Lai, Xun Jia, Arjun Krishna, Klaus Mueller, Andreu Badal, KyleJ. Myers, Rongping Zeng

    Abstract: We propose a Noise Entangled GAN (NE-GAN) for simulating low-dose computed tomography (CT) images from a higher dose CT image. First, we present two schemes to generate a clean CT image and a noise image from the high-dose CT image. Then, given these generated images, an NE-GAN is proposed to simulate different levels of low-dose CT images, where the level of generated noise can be continuously co… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

  49. arXiv:2012.10936  [pdf, other

    cs.LG cs.DC

    Toward Understanding the Influence of Individual Clients in Federated Learning

    Authors: Yihao Xue, Chaoyue Niu, Zhenzhe Zheng, Shaojie Tang, Chengfei Lv, Fan Wu, Guihai Chen

    Abstract: Federated learning allows mobile clients to jointly train a global model without sending their private data to a central server. Extensive works have studied the performance guarantee of the global model, however, it is still unclear how each individual client influences the collaborative training process. In this work, we defined a new notion, called {\em Fed-Influence}, to quantify this influenc… ▽ More

    Submitted 12 April, 2021; v1 submitted 20 December, 2020; originally announced December 2020.

    Comments: Accepted at AAAI 2021

    ACM Class: I.2.6; G.3

  50. arXiv:2012.02907   

    cs.RO

    Depth estimation on embedded computers for robot swarms in forest

    Authors: Chaoyue Niu, Danesh Tarapore, Klaus-Peter Zauner

    Abstract: Robot swarms to date are not prepared for autonomous navigation such as path planning and obstacle detection in forest floor, unable to achieve low-cost. The development of depth sensing and embedded computing hardware paves the way for swarm of terrestrial robots. The goal of this research is to improve this situation by developing low cost vision system for small ground robots to rapidly perceiv… ▽ More

    Submitted 19 March, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: the depth estimation models cannot perform well in a forest