Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 104 results for author: Guan, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.15389  [pdf, other

    cs.LG cs.CR cs.DC

    Poisoning with A Pill: Circumventing Detection in Federated Learning

    Authors: Hanxi Guo, Hao Wang, Tao Song, Tianhang Zheng, Yang Hua, Haibing Guan, Xiangyu Zhang

    Abstract: Without direct access to the client's data, federated learning (FL) is well-known for its unique strength in data privacy protection among existing distributed machine learning techniques. However, its distributive and iterative nature makes FL inherently vulnerable to various poisoning attacks. To counteract these threats, extensive defenses have been proposed to filter out malicious clients, usi… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  2. Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling

    Authors: Sohaib Ahmad, Hui Guan, Ramesh K. Sitaraman

    Abstract: The rapid adoption of machine learning (ML) has underscored the importance of serving ML models with high throughput and resource efficiency. Traditional approaches to managing increasing query demands have predominantly focused on hardware scaling, which involves increasing server count or computing power. However, this strategy can often be impractical due to limitations in the available budget… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  3. arXiv:2407.01245  [pdf, other

    cs.AI cs.CY

    SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language Model

    Authors: Lingyue Fu, Hao Guan, Kounianhua Du, Jianghao Lin, Wei Xia, Weinan Zhang, Ruiming Tang, Yasheng Wang, Yong Yu

    Abstract: Knowledge Tracing (KT) aims to determine whether students will respond correctly to the next question, which is a crucial task in intelligent tutoring systems (ITS). In educational KT scenarios, transductive ID-based methods often face severe data sparsity and cold start problems, where interactions between individual students and questions are sparse, and new questions and concepts consistently a… ▽ More

    Submitted 23 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

  4. arXiv:2406.08810  [pdf, other

    cs.CV

    Few-Shot Anomaly Detection via Category-Agnostic Registration Learning

    Authors: Chaoqin Huang, Haoyan Guan, Aofan Jiang, Yanfeng Wang, Michael Spratling, Xinchao Wang, Ya Zhang

    Abstract: Most existing anomaly detection methods require a dedicated model for each category. Such a paradigm, despite its promising results, is computationally expensive and inefficient, thereby failing to meet the requirements for real-world applications. Inspired by how humans detect anomalies, by comparing a query image to known normal ones, this paper proposes a novel few-shot anomaly detection (FSAD)… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  5. arXiv:2406.08334  [pdf, other

    cs.DC cs.AI cs.LG cs.PF

    ProTrain: Efficient LLM Training via Memory-Aware Techniques

    Authors: Hanmei Yang, Jin Zhou, Yao Fu, Xiaoqun Wang, Ramine Roane, Hui Guan, Tongping Liu

    Abstract: It is extremely memory-hungry to train Large Language Models (LLM). To solve this problem, existing work exploits the combination of CPU and GPU for the training process, such as ZeRO-Offload. Such a technique largely democratizes billion-scale model training, making it possible to train with few consumer graphics cards. However, based on our observation, existing frameworks often provide coarse-g… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2406.00684  [pdf, other

    cs.CV cs.CL

    Deciphering Oracle Bone Language with Diffusion Models

    Authors: Haisu Guan, Huanxin Yang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu

    Abstract: Originating from China's Shang Dynasty approximately 3,000 years ago, the Oracle Bone Script (OBS) is a cornerstone in the annals of linguistic history, predating many established writing systems. Despite the discovery of thousands of inscriptions, a vast expanse of OBS remains undeciphered, casting a veil of mystery over this ancient language. The emergence of modern AI technologies presents a no… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: ACL2024 main conference long paper

  7. arXiv:2406.00552  [pdf, other

    cs.LG cs.DC

    Graph Neural Network Training Systems: A Performance Comparison of Full-Graph and Mini-Batch

    Authors: Saurabh Bajaj, Hui Guan, Marco Serafini

    Abstract: Graph Neural Networks (GNNs) have gained significant attention in recent years due to their ability to learn representations of graph structured data. Two common methods for training GNNs are mini-batch training and full-graph training. Since these two methods require different training pipelines and systems optimizations, two separate categories of GNN training systems emerged, each tailored for… ▽ More

    Submitted 8 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: 12 pages, 1 appendix, 8 Figures, 16 Tables, Graph Neural Network, Graph Neural Networks, Full-graph training, Mini-batch training, full-batch training, distributed training, performance, epoch time, time to accuracy, accuracy

  8. arXiv:2405.19931  [pdf, other

    cs.CV cs.AI cs.LG

    Exploring Diffusion Models' Corruption Stage in Few-Shot Fine-tuning and Mitigating with Bayesian Neural Networks

    Authors: Xiaoyu Wu, Jiaru Zhang, Yang Hua, Bohan Lyu, Hao Wang, Tao Song, Haibing Guan

    Abstract: Few-shot fine-tuning of Diffusion Models (DMs) is a key advancement, significantly reducing training costs and enabling personalized AI applications. However, we explore the training dynamics of DMs and observe an unanticipated phenomenon: during the training process, image fidelity initially improves, then unexpectedly deteriorates with the emergence of noisy patterns, only to recover later with… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Preprint. Under review

  9. arXiv:2405.16297  [pdf, other

    cs.LG physics.ao-ph physics.comp-ph

    LUCIE: A Lightweight Uncoupled ClImate Emulator with long-term stability and physical consistency for O(1000)-member ensembles

    Authors: Haiwen Guan, Troy Arcomano, Ashesh Chattopadhyay, Romit Maulik

    Abstract: We present LUCIE, a $1000$- member ensemble data-driven atmospheric emulator that remains stable during autoregressive inference for thousands of years without a drifting climatology. LUCIE has been trained on $9.5$ years of coarse-resolution ERA5 data with $4$ prognostic variables on a single A100 GPU for $2.4$ h. Owing to the cheap computational cost of inference, $1000$ model ensembles are exec… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  10. arXiv:2405.15551  [pdf, other

    cs.LG

    Thinking Forward: Memory-Efficient Federated Finetuning of Language Models

    Authors: Kunjal Panchal, Nisarg Parikh, Sunav Choudhary, Lijun Zhang, Yuriy Brun, Hui Guan

    Abstract: Finetuning large language models (LLMs) in federated learning (FL) settings has become important as it allows resource-constrained devices to finetune a model using private data. However, finetuning LLMs using backpropagation requires excessive memory (especially from intermediate activations) for resource-constrained devices. While Forward-mode Auto-Differentiation (AD) can reduce memory footprin… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  11. arXiv:2405.09713  [pdf, other

    cs.CV cs.AI cs.CL

    SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge

    Authors: Andong Wang, Bo Wu, Sunli Chen, Zhenfang Chen, Haotian Guan, Wei-Ning Lee, Li Erran Li, Chuang Gan

    Abstract: Learning commonsense reasoning from visual contexts and scenes in real-world is a crucial step toward advanced artificial intelligence. However, existing video reasoning benchmarks are still inadequate since they were mainly designed for factual or situated reasoning and rarely involve broader knowledge in the real world. Our work aims to delve deeper into reasoning evaluations, specifically withi… ▽ More

    Submitted 16 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: CVPR

  12. arXiv:2405.00074  [pdf, other

    cs.LG cs.SE

    PAODING: A High-fidelity Data-free Pruning Toolkit for Debloating Pre-trained Neural Networks

    Authors: Mark Huasong Meng, Hao Guan, Liuhuo Wan, Sin Gee Teo, Guangdong Bai, Jin Song Dong

    Abstract: We present PAODING, a toolkit to debloat pretrained neural network models through the lens of data-free pruning. To preserve the model fidelity, PAODING adopts an iterative process, which dynamically measures the effect of deleting a neuron to identify candidates that have the least impact to the output layer. Our evaluation shows that PAODING can significantly reduce the model size, generalize on… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 3 pages

  13. arXiv:2404.01133  [pdf, other

    cs.CV

    CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians

    Authors: Yang Liu, He Guan, Chuanchen Luo, Lue Fan, Naiyan Wang, Junran Peng, Zhaoxiang Zhang

    Abstract: The advancement of real-time 3D scene reconstruction and novel view synthesis has been significantly propelled by 3D Gaussian Splatting (3DGS). However, effectively training large-scale 3DGS and rendering it in real-time across various scales remains challenging. This paper introduces CityGaussian (CityGS), which employs a novel divide-and-conquer training approach and Level-of-Detail (LoD) strate… ▽ More

    Submitted 17 July, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted by ECCV2024; Project Page: https://dekuliutesla.github.io/citygs/

  14. arXiv:2403.11162  [pdf, other

    cs.CV cs.AI cs.CR cs.CY cs.LG

    CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion

    Authors: Xiaoyu Wu, Yang Hua, Chumeng Liang, Jiaru Zhang, Hao Wang, Tao Song, Haibing Guan

    Abstract: Diffusion Models (DMs) have evolved into advanced image generation tools, especially for few-shot generation where a pretrained model is fine-tuned on a small set of images to capture a specific style or object. Despite their success, concerns exist about potential copyright violations stemming from the use of unauthorized data in this process. In response, we present Contrasting Gradient Inversio… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  15. arXiv:2403.01849  [pdf, other

    cs.CV cs.AI cs.LG

    One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models

    Authors: Lin Li, Haoyan Guan, Jianing Qiu, Michael Spratling

    Abstract: Large pre-trained Vision-Language Models (VLMs) like CLIP, despite having remarkable generalization ability, are highly vulnerable to adversarial examples. This work studies the adversarial robustness of VLMs from the novel perspective of the text prompt instead of the extensively studied model weights (frozen in this work). We first show that the effectiveness of both adversarial attack and defen… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: CVPR2024

  16. GraphMini: Accelerating Graph Pattern Matching Using Auxiliary Graphs

    Authors: Juelin Liu, Sandeep Polisetty, Hui Guan, Marco Serafini

    Abstract: Graph pattern matching is a fundamental problem encountered by many common graph mining tasks and the basic building block of several graph mining systems. This paper explores for the first time how to proactively prune graphs to speed up graph pattern matching by leveraging the structure of the query pattern and the input graph. We propose building auxiliary graphs, which are different pruned… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  17. arXiv:2402.18252  [pdf, other

    cs.CL cs.AI

    Towards Generalist Prompting for Large Language Models by Mental Models

    Authors: Haoxiang Guan, Jiyan He, Shuxin Zheng, En-Hong Chen, Weiming Zhang, Nenghai Yu

    Abstract: Large language models (LLMs) have demonstrated impressive performance on many tasks. However, to achieve optimal performance, specially designed prompting methods are still needed. These methods either rely on task-specific few-shot examples that require a certain level of domain knowledge, or are designed to be simple but only perform well on a few types of tasks. In this work, we attempt to intr… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  18. arXiv:2402.16001  [pdf

    cs.CV

    Cross-Resolution Land Cover Classification Using Outdated Products and Transformers

    Authors: Huan Ni, Yubin Zhao, Haiyan Guan, Cheng Jiang, Yongshi Jie, Xing Wang, Yiyang Shen

    Abstract: Large-scale high-resolution land cover classification is a prerequisite for constructing Earth system models and addressing ecological and resource issues. Advancements in satellite sensor technology have led to an improvement in spatial resolution and wider coverage areas. Nevertheless, the lack of high-resolution labeled data is still a challenge, hindering the largescale application of land cov… ▽ More

    Submitted 5 March, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

  19. arXiv:2402.13631  [pdf, other

    cs.CV

    Delving into Dark Regions for Robust Shadow Detection

    Authors: Huankang Guan, Ke Xu, Rynson W. H. Lau

    Abstract: Shadow detection is a challenging task as it requires a comprehensive understanding of shadow characteristics and global/local illumination conditions. We observe from our experiment that state-of-the-art deep methods tend to have higher error rates in differentiating shadow pixels from non-shadow pixels in dark regions (ie, regions with low-intensity values). Our key insight to this problem is th… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  20. arXiv:2401.15365  [pdf, other

    cs.CV

    An open dataset for oracle bone script recognition and decipherment

    Authors: Pengjie Wang, Kaile Zhang, Xinyu Wang, Shengwei Han, Yongge Liu, Jinpeng Wan, Haisu Guan, Zhebin Kuang, Lianwen Jin, Xiang Bai, Yuliang Liu

    Abstract: Oracle Bone Script (OBS), one of the earliest known forms of ancient Chinese writing, holds invaluable insights into the humanities and geography of the Shang Dynasty, dating back 3,000 years. The immense historical and cultural significance of these writings cannot be overstated. However, the passage of time has obscured much of their meaning, presenting a significant challenge in deciphering the… ▽ More

    Submitted 5 June, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

  21. arXiv:2401.12467  [pdf, other

    cs.AI

    An open dataset for the evolution of oracle bone characters: EVOBC

    Authors: Haisu Guan, Jinpeng Wan, Yuliang Liu, Pengjie Wang, Kaile Zhang, Zhebin Kuang, Xinyu Wang, Xiang Bai, Lianwen Jin

    Abstract: The earliest extant Chinese characters originate from oracle bone inscriptions, which are closely related to other East Asian languages. These inscriptions hold immense value for anthropology and archaeology. However, deciphering oracle bone script remains a formidable challenge, with only approximately 1,600 of the over 4,500 extant characters elucidated to date. Further scholarly investigation i… ▽ More

    Submitted 13 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  22. arXiv:2401.12393  [pdf, other

    cs.DB cs.AI

    A Learning-based Declarative Privacy-Preserving Framework for Federated Data Management

    Authors: Hong Guan, Summer Gautier, Deepti Gupta, Rajan Hari Ambrish, Yancheng Wang, Harsha Lakamsani, Dhanush Giriyan, Saajan Maslanka, Chaowei Xiao, Yingzhen Yang, Jia Zou

    Abstract: It is challenging to balance the privacy and accuracy for federated query processing over multiple private data silos. In this work, we will demonstrate an end-to-end workflow for automating an emerging privacy-preserving technique that uses a deep learning model trained using the Differentially-Private Stochastic Gradient Descent (DP-SGD) algorithm to replace portions of actual data to answer a q… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  23. arXiv:2401.04247  [pdf, other

    cs.CV cs.AI

    Robust Image Watermarking using Stable Diffusion

    Authors: Lijun Zhang, Xiao Liu, Antoni Viros Martin, Cindy Xiong Bearfield, Yuriy Brun, Hui Guan

    Abstract: Watermarking images is critical for tracking image provenance and claiming ownership. With the advent of generative models, such as stable diffusion, able to create fake but realistic images, watermarking has become particularly important, e.g., to make generated images reliably identifiable. Unfortunately, the very same stable diffusion technology can remove watermarks injected using existing met… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 15 pages, 14 figures

  24. arXiv:2312.16151  [pdf, other

    cs.CV

    Large-scale Long-tailed Disease Diagnosis on Radiology Images

    Authors: Qiaoyu Zheng, Weike Zhao, Chaoyi Wu, Xiaoman Zhang, Lisong Dai, Hengyu Guan, Yuehua Li, Ya Zhang, Yanfeng Wang, Weidi Xie

    Abstract: Developing a generalist radiology diagnosis system can greatly enhance clinical diagnostics. In this paper, we introduce RadDiag, a foundational model supporting 2D and 3D inputs across various modalities and anatomies, using a transformer-based fusion module for comprehensive disease diagnosis. Due to patient privacy concerns and the lack of large-scale radiology diagnosis datasets, we utilize hi… ▽ More

    Submitted 16 June, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

  25. arXiv:2312.12484  [pdf, other

    cs.CR cs.DC cs.LG

    SkyMask: Attack-agnostic Robust Federated Learning with Fine-grained Learnable Masks

    Authors: Peishen Yan, Hao Wang, Tao Song, Yang Hua, Ruhui Ma, Ningxin Hu, Mohammad R. Haghighat, Haibing Guan

    Abstract: Federated Learning (FL) is becoming a popular paradigm for leveraging distributed data and preserving data privacy. However, due to the distributed characteristic, FL systems are vulnerable to Byzantine attacks that compromised clients attack the global model by uploading malicious model updates. With the development of layer-level and parameter-level fine-grained attacks, the attacks' stealthines… ▽ More

    Submitted 18 July, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted by ECCV2024

  26. arXiv:2311.17967  [pdf, other

    cs.CV astro-ph.IM cs.LG

    Discovering Galaxy Features via Dataset Distillation

    Authors: Haowen Guan, Xuan Zhao, Zishi Wang, Zhiyang Li, Julia Kempe

    Abstract: In many applications, Neural Nets (NNs) have classification performance on par or even exceeding human capacity. Moreover, it is likely that NNs leverage underlying features that might differ from those humans perceive to classify. Can we "reverse-engineer" pertinent features to enhance our scientific understanding? Here, we apply this idea to the notoriously difficult task of galaxy classificatio… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted to NeurIPS Workshop on Machine Learning and the Physical Sciences, 2023

  27. arXiv:2311.14975  [pdf, other

    cs.LG cs.DC

    Eliminating Domain Bias for Federated Learning in Representation Space

    Authors: Jianqing Zhang, Yang Hua, Jian Cao, Hao Wang, Tao Song, Zhengui Xue, Ruhui Ma, Haibing Guan

    Abstract: Recently, federated learning (FL) is popular for its privacy-preserving and collaborative learning abilities. However, under statistically heterogeneous scenarios, we observe that biased data domains on clients cause a representation bias phenomenon and further degenerate generic representations during local training, i.e., the representation degeneration phenomenon. To address these issues, we pr… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS 2023, 24 pages

  28. arXiv:2310.19112  [pdf, other

    cs.CV cs.AI cs.LG

    Efficient IoT Inference via Context-Awareness

    Authors: Mohammad Mehdi Rastikerdar, Jin Huang, Shiwei Fang, Hui Guan, Deepak Ganesan

    Abstract: While existing strategies to execute deep learning-based classification on low-power platforms assume the models are trained on all classes of interest, this paper posits that adopting context-awareness i.e. narrowing down a classification task to the current deployment context consisting of only recent inference queries can substantially enhance performance in resource-constrained environments. W… ▽ More

    Submitted 3 December, 2023; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: 12 pages, 8 figures

  29. arXiv:2309.04755  [pdf, other

    cs.CE cs.AI eess.SP physics.flu-dyn

    Towards Real-time Training of Physics-informed Neural Networks: Applications in Ultrafast Ultrasound Blood Flow Imaging

    Authors: Haotian Guan, Jinping Dong, Wei-Ning Lee

    Abstract: Physics-informed Neural Network (PINN) is one of the most preeminent solvers of Navier-Stokes equations, which are widely used as the governing equation of blood flow. However, current approaches, relying on full Navier-Stokes equations, are impractical for ultrafast Doppler ultrasound, the state-of-the-art technique for depiction of complex blood flow dynamics \emph{in vivo} through acquired thou… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

  30. arXiv:2309.01957  [pdf, other

    cs.DB

    Automatic Data Transformation Using Large Language Model: An Experimental Study on Building Energy Data

    Authors: Ankita Sharma, Xuanmao Li, Hong Guan, Guoxin Sun, Liang Zhang, Lanjun Wang, Kesheng Wu, Lei Cao, Erkang Zhu, Alexander Sim, Teresa Wu, Jia Zou

    Abstract: Existing approaches to automatic data transformation are insufficient to meet the requirements in many real-world scenarios, such as the building sector. First, there is no convenient interface for domain experts to provide domain knowledge easily. Second, they require significant training data collection overheads. Third, the accuracy suffers from complicated schema changes. To bridge this gap, w… ▽ More

    Submitted 6 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: 10 pages, 7 figures

    Journal ref: 2023 IEEE International Conference on Big Data

  31. arXiv:2308.10279  [pdf, other

    cs.LG cs.CR cs.DC

    GPFL: Simultaneously Learning Global and Personalized Feature Information for Personalized Federated Learning

    Authors: Jianqing Zhang, Yang Hua, Hao Wang, Tao Song, Zhengui Xue, Ruhui Ma, Jian Cao, Haibing Guan

    Abstract: Federated Learning (FL) is popular for its privacy-preserving and collaborative learning capabilities. Recently, personalized FL (pFL) has received attention for its ability to address statistical heterogeneity and achieve personalization in FL. However, from the perspective of feature extraction, most existing pFL methods only focus on extracting global or personalized feature information during… ▽ More

    Submitted 13 October, 2023; v1 submitted 20 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV2023

  32. arXiv:2308.02681  [pdf, other

    cs.CY

    MARTA Reach: Piloting an On-Demand Multimodal Transit System in Atlanta

    Authors: Pascal Van Hentenryck, Connor Riley, Anthony Trasatti, Hongzhao Guan, Tejas Santanam, Jorge A. Huertas, Kevin Dalmeijer, Kari Watkins, Juwon Drake, Samson Baskin

    Abstract: This paper reports on the results of the six-month pilot MARTA Reach, which aimed to demonstrate the potential value of On-Demand Multimodal Transit Systems (ODMTS) in the city of Atlanta, Georgia. ODMTS take a transit-centric view by integrating on-demand services and traditional fixed routes in order to address the first/last mile problem. ODMTS combine fixed routes and on-demand shuttle service… ▽ More

    Submitted 23 September, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

  33. arXiv:2307.11339  [pdf, other

    cs.DC

    Chrion: Optimizing Recurrent Neural Network Inference by Collaboratively Utilizing CPUs and GPUs

    Authors: Zinuo Cai, Hao Wang, Tao Song, Yang Hua, Ruhui Ma, Haibing Guan

    Abstract: Deploying deep learning models in cloud clusters provides efficient and prompt inference services to accommodate the widespread application of deep learning. These clusters are usually equipped with host CPUs and accelerators with distinct responsibilities to handle serving requests, i.e. generalpurpose CPUs for input preprocessing and domain-specific GPUs for forward computation. Recurrent neural… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  34. FedCP: Separating Feature Information for Personalized Federated Learning via Conditional Policy

    Authors: Jianqing Zhang, Yang Hua, Hao Wang, Tao Song, Zhengui Xue, Ruhui Ma, Haibing Guan

    Abstract: Recently, personalized federated learning (pFL) has attracted increasing attention in privacy protection, collaborative learning, and tackling statistical heterogeneity among clients, e.g., hospitals, mobile smartphones, etc. Most existing pFL methods focus on exploiting the global information and personalized information in the client-level model parameters while neglecting that data is the sourc… ▽ More

    Submitted 28 October, 2023; v1 submitted 1 July, 2023; originally announced July 2023.

    Comments: Accepted by KDD 2023

  35. arXiv:2306.08907  [pdf

    q-bio.BM cs.LG

    MCPI: Integrating Multimodal Data for Enhanced Prediction of Compound Protein Interactions

    Authors: Li Zhang, Wenhao Li, Haotian Guan, Zhiquan He, Mingjun Cheng, Han Wang

    Abstract: The identification of compound-protein interactions (CPI) plays a critical role in drug screening, drug repurposing, and combination therapy studies. The effectiveness of CPI prediction relies heavily on the features extracted from both compounds and target proteins. While various prediction methods employ different feature combinations, both molecular-based and network-based models encounter the… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 12 pages, 9 figures

  36. arXiv:2306.05980  [pdf, other

    cs.CV eess.IV

    Federated Learning for Medical Image Analysis: A Survey

    Authors: Hao Guan, Pew-Thian Yap, Andrea Bozoki, Mingxia Liu

    Abstract: Machine learning in medical imaging often faces a fundamental dilemma, namely, the small sample size problem. Many recent studies suggest using multi-domain data pooled from different acquisition sites/centers to improve statistical power. However, medical images from different sites cannot be easily shared to build large datasets for model training due to privacy protection reasons. As a promisin… ▽ More

    Submitted 7 July, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: 35 pages, 6 figures

    Journal ref: Pattern Recognition, volume 151, 2024

  37. arXiv:2306.04286  [pdf, other

    cs.SD cs.AI eess.AS

    A Mask Free Neural Network for Monaural Speech Enhancement

    Authors: Liang Liu, Haixin Guan, Jinlong Ma, Wei Dai, Guangyong Wang, Shaowei Ding

    Abstract: In speech enhancement, the lack of clear structural characteristics in the target speech phase requires the use of conservative and cumbersome network frameworks. It seems difficult to achieve competitive performance using direct methods and simple network architectures. However, we propose the MFNet, a direct and simple network that can not only map speech but also map reverse noise. This network… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  38. arXiv:2305.12066  [pdf, other

    cs.LG

    Multi-Task Models Adversarial Attacks

    Authors: Lijun Zhang, Xiao Liu, Kaleel Mahmood, Caiwen Ding, Hui Guan

    Abstract: Multi-Task Learning (MTL) involves developing a singular model, known as a multi-task model, to concurrently perform multiple tasks. While the security of single-task models has been thoroughly studied, multi-task models pose several critical security questions, such as 1) their vulnerability to single-task adversarial attacks, 2) the possibility of designing attacks that target multiple tasks, an… ▽ More

    Submitted 27 December, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: 19 pages, 6 figures

  39. arXiv:2304.13030  [pdf, other

    cs.CV

    CompletionFormer: Depth Completion with Convolutions and Vision Transformers

    Authors: Zhang Youmin, Guo Xianda, Poggi Matteo, Zhu Zheng, Huang Guan, Mattoccia Stefano

    Abstract: Given sparse depths and the corresponding RGB images, depth completion aims at spatially propagating the sparse measurements throughout the whole image to get a dense depth prediction. Despite the tremendous progress of deep-learning-based depth completion methods, the locality of the convolutional layer or graph model makes it hard for the network to model the long-range relationship between pixe… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR 2023. Code: https://github.com/youmi-zym/CompletionFormer. Project: https://youmi-zym.github.io/projects/CompletionFormer/

  40. arXiv:2304.06840  [pdf, other

    cs.LG cs.AI

    Structured Pruning for Multi-Task Deep Neural Networks

    Authors: Siddhant Garg, Lijun Zhang, Hui Guan

    Abstract: Although multi-task deep neural network (DNN) models have computation and storage benefits over individual single-task DNN models, they can be further optimized via model compression. Numerous structured pruning methods are already developed that can readily achieve speedups in single-task models, but the pruning of multi-task networks has not yet been extensively studied. In this work, we investi… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  41. arXiv:2303.13775  [pdf, other

    cs.DC cs.LG

    GSplit: Scaling Graph Neural Network Training on Large Graphs via Split-Parallelism

    Authors: Sandeep Polisetty, Juelin Liu, Kobi Falus, Yi Ren Fung, Seung-Hwan Lim, Hui Guan, Marco Serafini

    Abstract: Graph neural networks (GNNs), an emerging class of machine learning models for graphs, have gained popularity for their superior performance in various graph analytical tasks. Mini-batch training is commonly used to train GNNs on large graphs, and data parallelism is the standard approach to scale mini-batch training across multiple GPUs. One of the major performance costs in GNN training is the l… ▽ More

    Submitted 27 June, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

  42. Rethinking the backbone architecture for tiny object detection

    Authors: Jinlai Ning, Haoyan Guan, Michael Spratling

    Abstract: Tiny object detection has become an active area of research because images with tiny targets are common in several important real-world scenarios. However, existing tiny object detection methods use standard deep neural networks as their backbone architecture. We argue that such backbones are inappropriate for detecting tiny objects as they are designed for the classification of larger objects, an… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Journal ref: In Proceedings of the 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP2023, pages 103-114

  43. arXiv:2303.09013  [pdf

    cs.RO cs.LG

    Self-Inspection Method of Unmanned Aerial Vehicles in Power Plants Using Deep Q-Network Reinforcement Learning

    Authors: Haoran Guan

    Abstract: For the purpose of inspecting power plants, autonomous robots can be built using reinforcement learning techniques. The method replicates the environment and employs a simple reinforcement learning (RL) algorithm. This strategy might be applied in several sectors, including the electricity generation sector. A pre-trained model with perception, planning, and action is suggested by the research. To… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: Submitted to the International Conference on Artificial Intelligence, Robotics, and Control (AIRC 2023)

  44. arXiv:2302.04578  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples

    Authors: Chumeng Liang, Xiaoyu Wu, Yang Hua, Jiaru Zhang, Yiming Xue, Tao Song, Zhengui Xue, Ruhui Ma, Haibing Guan

    Abstract: Recently, Diffusion Models (DMs) boost a wave in AI for Art yet raise new copyright concerns, where infringers benefit from using unauthorized paintings to train DMs to generate novel paintings in a similar style. To address these emerging copyright violations, in this paper, we are the first to explore and propose to utilize adversarial examples for DMs to protect human-created artworks. Specific… ▽ More

    Submitted 6 June, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: Accepted by ICML2023 (Oral)

  45. arXiv:2302.04430  [pdf, other

    cs.DB cs.LG cs.PF

    A Comparison of Decision Forest Inference Platforms from A Database Perspective

    Authors: Hong Guan, Mahidhar Reddy Dwarampudi, Venkatesh Gunda, Hong Min, Lei Yu, Jia Zou

    Abstract: Decision forest, including RandomForest, XGBoost, and LightGBM, is one of the most popular machine learning techniques used in many industrial scenarios, such as credit card fraud detection, ranking, and business intelligence. Because the inference process is usually performance-critical, a number of frameworks were developed and dedicated for decision forest inference, such as ONNX, TreeLite from… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  46. arXiv:2302.00564  [pdf, other

    cs.LG stat.ML

    Automatically Marginalized MCMC in Probabilistic Programming

    Authors: Jinlin Lai, Javier Burroni, Hui Guan, Daniel Sheldon

    Abstract: Hamiltonian Monte Carlo (HMC) is a powerful algorithm to sample latent variables from Bayesian models. The advent of probabilistic programming languages (PPLs) frees users from writing inference algorithms and lets users focus on modeling. However, many models are difficult for HMC to solve directly, and often require tricks like model reparameterization. We are motivated by the fact that many of… ▽ More

    Submitted 1 June, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Accepted to the 40th International Conference on Machine Learning (ICML 2023)

  47. FedALA: Adaptive Local Aggregation for Personalized Federated Learning

    Authors: Jianqing Zhang, Yang Hua, Hao Wang, Tao Song, Zhengui Xue, Ruhui Ma, Haibing Guan

    Abstract: A key challenge in federated learning (FL) is the statistical heterogeneity that impairs the generalization of the global model on each client. To address this, we propose a method Federated learning with Adaptive Local Aggregation (FedALA) by capturing the desired information in the global model for client models in personalized FL. The key component of FedALA is an Adaptive Local Aggregation (AL… ▽ More

    Submitted 17 September, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Accepted by AAAI 2023

  48. arXiv:2211.15281  [pdf, other

    cs.LG

    Flow: Per-Instance Personalized Federated Learning Through Dynamic Routing

    Authors: Kunjal Panchal, Sunav Choudhary, Nisarg Parikh, Lijun Zhang, Hui Guan

    Abstract: Personalization in Federated Learning (FL) aims to modify a collaboratively trained global model according to each client. Current approaches to personalization in FL are at a coarse granularity, i.e. all the input instances of a client use the same personalized model. This ignores the fact that some instances are more accurately handled by the global model due to better generalizability. To addre… ▽ More

    Submitted 10 February, 2024; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: 37th Annual Conference on Neural Information Processing Systems (NeurIPS), 2023

  49. arXiv:2211.08615  [pdf, other

    cs.CV

    GLFF: Global and Local Feature Fusion for AI-synthesized Image Detection

    Authors: Yan Ju, Shan Jia, Jialing Cai, Haiying Guan, Siwei Lyu

    Abstract: With the rapid development of deep generative models (such as Generative Adversarial Networks and Diffusion models), AI-synthesized images are now of such high quality that humans can hardly distinguish them from pristine ones. Although existing detection methods have shown high performance in specific evaluation settings, e.g., on images from seen models or on images without real-world post-proce… ▽ More

    Submitted 4 September, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: 13 pages, 6 figures, 8 tables

  50. arXiv:2210.12055  [pdf, other

    cs.CV

    Query Semantic Reconstruction for Background in Few-Shot Segmentation

    Authors: Haoyan Guan, Michael Spratling

    Abstract: Few-shot segmentation (FSS) aims to segment unseen classes using a few annotated samples. Typically, a prototype representing the foreground class is extracted from annotated support image(s) and is matched to features representing each pixel in the query image. However, models learnt in this way are insufficiently discriminatory, and often produce false positives: misclassifying background pixels… ▽ More

    Submitted 21 December, 2022; v1 submitted 21 October, 2022; originally announced October 2022.