Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 1,367 results for author: Cao, X

.
  1. arXiv:2410.12987  [pdf, other

    astro-ph.CO

    Galaxy Mass Modelling from Multi-Wavelength JWST Strong Lens Analysis: Dark Matter Substructure, Angular Mass Complexity, or Both?

    Authors: Samuel C. Lange, Aristeidis Amvrosiadis, James W. Nightingale, Qiuhan He, Carlos S. Frenk, Andrew Robertson, Shaun Cole, Richard Massey, Xiaoyue Cao, Ran Li, Kaihao Wang

    Abstract: We analyze two galaxy-scale strong gravitational lenses, SPT0418-47 and SPT2147-50, using JWST NIRCam imaging across multiple filters. To account for angular complexity in the lens mass distribution, we introduce multipole perturbations with orders $m=1, 3, 4$. Our results show strong evidence for angular mass complexity in SPT2147, with multipole strengths of 0.3-1.7 $\%$ for $m=3, 4$ and 2.4-9.5… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 16 pages, 11 figures, submitted to MNRAS

  2. arXiv:2410.12138  [pdf, other

    cs.LG cs.CL

    Preference Optimization with Multi-Sample Comparisons

    Authors: Chaoqi Wang, Zhuokai Zhao, Chen Zhu, Karthik Abinav Sankararaman, Michal Valko, Xuefei Cao, Zhaorun Chen, Madian Khabsa, Yuxin Chen, Hao Ma, Sinong Wang

    Abstract: Recent advancements in generative models, particularly large language models (LLMs) and diffusion models, have been driven by extensive pretraining on large datasets followed by post-training. However, current post-training methods such as reinforcement learning from human feedback (RLHF) and direct alignment from preference methods (DAP) primarily utilize single-sample comparisons. These approach… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: preprint

  3. arXiv:2410.10365  [pdf, other

    cs.LG cs.AI

    SpeGCL: Self-supervised Graph Spectrum Contrastive Learning without Positive Samples

    Authors: Yuntao Shou, Xiangyong Cao, Deyu Meng

    Abstract: Graph Contrastive Learning (GCL) excels at managing noise and fluctuations in input data, making it popular in various fields (e.g., social networks, and knowledge graphs). Our study finds that the difference in high-frequency information between augmented graphs is greater than that in low-frequency information. However, most existing GCL methods focus mainly on the time domain (low-frequency inf… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 13 pages, 3 figures

  4. arXiv:2410.08688  [pdf, other

    cs.CV cs.AI

    Chain-of-Restoration: Multi-Task Image Restoration Models are Zero-Shot Step-by-Step Universal Image Restorers

    Authors: Jin Cao, Deyu Meng, Xiangyong Cao

    Abstract: Despite previous works typically targeting isolated degradation types, recent research has increasingly focused on addressing composite degradations which involve a complex interplay of multiple different isolated degradations. Recognizing the challenges posed by the exponential number of possible degradation combinations, we propose Universal Image Restoration (UIR), a new task setting that requi… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: 11 pages, 9 figures

  5. arXiv:2410.07272  [pdf, other

    cs.LG

    Boosting the Performance of Decentralized Federated Learning via Catalyst Acceleration

    Authors: Qinglun Li, Miao Zhang, Yingqi Liu, Quanjun Yin, Li Shen, Xiaochun Cao

    Abstract: Decentralized Federated Learning has emerged as an alternative to centralized architectures due to its faster training, privacy preservation, and reduced communication overhead. In decentralized communication, the server aggregation phase in Centralized Federated Learning shifts to the client side, which means that clients connect with each other in a peer-to-peer manner. However, compared to the… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: arXiv admin note: text overlap with arXiv:2410.06482

  6. arXiv:2410.07051  [pdf, other

    cs.IT quant-ph

    Exponents for Shared Randomness-Assisted Channel Simulation

    Authors: Aadil Oufkir, Michael X. Cao, Hao-Chung Cheng, Mario Berta

    Abstract: We determine the exact error and strong converse exponents of shared randomness-assisted channel simulation in worst case total-variation distance. Namely, we find that these exponents can be written as simple optimizations over the Rényi channel mutual information. Strikingly, and in stark contrast to channel coding, there are no critical rates, allowing a tight characterization for arbitrary rat… ▽ More

    Submitted 9 October, 2024; originally announced October 2024.

    Comments: 27+6 pages

  7. arXiv:2410.06719  [pdf, other

    cs.CV cs.AI

    Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques

    Authors: Benyuan Meng, Qianqian Xu, Zitai Wang, Zhiyong Yang, Xiaochun Cao, Qingming Huang

    Abstract: Diffusion models are powerful generative models, and this capability can also be applied to discrimination. The inner activations of a pre-trained diffusion model can serve as features for discriminative tasks, namely, diffusion feature. We discover that diffusion feature has been hindered by a hidden yet universal phenomenon that we call content shift. To be specific, there are content difference… ▽ More

    Submitted 10 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2410.03558

  8. Experimental coherent-state quantum secret sharing with finite pulses

    Authors: Yuan-Zhuo Wang, Xiao-Ran Sun, Xiao-Yu Cao, Hua-Lei Yin, Zeng-Bing Chen

    Abstract: Quantum secret sharing (QSS) plays a significant role in multiparty quantum communication and is a crucial component of future quantum multiparty computing networks. Therefore, it is highly valuable to develop a QSS protocol that offers both information-theoretic security and validation in real optical systems under a finite-key regime. In this work, we propose a three-user QSS protocol based on p… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: 14 pages, 5 figures, 5 tables

    Journal ref: Physical Review Applied 22, 044018 (2024)

  9. arXiv:2410.04662  [pdf

    eess.SY

    Path Planning and Robust Path Tracking Control of an Automated Parallel Parking Maneuver

    Authors: Xincheng Cao, Levent Guvenc

    Abstract: Self driving vehicles should be able to perform parallel parking or a similar maneuver successfully. With this motivation, the S shaped maneuverability test of the Ohio driver license examination is chosen here for automatic execution by a self driving vehicle with drive by wire capability and longitudinal and lateral controls. The Ohio maneuverability test requires the driver to start within an a… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: 12 pages, 19 figures

  10. arXiv:2410.04313  [pdf

    cs.RO eess.SY

    Vehicle-in-Virtual-Environment Method for ADAS and Connected and Automated Driving Function Development/Demonstration/Evaluation

    Authors: Xincheng Cao, Haochong Chen, Bilin Aksun-Guvenc, Levent Guvenc

    Abstract: The current approach for new Advanced Driver Assistance System (ADAS) and Connected and Automated Driving (CAD) function development involves a significant amount of public road testing which is inefficient due to the number miles that need to be driven for rare and extreme events to take place, thereby being very costly also, and unsafe as the rest of the road users become involuntary test subjec… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    Comments: 8 pages, 16 figures

  11. arXiv:2410.04260  [pdf, other

    math.OC cs.AI cs.RO

    Pareto Control Barrier Function for Inner Safe Set Maximization Under Input Constraints

    Authors: Xiaoyang Cao, Zhe Fu, Alexandre M. Bayen

    Abstract: This article introduces the Pareto Control Barrier Function (PCBF) algorithm to maximize the inner safe set of dynamical systems under input constraints. Traditional Control Barrier Functions (CBFs) ensure safety by maintaining system trajectories within a safe set but often fail to account for realistic input constraints. To address this problem, we leverage the Pareto multi-task learning framewo… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    Comments: Submitted to ACC 2025

  12. arXiv:2410.03558  [pdf, other

    cs.CV cs.AI

    Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features

    Authors: Benyuan Meng, Qianqian Xu, Zitai Wang, Xiaochun Cao, Qingming Huang

    Abstract: Diffusion models are initially designed for image generation. Recent research shows that the internal signals within their backbones, named activations, can also serve as dense features for various discriminative tasks such as semantic segmentation. Given numerous activations, selecting a small yet effective subset poses a fundamental problem. To this end, the early study of this field performs a… ▽ More

    Submitted 10 October, 2024; v1 submitted 4 October, 2024; originally announced October 2024.

  13. arXiv:2410.01768  [pdf, other

    cs.CV

    SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images

    Authors: Kaiyu Li, Ruixun Liu, Xiangyong Cao, Deyu Meng, Zhi Wang

    Abstract: Remote sensing image plays an irreplaceable role in fields such as agriculture, water resources, military, and disaster relief. Pixel-level interpretation is a critical aspect of remote sensing image applications; however, a prevalent limitation remains the need for extensive manual annotation. For this, we try to introduce open-vocabulary semantic segmentation (OVSS) into the remote sensing conte… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  14. arXiv:2410.01226  [pdf, other

    cs.CV

    Towards Native Generative Model for 3D Head Avatar

    Authors: Yiyu Zhuang, Yuxiao He, Jiawei Zhang, Yanwen Wang, Jiahe Zhu, Yao Yao, Siyu Zhu, Xun Cao, Hao Zhu

    Abstract: Creating 3D head avatars is a significant yet challenging task for many applicated scenarios. Previous studies have set out to learn 3D human head generative models using massive 2D image data. Although these models are highly generalizable for human appearance, their result models are not 360$^\circ$-renderable, and the predicted 3D geometry is unreliable. Therefore, such results cannot be used i… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

  15. arXiv:2409.20453  [pdf, other

    eess.SP

    E-Healthcare Systems: Integrated Sensing, Computing, and Semantic Communication with Physical Layer Security

    Authors: Yinchao Yang, Zhaohui Yang, Weijie Yuan, Fan Liu, Xiaowen Cao, Chongwen Huang, Zhaoyang Zhang, Mohammad Shikh-Bahaei

    Abstract: This paper introduces an integrated sensing, computing, and semantic communication (ISCSC) framework tailored for smart healthcare systems. The framework is evaluated in the context of smart healthcare, optimising the transmit beamforming matrix and semantic extraction ratio for improved data rates, sensing accuracy, and general data protection regulation (GDPR) compliance, while considering IoRT… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: This paper has been accepted by GLOBECOM 2024

  16. arXiv:2409.19679  [pdf, other

    cs.CV

    SemiDDM-Weather: A Semi-supervised Learning Framework for All-in-one Adverse Weather Removal

    Authors: Fang Long, Wenkang Su, Zixuan Li, Lei Cai, Mingjie Li, Yuan-Gen Wang, Xiaochun Cao

    Abstract: Adverse weather removal aims to restore clear vision under adverse weather conditions. Existing methods are mostly tailored for specific weather types and rely heavily on extensive labeled data. In dealing with these two limitations, this paper presents a pioneering semi-supervised all-in-one adverse weather removal framework built on the teacher-student network with a Denoising Diffusion Model (D… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

  17. arXiv:2409.19526  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    Efficient Backdoor Defense in Multimodal Contrastive Learning: A Token-Level Unlearning Method for Mitigating Threats

    Authors: Kuanrong Liu, Siyuan Liang, Jiawei Liang, Pengwen Dai, Xiaochun Cao

    Abstract: Multimodal contrastive learning uses various data modalities to create high-quality features, but its reliance on extensive data sources on the Internet makes it vulnerable to backdoor attacks. These attacks insert malicious behaviors during training, which are activated by specific triggers during inference, posing significant security risks. Despite existing countermeasures through fine-tuning t… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

  18. arXiv:2409.19388  [pdf, ps, other

    math.AP

    Finite-time blow-up in fully parabolic quasilinear Keller-Segel systems with supercritical exponents

    Authors: Xinru Cao, Mario Fuest

    Abstract: We examine the possibility of finite-time blow-up of solutions to the fully parabolic quasilinear Keller--Segel model \begin{align}\tag{$\star$}\label{prob:star} \begin{cases} u_t = \nabla \cdot ((u+1)^{m-1}\nabla u - u(u+1)^{q-1}\nabla v) & \text{in $Ω\times (0, T)$}, \\ v_t = Δv - v + u & \text{in $Ω\times (0, T)$} \end{cases} \end{align} in a ball $Ω\subset \mathbb R^n$ with $n\geq 2$.… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

    Comments: 23 pages

    MSC Class: 35B44 (primary); 35B33; 35K45; 35K59; 92C17 (secondary)

  19. arXiv:2409.19042  [pdf, other

    eess.AS cs.SD

    Probing mental health information in speech foundation models

    Authors: Marc de Gennes, Adrien Lesage, Martin Denais, Xuan-Nga Cao, Simon Chang, Pierre Van Remoortere, Cyrille Dakhlia, Rachid Riad

    Abstract: Non-invasive methods for diagnosing mental health conditions, such as speech analysis, offer promising potential in modern medicine. Recent advancements in machine learning, particularly speech foundation models, have shown significant promise in detecting mental health states by capturing diverse features. This study investigates which pretext tasks in these models best transfer to mental health… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    Comments: 6 pages, 4 figures

  20. arXiv:2409.17681  [pdf, other

    cs.NI cs.CY

    Computation Pre-Offloading for MEC-Enabled Vehicular Networks via Trajectory Prediction

    Authors: Ting Zhang, Bo Yang, Zhiwen Yu, Xuelin Cao, George C. Alexandropoulos, Yan Zhang, Chau Yuen

    Abstract: Task offloading is of paramount importance to efficiently orchestrate vehicular wireless networks, necessitating the availability of information regarding the current network status and computational resources. However, due to the mobility of the vehicles and the limited computational resources for performing task offloading in near-real-time, such schemes may require high latency, thus, become ev… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  21. arXiv:2409.17634  [pdf, other

    cs.CV cs.AI

    P4Q: Learning to Prompt for Quantization in Visual-language Models

    Authors: Huixin Sun, Runqi Wang, Yanjing Li, Xianbin Cao, Xiaolong Jiang, Yao Hu, Baochang Zhang

    Abstract: Large-scale pre-trained Vision-Language Models (VLMs) have gained prominence in various visual and multimodal tasks, yet the deployment of VLMs on downstream application platforms remains challenging due to their prohibitive requirements of training samples and computing resources. Fine-tuning and quantization of VLMs can substantially reduce the sample and computation costs, which are in urgent n… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  22. arXiv:2409.17601  [pdf, other

    cs.CV cs.AI

    TA-Cleaner: A Fine-grained Text Alignment Backdoor Defense Strategy for Multimodal Contrastive Learning

    Authors: Yuan Xun, Siyuan Liang, Xiaojun Jia, Xinwei Liu, Xiaochun Cao

    Abstract: Pre-trained large models for multimodal contrastive learning, such as CLIP, have been widely recognized in the industry as highly susceptible to data-poisoned backdoor attacks. This poses significant risks to downstream model training. In response to such potential threats, finetuning offers a simpler and more efficient defense choice compared to retraining large models with augmented data. In the… ▽ More

    Submitted 7 October, 2024; v1 submitted 26 September, 2024; originally announced September 2024.

  23. arXiv:2409.17058  [pdf, other

    cs.CV

    Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors

    Authors: Aiping Zhang, Zongsheng Yue, Renjing Pei, Wenqi Ren, Xiaochun Cao

    Abstract: Diffusion-based image super-resolution (SR) methods have achieved remarkable success by leveraging large pre-trained text-to-image diffusion models as priors. However, these methods still face two challenges: the requirement for dozens of sampling steps to achieve satisfactory results, which limits efficiency in real scenarios, and the neglect of degradation models, which are critical auxiliary in… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: The code is available at https://github.com/ArcticHare105/S3Diff

  24. arXiv:2409.15968  [pdf, other

    cs.CV

    Adversarial Backdoor Defense in CLIP

    Authors: Junhao Kuang, Siyuan Liang, Jiawei Liang, Kuanrong Liu, Xiaochun Cao

    Abstract: Multimodal contrastive pretraining, exemplified by models like CLIP, has been found to be vulnerable to backdoor attacks. While current backdoor defense methods primarily employ conventional data augmentation to create augmented samples aimed at feature alignment, these methods fail to capture the distinct features of backdoor samples, resulting in suboptimal defense performance. Observations reve… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  25. arXiv:2409.15316  [pdf, other

    cs.HC

    Towards Social AI: A Survey on Understanding Social Interactions

    Authors: Sangmin Lee, Minzhi Li, Bolin Lai, Wenqi Jia, Fiona Ryan, Xu Cao, Ozgur Kara, Bikram Boote, Weiyan Shi, Diyi Yang, James M. Rehg

    Abstract: Social interactions form the foundation of human societies. Artificial intelligence has made significant progress in certain areas, but enabling machines to seamlessly understand social interactions remains an open challenge. It is important to address this gap by endowing machines with social capabilities. We identify three key capabilities needed for effective social understanding: 1) understand… ▽ More

    Submitted 30 September, 2024; v1 submitted 5 September, 2024; originally announced September 2024.

  26. arXiv:2409.13349  [pdf, other

    cs.CV cs.AI

    ID-Guard: A Universal Framework for Combating Facial Manipulation via Breaking Identification

    Authors: Zuomin Qu, Wei Lu, Xiangyang Luo, Qian Wang, Xiaochun Cao

    Abstract: The misuse of deep learning-based facial manipulation poses a potential threat to civil rights. To prevent this fraud at its source, proactive defense technology was proposed to disrupt the manipulation process by adding invisible adversarial perturbations into images, making the forged output unconvincing to the observer. However, their non-directional disruption of the output may result in the r… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  27. arXiv:2409.13268  [pdf, other

    cs.CV

    JoyHallo: Digital human model for Mandarin

    Authors: Sheng Shi, Xuyang Cao, Jun Zhao, Guoxin Wang

    Abstract: In audio-driven video generation, creating Mandarin videos presents significant challenges. Collecting comprehensive Mandarin datasets is difficult, and the complex lip movements in Mandarin further complicate model training compared to English. In this study, we collected 29 hours of Mandarin speech video from JD Health International Inc. employees, resulting in the jdh-Hallo dataset. This datase… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  28. arXiv:2409.13123  [pdf, ps, other

    math.DG

    Geometry and Analysis of Gradient Ricci Solitons in Dimension Four

    Authors: Xiaodong Cao, Hung Tran

    Abstract: [Dedicated to Richard S. Hamilton on forty years of Ricci flow] Gradient Ricci solitons have garnered significant attention both as self-similar solutions and singularity models of the Ricci flow. This survey article starts with a list of examples; it also provides some geometric aspects of gradient Ricci solitons, including various asymptotic behaviors; finally, it discusses some recent results o… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

    Comments: survey paper

  29. arXiv:2409.12470  [pdf, other

    cs.CV eess.IV

    HSIGene: A Foundation Model For Hyperspectral Image Generation

    Authors: Li Pang, Datao Tang, Shuang Xu, Deyu Meng, Xiangyong Cao

    Abstract: Hyperspectral image (HSI) plays a vital role in various fields such as agriculture and environmental monitoring. However, due to the expensive acquisition cost, the number of hyperspectral images is limited, degenerating the performance of downstream tasks. Although some recent studies have attempted to employ diffusion models to synthesize HSIs, they still struggle with the scarcity of HSIs, affe… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

  30. arXiv:2409.12448  [pdf, other

    cs.CV

    Infrared Small Target Detection in Satellite Videos: A New Dataset and A Novel Recurrent Feature Refinement Framework

    Authors: Xinyi Ying, Li Liu, Zaipin Lin, Yangsi Shi, Yingqian Wang, Ruojing Li, Xu Cao, Boyang Li, Shilin Zhou

    Abstract: Multi-frame infrared small target (MIRST) detection in satellite videos is a long-standing, fundamental yet challenging task for decades, and the challenges can be summarized as: First, extremely small target size, highly complex clutters & noises, various satellite motions result in limited feature representation, high false alarms, and difficult motion analyses. Second, the lack of large-scale p… ▽ More

    Submitted 4 October, 2024; v1 submitted 18 September, 2024; originally announced September 2024.

  31. arXiv:2409.10223  [pdf, other

    math.DS

    A cytokine-enhanced viral infection model with CTL immune response, distributed delay and saturation incidence

    Authors: Xiaodong Cao, Songbo Hou, Xiaoqing Kong

    Abstract: In this paper, we propose a delayed cytokine-enhanced viral infection model incorporating saturation incidence and immune response. We compute the basic reproduction numbers and introduce a convex cone to discuss the impact of non-negative initial data on solutions. By defining appropriate Lyapunov functionals and employing LaSalle's invariance principle, we investigate the stability of three equi… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 20 pages

    MSC Class: 60H10; 92D30

  32. arXiv:2409.06411  [pdf, other

    cs.LG cs.CL

    Length Desensitization in Directed Preference Optimization

    Authors: Wei Liu, Yang Bai, Chengcheng Han, Rongxiang Weng, Jun Xu, Xuezhi Cao, Jingang Wang, Xunliang Cai

    Abstract: Direct Preference Optimization (DPO) is widely utilized in the Reinforcement Learning from Human Feedback (RLHF) phase to align Large Language Models (LLMs) with human preferences, thereby enhancing both their harmlessness and efficacy. However, it has been observed that DPO tends to over-optimize for verbosity, which can detrimentally affect both performance and user experience. In this paper, we… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: 21 pages, 9 figures

  33. arXiv:2409.06324  [pdf, other

    cs.CV

    SDF-Net: A Hybrid Detection Network for Mediastinal Lymph Node Detection on Contrast CT Images

    Authors: Jiuli Xiong, Lanzhuju Mei, Jiameng Liu, Dinggang Shen, Zhong Xue, Xiaohuan Cao

    Abstract: Accurate lymph node detection and quantification are crucial for cancer diagnosis and staging on contrast-enhanced CT images, as they impact treatment planning and prognosis. However, detecting lymph nodes in the mediastinal area poses challenges due to their low contrast, irregular shapes and dispersed distribution. In this paper, we propose a Swin-Det Fusion Network (SDF-Net) to effectively dete… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: 10 pages, 4 figures

  34. arXiv:2409.06323  [pdf, other

    cs.LG cs.AI cs.SI

    LAMP: Learnable Meta-Path Guided Adversarial Contrastive Learning for Heterogeneous Graphs

    Authors: Siqing Li, Jin-Duk Park, Wei Huang, Xin Cao, Won-Yong Shin, Zhiqiang Xu

    Abstract: Heterogeneous graph neural networks (HGNNs) have significantly propelled the information retrieval (IR) field. Still, the effectiveness of HGNNs heavily relies on high-quality labels, which are often expensive to acquire. This challenge has shifted attention towards Heterogeneous Graph Contrastive Learning (HGCL), which usually requires pre-defined meta-paths. However, our findings reveal that met… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

    Comments: 19 pages, 7 figures

  35. arXiv:2409.03450  [pdf, other

    cond-mat.quant-gas nlin.PS

    Dissipative Nonlinear Thouless Pumping of Temporal Solitons

    Authors: Xuzhen Cao, Chunyu Jia, Ying Hu, Zhaoxin Liang

    Abstract: The interplay between topology and soliton is a central topic in nonlinear topological physics. So far, most studies have been confined to conservative settings. Here, we explore Thouless pumping of dissipative temporal solitons in a nonconservative one-dimensional optical system with gain and spectral filtering, described by the paradigmatic complex Ginzburg-Landau equation. Two dissipatively ind… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: 6 pages, 3 figures

  36. arXiv:2409.01012  [pdf, other

    cs.IR cs.LG

    Improved Diversity-Promoting Collaborative Metric Learning for Recommendation

    Authors: Shilong Bao, Qianqian Xu, Zhiyong Yang, Yuan He, Xiaochun Cao, Qingming Huang

    Abstract: Collaborative Metric Learning (CML) has recently emerged as a popular method in recommendation systems (RS), closing the gap between metric learning and collaborative filtering. Following the convention of RS, existing practices exploit unique user representation in their model design. This paper focuses on a challenging scenario where a user has multiple categories of interests. Under this settin… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: arXiv admin note: text overlap with arXiv:2209.15292

  37. arXiv:2409.00410  [pdf, other

    cs.CV

    A Hybrid Transformer-Mamba Network for Single Image Deraining

    Authors: Shangquan Sun, Wenqi Ren, Juxiang Zhou, Jianhou Gan, Rui Wang, Xiaochun Cao

    Abstract: Existing deraining Transformers employ self-attention mechanisms with fixed-range windows or along channel dimensions, limiting the exploitation of non-local receptive fields. In response to this issue, we introduce a novel dual-branch hybrid Transformer-Mamba network, denoted as TransMamba, aimed at effectively capturing long-range rain-related dependencies. Based on the prior of distinct spectra… ▽ More

    Submitted 31 August, 2024; originally announced September 2024.

    Comments: 12 pages, 9 figures

  38. arXiv:2408.17333  [pdf, other

    math.NA math.AP

    Subspace Diffusion Posterior Sampling for Travel-Time Tomography

    Authors: Xiang Cao, Xiaoqun Zhang

    Abstract: Diffusion models have been widely studied as effective generative tools for solving inverse problems. The main ideas focus on performing the reverse sampling process conditioned on noisy measurements, using well-established numerical solvers for gradient updates. Although diffusion-based sampling methods can produce high-quality reconstructions, challenges persist in nonlinear PDE-based inverse pr… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 20 pages, 8 figures, 2 tables

  39. arXiv:2408.15825  [pdf, other

    hep-ph

    A comprehensive study of axion photoproduction off the nucleon in chiral effective field theory

    Authors: Xiong-Hui Cao, Zhi-Hui Guo

    Abstract: We calculate the amplitudes of the axion photoproduction off the nucleon, i.e., $γN \to a N$, within the framework of chiral effective field theory. Several different types of contributions are simultaneously included in our calculation, namely the nucleon exchanges up to next-to-leading order, the $aγγ$ vertex and the vector meson exchanges in the $t$-channel. We utilize the existing hadronic inp… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 25 pages, 9 figures

  40. arXiv:2408.15020  [pdf, other

    cs.CV

    Hierarchical Graph Interaction Transformer with Dynamic Token Clustering for Camouflaged Object Detection

    Authors: Siyuan Yao, Hao Sun, Tian-Zhu Xiang, Xiao Wang, Xiaochun Cao

    Abstract: Camouflaged object detection (COD) aims to identify the objects that seamlessly blend into the surrounding backgrounds. Due to the intrinsic similarity between the camouflaged objects and the background region, it is extremely challenging to precisely distinguish the camouflaged objects by existing approaches. In this paper, we propose a hierarchical graph interaction network termed HGINet for cam… ▽ More

    Submitted 21 September, 2024; v1 submitted 27 August, 2024; originally announced August 2024.

    Comments: Accepted by IEEE Transactions on Image Processing

  41. arXiv:2408.13862  [pdf, ps, other

    nucl-th

    Bubble $^{36}$Ar and Its New Breathing Modes

    Authors: Ge Ren, Chun-Wang Ma, Xi-Guang Cao, Yu-Gang Ma

    Abstract: The bubble nuclei are important components of exotic nuclear structures characterized by special depletions of central densities. Focusing on bubble structures of $^{36}$Ar, the characterizations of bubble nuclei were explored with the framework of the extended quantum molecular dynamics model. Three density distribution modes were uncovered for the first time, i.e. micro-bubble, bubble, and clust… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

  42. arXiv:2408.13793  [pdf, ps, other

    math.AP physics.optics

    Optical Inversion Using Plasmonic Contrast Agents

    Authors: Xinlin Cao, Ahcene Ghandriche, Mourad Sini

    Abstract: We describe a new method to reconstruct the permittivity distribution, of an object to image, from the remotely measured electromagnetic field. We propose to use the remote fields measured before and after injecting locally in the medium plasmonic nano-particles. Such a technique is known in the framework of imaging using contrast agents where, in optical imaging, the nano-particles play the role… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

  43. arXiv:2408.13050  [pdf, other

    cond-mat.str-el

    Vision Transformer Neural Quantum States for Impurity Models

    Authors: Xiaodong Cao, Zhicheng Zhong, Yi Lu

    Abstract: Transformer neural networks, known for their ability to recognize complex patterns in high-dimensional data, offer a promising framework for capturing many-body correlations in quantum systems. We employ an adapted Vision Transformer (ViT) architecture to model quantum impurity models, optimizing it with a subspace expansion scheme that surpasses conventional variational Monte Carlo in both accura… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  44. arXiv:2408.09689  [pdf, other

    hep-ph nucl-th

    Gravitational form factor $D$ of charmonium from shear stress

    Authors: Tianyang Hu, Xianghui Cao, Siqi Xu, Yang Li, Xingbo Zhao, James P. Vary

    Abstract: Based on our recent analysis of the hadronic matrix element of the stress-energy tensor in covariant light front dynamics, we extract the charmonium gravitational form factor $D(Q^2)$ from shear stress $T^{12}$. This is in contrast to our recent work using the (light-front) energy density $T^{+-}$. Indeed, by comparing these two currents, we identify terms that are responsible for the violation of… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

    Comments: 9 pages, 4 figures

  45. arXiv:2408.09535  [pdf, other

    hep-ph hep-th nucl-th

    Dissecting a strongly coupled scalar nucleon

    Authors: Xianghui Cao, Yang Li, James P. Vary

    Abstract: We continue our investigation of the stress within a strongly coupled scalar nucleon, and now dissect the gravitational form factors into contributions from its constituents, the (mock) nucleon and the (mock) pion. The computation is based on a non-perturbative solution of the scalar Yukawa model in the light-front Hamiltonian formalism with a Fock sector expansion including up to one nucleon and… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

    Comments: 11 pages, 4 figures

  46. arXiv:2408.09144  [pdf, other

    cs.CV

    SSNeRF: Sparse View Semi-supervised Neural Radiance Fields with Augmentation

    Authors: Xiao Cao, Beibei Lin, Bo Wang, Zhiyong Huang, Robby T. Tan

    Abstract: Sparse view NeRF is challenging because limited input images lead to an under constrained optimization problem for volume rendering. Existing methods address this issue by relying on supplementary information, such as depth maps. However, generating this supplementary information accurately remains problematic and often leads to NeRF producing images with undesired artifacts. To address these arti… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

  47. arXiv:2408.08703  [pdf, other

    cs.CV

    TsCA: On the Semantic Consistency Alignment via Conditional Transport for Compositional Zero-Shot Learning

    Authors: Miaoge Li, Jingcai Guo, Richard Yi Da Xu, Dongsheng Wang, Xiaofeng Cao, Song Guo

    Abstract: Compositional Zero-Shot Learning (CZSL) aims to recognize novel \textit{state-object} compositions by leveraging the shared knowledge of their primitive components. Despite considerable progress, effectively calibrating the bias between semantically similar multimodal representations, as well as generalizing pre-trained knowledge to novel compositional contexts, remains an enduring challenge. In t… ▽ More

    Submitted 22 August, 2024; v1 submitted 16 August, 2024; originally announced August 2024.

    Comments: 12 pages, 8 figures

  48. arXiv:2408.08091  [pdf, other

    cs.CV

    HAIR: Hypernetworks-based All-in-One Image Restoration

    Authors: Jin Cao, Yi Cao, Li Pang, Deyu Meng, Xiangyong Cao

    Abstract: Image restoration aims to recover a high-quality clean image from its degraded version. Recent progress in image restoration has demonstrated the effectiveness of All-in-One image restoration models in addressing various unknown degradations simultaneously. However, these existing methods typically utilize the same parameters to tackle images with different types of degradation, forcing the model… ▽ More

    Submitted 15 October, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

  49. arXiv:2408.07666  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities

    Authors: Enneng Yang, Li Shen, Guibing Guo, Xingwei Wang, Xiaochun Cao, Jie Zhang, Dacheng Tao

    Abstract: Model merging is an efficient empowerment technique in the machine learning community that does not require the collection of raw training data and does not require expensive computation. As model merging becomes increasingly prevalent across various fields, it is crucial to understand the available model merging techniques comprehensively. However, there is a significant gap in the literature reg… ▽ More

    Submitted 5 September, 2024; v1 submitted 14 August, 2024; originally announced August 2024.

  50. arXiv:2408.06701  [pdf, other

    cs.NI cs.LG

    DiffSG: A Generative Solver for Network Optimization with Diffusion Model

    Authors: Ruihuai Liang, Bo Yang, Zhiwen Yu, Bin Guo, Xuelin Cao, Mérouane Debbah, H. Vincent Poor, Chau Yuen

    Abstract: Diffusion generative models, famous for their performance in image generation, are popular in various cross-domain applications. However, their use in the communication community has been mostly limited to auxiliary tasks like data modeling and feature extraction. These models hold greater promise for fundamental problems in network optimization compared to traditional machine learning methods. Di… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: 8 pages, 5 figures