Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 557 results for author: Xiang, Y

.
  1. arXiv:2407.19216  [pdf, other

    cs.CR cs.AI cs.SE

    EaTVul: ChatGPT-based Evasion Attack Against Software Vulnerability Detection

    Authors: Shigang Liu, Di Cao, Junae Kim, Tamas Abraham, Paul Montague, Seyit Camtepe, Jun Zhang, Yang Xiang

    Abstract: Recently, deep learning has demonstrated promising results in enhancing the accuracy of vulnerability detection and identifying vulnerabilities in software. However, these techniques are still vulnerable to attacks. Adversarial examples can exploit vulnerabilities within deep neural networks, posing a significant threat to system security. This study showcases the susceptibility of deep learning m… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

  2. arXiv:2407.19043  [pdf, other

    cond-mat.mtrl-sci

    A phase field model for deformation-induced amorphization

    Authors: Yuntong Huang, Shuyang Dai, Chuqi Chen, Yang Xiang

    Abstract: Amorphization by severe plastic deformation has been observed in various crystalline materials. However, developing a quantitative and comprehensive theory for strain-induced amorphization remains challenging due to the complex nature of microstructural evolutions and deformation mechanisms. We propose a phase field model coupled with elastic-plastic theory to study the strain-induced amorphizatio… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: 47 pages,10 figures

  3. arXiv:2407.17057  [pdf, other

    eess.SP

    Efffcient Sensing Parameter Estimation with Direct Clutter Mitigation in Perceptive Mobile Networks

    Authors: Hang Li, Hongming Yang, Qinghua Guo, J. Andrew Zhang, Yang Xiang, Yashan Pang

    Abstract: In this work, we investigate sensing parameter estimation in the presence of clutter in perceptive mobile networks (PMNs) that integrate radar sensing into mobile communications. Performing clutter suppression before sensing parameter estimation is generally desirable as the number of sensing parameters can be signiffcantly reduced. However, existing methods require high-complexity clutter mitigat… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  4. arXiv:2407.13911  [pdf, other

    cs.CV cs.LG

    Continual Distillation Learning

    Authors: Qifan Zhang, Yunhui Guo, Yu Xiang

    Abstract: We study the problem of Continual Distillation Learning (CDL) that considers Knowledge Distillation (KD) in the Continual Learning (CL) setup. A teacher model and a student model need to learn a sequence of tasks, and the knowledge of the teacher model will be distilled to the student to improve the student model. We introduce a novel method named CDL-Prompt that utilizes prompt-based continual le… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  5. arXiv:2407.11529  [pdf, other

    eess.IV cs.AI cs.CV

    Cross-Phase Mutual Learning Framework for Pulmonary Embolism Identification on Non-Contrast CT Scans

    Authors: Bizhe Bai, Yan-Jie Zhou, Yujian Hu, Tony C. W. Mok, Yilang Xiang, Le Lu, Hongkun Zhang, Minfeng Xu

    Abstract: Pulmonary embolism (PE) is a life-threatening condition where rapid and accurate diagnosis is imperative yet difficult due to predominantly atypical symptomatology. Computed tomography pulmonary angiography (CTPA) is acknowledged as the gold standard imaging tool in clinics, yet it can be contraindicated for emergency department (ED) patients and represents an onerous procedure, thus necessitating… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Early accept by MICCAI 2024

  6. arXiv:2407.07681  [pdf

    physics.optics physics.bio-ph

    Localizing axial dense emitters based onsingle-helix point spread function andcompressed sensing

    Authors: Hanzhe Wu, Danni Chen, YiHong Jiand Gan Xiang, Heng Li, Bin Yu, JunLe Qu

    Abstract: Among the approaches in three-dimensional (3D) single molecule localization microscopy, there are several point spread function (PSF) engineering approaches, in which depth information of molecules is encoded in 2D images. Usually,the molecules are excited sparsely in each raw image. The consequence is that the temporal resolution has to be sacrificed. In order to improve temporal resolution and e… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  7. arXiv:2407.07289  [pdf, other

    cs.CV

    Deformable Feature Alignment and Refinement for Moving Infrared Dim-small Target Detection

    Authors: Dengyan Luo, Yanping Xiang, Hu Wang, Luping Ji, Shuai Li, Mao Ye

    Abstract: The detection of moving infrared dim-small targets has been a challenging and prevalent research topic. The current state-of-the-art methods are mainly based on ConvLSTM to aggregate information from adjacent frames to facilitate the detection of the current frame. However, these methods implicitly utilize motion information only in the training stage and fail to explicitly explore motion compensa… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  8. arXiv:2407.03945  [pdf, other

    math.NA cs.LG

    A fast neural hybrid Newton solver adapted to implicit methods for nonlinear dynamics

    Authors: Tianyu Jin, Georg Maierhofer, Katharina Schratz, Yang Xiang

    Abstract: The use of implicit time-stepping schemes for the numerical approximation of solutions to stiff nonlinear time-evolution equations brings well-known advantages including, typically, better stability behaviour and corresponding support of larger time steps, and better structure preservation properties. However, this comes at the price of having to solve a nonlinear equation at every time step of th… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  9. arXiv:2407.03390  [pdf, other

    cond-mat.mes-hall physics.optics

    Observation of Co-propagating Chiral Zero Modes in Magnetic Photonic Crystals

    Authors: Zhongfu Li, Shaojie Ma, Shuwei Li, Oubo you, Yachao Liu, Qingdong Yang, Yuanjiang Xiang, Peiheng Zhou, Shuang Zhang

    Abstract: Topological singularities, such as Weyl points and Dirac points, can give rise to unidirectional propagation channels known as chiral zero modes (CZMs) when subject to a magnetic field. These CZMs are responsible for intriguing phenomena like the chiral anomaly in quantum systems. The propagation direction of each CZM is determined by both the applied magnetic field and the topological charge of t… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 6 pages, 5 figures

  10. arXiv:2407.02280  [pdf, other

    cs.CV cs.AI

    FedIA: Federated Medical Image Segmentation with Heterogeneous Annotation Completeness

    Authors: Yangyang Xiang, Nannan Wu, Li Yu, Xin Yang, Kwang-Ting Cheng, Zengqiang Yan

    Abstract: Federated learning has emerged as a compelling paradigm for medical image segmentation, particularly in light of increasing privacy concerns. However, most of the existing research relies on relatively stringent assumptions regarding the uniformity and completeness of annotations across clients. Contrary to this, this paper highlights a prevalent challenge in medical practice: incomplete annotatio… ▽ More

    Submitted 3 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: Early accepted by MICCAI 2024

  11. arXiv:2406.17969  [pdf, other

    cs.CL cs.AI

    Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective

    Authors: Hanqi Yan, Yanzheng Xiang, Guangyi Chen, Yifei Wang, Lin Gui, Yulan He

    Abstract: To better interpret the intrinsic mechanism of large language models (LLMs), recent studies focus on monosemanticity on its basic units. A monosemantic neuron is dedicated to a single and specific concept, which forms a one-to-one correlation between neurons and concepts. Despite extensive research in monosemanticity probing, it remains unclear whether monosemanticity is beneficial or harmful to m… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  12. arXiv:2406.15222  [pdf

    eess.IV cs.AI cs.CV

    Rapid and Accurate Diagnosis of Acute Aortic Syndrome using Non-contrast CT: A Large-scale, Retrospective, Multi-center and AI-based Study

    Authors: Yujian Hu, Yilang Xiang, Yan-Jie Zhou, Yangyan He, Shifeng Yang, Xiaolong Du, Chunlan Den, Youyao Xu, Gaofeng Wang, Zhengyao Ding, Jingyong Huang, Wenjun Zhao, Xuejun Wu, Donglin Li, Qianqian Zhu, Zhenjiang Li, Chenyang Qiu, Ziheng Wu, Yunjun He, Chen Tian, Yihui Qiu, Zuodong Lin, Xiaolong Zhang, Yuan He, Zhenpeng Yuan , et al. (15 additional authors not shown)

    Abstract: Chest pain symptoms are highly prevalent in emergency departments (EDs), where acute aortic syndrome (AAS) is a catastrophic cardiovascular emergency with a high fatality rate, especially when timely and accurate treatment is not administered. However, current triage practices in the ED can cause up to approximately half of patients with AAS to have an initially missed diagnosis or be misdiagnosed… ▽ More

    Submitted 16 July, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  13. arXiv:2406.12889  [pdf

    cond-mat.mtrl-sci

    Wide-bandgap semiconductor of three-dimensional unconventional stoichiometric NaCl2 crystal

    Authors: Siyan Gao, Junlin Jia, Xu Wang, Yue-Yu Zhang, Yijie Xiang, Pei Li, Ruobing Yi, Xuchang Su, Guosheng Shi, Feifei Qin, Yi-Feng Zheng, Lei Chen, Yu Qiang, Junjie Zhang, Lei Zhang, Haiping Fang

    Abstract: The expanding applications call for novel new-generation wide-bandgap semiconductors. Here, we show that a compound only composed of the ordinary elements Na and Cl, namely three-dimensional NaCl2 crystal, is a wide-bandgap semiconductor. This finding benefits from the breaking of conventional stoichiometry frameworks in the theoretical design, leading to the discovery of three-dimensional XY2 (X… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  14. arXiv:2406.07232  [pdf, other

    cs.CL cs.AI

    DUAL-REFLECT: Enhancing Large Language Models for Reflective Translation through Dual Learning Feedback Mechanisms

    Authors: Andong Chen, Lianzhang Lou, Kehai Chen, Xuefeng Bai, Yang Xiang, Muyun Yang, Tiejun Zhao, Min Zhang

    Abstract: Recently, large language models (LLMs) enhanced by self-reflection have achieved promising performance on machine translation. The key idea is guiding LLMs to generate translation with human-like feedback. However, existing self-reflection methods lack effective feedback information, limiting the translation performance. To address this, we introduce a DUAL-REFLECT framework, leveraging the dual l… ▽ More

    Submitted 21 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 main conference

  15. arXiv:2406.07036  [pdf, other

    cs.CL cs.AI

    Paying More Attention to Source Context: Mitigating Unfaithful Translations from Large Language Model

    Authors: Hongbin Zhang, Kehai Chen, Xuefeng Bai, Yang Xiang, Min Zhang

    Abstract: Large language models (LLMs) have showcased impressive multilingual machine translation ability. However, unlike encoder-decoder style models, decoder-only LLMs lack an explicit alignment between source and target contexts. Analyzing contribution scores during generation processes revealed that LLMs can be biased towards previously generated tokens over corresponding source tokens, leading to unfa… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL2024 Findings

  16. arXiv:2406.06843  [pdf, other

    cs.CV

    HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object Interaction

    Authors: Jikai Wang, Qifan Zhang, Yu-Wei Chao, Bowen Wen, Xiaohu Guo, Yu Xiang

    Abstract: We introduce a data capture system and a new dataset named HO-Cap that can be used to study 3D reconstruction and pose tracking of hands and objects in videos. The capture system uses multiple RGB-D cameras and a HoloLens headset for data collection, avoiding the use of expensive 3D scanners or mocap systems. We propose a semi-automatic method to obtain annotations of shape and pose of hands and o… ▽ More

    Submitted 16 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  17. arXiv:2406.03880  [pdf, other

    cs.LG cs.AI

    Memorization in deep learning: A survey

    Authors: Jiaheng Wei, Yanjun Zhang, Leo Yu Zhang, Ming Ding, Chao Chen, Kok-Leong Ong, Jun Zhang, Yang Xiang

    Abstract: Deep Learning (DL) powered by Deep Neural Networks (DNNs) has revolutionized various domains, yet understanding the intricacies of DNN decision-making and learning processes remains a significant challenge. Recent investigations have uncovered an interesting memorization phenomenon in which DNNs tend to memorize specific details from examples rather than learning general patterns, affecting model… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  18. arXiv:2406.02630  [pdf, other

    cs.CR cs.AI

    AI Agents Under Threat: A Survey of Key Security Challenges and Future Pathways

    Authors: Zehang Deng, Yongjian Guo, Changzhou Han, Wanlun Ma, Junwu Xiong, Sheng Wen, Yang Xiang

    Abstract: An Artificial Intelligence (AI) agent is a software entity that autonomously performs tasks or makes decisions based on pre-defined objectives and data inputs. AI agents, capable of perceiving user inputs, reasoning and planning tasks, and executing actions, have seen remarkable advancements in algorithm development and task performance. However, the security challenges they pose remain under-expl… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: ACM Computing Survey

  19. arXiv:2405.19437  [pdf, ps, other

    math.PR math-ph physics.bio-ph

    Quantitative hydrodynamics for a generalized contact model

    Authors: Julian Amorim, Milton Jara, Yangrui Xiang

    Abstract: We derive a quantitative version of the hydrodynamic limit for an interacting particle system inspired by integrate-and-fire neuron models. More precisely, we show that the $L^2$-speed of convergence of the empirical density of states in a generalized contact process defined over a $d$-dimensional torus of size $n$ is of the optimal order $\mathcal O(n^{d/2})$. In addition, we show that the typica… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  20. arXiv:2405.17859  [pdf, other

    cs.CV cs.RO

    Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation

    Authors: Yangxiao Lu, Jishnu Jaykumar P, Yunhui Guo, Nicholas Ruozzi, Yu Xiang

    Abstract: Novel Instance Detection and Segmentation (NIDS) aims at detecting and segmenting novel object instances given a few examples of each instance. We propose a unified framework (NIDS-Net) comprising object proposal generation, embedding creation for both instance templates and proposal regions, and embedding matching for instance label assignment. Leveraging recent advancements in large vision metho… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 22 pages, 9 figures, Code is available at: https://github.com/YoungSean/NIDS-Net

  21. arXiv:2405.16594  [pdf, ps, other

    stat.ML cs.LG

    Training-Conditional Coverage Bounds under Covariate Shift

    Authors: Mehrdad Pournaderi, Yu Xiang

    Abstract: Training-conditional coverage guarantees in conformal prediction concern the concentration of the error distribution, conditional on the training data, below some nominal level. The conformal prediction methodology has recently been generalized to the covariate shift setting, namely, the covariate distribution changes between the training and test data. In this paper, we study the training-conditi… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2404.13731

  22. arXiv:2405.15258  [pdf, other

    cs.CR

    Leakage-Resilient and Carbon-Neutral Aggregation Featuring the Federated AI-enabled Critical Infrastructure

    Authors: Zehang Deng, Ruoxi Sun, Minhui Xue, Sheng Wen, Seyit Camtepe, Surya Nepal, Yang Xiang

    Abstract: AI-enabled critical infrastructures (ACIs) integrate artificial intelligence (AI) technologies into various essential systems and services that are vital to the functioning of society, offering significant implications for efficiency, security and resilience. While adopting decentralized AI approaches (such as federated learning technology) in ACIs is plausible, private and sensitive data are stil… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  23. arXiv:2405.14099  [pdf, other

    cs.LG math.NA

    Automatic Differentiation is Essential in Training Neural Networks for Solving Differential Equations

    Authors: Chuqi Chen, Yahong Yang, Yang Xiang, Wenrui Hao

    Abstract: Neural network-based approaches have recently shown significant promise in solving partial differential equations (PDEs) in science and engineering, especially in scenarios featuring complex domains or the incorporation of empirical data. One advantage of the neural network method for PDEs lies in its automatic differentiation (AD), which necessitates only the sample points themselves, unlike trad… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  24. arXiv:2405.12114  [pdf, other

    cs.CV math.NA

    A New Cross-Space Total Variation Regularization Model for Color Image Restoration with Quaternion Blur Operator

    Authors: Zhigang Jia, Yuelian Xiang, Meixiang Zhao, Tingting Wu, Michael K. Ng

    Abstract: The cross-channel deblurring problem in color image processing is difficult to solve due to the complex coupling and structural blurring of color pixels. Until now, there are few efficient algorithms that can reduce color infection in deblurring process. To solve this challenging problem, we present a novel cross-space total variation (CSTV) regularization model for color image deblurring by intro… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 15pages,10figures

  25. arXiv:2405.10616  [pdf, other

    cs.CL cs.LG

    Feature-based Low-Rank Compression of Large Language Models via Bayesian Optimization

    Authors: Yixin Ji, Yang Xiang, Juntao Li, Wei Chen, Zhongyi Liu, Kehai Chen, Min Zhang

    Abstract: In recent years, large language models (LLMs) have driven advances in natural language processing. Still, their growing scale has increased the computational burden, necessitating a balance between efficiency and performance. Low-rank compression, a promising technique, reduces non-essential parameters by decomposing weight matrices into products of two low-rank matrices. Yet, its application in L… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted by 2024 ACL findings

  26. arXiv:2405.09298  [pdf

    eess.IV cs.CV

    Deep Blur Multi-Model (DeepBlurMM) -- a strategy to mitigate the impact of image blur on deep learning model performance in histopathology image analysis

    Authors: Yujie Xiang, Bojing Liu, Mattias Rantalainen

    Abstract: AI-based analysis of histopathology whole slide images (WSIs) is central in computational pathology. However, image quality, including unsharp areas of WSIs, impacts model performance. We investigate the impact of blur and propose a multi-model approach to mitigate negative impact of unsharp image areas. In this study, we use a simulation approach, evaluating model performance under varying levels… ▽ More

    Submitted 23 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    ACM Class: I.4; J.3

  27. arXiv:2405.06902   

    cs.LG stat.ML

    Causal Inference from Slowly Varying Nonstationary Processes

    Authors: Kang Du, Yu Xiang

    Abstract: Causal inference from observational data following the restricted structural causal models (SCM) framework hinges largely on the asymmetry between cause and effect from the data generating mechanisms, such as non-Gaussianity or non-linearity. This methodology can be adapted to stationary time series, yet inferring causal relationships from nonstationary time series remains a challenging task. In t… ▽ More

    Submitted 29 May, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

    Comments: This work was intended as a replacement of arXiv:2012.13025 and any subsequent updates will appear there

  28. arXiv:2405.05498  [pdf, other

    cs.SD eess.AS

    The RoyalFlush Automatic Speech Diarization and Recognition System for In-Car Multi-Channel Automatic Speech Recognition Challenge

    Authors: Jingguang Tian, Shuaishuai Ye, Shunfei Chen, Yang Xiang, Zhaohui Yin, Xinhui Hu, Xinkang Xu

    Abstract: This paper presents our system submission for the In-Car Multi-Channel Automatic Speech Recognition (ICMC-ASR) Challenge, which focuses on speaker diarization and speech recognition in complex multi-speaker scenarios. To address these challenges, we develop end-to-end speaker diarization models that notably decrease the diarization error rate (DER) by 49.58\% compared to the official baseline on t… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  29. arXiv:2405.04858  [pdf, other

    cs.CV

    Pedestrian Attribute Recognition as Label-balanced Multi-label Learning

    Authors: Yibo Zhou, Hai-Miao Hu, Yirong Xiang, Xiaokang Zhang, Haotian Wu

    Abstract: Rooting in the scarcity of most attributes, realistic pedestrian attribute datasets exhibit unduly skewed data distribution, from which two types of model failures are delivered: (1) label imbalance: model predictions lean greatly towards the side of majority labels; (2) semantics imbalance: model is easily overfitted on the under-represented attributes due to their insufficient semantic diversity… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted as ICML2024 main conference paper

  30. arXiv:2405.01570  [pdf

    cond-mat.supr-con

    Superconductivity of Bulk Abnormal Magic-stoichiometric Na3Cl Salt Crystals at Normal Pressure

    Authors: Shuqiang He, Yi-Feng Zheng, Guosheng Shi, Yi-Jie Xiang, Meihui Xiao, Qituan Zhang, Yue-Yu Zhang, Haiping Fang

    Abstract: The identification of new materials with superconducting properties is the pursuit in the realm of superconductivity research. Here, excitedly, we show that the simplest salt daily used can be made a superconductor at normal pressure only by adjusting its stoichiometry of Na and Cl as Na3Cl at normal pressure based on first-principles calculations. This bulk stable abnormal Na-Cl stoichiometric cr… ▽ More

    Submitted 17 April, 2024; originally announced May 2024.

  31. arXiv:2405.00273  [pdf, other

    cs.CL cs.HC

    Social Life Simulation for Non-Cognitive Skills Learning

    Authors: Zihan Yan, Yaohong Xiang, Yun Huang

    Abstract: Non-cognitive skills are crucial for personal and social life well-being, and such skill development can be supported by narrative-based (e.g., storytelling) technologies. While generative AI enables interactive and role-playing storytelling, little is known about how users engage with and perceive the use of AI in social life simulation for non-cognitive skills learning. Additionally, the benefit… ▽ More

    Submitted 19 July, 2024; v1 submitted 30 April, 2024; originally announced May 2024.

  32. arXiv:2405.00026  [pdf

    cs.CE cs.AI

    Enhancing Credit Card Fraud Detection A Neural Network and SMOTE Integrated Approach

    Authors: Mengran Zhu, Ye Zhang, Yulu Gong, Changxin Xu, Yafei Xiang

    Abstract: Credit card fraud detection is a critical challenge in the financial sector, demanding sophisticated approaches to accurately identify fraudulent transactions. This research proposes an innovative methodology combining Neural Networks (NN) and Synthet ic Minority Over-sampling Technique (SMOTE) to enhance the detection performance. The study addresses the inherent imbalance in credit card transact… ▽ More

    Submitted 26 February, 2024; originally announced May 2024.

  33. PromptCL: Improving Event Representation via Prompt Template and Contrastive Learning

    Authors: Yubo Feng, Lishuang Li, Yi Xiang, Xueyang Qin

    Abstract: The representation of events in text plays a significant role in various NLP tasks. Recent research demonstrates that contrastive learning has the ability to improve event comprehension capabilities of Pre-trained Language Models (PLMs) and enhance the performance of event representation learning. However, the efficacy of event representation learning based on contrastive learning and PLMs is limi… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: NLPCC 2023 Best Student Paper

    Journal ref: Natural Language Processing and Chinese Computing (NLPCC 2023)

  34. arXiv:2404.17738  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Ultimate charge transport regimes in doping-controlled graphene laminates: phonon-assisted processes revealed by the linear magnetoresistance

    Authors: Mohsen Moazzami Gudarzi, Sergey Slizovskiy, Boyang Mao, Endre Tóvári, Gergo Pinter, David Sanderson, Maryana Asaad, Ying Xiang, Zhiyuan Wang, Jianqiang Guo, Ben F. Spencer, Alexandra A. Geim, Vladimir I. Fal'ko, Andrey V. Kretinin

    Abstract: Understanding and controlling the electrical properties of solution-processed 2D materials is key to further printed electronics progress. Here we demonstrate that the thermolysis of the aromatic intercalants utilized in nanosheet exfoliation for graphene laminates opens the route to achieving high intrinsic mobility and simultaneously controlling doping type ($n$- and $p$-) and concentration over… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  35. arXiv:2404.15245  [pdf, other

    stat.ME cs.LG

    Mining Invariance from Nonlinear Multi-Environment Data: Binary Classification

    Authors: Austin Goddard, Kang Du, Yu Xiang

    Abstract: Making predictions in an unseen environment given data from multiple training environments is a challenging task. We approach this problem from an invariance perspective, focusing on binary classification to shed light on general nonlinear data generation mechanisms. We identify a unique form of invariance that exists solely in a binary setting that allows us to train models invariant over environ… ▽ More

    Submitted 3 July, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Accepted to the 2024 International Symposium on Information Theory (ISIT)

  36. arXiv:2404.13731  [pdf, ps, other

    stat.ML cs.LG

    Training-Conditional Coverage Bounds for Uniformly Stable Learning Algorithms

    Authors: Mehrdad Pournaderi, Yu Xiang

    Abstract: The training-conditional coverage performance of the conformal prediction is known to be empirically sound. Recently, there have been efforts to support this observation with theoretical guarantees. The training-conditional coverage bounds for jackknife+ and full-conformal prediction regions have been established via the notion of $(m,n)$-stability by Liang and Barber~[2023]. Although this notion… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: Accepted to the ISIT 2024 workshop on Information-Theoretic Methods for Trustworthy Machine Learning (IT-TML)

  37. arXiv:2404.12715  [pdf, other

    cs.CL

    Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration

    Authors: Yichong Huang, Xiaocheng Feng, Baohang Li, Yang Xiang, Hui Wang, Bing Qin, Ting Liu

    Abstract: Large language models (LLMs) exhibit complementary strengths in various tasks, motivating the research of LLM ensembling. However, existing work focuses on training an extra reward model or fusion model to select or combine all candidate answers, posing a great challenge to the generalization on unseen data distributions. Besides, prior methods use textual responses as communication media, ignorin… ▽ More

    Submitted 30 May, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: 16 pages, 9 figures, 9 tables

  38. arXiv:2404.11667  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Deep Dependency Networks and Advanced Inference Schemes for Multi-Label Classification

    Authors: Shivvrat Arya, Yu Xiang, Vibhav Gogate

    Abstract: We present a unified framework called deep dependency networks (DDNs) that combines dependency networks and deep learning architectures for multi-label classification, with a particular emphasis on image and video data. The primary advantage of dependency networks is their ease of training, in contrast to other probabilistic graphical models like Markov networks. In particular, when combined with… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Will appear in AISTATS 2024. arXiv admin note: substantial text overlap with arXiv:2302.00633

  39. Local pairing versus bulk superconductivity intertwined by the charge density wave order in Cs(V$_{1-x}$Ta$_{x}$)$_{3}$Sb$_{5}$

    Authors: Jinyulin Li, Qing Li, Jinjin Liu, Ying Xiang, Huan Yang, Zhiwei Wang, Yugui Yao, Hai-Hu Wen

    Abstract: There is a common belief that superconductivity and charge density wave (CDW) order accommodate homogenously in real space but compete with each other for the effective density of states in momentum space in CDW superconductors. By measuring resistivity along the $c$-axis in Cs(V$_{1-x}$Ta$_{x}$)$_{3}$Sb$_{5}$, we observe strong superconducting fluctuation behavior coexisting with the CDW order in… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 9 pages, 5 figures

    Journal ref: Phys. Rev. Materials 8, 014801 (2024)

  40. Cost-effective company response policy for product co-creation in company-sponsored online community

    Authors: Jiamin Hu, Lu-Xing Yang, Xiaofan Yang, Kaifan Huang, Gang Li, Yong Xiang

    Abstract: Product co-creation based on company-sponsored online community has come to be a paradigm of developing new products collaboratively with customers. In such a product co-creation campaign, the sponsoring company needs to interact intensively with active community members about the design scheme of the product. We call the collection of the rates of the company's response to active community member… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  41. arXiv:2404.08690  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Towards Building a Robust Toxicity Predictor

    Authors: Dmitriy Bespalov, Sourav Bhabesh, Yi Xiang, Liutong Zhou, Yanjun Qi

    Abstract: Recent NLP literature pays little attention to the robustness of toxicity language predictors, while these systems are most likely to be used in adversarial contexts. This paper presents a novel adversarial attack, \texttt{ToxicTrap}, introducing small word-level perturbations to fool SOTA text classifiers to predict toxic text samples as benign. ToxicTrap exploits greedy based search strategies t… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: ACL 2023 /

  42. arXiv:2404.06452  [pdf, other

    cs.RO eess.SY

    PAAM: A Framework for Coordinated and Priority-Driven Accelerator Management in ROS 2

    Authors: Daniel Enright, Yecheng Xiang, Hyunjong Choi, Hyoseung Kim

    Abstract: This paper proposes a Priority-driven Accelerator Access Management (PAAM) framework for multi-process robotic applications built on top of the Robot Operating System (ROS) 2 middleware platform. The framework addresses the issue of predictable execution of time- and safety-critical callback chains that require hardware accelerators such as GPUs and TPUs. PAAM provides a standalone ROS executor th… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 14 Pages, 14 Figures

  43. arXiv:2404.04357  [pdf, other

    math.OC

    Why does the two-timescale Q-learning converge to different mean field solutions? A unified convergence analysis

    Authors: Jing An, Jianfeng Lu, Yue Wu, Yang Xiang

    Abstract: We revisit the unified two-timescale Q-learning algorithm as initially introduced by Angiuli et al. \cite{angiuli2022unified}. This algorithm demonstrates efficacy in solving mean field game (MFG) and mean field control (MFC) problems, simply by tuning the ratio of two learning rates for mean field distribution and the Q-functions respectively. In this paper, we provide a comprehensive theoretical… ▽ More

    Submitted 28 May, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: 34 pages. Updated version for submission. We added more numerical results and fixed several minor mistakes

  44. arXiv:2403.16523  [pdf, other

    stat.ML cs.AI cs.LG

    Causal Discovery from Poisson Branching Structural Causal Model Using High-Order Cumulant with Path Analysis

    Authors: Jie Qiao, Yu Xiang, Zhengming Chen, Ruichu Cai, Zhifeng Hao

    Abstract: Count data naturally arise in many fields, such as finance, neuroscience, and epidemiology, and discovering causal structure among count data is a crucial task in various scientific and industrial scenarios. One of the most common characteristics of count data is the inherent branching structure described by a binomial thinning operator and an independent Poisson distribution that captures both br… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted by AAAI-2024

  45. arXiv:2403.13258  [pdf, other

    cs.CV

    SAMCT: Segment Any CT Allowing Labor-Free Task-Indicator Prompts

    Authors: Xian Lin, Yangyang Xiang, Zhehao Wang, Kwang-Ting Cheng, Zengqiang Yan, Li Yu

    Abstract: Segment anything model (SAM), a foundation model with superior versatility and generalization across diverse segmentation tasks, has attracted widespread attention in medical imaging. However, it has been proved that SAM would encounter severe performance degradation due to the lack of medical knowledge in training and local feature encoding. Though several SAM-based models have been proposed for… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  46. arXiv:2403.12504  [pdf, other

    cs.RO

    TON-VIO: Online Time Offset Modeling Networks for Robust Temporal Alignment in High Dynamic Motion VIO

    Authors: Chaoran Xiong, Guoqing Liu, Qi Wu, Songpengcheng Xia, Tong Hua, Kehui Ma, Zhen Sun, Yan Xiang, Ling Pei

    Abstract: Temporal misalignment (time offset) between sensors is common in low cost visual-inertial odometry (VIO) systems. Such temporal misalignment introduces inconsistent constraints for state estimation, leading to a significant positioning drift especially in high dynamic motion scenarios. In this article, we focus on online temporal calibration to reduce the positioning drift caused by the time offse… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  47. arXiv:2403.11544  [pdf, ps, other

    cs.LG

    RL in Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access Model

    Authors: Junyi Fan, Yuxuan Han, Jialin Zeng, Jian-Feng Cai, Yang Wang, Yang Xiang, Jiheng Zhang

    Abstract: Efficiently learning equilibria with large state and action spaces in general-sum Markov games while overcoming the curse of multi-agency is a challenging problem. Recent works have attempted to solve this problem by employing independent linear function classes to approximate the marginal $Q$-value for each agent. However, existing sample complexity bounds under such a framework have a suboptimal… ▽ More

    Submitted 19 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted at the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)

  48. arXiv:2403.09841  [pdf, other

    cs.RO

    MultiGripperGrasp: A Dataset for Robotic Grasping from Parallel Jaw Grippers to Dexterous Hands

    Authors: Luis Felipe Casas Murrilo, Ninad Khargonkar, Balakrishnan Prabhakaran, Yu Xiang

    Abstract: We introduce a large-scale dataset named MultiGripperGrasp for robotic grasping. Our dataset contains 30.4M grasps from 11 grippers for 345 objects. These grippers range from two-finger grippers to five-finger grippers, including a human hand. All grasps in the dataset are verified in Isaac Sim to classify them as successful and unsuccessful grasps. Additionally, the object fall-off time for each… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  49. arXiv:2403.08822  [pdf

    cs.LG cs.CL

    LoRA-SP: Streamlined Partial Parameter Adaptation for Resource-Efficient Fine-Tuning of Large Language Models

    Authors: Yichao Wu, Yafei Xiang, Shuning Huo, Yulu Gong, Penghao Liang

    Abstract: In addressing the computational and memory demands of fine-tuning Large Language Models(LLMs), we propose LoRA-SP(Streamlined Partial Parameter Adaptation), a novel approach utilizing randomized half-selective parameter freezing within the Low-Rank Adaptation(LoRA)framework. This method efficiently balances pre-trained knowledge retention and adaptability for task-specific optimizations. Through a… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

  50. arXiv:2403.06174  [pdf, other

    cs.LG cs.AI

    Domain Adversarial Active Learning for Domain Generalization Classification

    Authors: Jianting Chen, Ling Ding, Yunxiao Yang, Zaiyuan Di, Yang Xiang

    Abstract: Domain generalization models aim to learn cross-domain knowledge from source domain data, to improve performance on unknown target domains. Recent research has demonstrated that diverse and rich source domain samples can enhance domain generalization capability. This paper argues that the impact of each sample on the model's generalization ability varies. Despite its small scale, a high-quality da… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.