Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 101–150 of 1,175 results for author: He, S

.
  1. arXiv:2402.12851  [pdf, other

    cs.CL

    MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models

    Authors: Tongxu Luo, Jiahe Lei, Fangyu Lei, Weihao Liu, Shizhu He, Jun Zhao, Kang Liu

    Abstract: Fine-tuning is often necessary to enhance the adaptability of Large Language Models (LLM) to downstream tasks. Nonetheless, the process of updating billions of parameters demands significant computational resources and training time, which poses a substantial obstacle to the widespread application of large-scale models in various scenarios. To address this issue, Parameter-Efficient Fine-Tuning (P… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  2. arXiv:2402.12271  [pdf, other

    cs.DC cs.LG

    Secure Federated Learning Across Heterogeneous Cloud and High-Performance Computing Resources -- A Case Study on Federated Fine-tuning of LLaMA 2

    Authors: Zilinghan Li, Shilan He, Pranshu Chaturvedi, Volodymyr Kindratenko, Eliu A Huerta, Kibaek Kim, Ravi Madduri

    Abstract: Federated learning enables multiple data owners to collaboratively train robust machine learning models without transferring large or sensitive local datasets by only sharing the parameters of the locally trained models. In this paper, we elaborate on the design of our Advanced Privacy-Preserving Federated Learning (APPFL) framework, which streamlines end-to-end secure and reliable federated learn… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  3. arXiv:2402.12219  [pdf, other

    cs.CL cs.AI cs.LG

    Reformatted Alignment

    Authors: Run-Ze Fan, Xuefeng Li, Haoyang Zou, Junlong Li, Shwai He, Ethan Chern, Jiewen Hu, Pengfei Liu

    Abstract: The quality of finetuning data is crucial for aligning large language models (LLMs) with human values. Current methods to improve data quality are either labor-intensive or prone to factual errors caused by LLM hallucinations. This paper explores elevating the quality of existing instruction data to better align with human values, introducing a simple and effective approach named ReAlign, which re… ▽ More

    Submitted 17 April, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Homepage: https://gair-nlp.github.io/ReAlign/

  4. arXiv:2402.12099   

    cs.CV

    Human Video Translation via Query Warping

    Authors: Haiming Zhu, Yangyang Xu, Shengfeng He

    Abstract: In this paper, we present QueryWarp, a novel framework for temporally coherent human motion video translation. Existing diffusion-based video editing approaches that rely solely on key and value tokens to ensure temporal consistency, which scarifies the preservation of local and structural regions. In contrast, we aim to consider complementary query priors by constructing the temporal correlations… ▽ More

    Submitted 21 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: This is not a complete paper and the methods and results have not been updated. We decided to withdraw and make further improvements

  5. arXiv:2402.11139  [pdf, other

    cs.LG cs.AI

    LiGNN: Graph Neural Networks at LinkedIn

    Authors: Fedor Borisyuk, Shihai He, Yunbo Ouyang, Morteza Ramezani, Peng Du, Xiaochen Hou, Chengming Jiang, Nitin Pasumarthy, Priya Bannur, Birjodh Tiwana, Ping Liu, Siddharth Dangi, Daqi Sun, Zhoutao Pei, Xiao Shi, Sirou Zhu, Qianqi Shen, Kuang-Hsuan Lee, David Stein, Baolei Li, Haichao Wei, Amol Ghoting, Souvik Ghosh

    Abstract: In this paper, we present LiGNN, a deployed large-scale Graph Neural Networks (GNNs) Framework. We share our insight on developing and deployment of GNNs at large scale at LinkedIn. We present a set of algorithmic improvements to the quality of GNN representation learning including temporal graph architectures with long term losses, effective cold start solutions via graph densification, ID embedd… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  6. arXiv:2402.10940  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Neural machine translation of clinical procedure codes for medical diagnosis and uncertainty quantification

    Authors: Pei-Hung Chung, Shuhan He, Norawit Kijpaisalratana, Abdel-badih el Ariss, Byung-Jun Yoon

    Abstract: A Clinical Decision Support System (CDSS) is designed to enhance clinician decision-making by combining system-generated recommendations with medical expertise. Given the high costs, intensive labor, and time-sensitive nature of medical treatments, there is a pressing need for efficient decision support, especially in complex emergency scenarios. In these scenarios, where information can be limite… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  7. arXiv:2402.10464  [pdf, other

    cs.LG cs.NI

    FedKit: Enabling Cross-Platform Federated Learning for Android and iOS

    Authors: Sichang He, Beilong Tang, Boyan Zhang, Jiaoqi Shao, Xiaomin Ouyang, Daniel Nata Nugraha, Bing Luo

    Abstract: We present FedKit, a federated learning (FL) system tailored for cross-platform FL research on Android and iOS devices. FedKit pipelines cross-platform FL development by enabling model conversion, hardware-accelerated training, and cross-platform model aggregation. Our FL workflow supports flexible machine learning operations (MLOps) in production, facilitating continuous model delivery and traini… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: This work has been accepted for demonstration on IEEE International Conference on Computer Communications (INFOCOM) 2024

  8. arXiv:2402.10151  [pdf, other

    cs.CL

    ControlLM: Crafting Diverse Personalities for Language Models

    Authors: Yixuan Weng, Shizhu He, Kang Liu, Shengping Liu, Jun Zhao

    Abstract: As language models continue to scale in size and capability, they display an array of emerging behaviors, both beneficial and concerning. This heightens the need to control model behaviors. We hope to be able to control the personality traits of language models at the inference-time so as to have various character features, on top of which the requirements of different types of tasks can be met. P… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 17 pages

  9. arXiv:2402.10110  [pdf, other

    cs.CL cs.AI cs.LG

    Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

    Authors: Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Jiuxiang Gu, Tianyi Zhou

    Abstract: Instruction tuning is critical to large language models (LLMs) for achieving better instruction following and task adaptation capabilities but its success heavily relies on the training data quality. Many recent methods focus on improving the data quality but often overlook the compatibility of the data with the student model being finetuned. This paper introduces Selective Reflection-Tuning, a no… ▽ More

    Submitted 7 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: ACL2024 (findings), Camera-ready

  10. arXiv:2402.07939  [pdf, other

    cs.HC cs.AI cs.CL

    UFO: A UI-Focused Agent for Windows OS Interaction

    Authors: Chaoyun Zhang, Liqun Li, Shilin He, Xu Zhang, Bo Qiao, Si Qin, Minghua Ma, Yu Kang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang

    Abstract: We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to applications on Windows OS, harnessing the capabilities of GPT-Vision. UFO employs a dual-agent framework to meticulously observe and analyze the graphical user interface (GUI) and control information of Windows applications. This enables the agent to seamlessly navigate and operate within individual applications… ▽ More

    Submitted 23 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  11. arXiv:2402.05798  [pdf, other

    cs.HC

    Visual Harmony: Text-Visual Interplay in Circular Infographics

    Authors: Shuqi He, Yuqing Chen, Yuxin Xia, Yichun Li, Hai-Ning Liang, Lingyun Yu

    Abstract: Infographics are visual representations designed for efficient and effective communication of data and knowledge. One crucial aspect of infographic design is the interplay between text and visual elements, particularly in circular visualizations where the textual descriptions can either be embedded within the graphics or placed adjacent to the visual representation. While several studies have exam… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  12. arXiv:2402.05438  [pdf, other

    math.ST stat.ME

    Penalized spline estimation of principal components for sparse functional data: rates of convergence

    Authors: Shiyuan He, Jianhua Z. Huang, Kejun He

    Abstract: This paper gives a comprehensive treatment of the convergence rates of penalized spline estimators for simultaneously estimating several leading principal component functions, when the functional data is sparsely observed. The penalized spline estimators are defined as the solution of a penalized empirical risk minimization problem, where the loss function belongs to a general class of loss functi… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  13. arXiv:2402.01723  [pdf, other

    cs.CL cs.AI

    An Empirical Study on Large Language Models in Accuracy and Robustness under Chinese Industrial Scenarios

    Authors: Zongjie Li, Wenying Qiu, Pingchuan Ma, Yichen Li, You Li, Sijia He, Baozheng Jiang, Shuai Wang, Weixi Gu

    Abstract: Recent years have witnessed the rapid development of large language models (LLMs) in various domains. To better serve the large number of Chinese users, many commercial vendors in China have adopted localization strategies, training and providing local LLMs specifically customized for Chinese users. Furthermore, looking ahead, one of the key future applications of LLMs will be practical deployment… ▽ More

    Submitted 26 January, 2024; originally announced February 2024.

  14. arXiv:2402.00530  [pdf, other

    cs.CL

    Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

    Authors: Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou

    Abstract: Instruction tuning is critical to improve LLMs but usually suffers from low-quality and redundant data. Data filtering for instruction tuning has proved important in improving both the efficiency and performance of the tuning process. But it also leads to extra cost and computation due to the involvement of LLMs in this process. To reduce the filtering cost, we study Superfiltering: Can we use a s… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: ACL2024 main, Camera-ready

  15. arXiv:2402.00268  [pdf, other

    hep-th cond-mat.stat-mech quant-ph

    Relation between timelike and spacelike entanglement entropy

    Authors: Wu-zhong Guo, Song He, Yu-Xuan Zhang

    Abstract: In this study, we establish a connection between timelike and spacelike entanglement entropy. Specifically, for a diverse range of states, the timelike entanglement entropy is uniquely determined by a linear combination of the spacelike entanglement entropy and its first-order temporal derivative. This framework reveals that the imaginary component of the timelike entanglement entropy primarily or… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

    Comments: 5+8 pages, 1 figure

  16. arXiv:2401.15852  [pdf, ps, other

    math.AG math.CV

    The Spectral base and quotients of bounded symmetric domains

    Authors: Siqi He, Jie Liu, Ngaiming Mok

    Abstract: In this article, we explore Higgs bundles on a projective manifold $X$, focusing on their spectral bases, a concept introduced by T.Chen and B.Ngô. The spectral base is a specific closed subscheme within the space of symmetric differentials. We observe that if the spectral base vanishes, then any reductive representation $ρ: π_1(X) \to \text{GL}_r(\mathbb{C})$ is both rigid and integral. Additiona… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 21 pages

    MSC Class: 14J60; 53C35

  17. arXiv:2401.15123  [pdf, other

    cs.LG cs.AI

    Large Language Model Guided Knowledge Distillation for Time Series Anomaly Detection

    Authors: Chen Liu, Shibo He, Qihang Zhou, Shizhong Li, Wenchao Meng

    Abstract: Self-supervised methods have gained prominence in time series anomaly detection due to the scarcity of available annotations. Nevertheless, they typically demand extensive training data to acquire a generalizable representation map, which conflicts with scenarios of a few available samples, thereby limiting their performance. To overcome the limitation, we propose \textbf{AnomalyLLM}, a knowledge… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 12 pages, 5 figures

  18. arXiv:2401.13714  [pdf, other

    cs.CV cs.LG

    Value-Driven Mixed-Precision Quantization for Patch-Based Inference on Microcontrollers

    Authors: Wei Tao, Shenglin He, Kai Lu, Xiaoyang Qu, Guokuan Li, Jiguang Wan, Jianzong Wang, Jing Xiao

    Abstract: Deploying neural networks on microcontroller units (MCUs) presents substantial challenges due to their constrained computation and memory resources. Previous researches have explored patch-based inference as a strategy to conserve memory without sacrificing model accuracy. However, this technique suffers from severe redundant computation overhead, leading to a substantial increase in execution lat… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted by the 27th Design, Automation and Test in Europe Conference (DATE 2024)

  19. arXiv:2401.11235  [pdf, other

    cs.LG cs.AI

    TreeMIL: A Multi-instance Learning Framework for Time Series Anomaly Detection with Inexact Supervision

    Authors: Chen Liu, Shibo He, Haoyu Liu, Shizhong Li

    Abstract: Time series anomaly detection (TSAD) plays a vital role in various domains such as healthcare, networks, and industry. Considering labels are crucial for detection but difficult to obtain, we turn to TSAD with inexact supervision: only series-level labels are provided during the training phase, while point-level anomalies are predicted during the testing phase. Previous works follow a traditional… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: This paper has been accepted by IEEE ICASSP 2024

  20. Irrelevant and marginal deformed BMS field theories

    Authors: Song He, Xin-Cheng Mao

    Abstract: In this study, we investigate various deformations within the framework of Bondi-van der Burg-Metzner-Sachs invariant field theory (BMSFT). Specifically, we explore the impact of Bondi-van der Burg-Metzner-Sachs (BMS) symmetry on the theory by introducing key deformations, namely, $T \overline{T}$, $JT_μ$, and $\sqrt{T \overline{T}}$ deformations. In the context of generic seed theories possessing… ▽ More

    Submitted 27 March, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 54 pages, 0 figure

    Journal ref: JHEP 04 (2024) 138

  21. arXiv:2401.05483  [pdf, other

    hep-th hep-ph

    NLSM $\subset$ Tr$(φ^3)$

    Authors: Nima Arkani-Hamed, Qu Cao, Jin Dong, Carolina Figueiredo, Song He

    Abstract: Scattering amplitudes for the simplest theory of colored scalar particles - the Tr($Φ^3$) theory - have recently been the subject of active investigations. In this letter we describe an unanticipated wider implication of this work: the Tr($Φ^3$) theory secretly contains Non-linear Sigma Model (NLSM) amplitudes to all loop orders. The NLSM amplitudes are obtained from Tr$(Φ^3)$ amplitudes by a uniq… ▽ More

    Submitted 15 April, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: 10 pages, 13 figures. Addition of new material including a derivation of the results from a simple Lagrangian, identifying the symmetry breaking pattern, as well as some further discussions

  22. arXiv:2401.04723  [pdf, other

    stat.ME

    Spatio-temporal data fusion for the analysis of in situ and remote sensing data using the INLA-SPDE approach

    Authors: Shiyu He, Samuel W. K. Wong

    Abstract: We propose a Bayesian hierarchical model to address the challenge of spatial misalignment in spatio-temporal data obtained from in situ and satellite sources. The model is fit using the INLA-SPDE approach, which provides efficient computation. Our methodology combines the different data sources in a "fusion"" model via the construction of projection matrices in both spatial and temporal domains. T… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 23 pages, 7 figures

  23. arXiv:2401.02880  [pdf, other

    cs.CR

    Lotto: Secure Participant Selection against Adversarial Servers in Federated Learning

    Authors: Zhifeng Jiang, Peng Ye, Shiqi He, Wei Wang, Ruichuan Chen, Bo Li

    Abstract: In Federated Learning (FL), common privacy-enhancing techniques, such as secure aggregation and distributed differential privacy, rely on the critical assumption of an honest majority among participants to withstand various attacks. In practice, however, servers are not always trusted, and an adversarial server can strategically select compromised clients to create a dishonest majority, thereby un… ▽ More

    Submitted 6 March, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: This article has been accepted to USENIX Security '24

  24. arXiv:2401.01718  [pdf

    physics.plasm-ph physics.atom-ph physics.comp-ph physics.flu-dyn physics.optics

    RHDLPP: A multigroup radiation hydrodynamics code for laser-produced plasmas

    Authors: Qi Min, Ziyang Xu, Siqi He, Haidong Lu, Xingbang Liu, Ruizi Shen, Yanhong Wu, Qikun Pan, Chongxiao Zhao, Fei Chen, Maogen Su, Chenzhong Dong

    Abstract: We introduce the RHDLPP, a flux-limited multigroup radiation hydrodynamics numerical code designed for simulating laser-produced plasmas in diverse environments. The code bifurcates into two packages: RHDLPP-LTP for low-temperature plasmas generated by moderate-intensity nanosecond lasers, and RHDLPP-HTP for high-temperature, high-density plasmas formed by high-intensity laser pulses. The core rad… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  25. arXiv:2401.00667  [pdf, other

    stat.ME stat.CO

    Channelling Multimodality Through a Unimodalizing Transport: Warp-U Sampler and Stochastic Bridge Sampling

    Authors: Fei Ding, David E. Jones, Shiyuan He, Xiao-Li Meng

    Abstract: Monte Carlo integration is fundamental in scientific and statistical computation, but requires reliable samples from the target distribution, which poses a substantial challenge in the case of multi-modal distributions. Existing methods often involve time-consuming tuning, and typically lack tailored estimators for efficient use of the samples. This paper adapts the Warp-U transformation [Wang et… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  26. arXiv:2401.00041  [pdf, other

    hep-th hep-ph

    Scalar-Scaffolded Gluons and the Combinatorial Origins of Yang-Mills Theory

    Authors: Nima Arkani-Hamed, Qu Cao, Jin Dong, Carolina Figueiredo, Song He

    Abstract: We present a new formulation for Yang-Mills scattering amplitudes in any number of dimensions and at any loop order, based on the same combinatorial and binary-geometric ideas in kinematic space recently used to give an all-order description of Tr $φ^3$ theory. We propose that in a precise sense the amplitudes for a suitably "stringy" form of these two theories are identical, up to a simple shift… ▽ More

    Submitted 29 December, 2023; originally announced January 2024.

    Comments: 92 pages, 37 figures

  27. arXiv:2312.17591  [pdf, other

    cs.CL

    Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training

    Authors: Dongfang Li, Baotian Hu, Qingcai Chen, Shan He

    Abstract: Feature attribution methods highlight the important input tokens as explanations to model predictions, which have been widely applied to deep neural networks towards trustworthy AI. However, recent works show that explanations provided by these methods face challenges of being faithful and robust. In this paper, we propose a method with Robustness improvement and Explanation Guided training toward… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  28. arXiv:2312.16282  [pdf, other

    hep-th hep-ph

    Hidden zeros for particle/string amplitudes and the unity of colored scalars, pions and gluons

    Authors: Nima Arkani-Hamed, Qu Cao, Jin Dong, Carolina Figueiredo, Song He

    Abstract: Recent years have seen the emergence of a new understanding of scattering amplitudes in the simplest theory of colored scalar particles - the Tr$(φ^3)$ theory - based on combinatorial and geometric ideas in the kinematic space of scattering data. In this paper we report a surprise: far from the toy model it appears to be, the ''stringy'' Tr$(φ^3)$ amplitudes secretly contain the scattering amplitu… ▽ More

    Submitted 1 May, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: Added reference to early work of Gliozzi et. al. giving a different derivation of zeros for string amplitudes from monodromy relations, corrected typos

  29. arXiv:2312.16218  [pdf, other

    cs.CV

    Hyper-VolTran: Fast and Generalizable One-Shot Image to 3D Object Structure via HyperNetworks

    Authors: Christian Simon, Sen He, Juan-Manuel Perez-Rua, Mengmeng Xu, Amine Benhalloum, Tao Xiang

    Abstract: Solving image-to-3D from a single view is an ill-posed problem, and current neural reconstruction methods addressing it through diffusion models still rely on scene-specific optimization, constraining their generalization capability. To overcome the limitations of existing approaches regarding generalization and consistency, we introduce a novel neural rendering technique. Our approach employs the… ▽ More

    Submitted 5 January, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

  30. arXiv:2312.15633  [pdf, other

    cs.CV eess.IV

    MuLA-GAN: Multi-Level Attention GAN for Enhanced Underwater Visibility

    Authors: Ahsan Baidar Bakht, Zikai Jia, Muhayy ud Din, Waseem Akram, Lyes Saad Soud, Lakmal Seneviratne, Defu Lin, Shaoming He, Irfan Hussain

    Abstract: The underwater environment presents unique challenges, including color distortions, reduced contrast, and blurriness, hindering accurate analysis. In this work, we introduce MuLA-GAN, a novel approach that leverages the synergistic power of Generative Adversarial Networks (GANs) and Multi-Level Attention mechanisms for comprehensive underwater image enhancement. The integration of Multi-Level Atte… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  31. arXiv:2312.15484  [pdf, other

    hep-th

    On constructibility of AdS supergluon amplitudes

    Authors: Qu Cao, Song He, Yichao Tang

    Abstract: We prove that all tree-level $n$-point supergluon (scalar) amplitudes in AdS$_5$ can be recursively constructed, using factorization and flat-space limit. Our method is greatly facilitated by a natural R-symmetry basis for planar color-ordered amplitudes, which reduces the latter to "partial amplitudes" with simpler pole structures and factorization properties. Given the $n$-point scalar amplitude… ▽ More

    Submitted 14 January, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

    Comments: 5 pages, 4 figures, major revision from v2 including new ancillary file

  32. arXiv:2312.13875  [pdf, other

    stat.ML cs.LG stat.ME

    Best Arm Identification in Batched Multi-armed Bandit Problems

    Authors: Shengyu Cao, Simai He, Ruoqing Jiang, Jin Xu, Hongsong Yuan

    Abstract: Recently multi-armed bandit problem arises in many real-life scenarios where arms must be sampled in batches, due to limited time the agent can wait for the feedback. Such applications include biological experimentation and online marketing. The problem is further complicated when the number of arms is large and the number of batches is small. We consider pure exploration in a batched multi-armed… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  33. arXiv:2312.11988  [pdf, other

    cs.SE cs.AI cs.PL

    Xpert: Empowering Incident Management with Query Recommendations via Large Language Models

    Authors: Yuxuan Jiang, Chaoyun Zhang, Shilin He, Zhihao Yang, Minghua Ma, Si Qin, Yu Kang, Yingnong Dang, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang

    Abstract: Large-scale cloud systems play a pivotal role in modern IT infrastructure. However, incidents occurring within these systems can lead to service disruptions and adversely affect user experience. To swiftly resolve such incidents, on-call engineers depend on crafting domain-specific language (DSL) queries to analyze telemetry data. However, writing these queries can be challenging and time-consumin… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted as a reseach paper at ICSE 2024

  34. arXiv:2312.11549  [pdf, other

    cs.LG

    Label-Free Multivariate Time Series Anomaly Detection

    Authors: Qihang Zhou, Shibo He, Haoyu Liu, Jiming Chen, Wenchao Meng

    Abstract: Anomaly detection in multivariate time series (MTS) has been widely studied in one-class classification (OCC) setting. The training samples in OCC are assumed to be normal, which is difficult to guarantee in practical situations. Such a case may degrade the performance of OCC-based anomaly detection methods which fit the training distribution as the normal distribution. In this paper, we propose M… ▽ More

    Submitted 6 February, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2208.02108

  35. arXiv:2312.10979  [pdf, ps, other

    cs.SD eess.AS

    3S-TSE: Efficient Three-Stage Target Speaker Extraction for Real-Time and Low-Resource Applications

    Authors: Shulin He, Jinjiang liu, Hao Li, Yang Yang, Fei Chen, Xueliang Zhang

    Abstract: Target speaker extraction (TSE) aims to isolate a specific voice from multiple mixed speakers relying on a registerd sample. Since voiceprint features usually vary greatly, current end-to-end neural networks require large model parameters which are computational intensive and impractical for real-time applications, espetially on resource-constrained platforms. In this paper, we address the TSE tas… ▽ More

    Submitted 4 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2024

  36. arXiv:2312.09716  [pdf, other

    cs.CV

    Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval

    Authors: Zhe Ma, Jianfeng Dong, Shouling Ji, Zhenguang Liu, Xuhong Zhang, Zonghui Wang, Sifeng He, Feng Qian, Xiaobo Zhang, Lei Yang

    Abstract: Visual retrieval aims to search for the most relevant visual items, e.g., images and videos, from a candidate gallery with a given query item. Accuracy and efficiency are two competing objectives in retrieval tasks. Instead of crafting a new method pursuing further improvement on accuracy, in this paper we propose a multi-teacher distillation framework Whiten-MTD, which is able to transfer knowled… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  37. CAT: A Causally Graph Attention Network for Trimming Heterophilic Graph

    Authors: Silu He, Qinyao Luo, Xinsha Fu, Ling Zhao, Ronghua Du, Haifeng Li

    Abstract: Local Attention-guided Message Passing Mechanism (LAMP) adopted in Graph Attention Networks (GATs) is designed to adaptively learn the importance of neighboring nodes for better local aggregation on the graph, which can bring the representations of similar neighbors closer effectively, thus showing stronger discrimination ability. However, existing GATs suffer from a significant discrimination abi… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 25 pages, 18 figures, 5 tables

    Journal ref: Information Science 2024

  38. arXiv:2312.05062  [pdf, ps, other

    eess.IV

    Deep Learning Enabled Semantic Communication Systems for Video Transmission

    Authors: Zhenguo Zhang, Qianqian Yang, Shibo He, Jiming Chen

    Abstract: Semantic communication has emerged as a promising approach for improving efficient transmission in the next generation of wireless networks. Inspired by the success of semantic communication in different areas, we aim to provide a new semantic communication scheme from the semantic level. In this paper, we propose a novel DL-based semantic communication system for video transmission, which compact… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  39. arXiv:2312.04557  [pdf, other

    cs.CV

    GenTron: Diffusion Transformers for Image and Video Generation

    Authors: Shoufa Chen, Mengmeng Xu, Jiawei Ren, Yuren Cong, Sen He, Yanping Xie, Animesh Sinha, Ping Luo, Tao Xiang, Juan-Manuel Perez-Rua

    Abstract: In this study, we explore Transformer-based diffusion models for image and video generation. Despite the dominance of Transformer architectures in various fields due to their flexibility and scalability, the visual generative domain primarily utilizes CNN-based U-Net architectures, particularly in diffusion-based models. We introduce GenTron, a family of Generative models employing Transformer-bas… ▽ More

    Submitted 2 June, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: CVPR2024 Camera Ready. Website: https://www.shoufachen.com/gentron_website/

  40. arXiv:2312.02679  [pdf, other

    hep-th

    Entanglement and Pseudo Entanglement Dynamics versus Fusion in CFT

    Authors: Song He, Yu-Xuan Zhang, Long Zhao, Zi-Xuan Zhao

    Abstract: The fusion rules and operator product expansion (OPE) serve as crucial tools in the study of operator algebras within conformal field theory (CFT). Building upon the vision of using entanglement to explore the connections between fusion coefficients and OPE coefficients, we employ the replica method and Schmidt decomposition method to investigate the time evolution of entanglement entropy (EE) and… ▽ More

    Submitted 29 June, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: 29 pages, 4 figures, published version

  41. arXiv:2311.17541  [pdf, other

    cs.AI

    TaskWeaver: A Code-First Agent Framework

    Authors: Bo Qiao, Liqun Li, Xu Zhang, Shilin He, Yu Kang, Chaoyun Zhang, Fangkai Yang, Hang Dong, Jue Zhang, Lu Wang, Minghua Ma, Pu Zhao, Si Qin, Xiaoting Qin, Chao Du, Yong Xu, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang

    Abstract: Large Language Models (LLMs) have shown impressive abilities in natural language understanding and generation, leading to their widespread use in applications such as chatbots and virtual assistants. However, existing LLM frameworks face limitations in handling domain-specific data analytics tasks with rich data structures. Moreover, they struggle with flexibility to meet diverse user requirements… ▽ More

    Submitted 19 June, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  42. arXiv:2311.14008  [pdf, ps, other

    math.DG

    A note on rational homology vanishing theorem for hypersurfaces in aspherical manifolds

    Authors: Shihang He, Jintian Zhu

    Abstract: In this note, we generalize Gromov's reduction \cite{Gro20} from the aspherical conjecture to the generalized filling radius conjecture to the smooth $\mathbb Q$-homology vanishing conjecture for hypersurface. In particular, we can show that any continuous map from a closed $4$-manifold admitting positive scalar curvature to an aspherical $5$-manifold induces zero map in $H_4(\cdot,\mathbb Q)$. As… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 11 pages, all comments are welcome

  43. arXiv:2311.13535  [pdf, other

    cs.CV

    DiffusionMat: Alpha Matting as Sequential Refinement Learning

    Authors: Yangyang Xu, Shengfeng He, Wenqi Shao, Kwan-Yee K. Wong, Yu Qiao, Ping Luo

    Abstract: In this paper, we introduce DiffusionMat, a novel image matting framework that employs a diffusion model for the transition from coarse to refined alpha mattes. Diverging from conventional methods that utilize trimaps merely as loose guidance for alpha matte prediction, our approach treats image matting as a sequential refinement learning process. This process begins with the addition of noise to… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  44. arXiv:2311.13381  [pdf, other

    cs.LG cs.AI cs.DC

    Confidant: Customizing Transformer-based LLMs via Collaborative Edge Training

    Authors: Yuhao Chen, Yuxuan Yan, Qianqian Yang, Yuanchao Shu, Shibo He, Jiming Chen

    Abstract: Transformer-based large language models (LLMs) have demonstrated impressive capabilities in a variety of natural language processing (NLP) tasks. Nonetheless, it is challenging to deploy and fine-tune LLMs on mobile edge devices with limited computing, memory, and energy budgets. In this paper, we propose Confidant, a multi-backend collaborative training framework for customizing state-of-the-art… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: 6 pages, 7 figures; Submitted to HotMobile 2024

  45. arXiv:2311.11669  [pdf, other

    cs.CV

    PMP-Swin: Multi-Scale Patch Message Passing Swin Transformer for Retinal Disease Classification

    Authors: Zhihan Yang, Zhiming Cheng, Tengjin Weng, Shucheng He, Yaqi Wang, Xin Ye, Shuai Wang

    Abstract: Retinal disease is one of the primary causes of visual impairment, and early diagnosis is essential for preventing further deterioration. Nowadays, many works have explored Transformers for diagnosing diseases due to their strong visual representation capabilities. However, retinal diseases exhibit milder forms and often present with overlapping signs, which pose great difficulties for accurate mu… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 9 pages, 7 figures

  46. arXiv:2311.09861  [pdf, other

    cs.CL cs.AI

    ConceptPsy:A Benchmark Suite with Conceptual Comprehensiveness in Psychology

    Authors: Junlei Zhang, Hongliang He, Nirui Song, Zhanchao Zhou, Shuyuan He, Shuai Zhang, Huachuan Qiu, Anqi Li, Yong Dai, Lizhi Ma, Zhenzhong Lan

    Abstract: The critical field of psychology necessitates a comprehensive benchmark to enhance the evaluation and development of domain-specific Large Language Models (LLMs). Existing MMLU-type benchmarks, such as C-EVAL and CMMLU, include psychology-related subjects, but their limited number of questions and lack of systematic concept sampling strategies mean they cannot cover the concepts required in psycho… ▽ More

    Submitted 16 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Under Review

  47. arXiv:2311.09636  [pdf, ps, other

    hep-th gr-qc

    Holographic torus correlators in $\text{AdS}_3$ gravity coupled to scalar field

    Authors: Song He, Yun-Ze Li, Yunda Zhang

    Abstract: This paper investigates holographic torus correlators of generic operators at conformal infinity and a finite cutoff within AdS$_3$ gravity coupled with a free scalar field. Using a near-boundary analysis and solving the gravitational boundary value problem, we solve Einstein's equation and calculate mixed correlators for massless and massive coupled scalar fields. The conformal ward identity on t… ▽ More

    Submitted 24 May, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 51 pages. Match the published version

  48. arXiv:2311.07514  [pdf, other

    cs.CV

    VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search

    Authors: Shuting He, Hao Luo, Wei Jiang, Xudong Jiang, Henghui Ding

    Abstract: Text-based Person Search (TBPS) aims to retrieve images of target pedestrian indicated by textual descriptions. It is essential for TBPS to extract fine-grained local features and align them crossing modality. Existing methods utilize external tools or heavy cross-modal interaction to achieve explicit alignment of cross-modal fine-grained features, which is inefficient and time-consuming. In this… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted to IEEE TIP

  49. arXiv:2311.07039  [pdf, other

    eess.SY

    Time-Optimal Control for High-Order Chain-of-Integrators Systems with Full State Constraints and Arbitrary Terminal States (Extended Version)

    Authors: Yunan Wang, Chuxiong Hu, Zeyang Li, Shize Lin, Suqin He, Yu Zhu

    Abstract: Time-optimal control for high-order chain-of-integrators systems with full state constraints and arbitrarily given terminal states remains a challenging problem in the optimal control theory domain, yet to be resolved. To enhance further comprehension of the problem, this paper establishes a novel notation system and theoretical framework, providing the switching manifold for high-order problems i… ▽ More

    Submitted 28 March, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

  50. arXiv:2311.07032  [pdf, other

    cs.CL cs.AI

    ExpNote: Black-box Large Language Models are Better Task Solvers with Experience Notebook

    Authors: Wangtao Sun, Xuanqing Yu, Shizhu He, Jun Zhao, Kang Liu

    Abstract: Black-box Large Language Models (LLMs) have shown great power in solving various tasks and are considered general problem solvers. However, LLMs still fail in many specific tasks although understand the task instruction. In this paper, we focus on the problem of boosting the ability of black-box LLMs to solve downstream tasks. We propose ExpNote, an automated framework to help LLMs better adapt to… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023 findings