Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 224 results for author: Cui, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07400  [pdf, other

    cs.LG cs.LO

    Guiding LLM Temporal Logic Generation with Explicit Separation of Data and Control

    Authors: William Murphy, Nikolaus Holzer, Nathan Koenig, Leyi Cui, Raven Rothkopf, Feitong Qiao, Mark Santolucito

    Abstract: Temporal logics are powerful tools that are widely used for the synthesis and verification of reactive systems. The recent progress on Large Language Models (LLMs) has the potential to make the process of writing such specifications more accessible. However, writing specifications in temporal logics remains challenging for all but the most expert users. A key question in using LLMs for temporal lo… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2406.02027  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    Inference Attacks in Machine Learning as a Service: A Taxonomy, Review, and Promising Directions

    Authors: Feng Wu, Lei Cui, Shaowen Yao, Shui Yu

    Abstract: The prosperity of machine learning has also brought people's concerns about data privacy. Among them, inference attacks can implement privacy breaches in various MLaaS scenarios and model training/prediction phases. Specifically, inference attacks can perform privacy inference on undisclosed target training sets based on outputs of the target model, including but not limited to statistics, members… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  3. arXiv:2405.14742  [pdf, other

    cs.LG cs.AI

    HC-GAE: The Hierarchical Cluster-based Graph Auto-Encoder for Graph Representation Learning

    Authors: Zhuo Xu, Lu Bai, Lixin Cui, Ming Li, Yue Wang, Edwin R. Hancock

    Abstract: Graph Auto-Encoders (GAEs) are powerful tools for graph representation learning. In this paper, we develop a novel Hierarchical Cluster-based GAE (HC-GAE), that can learn effective structural characteristics for graph data analysis. To this end, during the encoding process, we commence by utilizing the hard node assignment to decompose a sample graph into a family of separated subgraphs. We compre… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  4. arXiv:2405.12689  [pdf, other

    cs.CL cs.AI

    Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text

    Authors: Yafu Li, Zhilin Wang, Leyang Cui, Wei Bi, Shuming Shi, Yue Zhang

    Abstract: AI-generated text detection has attracted increasing attention as powerful language models approach human-level generation. Limited work is devoted to detecting (partially) AI-paraphrased texts. However, AI paraphrasing is commonly employed in various application scenarios for text refinement and diversity. To this end, we propose a novel detection framework, paraphrased text span detection (PTD),… ▽ More

    Submitted 29 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: ACL 2024 Findings

  5. arXiv:2405.10218  [pdf, other

    cs.LG cs.AI

    ENADPool: The Edge-Node Attention-based Differentiable Pooling for Graph Neural Networks

    Authors: Zhehan Zhao, Lu Bai, Lixin Cui, Ming Li, Yue Wang, Lixiang Xu, Edwin R. Hancock

    Abstract: Graph Neural Networks (GNNs) are powerful tools for graph classification. One important operation for GNNs is the downsampling or pooling that can learn effective embeddings from the node representations. In this paper, we propose a new hierarchical pooling operation, namely the Edge-Node Attention-based Differentiable Pooling (ENADPool), for GNNs to learn effective graph representations. Unlike t… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  6. arXiv:2405.08054  [pdf, other

    cs.GR cs.CV

    Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning

    Authors: Wenqi Dong, Bangbang Yang, Lin Ma, Xiao Liu, Liyuan Cui, Hujun Bao, Yuewen Ma, Zhaopeng Cui

    Abstract: As humans, we aspire to create media content that is both freely willed and readily controlled. Thanks to the prominent development of generative techniques, we now can easily utilize 2D diffusion methods to synthesize images controlled by raw sketch or designated human poses, and even progressively edit/regenerate local regions with masked inpainting. However, similar workflows in 3D modeling tas… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Project webpage: https://zju3dv.github.io/coin3d

  7. arXiv:2405.02008  [pdf, other

    cs.CV

    DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model

    Authors: Peijin Jia, Tuopu Wen, Ziang Luo, Mengmeng Yang, Kun Jiang, Zhiquan Lei, Xuewei Tang, Ziyuan Liu, Le Cui, Kehua Sheng, Bo Zhang, Diange Yang

    Abstract: Constructing high-definition (HD) maps is a crucial requirement for enabling autonomous driving. In recent years, several map segmentation algorithms have been developed to address this need, leveraging advancements in Bird's-Eye View (BEV) perception. However, existing models still encounter challenges in producing realistic and consistent semantic map layouts. One prominent issue is the limited… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  8. arXiv:2404.03622  [pdf, other

    cs.CL

    Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models

    Authors: Wenshan Wu, Shaoguang Mao, Yadong Zhang, Yan Xia, Li Dong, Lei Cui, Furu Wei

    Abstract: Large language models (LLMs) have exhibited impressive performance in language comprehension and various reasoning tasks. However, their abilities in spatial reasoning, a crucial aspect of human cognition, remain relatively unexplored. Human possess a remarkable ability to create mental images of unseen objects and actions through a process known as the Mind's Eye, enabling the imagination of the… ▽ More

    Submitted 24 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  9. A Change of Scenery: Transformative Insights from Retrospective VR Embodied Perspective-Taking of Conflict With a Close Other

    Authors: Seraphina Yong, Leo Cui, Evan Suma Rosenberg, Svetlana Yarosh

    Abstract: Close relationships are irreplaceable social resources, yet prone to high-risk conflict. Building on findings from the fields of HCI, virtual reality, and behavioral therapy, we evaluate the unexplored potential of retrospective VR-embodied perspective-taking to fundamentally influence conflict resolution in close others. We develop a biographically-accurate Retrospective Embodied Perspective-Taki… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 18 pages, 5 figures, Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems

  10. arXiv:2403.18479  [pdf, other

    cs.IR

    Lightweight Embeddings for Graph Collaborative Filtering

    Authors: Xurong Liang, Tong Chen, Lizhen Cui, Yang Wang, Meng Wang, Hongzhi Yin

    Abstract: Graph neural networks (GNNs) are currently one of the most performant collaborative filtering methods. Meanwhile, owing to the use of an embedding table to represent each user/item as a distinct vector, GNN-based recommenders have inherited the long-standing defect of parameter inefficiency. As a common practice for scalable embeddings, parameter sharing enables the use of fewer embedding vectors… ▽ More

    Submitted 28 March, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted by SIGIR '24

  11. arXiv:2403.18249  [pdf, other

    cs.CL cs.SI

    Exploring the Deceptive Power of LLM-Generated Fake News: A Study of Real-World Detection Challenges

    Authors: Yanshen Sun, Jianfeng He, Limeng Cui, Shuo Lei, Chang-Tien Lu

    Abstract: Recent advancements in Large Language Models (LLMs) have enabled the creation of fake news, particularly in complex fields like healthcare. Studies highlight the gap in the deceptive power of LLM-generated fake news with and without human assistance, yet the potential of prompting techniques has not been fully explored. Thus, this work aims to determine whether prompting strategies can effectively… ▽ More

    Submitted 8 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  12. arXiv:2403.16443  [pdf, other

    cs.CL cs.AI cs.SE

    CodeS: Natural Language to Code Repository via Multi-Layer Sketch

    Authors: Daoguang Zan, Ailun Yu, Wei Liu, Dong Chen, Bo Shen, Wei Li, Yafen Yao, Yongshun Gong, Xiaolin Chen, Bei Guan, Zhiguang Yang, Yongji Wang, Qianxiang Wang, Lizhen Cui

    Abstract: The impressive performance of large language models (LLMs) on code-related tasks has shown the potential of fully automated software development. In light of this, we introduce a new software engineering task, namely Natural Language to code Repository (NL2Repo). This task aims to generate an entire code repository from its natural language requirements. To address this task, we propose a simple y… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: https://github.com/NL2Code/CodeS

  13. arXiv:2403.16227  [pdf, other

    cs.CV

    Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System

    Authors: Jing Li, Lu Bai, Bin Yang, Chang Li, Lingfei Ma, Lixin Cui, Edwin R. Hancock

    Abstract: Infrared and visible image fusion (IVF) plays an important role in intelligent transportation system (ITS). The early works predominantly focus on boosting the visual appeal of the fused result, and only several recent approaches have tried to combine the high-level vision task with IVF. However, they prioritize the design of cascaded structure to seek unified suitable features and fit different t… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  14. arXiv:2403.16133  [pdf, other

    cs.AI cs.LG

    SSHPool: The Separated Subgraph-based Hierarchical Pooling

    Authors: Zhuo Xu, Lixin Cui, Yue Wang, Hangyuan Du, Lu Bai, Edwin R. Hancock

    Abstract: In this paper, we develop a novel local graph pooling method, namely the Separated Subgraph-based Hierarchical Pooling (SSHPool), for graph classification. To this end, we commence by assigning the nodes of a sample graph into different clusters, resulting in a family of separated subgraphs. We individually employ a local graph convolution units as the local structure to further compress each subg… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  15. arXiv:2403.16130  [pdf, other

    cs.LG cs.AI

    AKBR: Learning Adaptive Kernel-based Representations for Graph Classification

    Authors: Feifei Qian, Lixin Cui, Yue Wang, Hangyuan Du, Lu Bai, Edwin R. Hancock

    Abstract: In this paper, we propose a new model to learn Adaptive Kernel-based Representations (AKBR) for graph classification. Unlike state-of-the-art R-convolution graph kernels that are defined by merely counting any pair of isomorphic substructures between graphs and cannot provide an end-to-end learning mechanism for the classifier, the proposed AKBR approach aims to define an end-to-end representation… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  16. arXiv:2403.06021  [pdf, other

    cs.IR cs.LG

    Hierarchical Query Classification in E-commerce Search

    Authors: Bing He, Sreyashi Nag, Limeng Cui, Suhang Wang, Zheng Li, Rahul Goutam, Zhen Li, Haiyang Zhang

    Abstract: E-commerce platforms typically store and structure product information and search data in a hierarchy. Efficiently categorizing user search queries into a similar hierarchical structure is paramount in enhancing user experience on e-commerce platforms as well as news curation and academic research. The significance of this task is amplified when dealing with sensitive query categorization or criti… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Published at: the ACM Web Conference 2024 in the industry track (WWW'24)

  17. arXiv:2403.02693  [pdf, other

    cs.MM eess.IV

    Optimizing Mobile-Friendly Viewport Prediction for Live 360-Degree Video Streaming

    Authors: Lei Zhang, Tao Long, Weizhen Xu, Laizhong Cui, Jiangchuan Liu

    Abstract: Viewport prediction is the crucial task for adaptive 360-degree video streaming, as the bitrate control algorithms usually require the knowledge of the user's viewing portions of the frames. Various methods are studied and adopted for viewport prediction from less accurate statistic tools to highly calibrated deep neural networks. Conventionally, it is difficult to implement sophisticated deep lea… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 14 pages

  18. arXiv:2403.01244  [pdf, other

    cs.CL cs.AI

    Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal

    Authors: Jianheng Huang, Leyang Cui, Ante Wang, Chengyi Yang, Xinting Liao, Linfeng Song, Junfeng Yao, Jinsong Su

    Abstract: Large language models (LLMs) suffer from catastrophic forgetting during continual learning. Conventional rehearsal-based methods rely on previous training data to retain the model's ability, which may not be feasible in real-world applications. When conducting continual learning based on a publicly-released LLM checkpoint, the availability of the original training data may be non-existent. To addr… ▽ More

    Submitted 25 May, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

    Comments: ACL 2024 main, long paper

  19. arXiv:2402.19255  [pdf, other

    cs.CL

    GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers

    Authors: Qintong Li, Leyang Cui, Xueliang Zhao, Lingpeng Kong, Wei Bi

    Abstract: Large language models (LLMs) have achieved impressive performance across various mathematical reasoning benchmarks. However, there are increasing debates regarding whether these models truly understand and apply mathematical knowledge or merely rely on shortcuts for mathematical reasoning. One essential and frequently occurring evidence is that when the math questions are slightly changed, LLMs ca… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  20. arXiv:2402.17532  [pdf, other

    cs.CL

    Retrieval is Accurate Generation

    Authors: Bowen Cao, Deng Cai, Leyang Cui, Xuxin Cheng, Wei Bi, Yuexian Zou, Shuming Shi

    Abstract: Standard language models generate text by selecting tokens from a fixed, finite, and standalone vocabulary. We introduce a novel method that selects context-aware phrases from a collection of supporting documents. One of the most significant challenges for this paradigm shift is determining the training oracles, because a string of text can be segmented in various ways and each segment can be retr… ▽ More

    Submitted 16 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  21. arXiv:2402.16978  [pdf, other

    math.OC cs.LG

    An inexact Bregman proximal point method and its acceleration version for unbalanced optimal transport

    Authors: Xiang Chen, Faqiang Wang, Jun Liu, Li Cui

    Abstract: The Unbalanced Optimal Transport (UOT) problem plays increasingly important roles in computational biology, computational imaging and deep learning. Scaling algorithm is widely used to solve UOT due to its convenience and good convergence properties. However, this algorithm has lower accuracy for large regularization parameters, and due to stability issues, small regularization parameters can easi… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  22. arXiv:2402.15865  [pdf, other

    cs.CV eess.IV

    HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models

    Authors: Li Pang, Xiangyu Rui, Long Cui, Hongzhong Wang, Deyu Meng, Xiangyong Cao

    Abstract: Hyperspectral image (HSI) restoration aims at recovering clean images from degraded observations and plays a vital role in downstream tasks. Existing model-based methods have limitations in accurately modeling the complex image characteristics with handcraft priors, and deep learning-based methods suffer from poor generalization ability. To alleviate these issues, this paper proposes an unsupervis… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  23. arXiv:2402.03815  [pdf, other

    cs.LG

    Expediting In-Network Federated Learning by Voting-Based Consensus Model Compression

    Authors: Xiaoxin Su, Yipeng Zhou, Laizhong Cui, Song Guo

    Abstract: Recently, federated learning (FL) has gained momentum because of its capability in preserving data privacy. To conduct model training by FL, multiple clients exchange model updates with a parameter server via Internet. To accelerate the communication speed, it has been explored to deploy a programmable switch (PS) in lieu of the parameter server to coordinate clients. The challenge to deploy the P… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: To appear in 2024 IEEE International Conference on Computer Communications(INFOCOM 2024)

  24. arXiv:2402.03770  [pdf, other

    cs.LG

    Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes

    Authors: Xiaoxin Su, Yipeng Zhou, Laizhong Cui, John C. S. Lui, Jiangchuan Liu

    Abstract: In Federated Learning (FL) paradigm, a parameter server (PS) concurrently communicates with distributed participating clients for model collection, update aggregation, and model distribution over multiple rounds, without touching private data owned by individual clients. FL is appealing in preserving data privacy; yet the communication between the PS and scattered clients can be a severe bottlenec… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: To appear in 2024 IEEE International Conference on Computer Communications(INFOCOM 2024)

  25. arXiv:2401.17630  [pdf, other

    cs.IR

    Towards Personalized Privacy: User-Governed Data Contribution for Federated Recommendation

    Authors: Liang Qu, Wei Yuan, Ruiqi Zheng, Lizhen Cui, Yuhui Shi, Hongzhi Yin

    Abstract: Federated recommender systems (FedRecs) have gained significant attention for their potential to protect user's privacy by keeping user privacy data locally and only communicating model parameters/gradients to the server. Nevertheless, the currently existing architecture of FedRecs assumes that all users have the same 0-privacy budget, i.e., they do not upload any data to the server, thus overlook… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  26. arXiv:2401.13448  [pdf, other

    cs.IR

    Decentralized Collaborative Learning with Adaptive Reference Data for On-Device POI Recommendation

    Authors: Ruiqi Zheng, Liang Qu, Tong Chen, Lizhen Cui, Yuhui Shi, Hongzhi Yin

    Abstract: In Location-based Social Networks, Point-of-Interest (POI) recommendation helps users discover interesting places. There is a trend to move from the cloud-based model to on-device recommendations for privacy protection and reduced server reliance. Due to the scarcity of local user-item interactions on individual devices, solely relying on local instances is not adequate. Collaborative Learning (CL… ▽ More

    Submitted 24 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  27. arXiv:2401.11913  [pdf, other

    cs.CV cs.AI

    Large receptive field strategy and important feature extraction strategy in 3D object detection

    Authors: Leichao Cui, Xiuxian Li, Min Meng, Guangyu Jia

    Abstract: The enhancement of 3D object detection is pivotal for precise environmental perception and improved task execution capabilities in autonomous driving. LiDAR point clouds, offering accurate depth information, serve as a crucial information for this purpose. Our study focuses on key challenges in 3D target detection. To tackle the challenge of expanding the receptive field of a 3D convolutional kern… ▽ More

    Submitted 10 March, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  28. arXiv:2401.10768  [pdf, other

    cs.CL

    Knowledge Verification to Nip Hallucination in the Bud

    Authors: Fanqi Wan, Xinting Huang, Leyang Cui, Xiaojun Quan, Wei Bi, Shuming Shi

    Abstract: While large language models (LLMs) have demonstrated exceptional performance across various tasks following human alignment, they may still generate responses that sound plausible but contradict factual knowledge, a phenomenon known as \emph{hallucination}. In this paper, we demonstrate the feasibility of mitigating hallucinations by verifying and minimizing the inconsistency between external know… ▽ More

    Submitted 16 April, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Work in progress

  29. arXiv:2401.09331  [pdf, other

    cs.CV cs.RO

    Event-Based Visual Odometry on Non-Holonomic Ground Vehicles

    Authors: Wanting Xu, Si'ao Zhang, Li Cui, Xin Peng, Laurent Kneip

    Abstract: Despite the promise of superior performance under challenging conditions, event-based motion estimation remains a hard problem owing to the difficulty of extracting and tracking stable features from event streams. In order to robustify the estimation, it is generally believed that fusion with other sensors is a requirement. In this work, we demonstrate reliable, purely event-based visual odometry… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted by 3DV 2024

  30. arXiv:2401.08294  [pdf, other

    cs.CL

    Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models

    Authors: Shuming Shi, Enbo Zhao, Deng Cai, Leyang Cui, Xinting Huang, Huayang Li

    Abstract: We present Inferflow, an efficient and highly configurable inference engine for large language models (LLMs). With Inferflow, users can serve most of the common transformer models by simply modifying some lines in corresponding configuration files, without writing a single line of source code. Compared with most existing inference engines, Inferflow has some key features. First, by implementing a… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Technical report of Inferflow

  31. arXiv:2401.04942  [pdf, other

    cs.CV

    Latency-aware Road Anomaly Segmentation in Videos: A Photorealistic Dataset and New Metrics

    Authors: Beiwen Tian, Huan-ang Gao, Leiyao Cui, Yupeng Zheng, Lan Luo, Baofeng Wang, Rong Zhi, Guyue Zhou, Hao Zhao

    Abstract: In the past several years, road anomaly segmentation is actively explored in the academia and drawing growing attention in the industry. The rationale behind is straightforward: if the autonomous car can brake before hitting an anomalous object, safety is promoted. However, this rationale naturally calls for a temporally informed setting while existing methods and benchmarks are designed in an unr… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  32. arXiv:2401.04143  [pdf, other

    cs.CV

    RHOBIN Challenge: Reconstruction of Human Object Interaction

    Authors: Xianghui Xie, Xi Wang, Nikos Athanasiou, Bharat Lal Bhatnagar, Chun-Hao P. Huang, Kaichun Mo, Hao Chen, Xia Jia, Zerui Zhang, Liangxian Cui, Xiao Lin, Bingqiao Qian, Jie Xiao, Wenfei Yang, Hyeongjin Nam, Daniel Sungho Jung, Kihoon Kim, Kyoung Mu Lee, Otmar Hilliges, Gerard Pons-Moll

    Abstract: Modeling the interaction between humans and objects has been an emerging research direction in recent years. Capturing human-object interaction is however a very challenging task due to heavy occlusion and complex dynamics, which requires understanding not only 3D human pose, and object pose but also the interaction between them. Reconstruction of 3D humans and objects has been two separate resear… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 14 pages, 5 tables, 7 figure. Technical report of the CVPR'23 workshop: RHOBIN challenge (https://rhobin-challenge.github.io/)

  33. arXiv:2312.15710  [pdf, other

    cs.CL cs.AI

    Alleviating Hallucinations of Large Language Models through Induced Hallucinations

    Authors: Yue Zhang, Leyang Cui, Wei Bi, Shuming Shi

    Abstract: Despite their impressive capabilities, large language models (LLMs) have been observed to generate responses that include inaccurate or fabricated information, a phenomenon commonly known as ``hallucination''. In this work, we propose a simple \textit{Induce-then-Contrast} Decoding (ICD) strategy to alleviate hallucinations. We first construct a factually weak LLM by inducing hallucinations from t… ▽ More

    Submitted 11 March, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Work in progress

  34. arXiv:2312.13596  [pdf, ps, other

    cs.LG cs.AI

    Anchoring Path for Inductive Relation Prediction in Knowledge Graphs

    Authors: Zhixiang Su, Di Wang, Chunyan Miao, Lizhen Cui

    Abstract: Aiming to accurately predict missing edges representing relations between entities, which are pervasive in real-world Knowledge Graphs (KGs), relation prediction plays a critical role in enhancing the comprehensiveness and utility of KGs. Recent research focuses on path-based methods due to their inductive and explainable properties. However, these methods face a great challenge when lots of reaso… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  35. arXiv:2312.09006  [pdf, other

    cs.LG cs.DC

    FedSSA: Semantic Similarity-based Aggregation for Efficient Model-Heterogeneous Personalized Federated Learning

    Authors: Liping Yi, Han Yu, Zhuan Shi, Gang Wang, Xiaoguang Liu, Lizhen Cui, Xiaoxiao Li

    Abstract: Federated learning (FL) is a privacy-preserving collaboratively machine learning paradigm. Traditional FL requires all data owners (a.k.a. FL clients) to train the same local model. This design is not well-suited for scenarios involving data and/or system heterogeneity. Model-Heterogeneous Personalized FL (MHPFL) has emerged to address this challenge. Existing MHPFL approaches often rely on a publ… ▽ More

    Submitted 19 April, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted by Proceedings of the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)

  36. arXiv:2312.07899  [pdf

    q-bio.QM cs.AI cs.CV cs.LG

    Morphological Profiling for Drug Discovery in the Era of Deep Learning

    Authors: Qiaosi Tang, Ranjala Ratnayake, Gustavo Seabra, Zhe Jiang, Ruogu Fang, Lina Cui, Yousong Ding, Tamer Kahveci, Jiang Bian, Chenglong Li, Hendrik Luesch, Yanjun Li

    Abstract: Morphological profiling is a valuable tool in phenotypic drug discovery. The advent of high-throughput automated imaging has enabled the capturing of a wide range of morphological features of cells or organisms in response to perturbations at the single-cell resolution. Concurrently, significant advances in machine learning and deep learning, especially in computer vision, have led to substantial… ▽ More

    Submitted 15 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 44 pages, 5 figure, 5 tables

  37. arXiv:2311.16465  [pdf, other

    cs.CV

    TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering

    Authors: Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

    Abstract: The diffusion model has been proven a powerful generative model in recent years, yet remains a challenge in generating visual text. Several methods alleviated this issue by incorporating explicit text position and content as guidance on where and what text to render. However, these methods still suffer from several drawbacks, such as limited flexibility and automation, constrained capability of la… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  38. arXiv:2311.09802  [pdf, other

    cs.AI cs.CL

    Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs

    Authors: Sen Yang, Xin Li, Leyang Cui, Lidong Bing, Wai Lam

    Abstract: Though prompting LLMs with various reasoning structures produces reasoning proofs along with answers, these proofs are not ensured to be causal and reliable due to the inherent defects of LLMs. Tracking such deficiencies, we present a neuro-symbolic integration method, in which a neural LLM is used to represent the knowledge of the problem while an LLM-free symbolic solver is adopted to do deliber… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  39. arXiv:2311.07324  [pdf, other

    cs.LG

    DAGC: Data-Volume-Aware Adaptive Sparsification Gradient Compression for Distributed Machine Learning in Mobile Computing

    Authors: Rongwei Lu, Yutong Jiang, Yinan Mao, Chen Tang, Bin Chen, Laizhong Cui, Zhi Wang

    Abstract: Distributed machine learning (DML) in mobile environments faces significant communication bottlenecks. Gradient compression has emerged as an effective solution to this issue, offering substantial benefits in environments with limited bandwidth and metered data. Yet, they encounter severe performance drop in non-IID environments due to a one-size-fits-all compression approach, which does not accou… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  40. arXiv:2310.20381  [pdf, other

    cs.CV cs.AI

    A Systematic Evaluation of GPT-4V's Multimodal Capability for Medical Image Analysis

    Authors: Yingshu Li, Yunyi Liu, Zhanyu Wang, Xinyu Liang, Lei Wang, Lingqiao Liu, Leyang Cui, Zhaopeng Tu, Longyue Wang, Luping Zhou

    Abstract: This work conducts an evaluation of GPT-4V's multimodal capability for medical image analysis, with a focus on three representative tasks of radiology report generation, medical visual question answering, and medical visual grounding. For the evaluation, a set of prompts is designed for each task to induce the corresponding capability of GPT-4V to produce sufficiently good outputs. Three evaluatio… ▽ More

    Submitted 30 January, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  41. arXiv:2310.19740  [pdf, other

    cs.CL

    Collaborative Evaluation: Exploring the Synergy of Large Language Models and Humans for Open-ended Generation Evaluation

    Authors: Qintong Li, Leyang Cui, Lingpeng Kong, Wei Bi

    Abstract: Humans are widely involved in the evaluation of open-ended natural language generation tasks (NLG) that demand creativity, as automatic metrics often exhibit weak correlations with human judgments. Large language models (LLMs) recently have emerged as a scalable and cost-effective alternative to human evaluations. However, both humans and LLMs have limitations, i.e., inherent subjectivity and unre… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: We release our resources at \url{https://github.com/qtli/CoEval}

  42. arXiv:2310.14274  [pdf, other

    cs.LG

    Robust Visual Imitation Learning with Inverse Dynamics Representations

    Authors: Siyuan Li, Xun Wang, Rongchang Zuo, Kewu Sun, Lingfei Cui, Jishiyu Ding, Peng Liu, Zhe Ma

    Abstract: Imitation learning (IL) has achieved considerable success in solving complex sequential decision-making problems. However, current IL methods mainly assume that the environment for learning policies is the same as the environment for collecting expert datasets. Therefore, these methods may fail to work when there are slight differences between the learning and expert environments, especially for c… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  43. arXiv:2310.13345  [pdf, other

    cs.CR

    An LLM can Fool Itself: A Prompt-Based Adversarial Attack

    Authors: Xilie Xu, Keyi Kong, Ning Liu, Lizhen Cui, Di Wang, Jingfeng Zhang, Mohan Kankanhalli

    Abstract: The wide-ranging applications of large language models (LLMs), especially in safety-critical domains, necessitate the proper evaluation of the LLM's adversarial robustness. This paper proposes an efficient tool to audit the LLM's adversarial robustness via a prompt-based adversarial attack (PromptAttack). PromptAttack converts adversarial textual attacks into an attack prompt that can cause the vi… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  44. arXiv:2310.07821  [pdf, other

    cs.CL

    Non-autoregressive Text Editing with Copy-aware Latent Alignments

    Authors: Yu Zhang, Yue Zhang, Leyang Cui, Guohong Fu

    Abstract: Recent work has witnessed a paradigm shift from Seq2Seq to Seq2Edit in the field of text editing, with the aim of addressing the slow autoregressive inference problem posed by the former. Despite promising results, Seq2Edit approaches still face several challenges such as inflexibility in generation and difficulty in generalizing to other languages. In this work, we propose a novel non-autoregress… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  45. arXiv:2310.07299  [pdf, other

    cs.CL cs.AI

    RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation

    Authors: Yue Zhang, Leyang Cui, Enbo Zhao, Wei Bi, Shuming Shi

    Abstract: Grammatical Error Correction (GEC) systems play a vital role in assisting people with their daily writing tasks. However, users may sometimes come across a GEC system that initially performs well but fails to correct errors when the inputs are slightly modified. To ensure an ideal user experience, a reliable GEC system should have the ability to provide consistent and accurate suggestions when enc… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (main conference, long paper)

  46. arXiv:2310.05341  [pdf, other

    cs.CV cs.AI

    A Critical Look at Classic Test-Time Adaptation Methods in Semantic Segmentation

    Authors: Chang'an Yi, Haotian Chen, Yifan Zhang, Yonghui Xu, Lizhen Cui

    Abstract: Test-time adaptation (TTA) aims to adapt a model, initially trained on training data, to potential distribution shifts in the test data. Most existing TTA studies, however, focus on classification tasks, leaving a notable gap in the exploration of TTA for semantic segmentation. This pronounced emphasis on classification might lead numerous newcomers and engineers to mistakenly assume that classic… ▽ More

    Submitted 11 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  47. arXiv:2310.00919  [pdf, other

    eess.IV cs.CV cs.LG

    BAAF: A Benchmark Attention Adaptive Framework for Medical Ultrasound Image Segmentation Tasks

    Authors: Gongping Chen, Lei Zhao, Xiaotao Yin, Liang Cui, Jianxun Zhang, Yu Dai

    Abstract: The AI-based assisted diagnosis programs have been widely investigated on medical ultrasound images. Complex scenario of ultrasound image, in which the coupled interference of internal and external factors is severe, brings a unique challenge for localize the object region automatically and precisely in ultrasound images. In this study, we seek to propose a more general and robust Benchmark Attent… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  48. arXiv:2309.17415  [pdf, other

    cs.CL

    Intuitive or Dependent? Investigating LLMs' Behavior Style to Conflicting Prompts

    Authors: Jiahao Ying, Yixin Cao, Kai Xiong, Yidong He, Long Cui, Yongbin Liu

    Abstract: This study investigates the behaviors of Large Language Models (LLMs) when faced with conflicting prompts versus their internal memory. This will not only help to understand LLMs' decision mechanism but also benefit real-world applications, such as retrieval-augmented generation (RAG). Drawing on cognitive theory, we target the first scenario of decision-making styles where there is no superiority… ▽ More

    Submitted 20 February, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  49. arXiv:2309.12641  [pdf, other

    cs.CV

    Global Context Aggregation Network for Lightweight Saliency Detection of Surface Defects

    Authors: Feng Yan, Xiaoheng Jiang, Yang Lu, Lisha Cui, Shupan Li, Jiale Cao, Mingliang Xu, Dacheng Tao

    Abstract: Surface defect inspection is a very challenging task in which surface defects usually show weak appearances or exist under complex backgrounds. Most high-accuracy defect detection methods require expensive computation and storage overhead, making them less practical in some resource-constrained defect detection applications. Although some lightweight methods have achieved real-time inference speed… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  50. arXiv:2309.11419  [pdf, other

    cs.CL cs.CV

    Kosmos-2.5: A Multimodal Literate Model

    Authors: Tengchao Lv, Yupan Huang, Jingye Chen, Lei Cui, Shuming Ma, Yaoyao Chang, Shaohan Huang, Wenhui Wang, Li Dong, Weiyao Luo, Shaoxiang Wu, Guoxin Wang, Cha Zhang, Furu Wei

    Abstract: We present Kosmos-2.5, a multimodal literate model for machine reading of text-intensive images. Pre-trained on large-scale text-intensive images, Kosmos-2.5 excels in two distinct yet cooperative transcription tasks: (1) generating spatially-aware text blocks, where each block of text is assigned its spatial coordinates within the image, and (2) producing structured text output that captures styl… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.