Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 378 results for author: Cui, L

.
  1. arXiv:2407.14530  [pdf, other

    cs.DB cs.AI

    FuncEvalGMN: Evaluating Functional Correctness of SQL via Graph Matching Network

    Authors: Yi Zhan, Yang Sun, Han Weng, Longjie Cui, Guifeng Wang, Jiajun Xie, Yu Tian, Xiaoming Yin, Boyi Liu, Dongchi Huang

    Abstract: In this paper, we propose a novel graph-based methodology to evaluate the functional correctness of SQL generation. Conventional metrics for assessing SQL code generation, such as matching-based and execution-based methods (e.g., exact set match and execution accuracy), are subject to two primary limitations. Firstly, the former fails to effectively assess functional correctness, as different SQL… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2407.13605  [pdf, other

    cs.LG

    Physics-guided Active Sample Reweighting for Urban Flow Prediction

    Authors: Wei Jiang, Tong Chen, Guanhua Ye, Wentao Zhang, Lizhen Cui, Zi Huang, Hongzhi Yin

    Abstract: Urban flow prediction is a spatio-temporal modeling task that estimates the throughput of transportation services like buses, taxis, and ride-sharing, where data-driven models have become the most popular solution in the past decade. Meanwhile, the implicitly learned mapping between historical observations to the prediction targets tend to over-simplify the dynamics of real-world urban flows, lead… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: This paper is accepted by Proceedings of the 33nd ACM International Conference on Information and Knowledge Management (CIKM '24)

  3. arXiv:2407.09860  [pdf, other

    quant-ph cond-mat.stat-mech

    Quantum Vicsek Model for Active Matter

    Authors: Hong Yuan, L. X. Cui, L. T. Chen, C. P. Sun

    Abstract: We propose a quantum analog of the Vicsek model, consisting of an ensemble of overdamped spin$-1/2$ particles with ferromagnetic couplings, driven by a uniformly polarized magnetic field. The spontaneous magnetization of the spin components breaks the $SO(3)$ (or $SO(2)$) symmetry, inducing an ordered phase of flocking. We derive the hydrodynamic equations, similar to those formulated by Toner and… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  4. arXiv:2406.19698  [pdf, other

    math.CO

    Optimal radio labeling for the Cartesian product of square mesh networks and stars

    Authors: Linlin Cui, Feng Li

    Abstract: As the most critical component in the communication process, channels have a great impact on the communication quality of network. With the continuous expansion of network scale, the limited channel resources lead to the limitation of communication network scale. Therefore, achieving reasonable channel assignment and utilization becomes an extremely challenging problem. In order to solve this issu… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  5. arXiv:2406.18962  [pdf, other

    cs.IR

    Multi-modal Food Recommendation using Clustering and Self-supervised Learning

    Authors: Yixin Zhang, Xin Zhou, Qianwen Meng, Fanglin Zhu, Yonghui Xu, Zhiqi Shen, Lizhen Cui

    Abstract: Food recommendation systems serve as pivotal components in the realm of digital lifestyle services, designed to assist users in discovering recipes and food items that resonate with their unique dietary predilections. Typically, multi-modal descriptions offer an exhaustive profile for each recipe, thereby ensuring recommendations that are both personalized and accurate. Our preliminary investigati… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Working paper

  6. arXiv:2406.17335  [pdf, other

    cs.IR cs.LG

    A Thorough Performance Benchmarking on Lightweight Embedding-based Recommender Systems

    Authors: Hung Vinh Tran, Tong Chen, Quoc Viet Hung Nguyen, Zi Huang, Lizhen Cui, Hongzhi Yin

    Abstract: Since the creation of the Web, recommender systems (RSs) have been an indispensable mechanism in information filtering. State-of-the-art RSs primarily depend on categorical features, which ecoded by embedding vectors, resulting in excessively large embedding tables. To prevent over-parameterized embedding tables from harming scalability, both academia and industry have seen increasing efforts in c… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  7. arXiv:2406.17312  [pdf, other

    cs.CL

    Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning

    Authors: Sen Yang, Leyang Cui, Deng Cai, Xinting Huang, Shuming Shi, Wai Lam

    Abstract: Iterative preference learning, though yielding superior performances, requires online annotated preference labels. In this work, we study strategies to select worth-annotating response pairs for cost-efficient annotation while achieving competitive or even better performances compared with the random selection baseline for iterative preference learning. Built on assumptions regarding uncertainty a… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  8. arXiv:2406.16377  [pdf, other

    cs.CL cs.AI

    On the Transformations across Reward Model, Parameter Update, and In-Context Prompt

    Authors: Deng Cai, Huayang Li, Tingchen Fu, Siheng Li, Weiwen Xu, Shuaiyi Li, Bowen Cao, Zhisong Zhang, Xinting Huang, Leyang Cui, Yan Wang, Lemao Liu, Taro Watanabe, Shuming Shi

    Abstract: Despite the general capabilities of pre-trained large language models (LLMs), they still need further adaptation to better serve practical applications. In this paper, we demonstrate the interchangeability of three popular and distinct adaptation tools: parameter updating, reward modeling, and in-context prompting. This interchangeability establishes a triangular framework with six transformation… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  9. arXiv:2406.07400  [pdf, other

    cs.LG cs.LO

    Guiding LLM Temporal Logic Generation with Explicit Separation of Data and Control

    Authors: William Murphy, Nikolaus Holzer, Nathan Koenig, Leyi Cui, Raven Rothkopf, Feitong Qiao, Mark Santolucito

    Abstract: Temporal logics are powerful tools that are widely used for the synthesis and verification of reactive systems. The recent progress on Large Language Models (LLMs) has the potential to make the process of writing such specifications more accessible. However, writing specifications in temporal logics remains challenging for all but the most expert users. A key question in using LLMs for temporal lo… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  10. arXiv:2406.02027  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    Inference Attacks: A Taxonomy, Survey, and Promising Directions

    Authors: Feng Wu, Lei Cui, Shaowen Yao, Shui Yu

    Abstract: The prosperity of machine learning has also brought people's concerns about data privacy. Among them, inference attacks can implement privacy breaches in various MLaaS scenarios and model training/prediction phases. Specifically, inference attacks can perform privacy inference on undisclosed target training sets based on outputs of the target model, including but not limited to statistics, members… ▽ More

    Submitted 27 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  11. arXiv:2405.14742  [pdf, other

    cs.LG cs.AI

    HC-GAE: The Hierarchical Cluster-based Graph Auto-Encoder for Graph Representation Learning

    Authors: Zhuo Xu, Lu Bai, Lixin Cui, Ming Li, Yue Wang, Edwin R. Hancock

    Abstract: Graph Auto-Encoders (GAEs) are powerful tools for graph representation learning. In this paper, we develop a novel Hierarchical Cluster-based GAE (HC-GAE), that can learn effective structural characteristics for graph data analysis. To this end, during the encoding process, we commence by utilizing the hard node assignment to decompose a sample graph into a family of separated subgraphs. We compre… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  12. arXiv:2405.12689  [pdf, other

    cs.CL cs.AI

    Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text

    Authors: Yafu Li, Zhilin Wang, Leyang Cui, Wei Bi, Shuming Shi, Yue Zhang

    Abstract: AI-generated text detection has attracted increasing attention as powerful language models approach human-level generation. Limited work is devoted to detecting (partially) AI-paraphrased texts. However, AI paraphrasing is commonly employed in various application scenarios for text refinement and diversity. To this end, we propose a novel detection framework, paraphrased text span detection (PTD),… ▽ More

    Submitted 29 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: ACL 2024 Findings

  13. arXiv:2405.10218  [pdf, other

    cs.LG cs.AI

    ENADPool: The Edge-Node Attention-based Differentiable Pooling for Graph Neural Networks

    Authors: Zhehan Zhao, Lu Bai, Lixin Cui, Ming Li, Yue Wang, Lixiang Xu, Edwin R. Hancock

    Abstract: Graph Neural Networks (GNNs) are powerful tools for graph classification. One important operation for GNNs is the downsampling or pooling that can learn effective embeddings from the node representations. In this paper, we propose a new hierarchical pooling operation, namely the Edge-Node Attention-based Differentiable Pooling (ENADPool), for GNNs to learn effective graph representations. Unlike t… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  14. arXiv:2405.09808  [pdf, other

    quant-ph physics.optics

    Phase Retrieval from the Hong-Ou-Mandel Dip to Characterize the Phase Spectrum of Independent Pulses at the Single-Photon Level

    Authors: Yuhang Lei, Wen Zhao, Liang Cui, Xiaoying Li

    Abstract: Measuring the phase spectrum at the single-photon level is essential for the full characterization of the temporal-spectral mode of quantum sources. We present a phase retrieval algorithm-based method to recover the phase spectrum difference between two independent pulses from their Hong-Ou-Mandel interference pattern and intensity spectra. Our confirmatory experiment with coherent state pulses co… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  15. arXiv:2405.08054  [pdf, other

    cs.GR cs.CV

    Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning

    Authors: Wenqi Dong, Bangbang Yang, Lin Ma, Xiao Liu, Liyuan Cui, Hujun Bao, Yuewen Ma, Zhaopeng Cui

    Abstract: As humans, we aspire to create media content that is both freely willed and readily controlled. Thanks to the prominent development of generative techniques, we now can easily utilize 2D diffusion methods to synthesize images controlled by raw sketch or designated human poses, and even progressively edit/regenerate local regions with masked inpainting. However, similar workflows in 3D modeling tas… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Project webpage: https://zju3dv.github.io/coin3d

  16. arXiv:2405.06426  [pdf, other

    physics.plasm-ph

    Generation of Ultra-Collimated Polarized Attosecond $γ-$Rays via Beam Instabilities

    Authors: Li-Jie Cui, Ke-Jia Wei, Chong Lv, Feng Wan, Yousef I. Salamin, Lei-Feng Cao, Jian-Xing Li

    Abstract: Polarized attosecond $γ-$rays may offer excitation and hyperfine tracking of reactions relevant to nuclear physics, astrophysics, high-energy physics, etc. However, unfortunately, generation of a feasible and easy-to-deploy source is still a great challenge. Here, we put forward a novel method for producing ultra-collimated high-brilliance polarized attosecond $γ-$rays via the interaction of an un… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  17. arXiv:2405.04270  [pdf, other

    astro-ph.GA

    Very Long Baseline Array Observations of Parsec-scale Radio Emission in Dual Active Galactic Nuclei

    Authors: Wancheng Xu, Lang Cui, Xiang Liu, Tao An, Hongmin Cao, Pengfei Jiang, Luis C. Ho, Ning Chang, Xiaolong Yang, Yuling Shen, Guiping Tan, Zhenhua Han, Junhui Fan, Ming Zhang

    Abstract: It is believed that dual active galactic nuclei (dual AGN) will form during galaxies merge. Studying dual-AGN emission can provide valuable insights into galaxy merging and evolution. To investigate parsec-scale radio emission properties, we observed eight radio components of four selected dual-AGN systems using the Very Long Baseline Array (VLBA) at 5 GHz in multiple-phase-center mode. Among them… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 15 pages, 4 figures

  18. arXiv:2405.02008  [pdf, other

    cs.CV

    DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model

    Authors: Peijin Jia, Tuopu Wen, Ziang Luo, Mengmeng Yang, Kun Jiang, Zhiquan Lei, Xuewei Tang, Ziyuan Liu, Le Cui, Kehua Sheng, Bo Zhang, Diange Yang

    Abstract: Constructing high-definition (HD) maps is a crucial requirement for enabling autonomous driving. In recent years, several map segmentation algorithms have been developed to address this need, leveraging advancements in Bird's-Eye View (BEV) perception. However, existing models still encounter challenges in producing realistic and consistent semantic map layouts. One prominent issue is the limited… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  19. arXiv:2404.16343  [pdf, other

    astro-ph.GA astro-ph.HE

    Magnetically Driven Relativistic Jet in the High-Redshift Blazar OH~471

    Authors: S. Guo, T. An, Y. Liu, Y. Sotnikova, A. Volvach, T. Mufakharov, L. Chen, L. Cui, A. Wang, Z. Xu, Y. Zhang, W. Xu, Y. A. Kovalev, Y. Y. Kovalev, M. Kharinov, A. Erkenov, T. Semenova, L. Volvach

    Abstract: Context : Understanding the mechanisms that launch and shape powerful relativistic jets from supermassive black holes (SMBHs) in high-redshift active galactic nuclei (AGN) is crucial for probing the co-evolution of SMBHs and galaxies over cosmic time. Aims :We study the high-redshift ($z=3.396$) blazar OH~471 to explore the jet launching mechanism in the early Universe. Methods : Using multi-f… ▽ More

    Submitted 20 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 16 pages, 7 figures, 3 tables

    Journal ref: A&A 685, L11 (2024)

  20. arXiv:2404.06020  [pdf, other

    astro-ph.HE gr-qc

    Tests of the Kerr Hypothesis with MAXI J1803-298 Using Different RELXILL_NK Flavors

    Authors: Jie Liao, M. Ghasemi-Nodehi, Lang Cui, Ashutosh Tripathi, Yong-Feng Huang, Xiang Liu

    Abstract: Iron line spectroscopy has been one of the leading methods not only for measuring the spins of accreting black holes but also for testing fundamental physics. Basing on such a method, we present an analysis of a dataset observed simultaneously by NuSTAR and NICER for the black hole binary candidate MAXI J1803-298, which shows prominent relativistic reflection features. Various relxill_nk flavors a… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted by ApJ

  21. arXiv:2404.03622  [pdf, other

    cs.CL

    Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models

    Authors: Wenshan Wu, Shaoguang Mao, Yadong Zhang, Yan Xia, Li Dong, Lei Cui, Furu Wei

    Abstract: Large language models (LLMs) have exhibited impressive performance in language comprehension and various reasoning tasks. However, their abilities in spatial reasoning, a crucial aspect of human cognition, remain relatively unexplored. Human possess a remarkable ability to create mental images of unseen objects and actions through a process known as the Mind's Eye, enabling the imagination of the… ▽ More

    Submitted 24 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  22. A Change of Scenery: Transformative Insights from Retrospective VR Embodied Perspective-Taking of Conflict With a Close Other

    Authors: Seraphina Yong, Leo Cui, Evan Suma Rosenberg, Svetlana Yarosh

    Abstract: Close relationships are irreplaceable social resources, yet prone to high-risk conflict. Building on findings from the fields of HCI, virtual reality, and behavioral therapy, we evaluate the unexplored potential of retrospective VR-embodied perspective-taking to fundamentally influence conflict resolution in close others. We develop a biographically-accurate Retrospective Embodied Perspective-Taki… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 18 pages, 5 figures, Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems

  23. arXiv:2403.18479  [pdf, other

    cs.IR

    Lightweight Embeddings for Graph Collaborative Filtering

    Authors: Xurong Liang, Tong Chen, Lizhen Cui, Yang Wang, Meng Wang, Hongzhi Yin

    Abstract: Graph neural networks (GNNs) are currently one of the most performant collaborative filtering methods. Meanwhile, owing to the use of an embedding table to represent each user/item as a distinct vector, GNN-based recommenders have inherited the long-standing defect of parameter inefficiency. As a common practice for scalable embeddings, parameter sharing enables the use of fewer embedding vectors… ▽ More

    Submitted 28 March, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted by SIGIR '24

  24. arXiv:2403.18249  [pdf, other

    cs.CL cs.SI

    Exploring the Deceptive Power of LLM-Generated Fake News: A Study of Real-World Detection Challenges

    Authors: Yanshen Sun, Jianfeng He, Limeng Cui, Shuo Lei, Chang-Tien Lu

    Abstract: Recent advancements in Large Language Models (LLMs) have enabled the creation of fake news, particularly in complex fields like healthcare. Studies highlight the gap in the deceptive power of LLM-generated fake news with and without human assistance, yet the potential of prompting techniques has not been fully explored. Thus, this work aims to determine whether prompting strategies can effectively… ▽ More

    Submitted 8 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  25. arXiv:2403.16443  [pdf, other

    cs.CL cs.AI cs.SE

    CodeS: Natural Language to Code Repository via Multi-Layer Sketch

    Authors: Daoguang Zan, Ailun Yu, Wei Liu, Dong Chen, Bo Shen, Wei Li, Yafen Yao, Yongshun Gong, Xiaolin Chen, Bei Guan, Zhiguang Yang, Yongji Wang, Qianxiang Wang, Lizhen Cui

    Abstract: The impressive performance of large language models (LLMs) on code-related tasks has shown the potential of fully automated software development. In light of this, we introduce a new software engineering task, namely Natural Language to code Repository (NL2Repo). This task aims to generate an entire code repository from its natural language requirements. To address this task, we propose a simple y… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: https://github.com/NL2Code/CodeS

  26. arXiv:2403.16227  [pdf, other

    cs.CV

    Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System

    Authors: Jing Li, Lu Bai, Bin Yang, Chang Li, Lingfei Ma, Lixin Cui, Edwin R. Hancock

    Abstract: Infrared and visible image fusion (IVF) plays an important role in intelligent transportation system (ITS). The early works predominantly focus on boosting the visual appeal of the fused result, and only several recent approaches have tried to combine the high-level vision task with IVF. However, they prioritize the design of cascaded structure to seek unified suitable features and fit different t… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  27. arXiv:2403.16133  [pdf, other

    cs.AI cs.LG

    SSHPool: The Separated Subgraph-based Hierarchical Pooling

    Authors: Zhuo Xu, Lixin Cui, Yue Wang, Hangyuan Du, Lu Bai, Edwin R. Hancock

    Abstract: In this paper, we develop a novel local graph pooling method, namely the Separated Subgraph-based Hierarchical Pooling (SSHPool), for graph classification. To this end, we commence by assigning the nodes of a sample graph into different clusters, resulting in a family of separated subgraphs. We individually employ a local graph convolution units as the local structure to further compress each subg… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  28. arXiv:2403.16130  [pdf, other

    cs.LG cs.AI

    AKBR: Learning Adaptive Kernel-based Representations for Graph Classification

    Authors: Feifei Qian, Lixin Cui, Yue Wang, Hangyuan Du, Lu Bai, Edwin R. Hancock

    Abstract: In this paper, we propose a new model to learn Adaptive Kernel-based Representations (AKBR) for graph classification. Unlike state-of-the-art R-convolution graph kernels that are defined by merely counting any pair of isomorphic substructures between graphs and cannot provide an end-to-end learning mechanism for the classifier, the proposed AKBR approach aims to define an end-to-end representation… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  29. arXiv:2403.06021  [pdf, other

    cs.IR cs.LG

    Hierarchical Query Classification in E-commerce Search

    Authors: Bing He, Sreyashi Nag, Limeng Cui, Suhang Wang, Zheng Li, Rahul Goutam, Zhen Li, Haiyang Zhang

    Abstract: E-commerce platforms typically store and structure product information and search data in a hierarchy. Efficiently categorizing user search queries into a similar hierarchical structure is paramount in enhancing user experience on e-commerce platforms as well as news curation and academic research. The significance of this task is amplified when dealing with sensitive query categorization or criti… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Published at: the ACM Web Conference 2024 in the industry track (WWW'24)

  30. VLBI Astrometry of Radio Stars to Link Radio and Optical Celestial Reference Frames: Observing Strategies

    Authors: Jingdong Zhang, Bo Zhang, Shuangjing Xu, Niu Liu, Wen Chen, Hao Ding, Pengfei Jiang, Yan Sun, Jinqing Wang, Lang Cui, Shiming Wen, Xiaofeng Mai, Jinling Li, Fengchun Shu, Yidan Huang

    Abstract: The Gaia celestial reference frame (Gaia-CRF) will benefit from a close assessment with independent methods, such as Very Long Baseline Interferometry (VLBI) measurements of radio stars at bright magnitudes. However, obtaining full astrometric parameters for each radio star through VLBI measurements demands a significant amount of observation time. This study proposes an efficient observing strate… ▽ More

    Submitted 26 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures, accepted for publication in the Monthly Notices of the Royal Astronomy Society (MNRAS)

  31. arXiv:2403.02693  [pdf, other

    cs.MM eess.IV

    Optimizing Mobile-Friendly Viewport Prediction for Live 360-Degree Video Streaming

    Authors: Lei Zhang, Tao Long, Weizhen Xu, Laizhong Cui, Jiangchuan Liu

    Abstract: Viewport prediction is the crucial task for adaptive 360-degree video streaming, as the bitrate control algorithms usually require the knowledge of the user's viewing portions of the frames. Various methods are studied and adopted for viewport prediction from less accurate statistic tools to highly calibrated deep neural networks. Conventionally, it is difficult to implement sophisticated deep lea… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 14 pages

  32. arXiv:2403.01244  [pdf, other

    cs.CL cs.AI

    Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal

    Authors: Jianheng Huang, Leyang Cui, Ante Wang, Chengyi Yang, Xinting Liao, Linfeng Song, Junfeng Yao, Jinsong Su

    Abstract: Large language models (LLMs) suffer from catastrophic forgetting during continual learning. Conventional rehearsal-based methods rely on previous training data to retain the model's ability, which may not be feasible in real-world applications. When conducting continual learning based on a publicly-released LLM checkpoint, the availability of the original training data may be non-existent. To addr… ▽ More

    Submitted 25 May, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

    Comments: ACL 2024 main, long paper

  33. arXiv:2402.19255  [pdf, other

    cs.CL

    GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers

    Authors: Qintong Li, Leyang Cui, Xueliang Zhao, Lingpeng Kong, Wei Bi

    Abstract: Large language models (LLMs) have achieved impressive performance across various mathematical reasoning benchmarks. However, there are increasing debates regarding whether these models truly understand and apply mathematical knowledge or merely rely on shortcuts for mathematical reasoning. One essential and frequently occurring evidence is that when the math questions are slightly changed, LLMs ca… ▽ More

    Submitted 1 July, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  34. arXiv:2402.17532  [pdf, other

    cs.CL

    Retrieval is Accurate Generation

    Authors: Bowen Cao, Deng Cai, Leyang Cui, Xuxin Cheng, Wei Bi, Yuexian Zou, Shuming Shi

    Abstract: Standard language models generate text by selecting tokens from a fixed, finite, and standalone vocabulary. We introduce a novel method that selects context-aware phrases from a collection of supporting documents. One of the most significant challenges for this paradigm shift is determining the training oracles, because a string of text can be segmented in various ways and each segment can be retr… ▽ More

    Submitted 16 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  35. arXiv:2402.16978  [pdf, other

    math.OC cs.LG

    An inexact Bregman proximal point method and its acceleration version for unbalanced optimal transport

    Authors: Xiang Chen, Faqiang Wang, Jun Liu, Li Cui

    Abstract: The Unbalanced Optimal Transport (UOT) problem plays increasingly important roles in computational biology, computational imaging and deep learning. Scaling algorithm is widely used to solve UOT due to its convenience and good convergence properties. However, this algorithm has lower accuracy for large regularization parameters, and due to stability issues, small regularization parameters can easi… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  36. arXiv:2402.15865  [pdf, other

    cs.CV eess.IV

    HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models

    Authors: Li Pang, Xiangyu Rui, Long Cui, Hongzhong Wang, Deyu Meng, Xiangyong Cao

    Abstract: Hyperspectral image (HSI) restoration aims at recovering clean images from degraded observations and plays a vital role in downstream tasks. Existing model-based methods have limitations in accurately modeling the complex image characteristics with handcraft priors, and deep learning-based methods suffer from poor generalization ability. To alleviate these issues, this paper proposes an unsupervis… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  37. arXiv:2402.03815  [pdf, other

    cs.LG

    Expediting In-Network Federated Learning by Voting-Based Consensus Model Compression

    Authors: Xiaoxin Su, Yipeng Zhou, Laizhong Cui, Song Guo

    Abstract: Recently, federated learning (FL) has gained momentum because of its capability in preserving data privacy. To conduct model training by FL, multiple clients exchange model updates with a parameter server via Internet. To accelerate the communication speed, it has been explored to deploy a programmable switch (PS) in lieu of the parameter server to coordinate clients. The challenge to deploy the P… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: To appear in 2024 IEEE International Conference on Computer Communications(INFOCOM 2024)

  38. arXiv:2402.03770  [pdf, other

    cs.LG

    Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes

    Authors: Xiaoxin Su, Yipeng Zhou, Laizhong Cui, John C. S. Lui, Jiangchuan Liu

    Abstract: In Federated Learning (FL) paradigm, a parameter server (PS) concurrently communicates with distributed participating clients for model collection, update aggregation, and model distribution over multiple rounds, without touching private data owned by individual clients. FL is appealing in preserving data privacy; yet the communication between the PS and scattered clients can be a severe bottlenec… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: To appear in 2024 IEEE International Conference on Computer Communications(INFOCOM 2024)

  39. arXiv:2402.02360  [pdf, other

    astro-ph.HE

    On the Broadening of the Pulse Width of FRB 20121102A due to Propagation and Instrumental Effects

    Authors: Jia-Peng Wei, Yong-Feng Huang, Lang Cui, Xiang Liu, Jin-Jun Geng, Xue-Feng Wu

    Abstract: The pulse widths of fast radio bursts are always broadened due to the scattering of the plasma medium through which the electromagnetic wave passes. The recorded pulse width will be further affected by the radio telescopes since the sampling time and the bandwidth cannot be infinitely small. In this study, we focus on the pulse widths of the 3287 bursts detected from FRB 20121102A as of October 20… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  40. Gravitational Wave Emission from Close-in Strange Quark Planets Around Strange Stars with Magnetic Interactions

    Authors: Xiao-Li Zhang, Ze-Cheng Zou, Yong-Feng Huang, Hao-Xuan Gao, Pei Wang, Lang Cui, Xiang Liu

    Abstract: According to the strange quark matter hypothesis, strange planets may exist, which are planetary mass objects composed of almost equal numbers of up, down and strange quarks. A strange planet can revolve around its host strange star in a very close-in orbit. When it finally merges with the host, strong gravitational wave emissions will be generated. Here the gravitational waveforms are derived for… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Journal ref: Monthly Notices of the Royal Astronomical Society (MNRAS), 531:3905, 2024

  41. arXiv:2401.17630  [pdf, other

    cs.IR

    Towards Personalized Privacy: User-Governed Data Contribution for Federated Recommendation

    Authors: Liang Qu, Wei Yuan, Ruiqi Zheng, Lizhen Cui, Yuhui Shi, Hongzhi Yin

    Abstract: Federated recommender systems (FedRecs) have gained significant attention for their potential to protect user's privacy by keeping user privacy data locally and only communicating model parameters/gradients to the server. Nevertheless, the currently existing architecture of FedRecs assumes that all users have the same 0-privacy budget, i.e., they do not upload any data to the server, thus overlook… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  42. arXiv:2401.13448  [pdf, other

    cs.IR

    Decentralized Collaborative Learning with Adaptive Reference Data for On-Device POI Recommendation

    Authors: Ruiqi Zheng, Liang Qu, Tong Chen, Lizhen Cui, Yuhui Shi, Hongzhi Yin

    Abstract: In Location-based Social Networks, Point-of-Interest (POI) recommendation helps users discover interesting places. There is a trend to move from the cloud-based model to on-device recommendations for privacy protection and reduced server reliance. Due to the scarcity of local user-item interactions on individual devices, solely relying on local instances is not adequate. Collaborative Learning (CL… ▽ More

    Submitted 24 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  43. arXiv:2401.11913  [pdf, other

    cs.CV cs.AI

    Large receptive field strategy and important feature extraction strategy in 3D object detection

    Authors: Leichao Cui, Xiuxian Li, Min Meng, Guangyu Jia

    Abstract: The enhancement of 3D object detection is pivotal for precise environmental perception and improved task execution capabilities in autonomous driving. LiDAR point clouds, offering accurate depth information, serve as a crucial information for this purpose. Our study focuses on key challenges in 3D target detection. To tackle the challenge of expanding the receptive field of a 3D convolutional kern… ▽ More

    Submitted 10 March, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  44. arXiv:2401.11668  [pdf, ps, other

    astro-ph.GA

    Constraining annihilating dark matter using the multi-frequency radio flux profiles of the M33 galaxy

    Authors: Man Ho Chan, Chak Man Lee, Lang Cui, Ning Chang, Chun Sing Leung

    Abstract: Radio data can give stringent constraints for annihilating dark matter. In general, radio observations can detect very accurate radio flux density with high resolution and different frequencies for nearby galaxies. We are able to obtain the radio flux density as a function of distance from the galactic center and frequencies $S(r,ν)$. In this article, we demonstrate a comprehensive radio analysis… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: Accepted publication in ApJ

    Journal ref: ApJ 962,141 (2024)

  45. arXiv:2401.10768  [pdf, other

    cs.CL

    Knowledge Verification to Nip Hallucination in the Bud

    Authors: Fanqi Wan, Xinting Huang, Leyang Cui, Xiaojun Quan, Wei Bi, Shuming Shi

    Abstract: While large language models (LLMs) have demonstrated exceptional performance across various tasks following human alignment, they may still generate responses that sound plausible but contradict factual knowledge, a phenomenon known as \emph{hallucination}. In this paper, we demonstrate the feasibility of mitigating hallucinations by verifying and minimizing the inconsistency between external know… ▽ More

    Submitted 16 April, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Work in progress

  46. arXiv:2401.09331  [pdf, other

    cs.CV cs.RO

    Event-Based Visual Odometry on Non-Holonomic Ground Vehicles

    Authors: Wanting Xu, Si'ao Zhang, Li Cui, Xin Peng, Laurent Kneip

    Abstract: Despite the promise of superior performance under challenging conditions, event-based motion estimation remains a hard problem owing to the difficulty of extracting and tracking stable features from event streams. In order to robustify the estimation, it is generally believed that fusion with other sensors is a requirement. In this work, we demonstrate reliable, purely event-based visual odometry… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted by 3DV 2024

  47. arXiv:2401.08294  [pdf, other

    cs.CL

    Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models

    Authors: Shuming Shi, Enbo Zhao, Deng Cai, Leyang Cui, Xinting Huang, Huayang Li

    Abstract: We present Inferflow, an efficient and highly configurable inference engine for large language models (LLMs). With Inferflow, users can serve most of the common transformer models by simply modifying some lines in corresponding configuration files, without writing a single line of source code. Compared with most existing inference engines, Inferflow has some key features. First, by implementing a… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Technical report of Inferflow

  48. arXiv:2401.04942  [pdf, other

    cs.CV

    Latency-aware Road Anomaly Segmentation in Videos: A Photorealistic Dataset and New Metrics

    Authors: Beiwen Tian, Huan-ang Gao, Leiyao Cui, Yupeng Zheng, Lan Luo, Baofeng Wang, Rong Zhi, Guyue Zhou, Hao Zhao

    Abstract: In the past several years, road anomaly segmentation is actively explored in the academia and drawing growing attention in the industry. The rationale behind is straightforward: if the autonomous car can brake before hitting an anomalous object, safety is promoted. However, this rationale naturally calls for a temporally informed setting while existing methods and benchmarks are designed in an unr… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  49. arXiv:2401.04143  [pdf, other

    cs.CV

    RHOBIN Challenge: Reconstruction of Human Object Interaction

    Authors: Xianghui Xie, Xi Wang, Nikos Athanasiou, Bharat Lal Bhatnagar, Chun-Hao P. Huang, Kaichun Mo, Hao Chen, Xia Jia, Zerui Zhang, Liangxian Cui, Xiao Lin, Bingqiao Qian, Jie Xiao, Wenfei Yang, Hyeongjin Nam, Daniel Sungho Jung, Kihoon Kim, Kyoung Mu Lee, Otmar Hilliges, Gerard Pons-Moll

    Abstract: Modeling the interaction between humans and objects has been an emerging research direction in recent years. Capturing human-object interaction is however a very challenging task due to heavy occlusion and complex dynamics, which requires understanding not only 3D human pose, and object pose but also the interaction between them. Reconstruction of 3D humans and objects has been two separate resear… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 14 pages, 5 tables, 7 figure. Technical report of the CVPR'23 workshop: RHOBIN challenge (https://rhobin-challenge.github.io/)

  50. arXiv:2312.15710  [pdf, other

    cs.CL cs.AI

    Alleviating Hallucinations of Large Language Models through Induced Hallucinations

    Authors: Yue Zhang, Leyang Cui, Wei Bi, Shuming Shi

    Abstract: Despite their impressive capabilities, large language models (LLMs) have been observed to generate responses that include inaccurate or fabricated information, a phenomenon commonly known as ``hallucination''. In this work, we propose a simple \textit{Induce-then-Contrast} Decoding (ICD) strategy to alleviate hallucinations. We first construct a factually weak LLM by inducing hallucinations from t… ▽ More

    Submitted 11 March, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Work in progress