Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 385 results for author: cui, L

.
  1. arXiv:2409.03155  [pdf, other

    cs.CL cs.AI

    Debate on Graph: a Flexible and Reliable Reasoning Framework for Large Language Models

    Authors: Jie Ma, Zhitao Gao, Qi Chai, Wangchun Sun, Pinghui Wang, Hongbin Pei, Jing Tao, Lingyun Song, Jun Liu, Chen Zhang, Lizhen Cui

    Abstract: Large Language Models (LLMs) may suffer from hallucinations in real-world applications due to the lack of relevant knowledge. In contrast, knowledge graphs encompass extensive, multi-relational structures that store a vast array of symbolic facts. Consequently, integrating LLMs with knowledge graphs has been extensively explored, with Knowledge Graph Question Answering (KGQA) serving as a critical… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: 12 pages

    ACM Class: I.2.4

  2. arXiv:2408.09478  [pdf, other

    cs.LG cs.CR

    Mitigating Noise Detriment in Differentially Private Federated Learning with Model Pre-training

    Authors: Huitong Jin, Yipeng Zhou, Laizhong Cui, Quan Z. Sheng

    Abstract: Pre-training exploits public datasets to pre-train an advanced machine learning model, so that the model can be easily tuned to adapt to various downstream tasks. Pre-training has been extensively explored to mitigate computation and communication resource consumption. Inspired by these advantages, we are the first to explore how model pre-training can mitigate noise detriment in differentially pr… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  3. arXiv:2408.08642  [pdf, other

    cs.LG

    The Power of Bias: Optimizing Client Selection in Federated Learning with Heterogeneous Differential Privacy

    Authors: Jiating Ma, Yipeng Zhou, Qi Li, Quan Z. Sheng, Laizhong Cui, Jiangchuan Liu

    Abstract: To preserve the data privacy, the federated learning (FL) paradigm emerges in which clients only expose model gradients rather than original data for conducting model training. To enhance the protection of model gradients in FL, differentially private federated learning (DPFL) is proposed which incorporates differentially private (DP) noises to obfuscate gradients before they are exposed. Yet, an… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  4. arXiv:2408.06576  [pdf, other

    cs.CL

    CTISum: A New Benchmark Dataset For Cyber Threat Intelligence Summarization

    Authors: Wei Peng, Junmei Ding, Wei Wang, Lei Cui, Wei Cai, Zhiyu Hao, Xiaochun Yun

    Abstract: Cyber Threat Intelligence (CTI) summarization task requires the system to generate concise and accurate highlights from raw intelligence data, which plays an important role in providing decision-makers with crucial information to quickly detect and respond to cyber threats in the cybersecurity domain. However, efficient techniques for summarizing CTI reports, including facts, analytical insights,… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  5. arXiv:2408.03877  [pdf, other

    cs.LG cs.AI

    Knowledge Probing for Graph Representation Learning

    Authors: Mingyu Zhao, Xingyu Huang, Ziyu Lyu, Yanlin Wang, Lixin Cui, Lu Bai

    Abstract: Graph learning methods have been extensively applied in diverse application areas. However, what kind of inherent graph properties e.g. graph proximity, graph structural information has been encoded into graph representation learning for downstream tasks is still under-explored. In this paper, we propose a novel graph probing framework (GraphProbe) to investigate and interpret whether the family o… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  6. arXiv:2408.02215  [pdf

    cs.IR

    Exploring Query Understanding for Amazon Product Search

    Authors: Chen Luo, Xianfeng Tang, Hanqing Lu, Yaochen Xie, Hui Liu, Zhenwei Dai, Limeng Cui, Ashutosh Joshi, Sreyashi Nag, Yang Li, Zhen Li, Rahul Goutam, Jiliang Tang, Haiyang Zhang, Qi He

    Abstract: Online shopping platforms, such as Amazon, offer services to billions of people worldwide. Unlike web search or other search engines, product search engines have their unique characteristics, primarily featuring short queries which are mostly a combination of product attributes and structured product search space. The uniqueness of product search underscores the crucial importance of the query und… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

  7. arXiv:2407.21523  [pdf, other

    cs.LG cs.AI cs.DB

    Tabular Data Augmentation for Machine Learning: Progress and Prospects of Embracing Generative AI

    Authors: Lingxi Cui, Huan Li, Ke Chen, Lidan Shou, Gang Chen

    Abstract: Machine learning (ML) on tabular data is ubiquitous, yet obtaining abundant high-quality tabular data for model training remains a significant obstacle. Numerous works have focused on tabular data augmentation (TDA) to enhance the original table with additional data, thereby improving downstream ML tasks. Recently, there has been a growing interest in leveraging the capabilities of generative AI f… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: repository maintained at https://github.com/SuDIS-ZJU/awesome-tabular-data-augmentation

  8. arXiv:2407.14530  [pdf, other

    cs.DB cs.AI

    FuncEvalGMN: Evaluating Functional Correctness of SQL via Graph Matching Network

    Authors: Yi Zhan, Yang Sun, Han Weng, Longjie Cui, Guifeng Wang, Jiajun Xie, Yu Tian, Xiaoming Yin, Boyi Liu, Dongchi Huang

    Abstract: In this paper, we propose a novel graph-based methodology to evaluate the functional correctness of SQL generation. Conventional metrics for assessing SQL code generation, such as matching-based and execution-based methods (e.g., exact set match and execution accuracy), are subject to two primary limitations. Firstly, the former fails to effectively assess functional correctness, as different SQL… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  9. arXiv:2407.13605  [pdf, other

    cs.LG

    Physics-guided Active Sample Reweighting for Urban Flow Prediction

    Authors: Wei Jiang, Tong Chen, Guanhua Ye, Wentao Zhang, Lizhen Cui, Zi Huang, Hongzhi Yin

    Abstract: Urban flow prediction is a spatio-temporal modeling task that estimates the throughput of transportation services like buses, taxis, and ride-sharing, where data-driven models have become the most popular solution in the past decade. Meanwhile, the implicitly learned mapping between historical observations to the prediction targets tend to over-simplify the dynamics of real-world urban flows, lead… ▽ More

    Submitted 6 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: This paper is accepted by Proceedings of the 33nd ACM International Conference on Information and Knowledge Management (CIKM '24)

  10. arXiv:2407.09860  [pdf, other

    quant-ph cond-mat.stat-mech

    Quantum Vicsek Model for Active Matter

    Authors: Hong Yuan, L. X. Cui, L. T. Chen, C. P. Sun

    Abstract: We propose a quantum analog of the Vicsek model, consisting of an ensemble of overdamped spin$-1/2$ particles with ferromagnetic couplings, driven by a uniformly polarized magnetic field. The spontaneous magnetization of the spin components breaks the $SO(3)$ (or $SO(2)$) symmetry, inducing an ordered phase of flocking. We derive the hydrodynamic equations, similar to those formulated by Toner and… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  11. arXiv:2406.19698  [pdf, other

    math.CO

    Optimal radio labeling for the Cartesian product of square mesh networks and stars

    Authors: Linlin Cui, Feng Li

    Abstract: As the most critical component in the communication process, channels have a great impact on the communication quality of network. With the continuous expansion of network scale, the limited channel resources lead to the limitation of communication network scale. Therefore, achieving reasonable channel assignment and utilization becomes an extremely challenging problem. In order to solve this issu… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  12. arXiv:2406.18962  [pdf, other

    cs.IR

    Multi-modal Food Recommendation using Clustering and Self-supervised Learning

    Authors: Yixin Zhang, Xin Zhou, Qianwen Meng, Fanglin Zhu, Yonghui Xu, Zhiqi Shen, Lizhen Cui

    Abstract: Food recommendation systems serve as pivotal components in the realm of digital lifestyle services, designed to assist users in discovering recipes and food items that resonate with their unique dietary predilections. Typically, multi-modal descriptions offer an exhaustive profile for each recipe, thereby ensuring recommendations that are both personalized and accurate. Our preliminary investigati… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Working paper

  13. arXiv:2406.17335  [pdf, other

    cs.IR cs.LG

    A Thorough Performance Benchmarking on Lightweight Embedding-based Recommender Systems

    Authors: Hung Vinh Tran, Tong Chen, Quoc Viet Hung Nguyen, Zi Huang, Lizhen Cui, Hongzhi Yin

    Abstract: Since the creation of the Web, recommender systems (RSs) have been an indispensable mechanism in information filtering. State-of-the-art RSs primarily depend on categorical features, which ecoded by embedding vectors, resulting in excessively large embedding tables. To prevent over-parameterized embedding tables from harming scalability, both academia and industry have seen increasing efforts in c… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  14. arXiv:2406.17312  [pdf, other

    cs.CL

    Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning

    Authors: Sen Yang, Leyang Cui, Deng Cai, Xinting Huang, Shuming Shi, Wai Lam

    Abstract: Iterative preference learning, though yielding superior performances, requires online annotated preference labels. In this work, we study strategies to select worth-annotating response pairs for cost-efficient annotation while achieving competitive or even better performances compared with the random selection baseline for iterative preference learning. Built on assumptions regarding uncertainty a… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  15. arXiv:2406.16377  [pdf, other

    cs.CL cs.AI

    On the Transformations across Reward Model, Parameter Update, and In-Context Prompt

    Authors: Deng Cai, Huayang Li, Tingchen Fu, Siheng Li, Weiwen Xu, Shuaiyi Li, Bowen Cao, Zhisong Zhang, Xinting Huang, Leyang Cui, Yan Wang, Lemao Liu, Taro Watanabe, Shuming Shi

    Abstract: Despite the general capabilities of pre-trained large language models (LLMs), they still need further adaptation to better serve practical applications. In this paper, we demonstrate the interchangeability of three popular and distinct adaptation tools: parameter updating, reward modeling, and in-context prompting. This interchangeability establishes a triangular framework with six transformation… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  16. arXiv:2406.07400  [pdf, other

    cs.LG cs.LO

    Guiding LLM Temporal Logic Generation with Explicit Separation of Data and Control

    Authors: William Murphy, Nikolaus Holzer, Nathan Koenig, Leyi Cui, Raven Rothkopf, Feitong Qiao, Mark Santolucito

    Abstract: Temporal logics are powerful tools that are widely used for the synthesis and verification of reactive systems. The recent progress on Large Language Models (LLMs) has the potential to make the process of writing such specifications more accessible. However, writing specifications in temporal logics remains challenging for all but the most expert users. A key question in using LLMs for temporal lo… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  17. arXiv:2406.02027  [pdf, other

    cs.LG cs.AI cs.CR cs.CV

    Inference Attacks: A Taxonomy, Survey, and Promising Directions

    Authors: Feng Wu, Lei Cui, Shaowen Yao, Shui Yu

    Abstract: The prosperity of machine learning has also brought people's concerns about data privacy. Among them, inference attacks can implement privacy breaches in various MLaaS scenarios and model training/prediction phases. Specifically, inference attacks can perform privacy inference on undisclosed target training sets based on outputs of the target model, including but not limited to statistics, members… ▽ More

    Submitted 27 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  18. arXiv:2405.14742  [pdf, other

    cs.LG cs.AI

    HC-GAE: The Hierarchical Cluster-based Graph Auto-Encoder for Graph Representation Learning

    Authors: Zhuo Xu, Lu Bai, Lixin Cui, Ming Li, Yue Wang, Edwin R. Hancock

    Abstract: Graph Auto-Encoders (GAEs) are powerful tools for graph representation learning. In this paper, we develop a novel Hierarchical Cluster-based GAE (HC-GAE), that can learn effective structural characteristics for graph data analysis. To this end, during the encoding process, we commence by utilizing the hard node assignment to decompose a sample graph into a family of separated subgraphs. We compre… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  19. arXiv:2405.12689  [pdf, other

    cs.CL cs.AI

    Spotting AI's Touch: Identifying LLM-Paraphrased Spans in Text

    Authors: Yafu Li, Zhilin Wang, Leyang Cui, Wei Bi, Shuming Shi, Yue Zhang

    Abstract: AI-generated text detection has attracted increasing attention as powerful language models approach human-level generation. Limited work is devoted to detecting (partially) AI-paraphrased texts. However, AI paraphrasing is commonly employed in various application scenarios for text refinement and diversity. To this end, we propose a novel detection framework, paraphrased text span detection (PTD),… ▽ More

    Submitted 29 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: ACL 2024 Findings

  20. arXiv:2405.10218  [pdf, other

    cs.LG cs.AI

    ENADPool: The Edge-Node Attention-based Differentiable Pooling for Graph Neural Networks

    Authors: Zhehan Zhao, Lu Bai, Lixin Cui, Ming Li, Yue Wang, Lixiang Xu, Edwin R. Hancock

    Abstract: Graph Neural Networks (GNNs) are powerful tools for graph classification. One important operation for GNNs is the downsampling or pooling that can learn effective embeddings from the node representations. In this paper, we propose a new hierarchical pooling operation, namely the Edge-Node Attention-based Differentiable Pooling (ENADPool), for GNNs to learn effective graph representations. Unlike t… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  21. arXiv:2405.09808  [pdf, other

    quant-ph physics.optics

    Single-photon phase spectrum recovery from the Hong-Ou-Mandel dip

    Authors: Yuhang Lei, Wen Zhao, Liang Cui, Xiaoying Li

    Abstract: Characterizing the temporal-spectral profile of single photons is essential for quantum information protocol utilizing temporal mode for encoding. Based on the phase retrieval algorithm, we present a method to reconstruct the phase spectrum difference between two wave packets from their Hong-Ou-Mandel dip, and intensity spectra. Our confirmatory experiment with weak coherent wave packets demonstra… ▽ More

    Submitted 14 August, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  22. arXiv:2405.08054  [pdf, other

    cs.GR cs.CV

    Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning

    Authors: Wenqi Dong, Bangbang Yang, Lin Ma, Xiao Liu, Liyuan Cui, Hujun Bao, Yuewen Ma, Zhaopeng Cui

    Abstract: As humans, we aspire to create media content that is both freely willed and readily controlled. Thanks to the prominent development of generative techniques, we now can easily utilize 2D diffusion methods to synthesize images controlled by raw sketch or designated human poses, and even progressively edit/regenerate local regions with masked inpainting. However, similar workflows in 3D modeling tas… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Project webpage: https://zju3dv.github.io/coin3d

  23. arXiv:2405.06426  [pdf, other

    physics.plasm-ph

    Generation of Ultra-Collimated Polarized Attosecond $γ-$Rays via Beam Instabilities

    Authors: Li-Jie Cui, Ke-Jia Wei, Chong Lv, Feng Wan, Yousef I. Salamin, Lei-Feng Cao, Jian-Xing Li

    Abstract: Polarized attosecond $γ-$rays may offer excitation and hyperfine tracking of reactions relevant to nuclear physics, astrophysics, high-energy physics, etc. However, unfortunately, generation of a feasible and easy-to-deploy source is still a great challenge. Here, we put forward a novel method for producing ultra-collimated high-brilliance polarized attosecond $γ-$rays via the interaction of an un… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  24. arXiv:2405.04270  [pdf, other

    astro-ph.GA

    Very Long Baseline Array Observations of Parsec-scale Radio Emission in Dual Active Galactic Nuclei

    Authors: Wancheng Xu, Lang Cui, Xiang Liu, Tao An, Hongmin Cao, Pengfei Jiang, Luis C. Ho, Ning Chang, Xiaolong Yang, Yuling Shen, Guiping Tan, Zhenhua Han, Junhui Fan, Ming Zhang

    Abstract: It is believed that dual active galactic nuclei (dual AGN) will form during galaxies merge. Studying dual-AGN emission can provide valuable insights into galaxy merging and evolution. To investigate parsec-scale radio emission properties, we observed eight radio components of four selected dual-AGN systems using the Very Long Baseline Array (VLBA) at 5 GHz in multiple-phase-center mode. Among them… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 15 pages, 4 figures

  25. arXiv:2405.02008  [pdf, other

    cs.CV

    DiffMap: Enhancing Map Segmentation with Map Prior Using Diffusion Model

    Authors: Peijin Jia, Tuopu Wen, Ziang Luo, Mengmeng Yang, Kun Jiang, Zhiquan Lei, Xuewei Tang, Ziyuan Liu, Le Cui, Bo Zhang, Long Huang, Diange Yang

    Abstract: Constructing high-definition (HD) maps is a crucial requirement for enabling autonomous driving. In recent years, several map segmentation algorithms have been developed to address this need, leveraging advancements in Bird's-Eye View (BEV) perception. However, existing models still encounter challenges in producing realistic and consistent semantic map layouts. One prominent issue is the limited… ▽ More

    Submitted 1 September, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  26. arXiv:2404.16343  [pdf, other

    astro-ph.GA astro-ph.HE

    Magnetically Driven Relativistic Jet in the High-Redshift Blazar OH~471

    Authors: S. Guo, T. An, Y. Liu, Y. Sotnikova, A. Volvach, T. Mufakharov, L. Chen, L. Cui, A. Wang, Z. Xu, Y. Zhang, W. Xu, Y. A. Kovalev, Y. Y. Kovalev, M. Kharinov, A. Erkenov, T. Semenova, L. Volvach

    Abstract: Context : Understanding the mechanisms that launch and shape powerful relativistic jets from supermassive black holes (SMBHs) in high-redshift active galactic nuclei (AGN) is crucial for probing the co-evolution of SMBHs and galaxies over cosmic time. Aims :We study the high-redshift ($z=3.396$) blazar OH~471 to explore the jet launching mechanism in the early Universe. Methods : Using multi-f… ▽ More

    Submitted 20 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 16 pages, 7 figures, 3 tables

    Journal ref: A&A 685, L11 (2024)

  27. arXiv:2404.06020  [pdf, other

    astro-ph.HE gr-qc

    Tests of the Kerr Hypothesis with MAXI J1803-298 Using Different RELXILL_NK Flavors

    Authors: Jie Liao, M. Ghasemi-Nodehi, Lang Cui, Ashutosh Tripathi, Yong-Feng Huang, Xiang Liu

    Abstract: Iron line spectroscopy has been one of the leading methods not only for measuring the spins of accreting black holes but also for testing fundamental physics. Basing on such a method, we present an analysis of a dataset observed simultaneously by NuSTAR and NICER for the black hole binary candidate MAXI J1803-298, which shows prominent relativistic reflection features. Various relxill_nk flavors a… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted by ApJ

  28. arXiv:2404.03622  [pdf, other

    cs.CL

    Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models

    Authors: Wenshan Wu, Shaoguang Mao, Yadong Zhang, Yan Xia, Li Dong, Lei Cui, Furu Wei

    Abstract: Large language models (LLMs) have exhibited impressive performance in language comprehension and various reasoning tasks. However, their abilities in spatial reasoning, a crucial aspect of human cognition, remain relatively unexplored. Human possess a remarkable ability to create mental images of unseen objects and actions through a process known as the Mind's Eye, enabling the imagination of the… ▽ More

    Submitted 24 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  29. A Change of Scenery: Transformative Insights from Retrospective VR Embodied Perspective-Taking of Conflict With a Close Other

    Authors: Seraphina Yong, Leo Cui, Evan Suma Rosenberg, Svetlana Yarosh

    Abstract: Close relationships are irreplaceable social resources, yet prone to high-risk conflict. Building on findings from the fields of HCI, virtual reality, and behavioral therapy, we evaluate the unexplored potential of retrospective VR-embodied perspective-taking to fundamentally influence conflict resolution in close others. We develop a biographically-accurate Retrospective Embodied Perspective-Taki… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 18 pages, 5 figures, Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems

  30. arXiv:2403.18479  [pdf, other

    cs.IR

    Lightweight Embeddings for Graph Collaborative Filtering

    Authors: Xurong Liang, Tong Chen, Lizhen Cui, Yang Wang, Meng Wang, Hongzhi Yin

    Abstract: Graph neural networks (GNNs) are currently one of the most performant collaborative filtering methods. Meanwhile, owing to the use of an embedding table to represent each user/item as a distinct vector, GNN-based recommenders have inherited the long-standing defect of parameter inefficiency. As a common practice for scalable embeddings, parameter sharing enables the use of fewer embedding vectors… ▽ More

    Submitted 28 March, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted by SIGIR '24

  31. arXiv:2403.18249  [pdf, other

    cs.CL cs.SI

    Exploring the Deceptive Power of LLM-Generated Fake News: A Study of Real-World Detection Challenges

    Authors: Yanshen Sun, Jianfeng He, Limeng Cui, Shuo Lei, Chang-Tien Lu

    Abstract: Recent advancements in Large Language Models (LLMs) have enabled the creation of fake news, particularly in complex fields like healthcare. Studies highlight the gap in the deceptive power of LLM-generated fake news with and without human assistance, yet the potential of prompting techniques has not been fully explored. Thus, this work aims to determine whether prompting strategies can effectively… ▽ More

    Submitted 8 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

  32. arXiv:2403.16443  [pdf, other

    cs.CL cs.AI cs.SE

    CodeS: Natural Language to Code Repository via Multi-Layer Sketch

    Authors: Daoguang Zan, Ailun Yu, Wei Liu, Dong Chen, Bo Shen, Wei Li, Yafen Yao, Yongshun Gong, Xiaolin Chen, Bei Guan, Zhiguang Yang, Yongji Wang, Qianxiang Wang, Lizhen Cui

    Abstract: The impressive performance of large language models (LLMs) on code-related tasks has shown the potential of fully automated software development. In light of this, we introduce a new software engineering task, namely Natural Language to code Repository (NL2Repo). This task aims to generate an entire code repository from its natural language requirements. To address this task, we propose a simple y… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: https://github.com/NL2Code/CodeS

  33. arXiv:2403.16227  [pdf, other

    cs.CV

    Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System

    Authors: Jing Li, Lu Bai, Bin Yang, Chang Li, Lingfei Ma, Lixin Cui, Edwin R. Hancock

    Abstract: Infrared and visible image fusion (IVF) plays an important role in intelligent transportation system (ITS). The early works predominantly focus on boosting the visual appeal of the fused result, and only several recent approaches have tried to combine the high-level vision task with IVF. However, they prioritize the design of cascaded structure to seek unified suitable features and fit different t… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  34. arXiv:2403.16133  [pdf, other

    cs.AI cs.LG

    SSHPool: The Separated Subgraph-based Hierarchical Pooling

    Authors: Zhuo Xu, Lixin Cui, Ming Li, Yue Wang, Ziyu Lyu, Hangyuan Du, Lu Bai, Philip S. Yu, Edwin R. Hancock

    Abstract: In this paper, we develop a novel local graph pooling method, namely the Separated Subgraph-based Hierarchical Pooling (SSHPool), for graph classification. We commence by assigning the nodes of a sample graph into different clusters, resulting in a family of separated subgraphs. We individually employ the local graph convolution units as the local structure to further compress each subgraph into a… ▽ More

    Submitted 13 August, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  35. arXiv:2403.16130  [pdf, other

    cs.LG cs.AI

    AKBR: Learning Adaptive Kernel-based Representations for Graph Classification

    Authors: Feifei Qian, Lixin Cui, Ming Li, Yue Wang, Hangyuan Du, Lixiang Xu, Lu Bai, Philip S. Yu, Edwin R. Hancock

    Abstract: In this paper, we propose a new model to learn Adaptive Kernel-based Representations (AKBR) for graph classification. Unlike state-of-the-art R-convolution graph kernels that are defined by merely counting any pair of isomorphic substructures between graphs and cannot provide an end-to-end learning mechanism for the classifier, the proposed AKBR approach aims to define an end-to-end representation… ▽ More

    Submitted 13 August, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  36. arXiv:2403.06021  [pdf, other

    cs.IR cs.LG

    Hierarchical Query Classification in E-commerce Search

    Authors: Bing He, Sreyashi Nag, Limeng Cui, Suhang Wang, Zheng Li, Rahul Goutam, Zhen Li, Haiyang Zhang

    Abstract: E-commerce platforms typically store and structure product information and search data in a hierarchy. Efficiently categorizing user search queries into a similar hierarchical structure is paramount in enhancing user experience on e-commerce platforms as well as news curation and academic research. The significance of this task is amplified when dealing with sensitive query categorization or criti… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Published at: the ACM Web Conference 2024 in the industry track (WWW'24)

  37. VLBI Astrometry of Radio Stars to Link Radio and Optical Celestial Reference Frames: Observing Strategies

    Authors: Jingdong Zhang, Bo Zhang, Shuangjing Xu, Niu Liu, Wen Chen, Hao Ding, Pengfei Jiang, Yan Sun, Jinqing Wang, Lang Cui, Shiming Wen, Xiaofeng Mai, Jinling Li, Fengchun Shu, Yidan Huang

    Abstract: The Gaia celestial reference frame (Gaia-CRF) will benefit from a close assessment with independent methods, such as Very Long Baseline Interferometry (VLBI) measurements of radio stars at bright magnitudes. However, obtaining full astrometric parameters for each radio star through VLBI measurements demands a significant amount of observation time. This study proposes an efficient observing strate… ▽ More

    Submitted 26 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures, accepted for publication in the Monthly Notices of the Royal Astronomy Society (MNRAS)

  38. arXiv:2403.02693  [pdf, other

    cs.MM eess.IV

    Optimizing Mobile-Friendly Viewport Prediction for Live 360-Degree Video Streaming

    Authors: Lei Zhang, Tao Long, Weizhen Xu, Laizhong Cui, Jiangchuan Liu

    Abstract: Viewport prediction is the crucial task for adaptive 360-degree video streaming, as the bitrate control algorithms usually require the knowledge of the user's viewing portions of the frames. Various methods are studied and adopted for viewport prediction from less accurate statistic tools to highly calibrated deep neural networks. Conventionally, it is difficult to implement sophisticated deep lea… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 14 pages

  39. arXiv:2403.01244  [pdf, other

    cs.CL cs.AI

    Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal

    Authors: Jianheng Huang, Leyang Cui, Ante Wang, Chengyi Yang, Xinting Liao, Linfeng Song, Junfeng Yao, Jinsong Su

    Abstract: Large language models (LLMs) suffer from catastrophic forgetting during continual learning. Conventional rehearsal-based methods rely on previous training data to retain the model's ability, which may not be feasible in real-world applications. When conducting continual learning based on a publicly-released LLM checkpoint, the availability of the original training data may be non-existent. To addr… ▽ More

    Submitted 25 May, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

    Comments: ACL 2024 main, long paper

  40. arXiv:2402.19255  [pdf, other

    cs.CL

    GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers

    Authors: Qintong Li, Leyang Cui, Xueliang Zhao, Lingpeng Kong, Wei Bi

    Abstract: Large language models (LLMs) have achieved impressive performance across various mathematical reasoning benchmarks. However, there are increasing debates regarding whether these models truly understand and apply mathematical knowledge or merely rely on shortcuts for mathematical reasoning. One essential and frequently occurring evidence is that when the math questions are slightly changed, LLMs ca… ▽ More

    Submitted 1 July, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  41. arXiv:2402.17532  [pdf, other

    cs.CL

    Retrieval is Accurate Generation

    Authors: Bowen Cao, Deng Cai, Leyang Cui, Xuxin Cheng, Wei Bi, Yuexian Zou, Shuming Shi

    Abstract: Standard language models generate text by selecting tokens from a fixed, finite, and standalone vocabulary. We introduce a novel method that selects context-aware phrases from a collection of supporting documents. One of the most significant challenges for this paradigm shift is determining the training oracles, because a string of text can be segmented in various ways and each segment can be retr… ▽ More

    Submitted 16 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  42. arXiv:2402.16978  [pdf, other

    math.OC cs.LG

    An inexact Bregman proximal point method and its acceleration version for unbalanced optimal transport

    Authors: Xiang Chen, Faqiang Wang, Jun Liu, Li Cui

    Abstract: The Unbalanced Optimal Transport (UOT) problem plays increasingly important roles in computational biology, computational imaging and deep learning. Scaling algorithm is widely used to solve UOT due to its convenience and good convergence properties. However, this algorithm has lower accuracy for large regularization parameters, and due to stability issues, small regularization parameters can easi… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  43. arXiv:2402.15865  [pdf, other

    cs.CV eess.IV

    HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models

    Authors: Li Pang, Xiangyu Rui, Long Cui, Hongzhong Wang, Deyu Meng, Xiangyong Cao

    Abstract: Hyperspectral image (HSI) restoration aims at recovering clean images from degraded observations and plays a vital role in downstream tasks. Existing model-based methods have limitations in accurately modeling the complex image characteristics with handcraft priors, and deep learning-based methods suffer from poor generalization ability. To alleviate these issues, this paper proposes an unsupervis… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  44. arXiv:2402.03815  [pdf, other

    cs.LG

    Expediting In-Network Federated Learning by Voting-Based Consensus Model Compression

    Authors: Xiaoxin Su, Yipeng Zhou, Laizhong Cui, Song Guo

    Abstract: Recently, federated learning (FL) has gained momentum because of its capability in preserving data privacy. To conduct model training by FL, multiple clients exchange model updates with a parameter server via Internet. To accelerate the communication speed, it has been explored to deploy a programmable switch (PS) in lieu of the parameter server to coordinate clients. The challenge to deploy the P… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: To appear in 2024 IEEE International Conference on Computer Communications(INFOCOM 2024)

  45. arXiv:2402.03770  [pdf, other

    cs.LG

    Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes

    Authors: Xiaoxin Su, Yipeng Zhou, Laizhong Cui, John C. S. Lui, Jiangchuan Liu

    Abstract: In Federated Learning (FL) paradigm, a parameter server (PS) concurrently communicates with distributed participating clients for model collection, update aggregation, and model distribution over multiple rounds, without touching private data owned by individual clients. FL is appealing in preserving data privacy; yet the communication between the PS and scattered clients can be a severe bottlenec… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: To appear in 2024 IEEE International Conference on Computer Communications(INFOCOM 2024)

  46. arXiv:2402.02360  [pdf, other

    astro-ph.HE

    On the Broadening of the Pulse Width of FRB 20121102A due to Propagation and Instrumental Effects

    Authors: Jia-Peng Wei, Yong-Feng Huang, Lang Cui, Xiang Liu, Jin-Jun Geng, Xue-Feng Wu

    Abstract: The pulse widths of fast radio bursts are always broadened due to the scattering of the plasma medium through which the electromagnetic wave passes. The recorded pulse width will be further affected by the radio telescopes since the sampling time and the bandwidth cannot be infinitely small. In this study, we focus on the pulse widths of the 3287 bursts detected from FRB 20121102A as of October 20… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  47. Gravitational Wave Emission from Close-in Strange Quark Planets Around Strange Stars with Magnetic Interactions

    Authors: Xiao-Li Zhang, Ze-Cheng Zou, Yong-Feng Huang, Hao-Xuan Gao, Pei Wang, Lang Cui, Xiang Liu

    Abstract: According to the strange quark matter hypothesis, strange planets may exist, which are planetary mass objects composed of almost equal numbers of up, down and strange quarks. A strange planet can revolve around its host strange star in a very close-in orbit. When it finally merges with the host, strong gravitational wave emissions will be generated. Here the gravitational waveforms are derived for… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Journal ref: Monthly Notices of the Royal Astronomical Society (MNRAS), 531:3905, 2024

  48. arXiv:2401.17630  [pdf, other

    cs.IR

    Towards Personalized Privacy: User-Governed Data Contribution for Federated Recommendation

    Authors: Liang Qu, Wei Yuan, Ruiqi Zheng, Lizhen Cui, Yuhui Shi, Hongzhi Yin

    Abstract: Federated recommender systems (FedRecs) have gained significant attention for their potential to protect user's privacy by keeping user privacy data locally and only communicating model parameters/gradients to the server. Nevertheless, the currently existing architecture of FedRecs assumes that all users have the same 0-privacy budget, i.e., they do not upload any data to the server, thus overlook… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  49. arXiv:2401.13448  [pdf, other

    cs.IR

    Decentralized Collaborative Learning with Adaptive Reference Data for On-Device POI Recommendation

    Authors: Ruiqi Zheng, Liang Qu, Tong Chen, Lizhen Cui, Yuhui Shi, Hongzhi Yin

    Abstract: In Location-based Social Networks, Point-of-Interest (POI) recommendation helps users discover interesting places. There is a trend to move from the cloud-based model to on-device recommendations for privacy protection and reduced server reliance. Due to the scarcity of local user-item interactions on individual devices, solely relying on local instances is not adequate. Collaborative Learning (CL… ▽ More

    Submitted 24 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  50. arXiv:2401.11913  [pdf, other

    cs.CV cs.AI

    Large receptive field strategy and important feature extraction strategy in 3D object detection

    Authors: Leichao Cui, Xiuxian Li, Min Meng, Guangyu Jia

    Abstract: The enhancement of 3D object detection is pivotal for precise environmental perception and improved task execution capabilities in autonomous driving. LiDAR point clouds, offering accurate depth information, serve as a crucial information for this purpose. Our study focuses on key challenges in 3D target detection. To tackle the challenge of expanding the receptive field of a 3D convolutional kern… ▽ More

    Submitted 10 March, 2024; v1 submitted 22 January, 2024; originally announced January 2024.