Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 81 results for author: Ye, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13605  [pdf, other

    cs.LG

    Physics-guided Active Sample Reweighting for Urban Flow Prediction

    Authors: Wei Jiang, Tong Chen, Guanhua Ye, Wentao Zhang, Lizhen Cui, Zi Huang, Hongzhi Yin

    Abstract: Urban flow prediction is a spatio-temporal modeling task that estimates the throughput of transportation services like buses, taxis, and ride-sharing, where data-driven models have become the most popular solution in the past decade. Meanwhile, the implicitly learned mapping between historical observations to the prediction targets tend to over-simplify the dynamics of real-world urban flows, lead… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: This paper is accepted by Proceedings of the 33nd ACM International Conference on Information and Knowledge Management (CIKM '24)

  2. arXiv:2406.13200  [pdf, other

    cs.LG

    RobGC: Towards Robust Graph Condensation

    Authors: Xinyi Gao, Hongzhi Yin, Tong Chen, Guanhua Ye, Wentao Zhang, Bin Cui

    Abstract: Graph neural networks (GNNs) have attracted widespread attention for their impressive capability of graph representation learning. However, the increasing prevalence of large-scale graphs presents a significant challenge for GNN training due to their computational demands, limiting the applicability of GNNs in various scenarios. In response to this challenge, graph condensation (GC) is proposed as… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2406.05387  [pdf, other

    cs.IR

    PTF-FSR: A Parameter Transmission-Free Federated Sequential Recommender System

    Authors: Wei Yuan, Chaoqun Yang, Liang Qu, Quoc Viet Hung Nguyen, Guanhua Ye, Hongzhi Yin

    Abstract: Sequential recommender systems have made significant progress. Recently, due to increasing concerns about user data privacy, some researchers have implemented federated learning for sequential recommendation, a.k.a., Federated Sequential Recommender Systems (FedSeqRecs), in which a public sequential recommender model is shared and frequently transmitted between a central server and clients to achi… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  4. arXiv:2406.01022  [pdf, other

    cs.CR cs.IR

    Poisoning Attacks and Defenses in Recommender Systems: A Survey

    Authors: Zongwei Wang, Junliang Yu, Min Gao, Wei Yuan, Guanhua Ye, Shazia Sadiq, Hongzhi Yin

    Abstract: Modern recommender systems (RS) have profoundly enhanced user experience across digital platforms, yet they face significant threats from poisoning attacks. These attacks, aimed at manipulating recommendation outputs for unethical gains, exploit vulnerabilities in RS through injecting malicious data or intervening model training. This survey presents a unique perspective by examining these threats… ▽ More

    Submitted 5 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 22 pages, 8 figures

  5. arXiv:2405.17894  [pdf, other

    cs.CV cs.AI

    White-box Multimodal Jailbreaks Against Large Vision-Language Models

    Authors: Ruofan Wang, Xingjun Ma, Hanxu Zhou, Chuanjun Ji, Guangnan Ye, Yu-Gang Jiang

    Abstract: Recent advancements in Large Vision-Language Models (VLMs) have underscored their superiority in various multimodal tasks. However, the adversarial robustness of VLMs has not been fully explored. Existing methods mainly assess robustness through unimodal adversarial attacks that perturb images, while assuming inherent resilience against text-based attacks. Different from existing attacks, in this… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  6. arXiv:2405.16433  [pdf, other

    cs.CL cs.AI cs.CY

    CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling

    Authors: Chenhao Zhang, Renhao Li, Minghuan Tan, Min Yang, Jingwei Zhu, Di Yang, Jiahao Zhao, Guancheng Ye, Chengming Li, Xiping Hu

    Abstract: Using large language models (LLMs) to assist psychological counseling is a significant but challenging task at present. Attempts have been made on improving empathetic conversations or acting as effective assistants in the treatment with LLMs. However, the existing datasets lack consulting knowledge, resulting in LLMs lacking professional consulting competence. Moreover, how to automatically evalu… ▽ More

    Submitted 10 June, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: Appectped to Findings of ACL2024

  7. arXiv:2405.13811  [pdf, other

    cs.IR

    Diffusion-Based Cloud-Edge-Device Collaborative Learning for Next POI Recommendations

    Authors: Jing Long, Guanhua Ye, Tong Chen, Yang Wang, Meng Wang, Hongzhi Yin

    Abstract: The rapid expansion of Location-Based Social Networks (LBSNs) has highlighted the importance of effective next Point-of-Interest (POI) recommendations, which leverage historical check-in data to predict users' next POIs to visit. Traditional centralized deep neural networks (DNNs) offer impressive POI recommendation performance but face challenges due to privacy concerns and limited timeliness. In… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  8. arXiv:2405.13707  [pdf, other

    cs.LG cs.AI

    Rethinking and Accelerating Graph Condensation: A Training-Free Approach with Class Partition

    Authors: Xinyi Gao, Tong Chen, Wentao Zhang, Junliang Yu, Guanhua Ye, Quoc Viet Hung Nguyen, Hongzhi Yin

    Abstract: The increasing prevalence of large-scale graphs poses a significant challenge for graph neural network training, attributed to their substantial computational requirements. In response, graph condensation (GC) emerges as a promising data-centric solution aiming to substitute the large graph with a small yet informative condensed graph to facilitate data-efficient GNN training. However, existing GC… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  9. arXiv:2405.11811  [pdf, other

    cs.LG cs.DC

    FedCAda: Adaptive Client-Side Optimization for Accelerated and Stable Federated Learning

    Authors: Liuzhi Zhou, Yu He, Kun Zhai, Xiang Liu, Sen Liu, Xingjun Ma, Guangnan Ye, Yu-Gang Jiang, Hongfeng Chai

    Abstract: Federated learning (FL) has emerged as a prominent approach for collaborative training of machine learning models across distributed clients while preserving data privacy. However, the quest to balance acceleration and stability becomes a significant challenge in FL, especially on the client-side. In this paper, we introduce FedCAda, an innovative federated client adaptive algorithm designed to ta… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  10. arXiv:2405.10212  [pdf, other

    cs.CL

    CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations

    Authors: Jiahao Zhao, Jingwei Zhu, Minghuan Tan, Min Yang, Di Yang, Chenhao Zhang, Guancheng Ye, Chengming Li, Xiping Hu

    Abstract: In this paper, we introduce a novel psychological benchmark, CPsyExam, constructed from questions sourced from Chinese language examinations. CPsyExam is designed to prioritize psychological knowledge and case analysis separately, recognizing the significance of applying psychological knowledge to real-world scenarios. From the pool of 22k questions, we utilize 4k to create the benchmark that offe… ▽ More

    Submitted 18 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  11. arXiv:2405.08340  [pdf, other

    cs.CR cs.CV

    Achieving Resolution-Agnostic DNN-based Image Watermarking:A Novel Perspective of Implicit Neural Representation

    Authors: Yuchen Wang, Xingyu Zhu, Guanhui Ye, Shiyao Zhang, Xuetao Wei

    Abstract: DNN-based watermarking methods are rapidly developing and delivering impressive performances. Recent advances achieve resolution-agnostic image watermarking by reducing the variant resolution watermarking problem to a fixed resolution watermarking problem. However, such a reduction process can potentially introduce artifacts and low robustness. To address this issue, we propose the first, to the b… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  12. arXiv:2404.16880  [pdf, other

    q-bio.QM cs.AI cs.CL

    Atomas: Hierarchical Alignment on Molecule-Text for Unified Molecule Understanding and Generation

    Authors: Yikun Zhang, Geyan Ye, Chaohao Yuan, Bo Han, Long-Kai Huang, Jianhua Yao, Wei Liu, Yu Rong

    Abstract: Molecule-and-text cross-modal representation learning has emerged as a promising direction for enhancing the quality of molecular representation, thereby improving performance in various scientific fields, including drug discovery and materials science. Existing studies adopt a global alignment approach to learn the knowledge from different modalities. These global alignment approaches fail to cap… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  13. arXiv:2404.16866  [pdf, other

    q-bio.QM cs.AI cs.LG

    Functional Protein Design with Local Domain Alignment

    Authors: Chaohao Yuan, Songyou Li, Geyan Ye, Yikun Zhang, Long-Kai Huang, Wenbing Huang, Wei Liu, Jianhua Yao, Yu Rong

    Abstract: The core challenge of de novo protein design lies in creating proteins with specific functions or properties, guided by certain conditions. Current models explore to generate protein using structural and evolutionary guidance, which only provide indirect conditions concerning functions and properties. However, textual annotations of proteins, especially the annotations for protein domains, which d… ▽ More

    Submitted 27 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  14. arXiv:2404.11888  [pdf, other

    cs.LG cs.AI

    The Dog Walking Theory: Rethinking Convergence in Federated Learning

    Authors: Kun Zhai, Yifeng Gao, Xingjun Ma, Difan Zou, Guangnan Ye, Yu-Gang Jiang

    Abstract: Federated learning (FL) is a collaborative learning paradigm that allows different clients to train one powerful global model without sharing their private data. Although FL has demonstrated promising results in various applications, it is known to suffer from convergence issues caused by the data distribution shift across different clients, especially on non-independent and identically distribute… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  15. arXiv:2404.04949  [pdf, other

    cs.CL cs.CE

    SilverSight: A Multi-Task Chinese Financial Large Language Model Based on Adaptive Semantic Space Learning

    Authors: Yuhang Zhou, Zeping Li, Siyu Tian, Yuchen Ni, Sen Liu, Guangnan Ye, Hongfeng Chai

    Abstract: Large language models (LLMs) are increasingly being applied across various specialized fields, leveraging their extensive knowledge to empower a multitude of scenarios within these domains. However, each field encompasses a variety of specific tasks that require learning, and the diverse, heterogeneous data across these domains can lead to conflicts during model task transfer. In response to this… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 17 pages, 17 figures

  16. arXiv:2403.20107  [pdf, other

    cs.IR

    Robust Federated Contrastive Recommender System against Model Poisoning Attack

    Authors: Wei Yuan, Chaoqun Yang, Liang Qu, Guanhua Ye, Quoc Viet Hung Nguyen, Hongzhi Yin

    Abstract: Federated Recommender Systems (FedRecs) have garnered increasing attention recently, thanks to their privacy-preserving benefits. However, the decentralized and open characteristics of current FedRecs present two dilemmas. First, the performance of FedRecs is compromised due to highly sparse on-device data for each client. Second, the system's robustness is undermined by the vulnerability to model… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  17. arXiv:2403.19146  [pdf, ps, other

    cs.DS cs.DC math.OC

    Improving the Bit Complexity of Communication for Distributed Convex Optimization

    Authors: Mehrdad Ghadiri, Yin Tat Lee, Swati Padmanabhan, William Swartworth, David Woodruff, Guanghao Ye

    Abstract: We consider the communication complexity of some fundamental convex optimization problems in the point-to-point (coordinator) and blackboard communication models. We strengthen known bounds for approximately solving linear regression, $p$-norm regression (for $1\leq p\leq 2$), linear programming, minimizing the sum of finitely many convex nonsmooth functions with varying supports, and low rank app… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: To appear in STOC '24. Abstract shortened to meet the arXiv limits. Comments welcome!

  18. arXiv:2403.18379  [pdf, other

    cs.LG cs.AI

    IIP-Mixer:Intra-Inter Patch Mixing Architecture for Battery Remaining Useful Life Prediction

    Authors: Guangzai Ye, Li Feng, Jianlan Guo, Yuqiang Chen

    Abstract: Accurately estimating the Remaining Useful Life (RUL) of lithium-ion batteries is crucial for maintaining the safe and stable operation of rechargeable battery management systems. However, this task is often challenging due to the complex temporal dynamics involved. Recently, attention-based networks, such as Transformers and Informer, have been the popular architecture in time series forecasting.… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  19. arXiv:2402.17472  [pdf, other

    cs.LG cs.AI

    RAGFormer: Learning Semantic Attributes and Topological Structure for Fraud Detection

    Authors: Haolin Li, Shuyang Jiang, Lifeng Zhang, Siyuan Du, Guangnan Ye, Hongfeng Chai

    Abstract: Fraud detection remains a challenging task due to the complex and deceptive nature of fraudulent activities. Current approaches primarily concentrate on learning only one perspective of the graph: either the topological structure of the graph or the attributes of individual nodes. However, we conduct empirical studies to reveal that these two types of features, while nearly orthogonal, are each in… ▽ More

    Submitted 18 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Preprint.Under review

  20. arXiv:2402.12713  [pdf, ps, other

    cs.CL

    Are LLMs Rational Investors? A Study on Detecting and Reducing the Financial Bias in LLMs

    Authors: Yuhang Zhou, Yuchen Ni, Yunhui Gan, Zhangyue Yin, Xiang Liu, Jian Zhang, Sen Liu, Xipeng Qiu, Guangnan Ye, Hongfeng Chai

    Abstract: Large Language Models (LLMs) are increasingly adopted in financial analysis for interpreting complex market data and trends. However, their use is challenged by intrinsic biases (e.g., risk-preference bias) and a superficial understanding of market intricacies, necessitating a thorough assessment of their financial insight. To address these issues, we introduce Financial Bias Indicators (FBI), a f… ▽ More

    Submitted 1 July, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  21. arXiv:2401.14583  [pdf, other

    cs.IR

    Physical Trajectory Inference Attack and Defense in Decentralized POI Recommendation

    Authors: Jing Long, Tong Chen, Guanhua Ye, Kai Zheng, Nguyen Quoc Viet Hung, Hongzhi Yin

    Abstract: As an indispensable personalized service within Location-Based Social Networks (LBSNs), the Point-of-Interest (POI) recommendation aims to assist individuals in discovering attractive and engaging places. However, the accurate recommendation capability relies on the powerful server collecting a vast amount of users' historical check-in data, posing significant risks of privacy breaches. Although s… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  22. arXiv:2401.11720  [pdf, other

    cs.LG cs.AI

    Graph Condensation: A Survey

    Authors: Xinyi Gao, Junliang Yu, Tong Chen, Guanhua Ye, Wentao Zhang, Hongzhi Yin

    Abstract: The rapid growth of graph data poses significant challenges in storage, transmission, and particularly the training of graph neural networks (GNNs). To address these challenges, graph condensation (GC) has emerged as an innovative solution. GC focuses on synthesizing a compact yet highly representative graph, enabling GNNs trained on it to achieve performance comparable to those trained on the ori… ▽ More

    Submitted 22 July, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  23. arXiv:2401.10334  [pdf, other

    q-bio.QM cs.AI cs.CL cs.LG

    DrugAssist: A Large Language Model for Molecule Optimization

    Authors: Geyan Ye, Xibao Cai, Houtim Lai, Xing Wang, Junhong Huang, Longyue Wang, Wei Liu, Xiangxiang Zeng

    Abstract: Recently, the impressive performance of large language models (LLMs) on a wide range of tasks has attracted an increasing number of attempts to apply LLMs in drug discovery. However, molecule optimization, a critical task in the drug discovery pipeline, is currently an area that has seen little involvement from LLMs. Most of existing approaches focus solely on capturing the underlying patterns in… ▽ More

    Submitted 28 December, 2023; originally announced January 2024.

    Comments: Geyan Ye and Xibao Cai are equal contributors; Longyue Wang is corresponding author

  24. arXiv:2401.07061  [pdf, other

    cs.CV

    Dual-View Data Hallucination with Semantic Relation Guidance for Few-Shot Image Recognition

    Authors: Hefeng Wu, Guangzhi Ye, Ziyang Zhou, Ling Tian, Qing Wang, Liang Lin

    Abstract: Learning to recognize novel concepts from just a few image samples is very challenging as the learned model is easily overfitted on the few data and results in poor generalizability. One promising but underexplored solution is to compensate the novel classes by generating plausible samples. However, most existing works of this line exploit visual information only, rendering the generated data easy… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: 13 pages

  25. arXiv:2312.15826  [pdf, other

    cs.IR

    Adversarial Item Promotion on Visually-Aware Recommender Systems by Guided Diffusion

    Authors: Lijian Chen, Wei Yuan, Tong Chen, Guanhua Ye, Quoc Viet Hung Nguyen, Hongzhi Yin

    Abstract: Visually-aware recommender systems have found widespread application in domains where visual elements significantly contribute to the inference of users' potential preferences. While the incorporation of visual information holds the promise of enhancing recommendation accuracy and alleviating the cold-start problem, it is essential to point out that the inclusion of item images may introduce subst… ▽ More

    Submitted 22 May, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: Accepted by TOIS 2024

  26. arXiv:2312.12436  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

    Authors: Chaoyou Fu, Renrui Zhang, Zihan Wang, Yubo Huang, Zhengye Zhang, Longtian Qiu, Gaoxiang Ye, Yunhang Shen, Mengdan Zhang, Peixian Chen, Sirui Zhao, Shaohui Lin, Deqiang Jiang, Di Yin, Peng Gao, Ke Li, Hongsheng Li, Xing Sun

    Abstract: The surge of interest towards Multi-modal Large Language Models (MLLMs), e.g., GPT-4V(ision) from OpenAI, has marked a significant trend in both academia and industry. They endow Large Language Models (LLMs) with powerful capabilities in visual understanding, enabling them to tackle diverse multi-modal tasks. Very recently, Google released Gemini, its newest and most capable MLLM built from the gr… ▽ More

    Submitted 20 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Total 120 pages. See our project at https://github.com/BradyFU/Awesome-Multimodal-Large-Language-Models

  27. arXiv:2312.04815  [pdf, other

    cs.LG

    Not All Negatives Are Worth Attending to: Meta-Bootstrapping Negative Sampling Framework for Link Prediction

    Authors: Yakun Wang, Binbin Hu, Shuo Yang, Meiqi Zhu, Zhiqiang Zhang, Qiyang Zhang, Jun Zhou, Guo Ye, Huimei He

    Abstract: The rapid development of graph neural networks (GNNs) encourages the rising of link prediction, achieving promising performance with various applications. Unfortunately, through a comprehensive analysis, we surprisingly find that current link predictors with dynamic negative samplers (DNSs) suffer from the migration phenomenon between "easy" and "hard" samples, which goes against the preference of… ▽ More

    Submitted 11 December, 2023; v1 submitted 7 December, 2023; originally announced December 2023.

  28. arXiv:2311.12059  [pdf, other

    cs.CV cs.CR

    Towards Function Space Mesh Watermarking: Protecting the Copyright of Signed Distance Fields

    Authors: Xingyu Zhu, Guanhui Ye, Chengdong Dong, Xiapu Luo, Xuetao Wei

    Abstract: The signed distance field (SDF) represents 3D geometries in continuous function space. Due to its continuous nature, explicit 3D models (e.g., meshes) can be extracted from it at arbitrary resolution, which means losing the SDF is equivalent to losing the mesh. Recent research has shown meshes can also be extracted from SDF-enhanced neural radiance fields (NeRF). Such a signal raises an alarm that… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  29. arXiv:2311.11235  [pdf, other

    cs.LG cs.AI

    Unraveling the "Anomaly" in Time Series Anomaly Detection: A Self-supervised Tri-domain Solution

    Authors: Yuting Sun, Guansong Pang, Guanhua Ye, Tong Chen, Xia Hu, Hongzhi Yin

    Abstract: The ongoing challenges in time series anomaly detection (TSAD), notably the scarcity of anomaly labels and the variability in anomaly lengths and shapes, have led to the need for a more efficient solution. As limited anomaly labels hinder traditional supervised models in TSAD, various SOTA deep learning techniques, such as self-supervised learning, have been introduced to tackle this issue. Howeve… ▽ More

    Submitted 26 November, 2023; v1 submitted 19 November, 2023; originally announced November 2023.

    Comments: This work is submitted to IEEE International Conference on Data Engineering (ICDE) 2024

  30. arXiv:2311.03612  [pdf, other

    cs.CR cs.DC

    BlockEmulator: An Emulator Enabling to Test Blockchain Sharding Protocols

    Authors: Huawei Huang, Guang Ye, Qinde Chen, Zhaokang Yin, Xiaofei Luo, Jianru Lin, Taotao Li, Qinglin Yang, Zibin Zheng

    Abstract: Numerous blockchain simulators have been proposed to allow researchers to simulate mainstream blockchains. However, we have not yet found a testbed that enables researchers to develop and evaluate their new consensus algorithms or new protocols for blockchain sharding systems. To fill this gap, we develop BlockEmulator, which is designed as an experimental platform, particularly for emulating bloc… ▽ More

    Submitted 11 November, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

  31. arXiv:2311.01862  [pdf, other

    cs.CL cs.DB

    $R^3$-NL2GQL: A Model Coordination and Knowledge Graph Alignment Approach for NL2GQL

    Authors: Yuhang Zhou, Yu He, Siyu Tian, Yuchen Ni, Zhangyue Yin, Xiang Liu, Chuanjun Ji, Sen Liu, Xipeng Qiu, Guangnan Ye, Hongfeng Chai

    Abstract: While current tasks of converting natural language to SQL (NL2SQL) using Foundation Models have shown impressive achievements, adapting these approaches for converting natural language to Graph Query Language (NL2GQL) encounters hurdles due to the distinct nature of GQL compared to SQL, alongside the diverse forms of GQL. Moving away from traditional rule-based and slot-filling methodologies, we i… ▽ More

    Submitted 1 July, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

  32. arXiv:2310.16351  [pdf, other

    cs.DS

    Fast Algorithms for Separable Linear Programs

    Authors: Sally Dong, Gramoz Goranci, Lawrence Li, Sushant Sachdeva, Guanghao Ye

    Abstract: In numerical linear algebra, considerable effort has been devoted to obtaining faster algorithms for linear systems whose underlying matrices exhibit structural properties. A prominent success story is the method of generalized nested dissection~[Lipton-Rose-Tarjan'79] for separable matrices. On the other hand, the majority of recent developments in the design of efficient linear program (LP) solv… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 55 pages. To appear at SODA 2024

  33. arXiv:2309.07369  [pdf, other

    eess.AS cs.CL cs.SD

    Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation

    Authors: Shaoshi Ling, Guoli Ye, Rui Zhao, Yifan Gong

    Abstract: Attention-based encoder-decoder (AED) speech recognition model has been widely successful in recent years. However, the joint optimization of acoustic model and language model in end-to-end manner has created challenges for text adaptation. In particular, effectively, quickly and inexpensively adapting text has become a primary concern for deploying AED systems in industry. To address this issue,… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  34. arXiv:2308.14727  [pdf, ps, other

    cs.DS

    Faster Min-Cost Flow and Approximate Tree Decomposition on Bounded Treewidth Graphs

    Authors: Sally Dong, Guanghao Ye

    Abstract: We present an algorithm for min-cost flow in graphs with $n$ vertices and $m$ edges, given a tree decomposition of width $τ$ and size $S$, and polynomially bounded, integral edge capacities and costs, running in $\widetilde{O}(m\sqrtτ + S)$ time. This improves upon the previous fastest algorithm in this setting achieved by the bounded-treewidth linear program solver by [Dong-Lee-Ye,21] and [Gu-Son… ▽ More

    Submitted 30 June, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: 15 pages, to appear at ESA 2024

  35. arXiv:2308.13269  [pdf, other

    cs.LG

    Heterogeneous Decentralized Machine Unlearning with Seed Model Distillation

    Authors: Guanhua Ye, Tong Chen, Quoc Viet Hung Nguyen, Hongzhi Yin

    Abstract: As some recent information security legislation endowed users with unconditional rights to be forgotten by any trained machine learning model, personalized IoT service providers have to put unlearning functionality into their consideration. The most straightforward method to unlearn users' contribution is to retrain the model from the initial state, which is not realistic in high throughput applic… ▽ More

    Submitted 28 August, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

  36. arXiv:2308.07622  [pdf, other

    cs.MM

    EMID: An Emotional Aligned Dataset in Audio-Visual Modality

    Authors: Jialing Zou, Jiahao Mei, Guangze Ye, Tianyu Huai, Qiwei Shen, Daoguo Dong

    Abstract: In this paper, we propose Emotionally paired Music and Image Dataset (EMID), a novel dataset designed for the emotional matching of music and images, to facilitate auditory-visual cross-modal tasks such as generation and retrieval. Unlike existing approaches that primarily focus on semantic correlations or roughly divided emotional relations, EMID emphasizes the significance of emotional consisten… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  37. arXiv:2307.11628  [pdf, other

    cs.CR

    Rethinking Mesh Watermark: Towards Highly Robust and Adaptable Deep 3D Mesh Watermarking

    Authors: Xingyu Zhu, Guanhui Ye, Xiapu Luo, Xuetao Wei

    Abstract: The goal of 3D mesh watermarking is to embed the message in 3D meshes that can withstand various attacks imperceptibly and reconstruct the message accurately from watermarked meshes. The watermarking algorithm is supposed to withstand multiple attacks, and the complexity should not grow significantly with the mesh size. Unfortunately, previous methods are less robust against attacks and lack of ad… ▽ More

    Submitted 14 December, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

  38. arXiv:2305.02781  [pdf, other

    cs.CR

    ItoV: Efficiently Adapting Deep Learning-based Image Watermarking to Video Watermarking

    Authors: Guanhui Ye, Jiashi Gao, Yuchen Wang, Liyan Song, Xuetao Wei

    Abstract: Robust watermarking tries to conceal information within a cover image/video imperceptibly that is resistant to various distortions. Recently, deep learning-based approaches for image watermarking have made significant advancements in robustness and invisibility. However, few studies focused on video watermarking using deep neural networks due to the high complexity and computational costs. Our pap… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  39. arXiv:2304.08851  [pdf, other

    cs.IR

    PEGA: Personality-Guided Preference Aggregator for Ephemeral Group Recommendation

    Authors: Guangze Ye, Wen Wu, Liye Shi, Wenxin Hu, Xin Chen, Liang He

    Abstract: Recently, making recommendations for ephemeral groups which contain dynamic users and few historic interactions have received an increasing number of attention. The main challenge of ephemeral group recommender is how to aggregate individual preferences to represent the group's overall preference. Score aggregation and preference aggregation are two commonly-used methods that adopt hand-craft pred… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  40. arXiv:2210.13801  [pdf, other

    cs.CV cs.CR

    Deep Boosting Robustness of DNN-based Image Watermarking via DBMark

    Authors: Guanhui Ye, Jiashi Gao, Wei Xie, Bo Yin, Xuetao Wei

    Abstract: Image watermarking is a technique for hiding information into images that can withstand distortions while requiring the encoded image to be perceptually identical to the original image. Recent work based on deep neural networks (DNN) has achieved impressive progression in digital watermarking. Higher robustness under various distortions is the eternal pursuit of digital image watermarking approach… ▽ More

    Submitted 16 November, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

  41. arXiv:2210.08665  [pdf, other

    eess.AS cs.SD

    Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding

    Authors: Ruchao Fan, Guoli Ye, Yashesh Gaur, Jinyu Li

    Abstract: Masked language model (MLM) has been widely used for understanding tasks, e.g. BERT. Recently, MLM has also been used for generation tasks. The most popular one in speech is using Mask-CTC for non-autoregressive speech recognition. In this paper, we take one step further, and explore the possibility of using MLM as a non-autoregressive spell correction (SC) model for transformer-transducer (TT), d… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

  42. arXiv:2208.03811  [pdf, ps, other

    math.OC cs.LG

    Decomposable Non-Smooth Convex Optimization with Nearly-Linear Gradient Oracle Complexity

    Authors: Sally Dong, Haotian Jiang, Yin Tat Lee, Swati Padmanabhan, Guanghao Ye

    Abstract: Many fundamental problems in machine learning can be formulated by the convex program \[ \min_{θ\in R^d}\ \sum_{i=1}^{n}f_{i}(θ), \] where each $f_i$ is a convex, Lipschitz function supported on a subset of $d_i$ coordinates of $θ$. One common approach to this problem, exemplified by stochastic gradient descent, involves sampling one $f_i$ term at every iteration to make progress. This approach cr… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

  43. arXiv:2205.13705  [pdf, other

    cs.DC

    Heterogeneous Collaborative Learning for Personalized Healthcare Analytics via Messenger Distillation

    Authors: Guanhua Ye, Tong Chen, Yawen Li, Lizhen Cui, Quoc Viet Hung Nguyen, Hongzhi Yin

    Abstract: In this paper, we propose a Similarity-Quality-based Messenger Distillation (SQMD) framework for heterogeneous asynchronous on-device healthcare analytics. By introducing a preloaded reference dataset, SQMD enables all participant devices to distill knowledge from peers via messengers (i.e., the soft labels of the reference dataset generated by clients) without assuming the same model architecture… ▽ More

    Submitted 18 February, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

  44. arXiv:2205.01562  [pdf, ps, other

    cs.DS

    Nested Dissection Meets IPMs: Planar Min-Cost Flow in Nearly-Linear Time

    Authors: Sally Dong, Yu Gao, Gramoz Goranci, Yin Tat Lee, Richard Peng, Sushant Sachdeva, Guanghao Ye

    Abstract: We present a nearly-linear time algorithm for finding a minimum-cost flow in planar graphs with polynomially bounded integer costs and capacities. The previous fastest algorithm for this problem is based on interior point methods (IPMs) and works for general sparse graphs in $O(n^{1.5}\text{poly}(\log n))$ time [Daitch-Spielman, STOC'08]. Intuitively, $Ω(n^{1.5})$ is a natural runtime barrier for… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: 93 pages

  45. arXiv:2203.11698  [pdf, other

    cs.LG cs.CE

    A Machine Learning Generative Method for Automating Antenna Design and Optimization

    Authors: Yang Zhong, Peter Renner, Weiping Dou, Geng Ye, Jiang Zhu, Qing Huo Liu

    Abstract: To facilitate the antenna design with the aid of computer, one of the practices in consumer electronic industry is to model and optimize antenna performances with a simplified antenna geometric scheme. Traditional antenna modeling requires profound prior knowledge of electromagnetics in order to achieve a good design which satisfies the performance specifications from both antenna and product desi… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

    Comments: 16 pages, 12 figures

  46. arXiv:2203.07832  [pdf, other

    cs.LG cs.AI cs.MA

    Learning to Infer Belief Embedded Communication

    Authors: Guo Ye, Han Liu, Biswa Sengupta

    Abstract: In multi-agent collaboration problems with communication, an agent's ability to encode their intention and interpret other agents' strategies is critical for planning their future actions. This paper introduces a novel algorithm called Intention Embedded Communication (IEC) to mimic an agent's language learning ability. IEC contains a perception module for decoding other agents' intentions in resp… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  47. arXiv:2203.00964  [pdf, other

    cs.AI

    PKGM: A Pre-trained Knowledge Graph Model for E-commerce Application

    Authors: Wen Zhang, Chi-Man Wong, Ganqinag Ye, Bo Wen, Hongting Zhou, Wei Zhang, Huajun Chen

    Abstract: In recent years, knowledge graphs have been widely applied as a uniform way to organize data and have enhanced many tasks requiring knowledge. In online shopping platform Taobao, we built a billion-scale e-commerce product knowledge graph. It organizes data uniformly and provides item knowledge services for various tasks such as item recommendation. Usually, such knowledge services are provided th… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: This is an extension of work "Billion-scale Pre-trained E-commerce Product Knowledge Graph Model" published at ICDE2021. We test PKGM on two additional tasks, scene detection and sequential recommendation, and add serving with item embeddings as one of the baseline. The extensive experiments show the effectiveness of PKGM, pre-trained knowledge graph model. arXiv admin note: text overlap with arXiv:2105.00388

  48. arXiv:2112.09341  [pdf, other

    cs.LG cs.DC

    Personalized On-Device E-health Analytics with Decentralized Block Coordinate Descent

    Authors: Guanhua Ye, Hongzhi Yin, Tong Chen, Miao Xu, Quoc Viet Hung Nguyen, Jiangning Song

    Abstract: Actuated by the growing attention to personal healthcare and the pandemic, the popularity of E-health is proliferating. Nowadays, enhancement on medical diagnosis via machine learning models has been highly effective in many aspects of e-health analytics. Nevertheless, in the classic cloud-based/centralized e-health paradigms, all the data will be centrally stored on the server to facilitate model… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

  49. arXiv:2112.04087  [pdf, other

    cs.AI

    Improving Knowledge Graph Representation Learning by Structure Contextual Pre-training

    Authors: Ganqiang Ye, Wen Zhang, Zhen Bi, Chi Man Wong, Chen Hui, Huajun Chen

    Abstract: Representation learning models for Knowledge Graphs (KG) have proven to be effective in encoding structural information and performing reasoning over KGs. In this paper, we propose a novel pre-training-then-fine-tuning framework for knowledge graph representation learning, in which a KG model is firstly pre-trained with triple classification task, followed by discriminative fine-tuning on specific… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Accepted to IJCKG 2021

  50. arXiv:2112.00963  [pdf, other

    cs.LG cs.AI cs.CE cs.IR

    Multi-Domain Transformer-Based Counterfactual Augmentation for Earnings Call Analysis

    Authors: Zixuan Yuan, Yada Zhu, Wei Zhang, Ziming Huang, Guangnan Ye, Hui Xiong

    Abstract: Earnings call (EC), as a periodic teleconference of a publicly-traded company, has been extensively studied as an essential market indicator because of its high analytical value in corporate fundamentals. The recent emergence of deep learning techniques has shown great promise in creating automated pipelines to benefit the EC-supported financial applications. However, these methods presume all inc… ▽ More

    Submitted 3 December, 2021; v1 submitted 1 December, 2021; originally announced December 2021.