Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 119 results for author: Xiong, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14841  [pdf, other

    cs.CR cs.DB cs.LG

    TabularMark: Watermarking Tabular Datasets for Machine Learning

    Authors: Yihao Zheng, Haocheng Xia, Junyuan Pang, Jinfei Liu, Kui Ren, Lingyang Chu, Yang Cao, Li Xiong

    Abstract: Watermarking is broadly utilized to protect ownership of shared data while preserving data utility. However, existing watermarking methods for tabular datasets fall short on the desired properties (detectability, non-intrusiveness, and robustness) and only preserve data utility from the perspective of data statistics, ignoring the performance of downstream ML models trained on the datasets. Can we… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.02744  [pdf, other

    cs.CR cs.LG

    DPDR: Gradient Decomposition and Reconstruction for Differentially Private Deep Learning

    Authors: Yixuan Liu, Li Xiong, Yuhan Liu, Yujie Gu, Ruixuan Liu, Hong Chen

    Abstract: Differentially Private Stochastic Gradients Descent (DP-SGD) is a prominent paradigm for preserving privacy in deep learning. It ensures privacy by perturbing gradients with random noise calibrated to their entire norm at each training step. However, this perturbation suffers from a sub-optimal performance: it repeatedly wastes privacy budget on the general converging direction shared among gradie… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 14 pages

  3. arXiv:2406.01457  [pdf, other

    cs.LG cs.CL cs.CR

    Differentially Private Tabular Data Synthesis using Large Language Models

    Authors: Toan V. Tran, Li Xiong

    Abstract: Synthetic tabular data generation with differential privacy is a crucial problem to enable data sharing with formal privacy. Despite a rich history of methodological research and development, developing differentially private tabular data generators that can provide realistic synthetic datasets remains challenging. This paper introduces DP-LLMTGen -- a novel framework for differentially private ta… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  4. arXiv:2405.19669  [pdf, other

    cs.CV

    Texture-guided Coding for Deep Features

    Authors: Lei Xiong, Xin Luo, Zihao Wang, Chaofan He, Shuyuan Zhu, Bing Zeng

    Abstract: With the rapid development of machine vision technology in recent years, many researchers have begun to focus on feature compression that is better suited for machine vision tasks. The target of feature compression is deep features, which arise from convolution in the middle layer of a pre-trained convolutional neural network. However, due to the large volume of data and high level of abstraction… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2405.08043  [pdf, other

    cs.CR cs.LG

    HRNet: Differentially Private Hierarchical and Multi-Resolution Network for Human Mobility Data Synthesization

    Authors: Shun Takagi, Li Xiong, Fumiyuki Kato, Yang Cao, Masatoshi Yoshikawa

    Abstract: Human mobility data offers valuable insights for many applications such as urban planning and pandemic response, but its use also raises privacy concerns. In this paper, we introduce the Hierarchical and Multi-Resolution Network (HRNet), a novel deep generative model specifically designed to synthesize realistic human mobility data while guaranteeing differential privacy. We first identify the key… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  6. arXiv:2405.00438  [pdf, other

    cs.LG cs.CL

    MetaRM: Shifted Distributions Alignment via Meta-Learning

    Authors: Shihan Dou, Yan Liu, Enyu Zhou, Tianlong Li, Haoxiang Jia, Limao Xiong, Xin Zhao, Junjie Ye, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: The success of Reinforcement Learning from Human Feedback (RLHF) in language model alignment is critically dependent on the capability of the reward model (RM). However, as the training process progresses, the output distribution of the policy model shifts, leading to the RM's reduced ability to distinguish between responses. This issue is further compounded when the RM, trained on a specific data… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 11 pages, 6 figures. arXiv admin note: text overlap with arXiv:2401.06080

  7. arXiv:2403.15484  [pdf, other

    cs.CL cs.LG

    RakutenAI-7B: Extending Large Language Models for Japanese

    Authors: Rakuten Group, Aaron Levine, Connie Huang, Chenguang Wang, Eduardo Batista, Ewa Szymanska, Hongyi Ding, Hou Wei Chou, Jean-François Pessiot, Johanes Effendi, Justin Chiu, Kai Torben Ohlhus, Karan Chopra, Keiji Shinzato, Koji Murakami, Lee Xiong, Lei Chen, Maki Kubota, Maksim Tkachenko, Miroku Lee, Naoki Takahashi, Prathyusha Jwalapuram, Ryutaro Tatsushima, Saurabh Jain, Sunil Kumar Yadav , et al. (5 additional authors not shown)

    Abstract: We introduce RakutenAI-7B, a suite of Japanese-oriented large language models that achieve the best performance on the Japanese LM Harness benchmarks among the open 7B models. Along with the foundation model, we release instruction- and chat-tuned models, RakutenAI-7B-instruct and RakutenAI-7B-chat respectively, under the Apache 2.0 license.

    Submitted 21 March, 2024; originally announced March 2024.

  8. arXiv:2403.12171  [pdf, other

    cs.CL cs.AI

    EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models

    Authors: Weikang Zhou, Xiao Wang, Limao Xiong, Han Xia, Yingshuang Gu, Mingxu Chai, Fukang Zhu, Caishuang Huang, Shihan Dou, Zhiheng Xi, Rui Zheng, Songyang Gao, Yicheng Zou, Hang Yan, Yifan Le, Ruohui Wang, Lijun Li, Jing Shao, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: Jailbreak attacks are crucial for identifying and mitigating the security vulnerabilities of Large Language Models (LLMs). They are designed to bypass safeguards and elicit prohibited outputs. However, due to significant differences among various jailbreak methods, there is no standard implementation framework available for the community, which limits comprehensive security evaluations. This paper… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  9. arXiv:2403.09562  [pdf, other

    cs.CR

    PreCurious: How Innocent Pre-Trained Language Models Turn into Privacy Traps

    Authors: Ruixuan Liu, Tianhao Wang, Yang Cao, Li Xiong

    Abstract: The pre-training and fine-tuning paradigm has demonstrated its effectiveness and has become the standard approach for tailoring language models to various tasks. Currently, community-based platforms offer easy access to various pre-trained models, as anyone can publish without strict validation processes. However, a released pre-trained model can be a privacy trap for fine-tuning datasets if it is… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 19 pages

  10. arXiv:2402.01391  [pdf, other

    cs.SE cs.CL

    StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback

    Authors: Shihan Dou, Yan Liu, Haoxiang Jia, Limao Xiong, Enyu Zhou, Wei Shen, Junjie Shan, Caishuang Huang, Xiao Wang, Xiaoran Fan, Zhiheng Xi, Yuhao Zhou, Tao Ji, Rui Zheng, Qi Zhang, Xuanjing Huang, Tao Gui

    Abstract: The advancement of large language models (LLMs) has significantly propelled the field of code generation. Previous work integrated reinforcement learning (RL) with compiler feedback for exploring the output space of LLMs to enhance code generation quality. However, the lengthy code generated by LLMs in response to complex human requirements makes RL exploration a challenge. Also, since the unit te… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 13 pages, 5 figures

  11. arXiv:2401.16251  [pdf, other

    cs.CR cs.AI cs.LG

    Cross-silo Federated Learning with Record-level Personalized Differential Privacy

    Authors: Junxu Liu, Jian Lou, Li Xiong, Jinfei Liu, Xiaofeng Meng

    Abstract: Federated learning enhanced by differential privacy has emerged as a popular approach to better safeguard the privacy of client-side data by protecting clients' contributions during the training process. Existing solutions typically assume a uniform privacy budget for all records and provide one-size-fits-all solutions that may not be adequate to meet each record's privacy requirement. In this pap… ▽ More

    Submitted 29 January, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 12 pages, 7 figures, under review

  12. arXiv:2401.10458  [pdf, other

    cs.LG cs.CR

    Contrastive Unlearning: A Contrastive Approach to Machine Unlearning

    Authors: Hong kyu Lee, Qiuchen Zhang, Carl Yang, Jian Lou, Li Xiong

    Abstract: Machine unlearning aims to eliminate the influence of a subset of training samples (i.e., unlearning samples) from a trained model. Effectively and efficiently removing the unlearning samples without negatively impacting the overall model performance is still challenging. In this paper, we propose a contrastive unlearning framework, leveraging the concept of representation learning for more effect… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  13. arXiv:2401.03472  [pdf, other

    cs.CL

    PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction

    Authors: Zening Lin, Jiapeng Wang, Teng Li, Wenhui Liao, Dayi Huang, Longfei Xiong, Lianwen Jin

    Abstract: Document pair extraction aims to identify key and value entities as well as their relationships from visually-rich documents. Most existing methods divide it into two separate tasks: semantic entity recognition (SER) and relation extraction (RE). However, simply concatenating SER and RE serially can lead to severe error propagation, and it fails to handle cases like multi-line entities in real sce… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  14. arXiv:2312.03408  [pdf, other

    cs.CV

    Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future

    Authors: Hongyang Li, Yang Li, Huijie Wang, Jia Zeng, Huilin Xu, Pinlong Cai, Li Chen, Junchi Yan, Feng Xu, Lu Xiong, Jingdong Wang, Futang Zhu, Chunjing Xu, Tiancai Wang, Fei Xia, Beipeng Mu, Zhihui Peng, Dahua Lin, Yu Qiao

    Abstract: With the continuous maturation and application of autonomous driving technology, a systematic examination of open-source autonomous driving datasets becomes instrumental in fostering the robust evolution of the industry ecosystem. Current autonomous driving datasets can broadly be categorized into two generations. The first-generation autonomous driving datasets are characterized by relatively sim… ▽ More

    Submitted 22 March, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: This article is a simplified English translation of corresponding Chinese article. Please refer to Chinese version for the complete content

  15. arXiv:2312.02646  [pdf, other

    cs.LG cs.AI

    SAMSGL: Series-Aligned Multi-Scale Graph Learning for Spatio-Temporal Forecasting

    Authors: Xiaobei Zou, Luolin Xiong, Yang Tang, Jürgen Kurths

    Abstract: Spatio-temporal forecasting in various domains, like traffic prediction and weather forecasting, is a challenging endeavor, primarily due to the difficulties in modeling propagation dynamics and capturing high-dimensional interactions among nodes. Despite the significant strides made by graph-based networks in spatio-temporal forecasting, there remain two pivotal factors closely related to forecas… ▽ More

    Submitted 27 May, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted by Chaos

  16. arXiv:2311.08430  [pdf, other

    cs.LG cs.AI cs.IR

    Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale

    Authors: Wei Wen, Kuang-Hung Liu, Igor Fedorov, Xin Zhang, Hang Yin, Weiwei Chu, Kaveh Hassani, Mengying Sun, Jiang Liu, Xu Wang, Lin Jiang, Yuxin Chen, Buyun Zhang, Xi Liu, Dehua Cheng, Zhengxing Chen, Guang Zhao, Fangqiu Han, Jiyan Yang, Yuchen Hao, Liang Xiong, Wen-Yen Chen

    Abstract: Neural Architecture Search (NAS) has demonstrated its efficacy in computer vision and potential for ranking systems. However, prior work focused on academic problems, which are evaluated at small scale under well-controlled fixed baselines. In industry system, such as ranking system in Meta, it is unclear whether NAS algorithms from the literature can outperform production baselines because of: (1… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Wei Wen and Kuang-Hung Liu contribute equally

  17. arXiv:2311.06227  [pdf, other

    cs.CR cs.LG

    Does Differential Privacy Prevent Backdoor Attacks in Practice?

    Authors: Fereshteh Razmi, Jian Lou, Li Xiong

    Abstract: Differential Privacy (DP) was originally developed to protect privacy. However, it has recently been utilized to secure machine learning (ML) models from poisoning attacks, with DP-SGD receiving substantial attention. Nevertheless, a thorough investigation is required to assess the effectiveness of different DP techniques in preventing backdoor attacks in practice. In this paper, we investigate th… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  18. arXiv:2311.03896  [pdf, other

    cs.CL

    iACOS: Advancing Implicit Sentiment Extraction with Informative and Adaptive Negative Examples

    Authors: Xiancai Xu, Jia-Dong Zhang, Lei Xiong, Zhishang Liu

    Abstract: Aspect-based sentiment analysis (ABSA) have been extensively studied, but little light has been shed on the quadruple extraction consisting of four fundamental elements: aspects, categories, opinions and sentiments, especially with implicit aspects and opinions. In this paper, we propose a new method iACOS for extracting Implicit Aspects with Categories and Opinions with Sentiments. First, iACOS a… ▽ More

    Submitted 22 June, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Journal ref: NAACL 2024 (Volume 1: Long Papers)

  19. arXiv:2310.20251  [pdf, other

    cs.MM

    An Implementation of Multimodal Fusion System for Intelligent Digital Human Generation

    Authors: Yingjie Zhou, Yaodong Chen, Kaiyue Bi, Lian Xiong, Hui Liu

    Abstract: With the rapid development of artificial intelligence (AI), digital humans have attracted more and more attention and are expected to achieve a wide range of applications in several industries. Then, most of the existing digital humans still rely on manual modeling by designers, which is a cumbersome process and has a long development cycle. Therefore, facing the rise of digital humans, there is a… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  20. arXiv:2310.14783  [pdf, other

    cs.LG cs.AI

    Interpretable Deep Reinforcement Learning for Optimizing Heterogeneous Energy Storage Systems

    Authors: Luolin Xiong, Yang Tang, Chensheng Liu, Shuai Mao, Ke Meng, Zhaoyang Dong, Feng Qian

    Abstract: Energy storage systems (ESS) are pivotal component in the energy market, serving as both energy suppliers and consumers. ESS operators can reap benefits from energy arbitrage by optimizing operations of storage equipment. To further enhance ESS flexibility within the energy market and improve renewable energy utilization, a heterogeneous photovoltaic-ESS (PV-ESS) is proposed, which leverages the u… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  21. arXiv:2310.06502  [pdf, other

    cs.CL

    The Limits of ChatGPT in Extracting Aspect-Category-Opinion-Sentiment Quadruples: A Comparative Analysis

    Authors: Xiancai Xu, Jia-Dong Zhang, Rongchang Xiao, Lei Xiong

    Abstract: Recently, ChatGPT has attracted great attention from both industry and academia due to its surprising abilities in natural language understanding and generation. We are particularly curious about whether it can achieve promising performance on one of the most complex tasks in aspect-based sentiment analysis, i.e., extracting aspect-category-opinion-sentiment quadruples from texts. To this end, in… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  22. arXiv:2309.07864  [pdf, other

    cs.AI cs.CL

    The Rise and Potential of Large Language Model Based Agents: A Survey

    Authors: Zhiheng Xi, Wenxiang Chen, Xin Guo, Wei He, Yiwen Ding, Boyang Hong, Ming Zhang, Junzhe Wang, Senjie Jin, Enyu Zhou, Rui Zheng, Xiaoran Fan, Xiao Wang, Limao Xiong, Yuhao Zhou, Weiran Wang, Changhao Jiang, Yicheng Zou, Xiangyang Liu, Zhangyue Yin, Shihan Dou, Rongxiang Weng, Wensen Cheng, Qi Zhang, Wenjuan Qin , et al. (4 additional authors not shown)

    Abstract: For a long time, humanity has pursued artificial intelligence (AI) equivalent to or surpassing the human level, with AI agents considered a promising vehicle for this pursuit. AI agents are artificial entities that sense their environment, make decisions, and take actions. Many efforts have been made to develop intelligent agents, but they mainly focus on advancement in algorithms or training stra… ▽ More

    Submitted 19 September, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: 86 pages, 12 figures

  23. arXiv:2308.12210  [pdf, other

    cs.LG cs.CR

    ULDP-FL: Federated Learning with Across Silo User-Level Differential Privacy

    Authors: Fumiyuki Kato, Li Xiong, Shun Takagi, Yang Cao, Masatoshi Yoshikawa

    Abstract: Differentially Private Federated Learning (DP-FL) has garnered attention as a collaborative machine learning approach that ensures formal privacy. Most DP-FL approaches ensure DP at the record-level within each silo for cross-silo FL. However, a single user's data may extend across multiple silos, and the desired user-level DP guarantee for such a setting remains unknown. In this study, we present… ▽ More

    Submitted 16 June, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: This is the full version of the paper accepted to VLDB 2024

  24. arXiv:2308.06573  [pdf, other

    cs.CV cs.AI

    4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion

    Authors: Guirong Zhuo, Shouyi Lu, Huanyu Zhou, Lianqing Zheng, Lu Xiong

    Abstract: Four-dimensional (4D) radar--visual odometry (4DRVO) integrates complementary information from 4D radar and cameras, making it an attractive solution for achieving accurate and robust pose estimation. However, 4DRVO may exhibit significant tracking errors owing to three main factors: 1) sparsity of 4D radar point clouds; 2) inaccurate data association and insufficient feature interaction between t… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: 14 pages,12 figures

  25. arXiv:2307.06501  [pdf, other

    cs.AI cs.LG

    Hybrid Control Policy for Artificial Pancreas via Ensemble Deep Reinforcement Learning

    Authors: Wenzhou Lv, Tianyu Wu, Luolin Xiong, Liang Wu, Jian Zhou, Yang Tang, Feng Qian

    Abstract: Objective: The artificial pancreas (AP) has shown promising potential in achieving closed-loop glucose control for individuals with type 1 diabetes mellitus (T1DM). However, designing an effective control policy for the AP remains challenging due to the complex physiological processes, delayed insulin response, and inaccurate glucose measurements. While model predictive control (MPC) offers safety… ▽ More

    Submitted 13 July, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: 12 pages

  26. arXiv:2307.05717  [pdf, other

    cs.OH

    Towards Mobility Data Science (Vision Paper)

    Authors: Mohamed Mokbel, Mahmoud Sakr, Li Xiong, Andreas Züfle, Jussara Almeida, Taylor Anderson, Walid Aref, Gennady Andrienko, Natalia Andrienko, Yang Cao, Sanjay Chawla, Reynold Cheng, Panos Chrysanthis, Xiqi Fei, Gabriel Ghinita, Anita Graser, Dimitrios Gunopulos, Christian Jensen, Joon-Seok Kim, Kyoung-Sook Kim, Peer Kröger, John Krumm, Johannes Lauer, Amr Magdy, Mario Nascimento , et al. (23 additional authors not shown)

    Abstract: Mobility data captures the locations of moving objects such as humans, animals, and cars. With the availability of GPS-equipped mobile devices and other inexpensive location-tracking technologies, mobility data is collected ubiquitously. In recent years, the use of mobility data has demonstrated significant impact in various domains including traffic management, urban planning, and health sciences… ▽ More

    Submitted 7 March, 2024; v1 submitted 21 June, 2023; originally announced July 2023.

    Comments: Updated to reflect the major revision for ACM Transactions on Spatial Algorithms and Systems (TSAS). This version reflects the final version accepted by ACM TSAS

  27. arXiv:2307.04964  [pdf, other

    cs.CL cs.AI cs.LG

    Secrets of RLHF in Large Language Models Part I: PPO

    Authors: Rui Zheng, Shihan Dou, Songyang Gao, Yuan Hua, Wei Shen, Binghai Wang, Yan Liu, Senjie Jin, Qin Liu, Yuhao Zhou, Limao Xiong, Lu Chen, Zhiheng Xi, Nuo Xu, Wenbin Lai, Minghao Zhu, Cheng Chang, Zhangyue Yin, Rongxiang Weng, Wensen Cheng, Haoran Huang, Tianxiang Sun, Hang Yan, Tao Gui, Qi Zhang , et al. (2 additional authors not shown)

    Abstract: Large language models (LLMs) have formulated a blueprint for the advancement of artificial general intelligence. Its primary objective is to function as a human-centric (helpful, honest, and harmless) assistant. Alignment with humans assumes paramount significance, and reinforcement learning with human feedback (RLHF) emerges as the pivotal technological paradigm underpinning this pursuit. Current… ▽ More

    Submitted 18 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

  28. arXiv:2306.12608  [pdf, other

    cs.CR

    DP-BREM: Differentially-Private and Byzantine-Robust Federated Learning with Client Momentum

    Authors: Xiaolan Gu, Ming Li, Li Xiong

    Abstract: Federated Learning (FL) allows multiple participating clients to train machine learning models collaboratively while keeping their datasets local and only exchanging the gradient or model updates with a coordinating server. Existing FL protocols are vulnerable to attacks that aim to compromise data privacy and/or model robustness. Recently proposed defenses focused on ensuring either privacy or ro… ▽ More

    Submitted 10 May, 2024; v1 submitted 21 June, 2023; originally announced June 2023.

  29. arXiv:2306.07142  [pdf

    eess.SY cs.AI cs.MA cs.RO

    Evolving Testing Scenario Generation Method and Intelligence Evaluation Framework for Automated Vehicles

    Authors: Yining Ma, Wei Jiang, Lingtong Zhang, Junyi Chen, Hong Wang, Chen Lv, Xuesong Wang, Lu Xiong

    Abstract: Interaction between the background vehicles (BVs) and automated vehicles (AVs) in scenario-based testing plays a critical role in evaluating the intelligence of the AVs. Current testing scenarios typically employ predefined or scripted BVs, which inadequately reflect the complexity of human-like social behaviors in real-world driving scenarios, and also lack a systematic metric for evaluating the… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 18 pages,17 figures

  30. arXiv:2305.12485  [pdf, other

    cs.CL cs.AI

    A Confidence-based Partial Label Learning Model for Crowd-Annotated Named Entity Recognition

    Authors: Limao Xiong, Jie Zhou, Qunxi Zhu, Xiao Wang, Yuanbin Wu, Qi Zhang, Tao Gui, Xuanjing Huang, Jin Ma, Ying Shan

    Abstract: Existing models for named entity recognition (NER) are mainly based on large-scale labeled datasets, which always obtain using crowdsourcing. However, it is hard to obtain a unified and correct label via majority voting from multiple annotators for NER due to the large labeling space and complexity of this task. To address this problem, we aim to utilize the original multi-annotator labels directl… ▽ More

    Submitted 27 July, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

  31. arXiv:2304.06929  [pdf

    cs.CR

    Advancing Differential Privacy: Where We Are Now and Future Directions for Real-World Deployment

    Authors: Rachel Cummings, Damien Desfontaines, David Evans, Roxana Geambasu, Yangsibo Huang, Matthew Jagielski, Peter Kairouz, Gautam Kamath, Sewoong Oh, Olga Ohrimenko, Nicolas Papernot, Ryan Rogers, Milan Shen, Shuang Song, Weijie Su, Andreas Terzis, Abhradeep Thakurta, Sergei Vassilvitskii, Yu-Xiang Wang, Li Xiong, Sergey Yekhanin, Da Yu, Huanyu Zhang, Wanrong Zhang

    Abstract: In this article, we present a detailed review of current practices and state-of-the-art methodologies in the field of differential privacy (DP), with a focus of advancing DP's deployment in real-world applications. Key points and high-level contents of the article were originated from the discussions from "Differential Privacy (DP): Challenges Towards the Next Frontier," a workshop held in July 20… ▽ More

    Submitted 12 March, 2024; v1 submitted 14 April, 2023; originally announced April 2023.

  32. arXiv:2304.05516  [pdf, other

    cs.CR cs.LG

    Echo of Neighbors: Privacy Amplification for Personalized Private Federated Learning with Shuffle Model

    Authors: Yixuan Liu, Suyun Zhao, Li Xiong, Yuhan Liu, Hong Chen

    Abstract: Federated Learning, as a popular paradigm for collaborative training, is vulnerable against privacy attacks. Different privacy levels regarding users' attitudes need to be satisfied locally, while a strict privacy guarantee for the global model is also required centrally. Personalized Local Differential Privacy (PLDP) is suitable for preserving users' varying local privacy, yet only provides a cen… ▽ More

    Submitted 26 May, 2023; v1 submitted 11 April, 2023; originally announced April 2023.

  33. arXiv:2303.12787  [pdf, other

    cs.CV

    EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

    Authors: Hansheng Chen, Wei Tian, Pichao Wang, Fan Wang, Lu Xiong, Hao Li

    Abstract: Locating 3D objects from a single RGB image via Perspective-n-Point (PnP) is a long-standing problem in computer vision. Driven by end-to-end deep learning, recent studies suggest interpreting PnP as a differentiable layer, allowing for partial learning of 2D-3D point correspondences by backpropagating the gradients of pose loss. Yet, learning the entire correspondences from scratch is highly chal… ▽ More

    Submitted 17 December, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Code available at https://github.com/tjiiv-cprg/EPro-PnP-v2. Revised and fixed typos. arXiv admin note: substantial text overlap with arXiv:2203.13254

  34. arXiv:2303.12357  [pdf, other

    cs.LG cs.AI

    Wasserstein Adversarial Examples on Univariant Time Series Data

    Authors: Wenjie Wang, Li Xiong, Jian Lou

    Abstract: Adversarial examples are crafted by adding indistinguishable perturbations to normal examples in order to fool a well-trained deep learning model to misclassify. In the context of computer vision, this notion of indistinguishability is typically bounded by $L_{\infty}$ or other norms. However, these norms are not appropriate for measuring indistinguishiability for time series data. In this work, w… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  35. arXiv:2303.05665  [pdf, other

    cs.RO

    A Systematic Survey of Control Techniques and Applications in Connected and Automated Vehicles

    Authors: Wei Liu, Min Hua, Zhiyun Deng, Zonglin Meng, Yanjun Huang, Chuan Hu, Shunhui Song, Letian Gao, Changsheng Liu, Bin Shuai, Amir Khajepour, Lu Xiong, Xin Xia

    Abstract: Vehicle control is one of the most critical challenges in autonomous vehicles (AVs) and connected and automated vehicles (CAVs), and it is paramount in vehicle safety, passenger comfort, transportation efficiency, and energy saving. This survey attempts to provide a comprehensive and thorough overview of the current state of vehicle control technology, focusing on the evolution from vehicle state… ▽ More

    Submitted 11 April, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

  36. arXiv:2212.14621  [pdf, other

    cs.LG cs.AI

    Label-Efficient Interactive Time-Series Anomaly Detection

    Authors: Hong Guo, Yujing Wang, Jieyu Zhang, Zhengjie Lin, Yunhai Tong, Lei Yang, Luoxing Xiong, Congrui Huang

    Abstract: Time-series anomaly detection is an important task and has been widely applied in the industry. Since manual data annotation is expensive and inefficient, most applications adopt unsupervised anomaly detection methods, but the results are usually sub-optimal and unsatisfactory to end customers. Weak supervision is a promising paradigm for obtaining considerable labels in a low-cost way, which enab… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

  37. arXiv:2212.03623  [pdf, other

    cs.CV

    An Intuitive and Unconstrained 2D Cube Representation for Simultaneous Head Detection and Pose Estimation

    Authors: Huayi Zhou, Fei Jiang, Lili Xiong, Hongtao Lu

    Abstract: Most recent head pose estimation (HPE) methods are dominated by the Euler angle representation. To avoid its inherent ambiguity problem of rotation labels, alternative quaternion-based and vector-based representations are introduced. However, they both are not visually intuitive, and often derived from equivocal Euler angle labels. In this paper, we present a novel single-stage keypoint-based meth… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  38. arXiv:2211.14042  [pdf, other

    cs.LG cs.AI q-bio.BM

    Molecular Joint Representation Learning via Multi-modal Information

    Authors: Tianyu Wu, Yang Tang, Qiyu Sun, Luolin Xiong

    Abstract: In recent years, artificial intelligence has played an important role on accelerating the whole process of drug discovery. Various of molecular representation schemes of different modals (e.g. textual sequence or graph) are developed. By digitally encoding them, different chemical information can be learned through corresponding network structures. Molecular graphs and Simplified Molecular Input L… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

  39. arXiv:2211.03127  [pdf, other

    cs.HC cs.CV

    StuArt: Individualized Classroom Observation of Students with Automatic Behavior Recognition and Tracking

    Authors: Huayi Zhou, Fei Jiang, Jiaxin Si, Lili Xiong, Hongtao Lu

    Abstract: Each student matters, but it is hardly for instructors to observe all the students during the courses and provide helps to the needed ones immediately. In this paper, we present StuArt, a novel automatic system designed for the individualized classroom observation, which empowers instructors to concern the learning status of each student. StuArt can recognize five representative student behaviors… ▽ More

    Submitted 13 March, 2023; v1 submitted 6 November, 2022; originally announced November 2022.

    Comments: accepted by ICASSP2023. Novel pedagogical approaches in signal processing for K-12 education

  40. arXiv:2211.01628  [pdf, other

    cs.LG cs.CR

    Private Semi-supervised Knowledge Transfer for Deep Learning from Noisy Labels

    Authors: Qiuchen Zhang, Jing Ma, Jian Lou, Li Xiong, Xiaoqian Jiang

    Abstract: Deep learning models trained on large-scale data have achieved encouraging performance in many real-world tasks. Meanwhile, publishing those models trained on sensitive datasets, such as medical records, could pose serious privacy concerns. To counter these issues, one of the current state-of-the-art approaches is the Private Aggregation of Teacher Ensembles, or PATE, which achieved promising resu… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  41. DPAR: Decoupled Graph Neural Networks with Node-Level Differential Privacy

    Authors: Qiuchen Zhang, Hong kyu Lee, Jing Ma, Jian Lou, Carl Yang, Li Xiong

    Abstract: Graph Neural Networks (GNNs) have achieved great success in learning with graph-structured data. Privacy concerns have also been raised for the trained models which could expose the sensitive information of graphs including both node features and the structure information. In this paper, we aim to achieve node-level differential privacy (DP) for training GNNs so that a node and its edges are prote… ▽ More

    Submitted 14 March, 2024; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted to The 2024 Web Conference

  42. arXiv:2209.15091  [pdf, other

    cs.CR

    L-SRR: Local Differential Privacy for Location-Based Services with Staircase Randomized Response

    Authors: Han Wang, Hanbin Hong, Li Xiong, Zhan Qin, Yuan Hong

    Abstract: Location-based services (LBS) have been significantly developed and widely deployed in mobile devices. It is also well-known that LBS applications may result in severe privacy concerns by collecting sensitive locations. A strong privacy model ''local differential privacy'' (LDP) has been recently deployed in many different applications (e.g., Google RAPPOR, iOS, and Microsoft Telemetry) but not ef… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: accepted to CCS'22; full version

  43. arXiv:2209.06359  [pdf, other

    cs.LG cs.AI

    Federated Pruning: Improving Neural Network Efficiency with Federated Learning

    Authors: Rongmei Lin, Yonghui Xiao, Tien-Ju Yang, Ding Zhao, Li Xiong, Giovanni Motta, Françoise Beaufays

    Abstract: Automatic Speech Recognition models require large amount of speech data for training, and the collection of such data often leads to privacy concerns. Federated learning has been widely used and is considered to be an effective decentralized technique by collaboratively learning a shared prediction model while keeping the data local on different clients devices. However, the limited computation an… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: To appear in INTERSPEECH 2022

  44. arXiv:2208.00993  [pdf, ps, other

    cs.LG

    MULTIPAR: Supervised Irregular Tensor Factorization with Multi-task Learning

    Authors: Yifei Ren, Jian Lou, Li Xiong, Joyce C Ho, Xiaoqian Jiang, Sivasubramanium Bhavani

    Abstract: Tensor factorization has received increasing interest due to its intrinsic ability to capture latent factors in multi-dimensional data with many applications such as recommender systems and Electronic Health Records (EHR) mining. PARAFAC2 and its variants have been proposed to address irregular tensors where one of the tensor modes is not aligned, e.g., different users in recommender systems or pa… ▽ More

    Submitted 9 August, 2022; v1 submitted 1 August, 2022; originally announced August 2022.

  45. NASRec: Weight Sharing Neural Architecture Search for Recommender Systems

    Authors: Tunhou Zhang, Dehua Cheng, Yuchen He, Zhengxing Chen, Xiaoliang Dai, Liang Xiong, Feng Yan, Hai Li, Yiran Chen, Wei Wen

    Abstract: The rise of deep neural networks offers new opportunities in optimizing recommender systems. However, optimizing recommender systems using deep neural networks requires delicate architecture fabrication. We propose NASRec, a paradigm that trains a single supernet and efficiently produces abundant models/sub-architectures by weight sharing. To overcome the data multi-modality and architecture heter… ▽ More

    Submitted 12 February, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Proceedings of the ACM Web Conference 2023 (WWW'23)

    Journal ref: Proceedings of the ACM Web Conference 2023 (WWW'23)

  46. arXiv:2204.04391  [pdf, other

    cs.CL cs.IR

    MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective

    Authors: Xiao Wang, Shihan Dou, Limao Xiong, Yicheng Zou, Qi Zhang, Tao Gui, Liang Qiao, Zhanzhan Cheng, Xuanjing Huang

    Abstract: NER model has achieved promising performance on standard NER benchmarks. However, recent studies show that previous approaches may over-rely on entity mention information, resulting in poor performance on out-of-vocabulary (OOV) entity recognition. In this work, we propose MINER, a novel NER learning framework, to remedy this issue from an information-theoretic perspective. The proposed approach c… ▽ More

    Submitted 3 May, 2022; v1 submitted 9 April, 2022; originally announced April 2022.

    Comments: Accepted as a long paper at ACL 2022

  47. Adaptively Enhancing Facial Expression Crucial Regions via Local Non-Local Joint Network

    Authors: Guanghui Shi, Shasha Mao, Shuiping Gou, Dandan Yan, Licheng Jiao, Lin Xiong

    Abstract: Facial expression recognition (FER) is still one challenging research due to the small inter-class discrepancy in the facial expression data. In view of the significance of facial crucial regions for FER, many existing researches utilize the prior information from some annotated crucial points to improve the performance of FER. However, it is complicated and time-consuming to manually annotate fac… ▽ More

    Submitted 27 February, 2024; v1 submitted 26 March, 2022; originally announced March 2022.

    Report number: SN-2731-5398

    Journal ref: Machine Intelligence Research, vol. 21, pp. 331-348, 2024

  48. arXiv:2203.13254  [pdf, other

    cs.CV

    EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

    Authors: Hansheng Chen, Pichao Wang, Fan Wang, Wei Tian, Lu Xiong, Hao Li

    Abstract: Locating 3D objects from a single RGB image via Perspective-n-Points (PnP) is a long-standing problem in computer vision. Driven by end-to-end deep learning, recent studies suggest interpreting PnP as a differentiable layer, so that 2D-3D point correspondences can be partly learned by backpropagating the gradient w.r.t. object pose. Yet, learning the entire set of unrestricted 2D-3D points from sc… ▽ More

    Submitted 11 August, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: CVPR 2022 Best Student Paper, code available at https://github.com/tjiiv-cprg/EPro-PnP

  49. arXiv:2202.04816  [pdf, other

    cs.RO

    Scale Estimation with Dual Quadrics for Monocular Object SLAM

    Authors: Shuangfu Song, Junqiao Zhao, Tiantian Feng, Chen Ye, Lu Xiong

    Abstract: The scale ambiguity problem is inherently unsolvable to monocular SLAM without the metric baseline between moving cameras. In this paper, we present a novel scale estimation approach based on an object-level SLAM system. To obtain the absolute scale of the reconstructed map, we derive a nonlinear optimization method to make the scaled dimensions of objects conforming to the distribution of their s… ▽ More

    Submitted 2 November, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: 8 pages, 6 figures, accepted by IROS2022

  50. arXiv:2112.07768  [pdf, other

    cs.LG cs.AI

    Efficient Dynamic Graph Representation Learning at Scale

    Authors: Xinshi Chen, Yan Zhu, Haowen Xu, Mengyang Liu, Liang Xiong, Muhan Zhang, Le Song

    Abstract: Dynamic graphs with ordered sequences of events between nodes are prevalent in real-world industrial applications such as e-commerce and social platforms. However, representation learning for dynamic graphs has posed great computational challenges due to the time and structure dependency and irregular nature of the data, preventing such models from being deployed to real-world applications. To tac… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.