Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 75 results for author: Yao, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06498  [pdf, other

    cs.HC

    Enhancing spatial auditory attention decoding with neuroscience-inspired prototype training

    Authors: Zelin Qiu, Jianjun Gu, Dingding Yao, Junfeng Li

    Abstract: The spatial auditory attention decoding (Sp-AAD) technology aims to determine the direction of auditory attention in multi-talker scenarios via neural recordings. Despite the success of recent Sp-AAD algorithms, their performance is hindered by trial-specific features in EEG data. This study aims to improve decoding performance against these features. Studies in neuroscience indicate that spatial… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2407.05869  [pdf, other

    cs.AI

    PORCA: Root Cause Analysis with Partially Observed Data

    Authors: Chang Gong, Di Yao, Jin Wang, Wenbin Li, Lanting Fang, Yongtao Xie, Kaiyu Feng, Peng Han, Jingping Bi

    Abstract: Root Cause Analysis (RCA) aims at identifying the underlying causes of system faults by uncovering and analyzing the causal structure from complex systems. It has been widely used in many application domains. Reliable diagnostic conclusions are of great importance in mitigating system failures and financial losses. However, previous studies implicitly assume a full observation of the system, which… ▽ More

    Submitted 11 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  3. arXiv:2407.00541  [pdf

    cs.CL cs.AI cs.IR

    Answering real-world clinical questions using large language model based systems

    Authors: Yen Sia Low, Michael L. Jackson, Rebecca J. Hyde, Robert E. Brown, Neil M. Sanghavi, Julian D. Baldwin, C. William Pike, Jananee Muralidharan, Gavin Hui, Natasha Alexander, Hadeel Hassan, Rahul V. Nene, Morgan Pike, Courtney J. Pokrzywa, Shivam Vedak, Adam Paul Yan, Dong-han Yao, Amy R. Zipursky, Christina Dinh, Philip Ballentine, Dan C. Derieg, Vladimir Polony, Rehan N. Chawdry, Jordan Davies, Brigham B. Hyde , et al. (2 additional authors not shown)

    Abstract: Evidence to guide healthcare decisions is often limited by a lack of relevant and trustworthy literature as well as difficulty in contextualizing existing research for a specific patient. Large language models (LLMs) could potentially address both challenges by either summarizing published literature or generating new studies based on real-world data (RWD). We evaluated the ability of five LLM-bas… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 28 pages (2 figures, 3 tables) inclusive of 8 pages of supplemental materials (4 supplemental figures and 4 supplemental tables)

  4. arXiv:2407.00014  [pdf

    cs.RO eess.SY

    Simplifying Kinematic Parameter Estimation in sEMG Prosthetic Hands: A Two-Point Approach

    Authors: Gang Liu, Zhenxiang Wang, Ziyang He, Shanshan Guo, Rui Zhang, Dezhong Yao

    Abstract: Regression-based sEMG prosthetic hands are widely used for their ability to provide continuous kinematic parameters. However, establishing these models traditionally requires complex kinematic sensor systems to collect corresponding kinematic data in synchronization with EMG, which is cumbersome and user-unfriendly. This paper presents a simplified approach utilizing only two data points to depict… ▽ More

    Submitted 1 May, 2024; originally announced July 2024.

    Comments: 13 pages

  5. arXiv:2406.19065  [pdf, other

    cs.CL

    STBench: Assessing the Ability of Large Language Models in Spatio-Temporal Analysis

    Authors: Wenbin Li, Di Yao, Ruibo Zhao, Wenjie Chen, Zijie Xu, Chengxue Luo, Chang Gong, Quanliang Jing, Haining Tan, Jingping Bi

    Abstract: The rapid evolution of large language models (LLMs) holds promise for reforming the methodology of spatio-temporal data mining. However, current works for evaluating the spatio-temporal understanding capability of LLMs are somewhat limited and biased. These works either fail to incorporate the latest language models or only focus on assessing the memorized spatio-temporal knowledge. To address thi… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  6. CausalMMM: Learning Causal Structure for Marketing Mix Modeling

    Authors: Chang Gong, Di Yao, Lei Zhang, Sheng Chen, Wenbin Li, Yueyang Su, Jingping Bi

    Abstract: In online advertising, marketing mix modeling (MMM) is employed to predict the gross merchandise volume (GMV) of brand shops and help decision-makers to adjust the budget allocation of various advertising channels. Traditional MMM methods leveraging regression techniques can fail in handling the complexity of marketing. Although some efforts try to encode the causal structures for better predictio… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: WSDM 2024, full version

  7. HiFGL: A Hierarchical Framework for Cross-silo Cross-device Federated Graph Learning

    Authors: Zhuoning Guo, Duanyi Yao, Qiang Yang, Hao Liu

    Abstract: Federated Graph Learning (FGL) has emerged as a promising way to learn high-quality representations from distributed graph data with privacy preservation. Despite considerable efforts have been made for FGL under either cross-device or cross-silo paradigm, how to effectively capture graph knowledge in a more complicated cross-silo cross-device environment remains an under-explored problem. However… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Accepted by SIGKDD 2024

  8. Reliable Object Tracking by Multimodal Hybrid Feature Extraction and Transformer-Based Fusion

    Authors: Hongze Sun, Rui Liu, Wuque Cai, Jun Wang, Yue Wang, Huajin Tang, Yan Cui, Dezhong Yao, Daqing Guo

    Abstract: Visual object tracking, which is primarily based on visible light image sequences, encounters numerous challenges in complicated scenarios, such as low light conditions, high dynamic ranges, and background clutter. To address these challenges, incorporating the advantages of multiple visual modalities is a promising solution for achieving reliable object tracking. However, the existing approaches… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 16 pages, 7 figures, 9 tabes; This work has been submitted for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  9. arXiv:2405.16848  [pdf, other

    cs.CV

    A re-calibration method for object detection with multi-modal alignment bias in autonomous driving

    Authors: Zhihang Song, Lihui Peng, Jianming Hu, Danya Yao, Yi Zhang

    Abstract: Multi-modal object detection in autonomous driving has achieved great breakthroughs due to the usage of fusing complementary information from different sensors. The calibration in fusion between sensors such as LiDAR and camera is always supposed to be precise in previous work. However, in reality, calibration matrices are fixed when the vehicles leave the factory, but vibration, bumps, and data l… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 figures

  10. arXiv:2405.14953  [pdf, other

    cs.LG cs.AI stat.ML

    Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions

    Authors: Haoxian Chen, Hanyang Zhao, Henry Lam, David Yao, Wenpin Tang

    Abstract: Direct Preference Optimization (DPO) has recently emerged as a popular approach to improve reinforcement learning with human feedback (RLHF), leading to better techniques to fine-tune large language models (LLM). A weakness of DPO, however, lies in its lack of capability to characterize the diversity of human preferences. Inspired by Mallows' theory of preference ranking, we develop in this paper… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  11. arXiv:2405.14291  [pdf, other

    cs.LG cs.AI cs.DC

    Variational Bayes for Federated Continual Learning

    Authors: Dezhong Yao, Sanmu Li, Yutong Dai, Zhiqiang Xu, Shengshan Hu, Peilin Zhao, Lichao Sun

    Abstract: Federated continual learning (FCL) has received increasing attention due to its potential in handling real-world streaming data, characterized by evolving data distributions and varying client classes over time. The constraints of storage limitations and privacy concerns confine local models to exclusively access the present data within each learning cycle. Consequently, this restriction induces p… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  12. arXiv:2405.13888  [pdf, other

    cs.LG stat.ML

    Marrying Causal Representation Learning with Dynamical Systems for Science

    Authors: Dingling Yao, Caroline Muller, Francesco Locatello

    Abstract: Causal representation learning promises to extend causal models to hidden causal variables from raw entangled measurements. However, most progress has focused on proving identifiability results in different settings, and we are not aware of any successful real-world application. At the same time, the field of dynamical systems benefited from deep learning and scaled to countless applications but d… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 21 pages, 8 figures, 6 tables

  13. arXiv:2405.07626  [pdf, other

    cs.LG cs.AI

    AnomalyLLM: Few-shot Anomaly Edge Detection for Dynamic Graphs using Large Language Models

    Authors: Shuo Liu, Di Yao, Lanting Fang, Zhetao Li, Wenbin Li, Kaiyu Feng, XiaoWen Ji, Jingping Bi

    Abstract: Detecting anomaly edges for dynamic graphs aims to identify edges significantly deviating from the normal pattern and can be applied in various domains, such as cybersecurity, financial transactions and AIOps. With the evolving of time, the types of anomaly edges are emerging and the labeled anomaly samples are few for each type. Current methods are either designed to detect randomly inserted edge… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 13pages

  14. arXiv:2405.00696  [pdf, other

    cs.RO

    Life-long Learning and Testing for Automated Vehicles via Adaptive Scenario Sampling as A Continuous Optimization Process

    Authors: Jingwei Ge, Pengbo Wang, Cheng Chang, Yi Zhang, Danya Yao, Li Li

    Abstract: Sampling critical testing scenarios is an essential step in intelligence testing for Automated Vehicles (AVs). However, due to the lack of prior knowledge on the distribution of critical scenarios in sampling space, we can hardly efficiently find the critical scenarios or accurately evaluate the intelligence of AVs. To solve this problem, we formulate the testing as a continuous optimization proce… ▽ More

    Submitted 28 March, 2024; originally announced May 2024.

  15. arXiv:2404.19582  [pdf, other

    cs.LG cs.CR

    Leveraging Label Information for Stealthy Data Stealing in Vertical Federated Learning

    Authors: Duanyi Yao, Songze Li, Xueluan Gong, Sizai Hou, Gaoning Pan

    Abstract: We develop DMAVFL, a novel attack strategy that evades current detection mechanisms. The key idea is to integrate a discriminator with auxiliary classifier that takes a full advantage of the label information (which was completely ignored in previous attacks): on one hand, label information helps to better characterize embeddings of samples from distinct classes, yielding an improved reconstructio… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  16. arXiv:2403.10801  [pdf, other

    cs.CV

    Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples

    Authors: Ziqi Zhou, Minghui Li, Wei Liu, Shengshan Hu, Yechao Zhang, Wei Wan, Lulu Xue, Leo Yu Zhang, Dezhong Yao, Hai Jin

    Abstract: With the evolution of self-supervised learning, the pre-training paradigm has emerged as a predominant solution within the deep learning landscape. Model providers furnish pre-trained encoders designed to function as versatile feature extractors, enabling downstream users to harness the benefits of expansive models with minimal effort through fine-tuning. Nevertheless, recent works have exposed a… ▽ More

    Submitted 18 March, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

  17. arXiv:2403.08335  [pdf, other

    cs.LG cs.AI stat.ML

    A Sparsity Principle for Partially Observable Causal Representation Learning

    Authors: Danru Xu, Dingling Yao, Sébastien Lachapelle, Perouz Taslakian, Julius von Kügelgen, Francesco Locatello, Sara Magliacane

    Abstract: Causal representation learning aims at identifying high-level causal variables from perceptual data. Most methods assume that all latent causal variables are captured in the high-dimensional observations. We instead consider a partially observed setting, in which each measurement only provides information about a subset of the underlying causal state. Prior work has studied this setting with multi… ▽ More

    Submitted 15 June, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: 45 pages, 32 figures, 16 tables

  18. arXiv:2403.02975   

    cs.CL cs.AI

    A General and Flexible Multi-concept Parsing Framework for Multilingual Semantic Matching

    Authors: Dong Yao

    Abstract: Sentence semantic matching is a research hotspot in natural language processing, which is considerably significant in various key scenarios, such as community question answering, searching, chatbot, and recommendation. Since most of the advanced models directly model the semantic relevance among words between two sentences while neglecting the \textit{keywords} and \textit{intents} concepts of the… ▽ More

    Submitted 3 April, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: arXiv admin comment: This version has been removed by arXiv administrators as the submitter did not have the rights to agree to the license at the time of submission

  19. arXiv:2402.01348  [pdf, other

    cs.LG cs.AI

    CORE: Mitigating Catastrophic Forgetting in Continual Learning through Cognitive Replay

    Authors: Jianshu Zhang, Yankai Fu, Ziheng Peng, Dongyu Yao, Kun He

    Abstract: This paper introduces a novel perspective to significantly mitigate catastrophic forgetting in continuous learning (CL), which emphasizes models' capacity to preserve existing knowledge and assimilate new information. Current replay-based methods treat every task and data sample equally and thus can not fully exploit the potential of the replay buffer. In response, we propose COgnitive REplay (COR… ▽ More

    Submitted 9 April, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted by CogSci24 as oral presentation

  20. arXiv:2401.16687  [pdf, other

    cs.CR cs.LG

    Revisiting Gradient Pruning: A Dual Realization for Defending against Gradient Attacks

    Authors: Lulu Xue, Shengshan Hu, Ruizhi Zhao, Leo Yu Zhang, Shengqing Hu, Lichao Sun, Dezhong Yao

    Abstract: Collaborative learning (CL) is a distributed learning framework that aims to protect user privacy by allowing users to jointly train a model by sharing their gradient updates only. However, gradient inversion attacks (GIAs), which recover users' training data from shared gradients, impose severe privacy threats to CL. Existing defense methods adopt different techniques, e.g., differential privacy,… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  21. arXiv:2401.15668  [pdf, other

    cs.CV

    Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakes

    Authors: Weifeng Liu, Tianyi She, Jiawei Liu, Run Wang, Dongyu Yao, Ziyou Liang

    Abstract: In recent years, DeepFake technology has achieved unprecedented success in high-quality video synthesis, whereas these methods also pose potential and severe security threats to humanity. DeepFake can be bifurcated into entertainment applications like face swapping and illicit uses such as lip-syncing fraud. However, lip-forgery videos, which neither change identity nor have discernible visual art… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: The first two authors contributed equally to this work

  22. arXiv:2401.11089  [pdf, other

    cs.CR cs.AI cs.DC cs.IR

    FedRKG: A Privacy-preserving Federated Recommendation Framework via Knowledge Graph Enhancement

    Authors: Dezhong Yao, Tongtong Liu, Qi Cao, Hai Jin

    Abstract: Federated Learning (FL) has emerged as a promising approach for preserving data privacy in recommendation systems by training models locally. Recently, Graph Neural Networks (GNN) have gained popularity in recommendation tasks due to their ability to capture high-order interactions between users and items. However, privacy concerns prevent the global sharing of the entire user-item graph. To addre… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  23. arXiv:2312.06197  [pdf, other

    cs.SD cs.MM eess.AS

    MART: Learning Hierarchical Music Audio Representations with Part-Whole Transformer

    Authors: Dong Yao, Jieming Zhu, Jiahao Xun, Shengyu Zhang, Zhou Zhao, Liqun Deng, Wenqiao Zhang, Zhenhua Dong, Xin Jiang

    Abstract: Recent research in self-supervised contrastive learning of music representations has demonstrated remarkable results across diverse downstream tasks. However, a prevailing trend in existing methods involves representing equally-sized music clips in either waveform or spectrogram formats, often overlooking the intrinsic part-whole hierarchies within music. In our quest to comprehend the bottom-up s… ▽ More

    Submitted 19 April, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Short paper accepted by WWW 2024. This is revised and condensed based on the previous version titled "Music-PAW: Learning Music Representations via Hierarchical Part-whole Interaction and Contrast". For more experimental details and discussions, please refer to the original long paper at arXiv:2312.06197v1

  24. arXiv:2311.04056  [pdf, other

    cs.LG cs.AI

    Multi-View Causal Representation Learning with Partial Observability

    Authors: Dingling Yao, Danru Xu, Sébastien Lachapelle, Sara Magliacane, Perouz Taslakian, Georg Martius, Julius von Kügelgen, Francesco Locatello

    Abstract: We present a unified framework for studying the identifiability of representations learned from simultaneously observed views, such as different data modalities. We allow a partially observed setting in which each view constitutes a nonlinear mixture of a subset of underlying latent variables, which can be causally related. We prove that the information shared across all subsets of any number of v… ▽ More

    Submitted 8 March, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 28 pages, 10 figures, 11 tables

  25. arXiv:2311.04044  [pdf, other

    cs.CL cs.CR

    PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models

    Authors: Haoran Li, Dadi Guo, Donghao Li, Wei Fan, Qi Hu, Xin Liu, Chunkit Chan, Duanyi Yao, Yuan Yao, Yangqiu Song

    Abstract: The rapid development of language models (LMs) brings unprecedented accessibility and usage for both models and users. On the one hand, powerful LMs achieve state-of-the-art performance over numerous downstream NLP tasks. On the other hand, more and more attention is paid to unrestricted model accesses that may bring malicious privacy risks of data leakage. To address these issues, many recent wor… ▽ More

    Submitted 1 June, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: To appear at ACL 2024

  26. arXiv:2310.15930  [pdf, other

    cs.SD eess.AS

    CDSD: Chinese Dysarthria Speech Database

    Authors: Mengyi Sun, Ming Gao, Xinchen Kang, Shiru Wang, Jun Du, Dengfeng Yao, Su-Jing Wang

    Abstract: We present the Chinese Dysarthria Speech Database (CDSD) as a valuable resource for dysarthria research. This database comprises speech data from 24 participants with dysarthria. Among these participants, one recorded an additional 10 hours of speech data, while each recorded one hour, resulting in 34 hours of speech material. To accommodate participants with varying cognitive levels, our text poo… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 9 pages, 3 figures

  27. arXiv:2310.11994  [pdf

    cs.HC eess.SP q-bio.NC

    Spectral homogeneity cross frequencies can be a quality metric for the large-scale resting EEG preprocessing

    Authors: Shiang Hu, Jie Ruan, Nicolas Langer, Jorge Bosch-Bayard, Zhao Lv, Dezhong Yao, Pedro Antonio Valdes-Sosa

    Abstract: The brain projects require the collection of massive electrophysiological data, aiming to the longitudinal, sectional, or populational neuroscience studies. Quality metrics automatically label the data after centralized preprocessing. However, although the waveforms-based metrics are partially useful, they may be unreliable by neglecting the spectral profiles. Here, we detected the phenomenon of p… ▽ More

    Submitted 4 December, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

  28. arXiv:2310.06119  [pdf, other

    cs.LG cs.AI

    Exploring Progress in Multivariate Time Series Forecasting: Comprehensive Benchmarking and Heterogeneity Analysis

    Authors: Zezhi Shao, Fei Wang, Yongjun Xu, Wei Wei, Chengqing Yu, Zhao Zhang, Di Yao, Guangyin Jin, Xin Cao, Gao Cong, Christian S. Jensen, Xueqi Cheng

    Abstract: Multivariate Time Series (MTS) widely exists in real-word complex systems, such as traffic and energy systems, making their forecasting crucial for understanding and influencing these systems. Recently, deep learning-based approaches have gained much popularity for effectively modeling temporal and spatial dependencies in MTS, specifically in Long-term Time Series Forecasting (LTSF) and Spatial-Te… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  29. FuzzLLM: A Novel and Universal Fuzzing Framework for Proactively Discovering Jailbreak Vulnerabilities in Large Language Models

    Authors: Dongyu Yao, Jianshu Zhang, Ian G. Harris, Marcel Carlsson

    Abstract: Jailbreak vulnerabilities in Large Language Models (LLMs), which exploit meticulously crafted prompts to elicit content that violates service guidelines, have captured the attention of research communities. While model owners can defend against individual jailbreak prompts through safety training strategies, this relatively passive approach struggles to handle the broader category of similar jailb… ▽ More

    Submitted 14 April, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Publish by ICASSP 2024 on 3/18/2024; Extended Arxiv version

  30. arXiv:2308.13881  [pdf, other

    cs.GT q-fin.GN

    Transaction fee mechanism for Proof-of-Stake protocol

    Authors: Wenpin Tang, David D. Yao

    Abstract: We study a mechanism design problem in the blockchain proof-of-stake (PoS) protocol. Our main objective is to extend the transaction fee mechanism (TFM) recently proposed in Chung and Shi (SODA, p.3856-3899, 2023), so as to incorporate a long-run utility model for the miner into the burning second-price auction mechanism $\texttt{BSP}(γ)$ proposed in Chung and Shi (where $γ$ is a key parameter in… ▽ More

    Submitted 29 August, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: 18 pages, 3 figures

  31. arXiv:2307.07972  [pdf, other

    cs.CV

    Dual-level Interaction for Domain Adaptive Semantic Segmentation

    Authors: Dongyu Yao, Boheng Li

    Abstract: Self-training approach recently secures its position in domain adaptive semantic segmentation, where a model is trained with target domain pseudo-labels. Current advances have mitigated noisy pseudo-labels resulting from the domain gap. However, they still struggle with erroneous pseudo-labels near the boundaries of the semantic classifier. In this paper, we tackle this issue by proposing a dual-l… ▽ More

    Submitted 10 August, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: Accepted to ICCVW on Uncertainty Quantification for Computer Vision (UnCV), 2023

  32. arXiv:2305.18901  [pdf, other

    cs.LG math.OC

    Policy Optimization for Continuous Reinforcement Learning

    Authors: Hanyang Zhao, Wenpin Tang, David D. Yao

    Abstract: We study reinforcement learning (RL) in the setting of continuous time and space, for an infinite horizon with a discounted objective and the underlying dynamics driven by a stochastic differential equation. Built upon recent advances in the continuous approach to RL, we develop a notion of occupation time (specifically for a discounted objective), and show how it can be effectively used to derive… ▽ More

    Submitted 18 October, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  33. arXiv:2305.01915  [pdf, other

    cs.IR cs.MM

    Denoising Multi-modal Sequential Recommenders with Contrastive Learning

    Authors: Dong Yao, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Wenqiao Zhang, Rui Zhang, Xiaofei He, Fei Wu

    Abstract: There is a rapidly-growing research interest in engaging users with multi-modal data for accurate user modeling on recommender systems. Existing multimedia recommenders have achieved substantial improvements by incorporating various modalities and devising delicate modules. However, when users decide to interact with items, most of them do not fully read the content of all modalities. We refer to… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  34. arXiv:2304.13407  [pdf, ps, other

    cs.LG cs.CR cs.IT

    FedVS: Straggler-Resilient and Privacy-Preserving Vertical Federated Learning for Split Models

    Authors: Songze Li, Duanyi Yao, Jin Liu

    Abstract: In a vertical federated learning (VFL) system consisting of a central server and many distributed clients, the training data are vertically partitioned such that different features are privately stored on different clients. The problem of split VFL is to train a model split between the server and the clients. This paper aims to address two major challenges in split VFL: 1) performance degradation… ▽ More

    Submitted 6 July, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted to ICML 2023

  35. Synthetic Datasets for Autonomous Driving: A Survey

    Authors: Zhihang Song, Zimin He, Xingyu Li, Qiming Ma, Ruibo Ming, Zhiqi Mao, Huaxin Pei, Lihui Peng, Jianming Hu, Danya Yao, Yi Zhang

    Abstract: Autonomous driving techniques have been flourishing in recent years while thirsting for huge amounts of high-quality data. However, it is difficult for real-world datasets to keep up with the pace of changing requirements due to their expensive and time-consuming experimental and labeling costs. Therefore, more and more researchers are turning to synthetic datasets to easily generate rich and chan… ▽ More

    Submitted 27 February, 2024; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 19 pages, 5 figures

    Journal ref: in IEEE Transactions on Intelligent Vehicles, vol. 9, no. 1, pp. 1847-1864, Jan. 2024

  36. arXiv:2304.07735  [pdf, other

    cs.CR

    Permutation Equivariance of Transformers and Its Applications

    Authors: Hengyuan Xu, Liyao Xiang, Hangyu Ye, Dixi Yao, Pengzhi Chu, Baochun Li

    Abstract: Revolutionizing the field of deep learning, Transformer-based models have achieved remarkable performance in many tasks. Recent research has recognized these models are robust to shuffling but are limited to inter-token permutation in the forward propagation. In this work, we propose our definition of permutation equivariance, a broader concept covering both inter- and intra- token permutation in… ▽ More

    Submitted 31 March, 2024; v1 submitted 16 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR 2024

  37. arXiv:2304.01347  [pdf

    q-bio.NC cs.LG cs.MM

    Temporal Dynamic Synchronous Functional Brain Network for Schizophrenia Diagnosis and Lateralization Analysis

    Authors: Cheng Zhu, Ying Tan, Shuqi Yang, Jiaqing Miao, Jiayi Zhu, Huan Huang, Dezhong Yao, Cheng Luo

    Abstract: The available evidence suggests that dynamic functional connectivity (dFC) can capture time-varying abnormalities in brain activity in resting-state cerebral functional magnetic resonance imaging (rs-fMRI) data and has a natural advantage in uncovering mechanisms of abnormal brain activity in schizophrenia(SZ) patients. Hence, an advanced dynamic brain network analysis model called the temporal br… ▽ More

    Submitted 11 September, 2023; v1 submitted 30 March, 2023; originally announced April 2023.

  38. arXiv:2303.10112  [pdf, other

    cs.LG stat.ME

    Causal Discovery from Temporal Data: An Overview and New Perspectives

    Authors: Chang Gong, Di Yao, Chuzhe Zhang, Wenbin Li, Jingping Bi

    Abstract: Temporal data, representing chronological observations of complex systems, has always been a typical data structure that can be widely generated by many domains, such as industry, medicine and finance. Analyzing this type of data is extremely valuable for various applications. Thus, different temporal data analysis tasks, eg, classification, clustering and prediction, have been proposed in the pas… ▽ More

    Submitted 3 August, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: 54 pages, 7 figures

  39. arXiv:2302.11106  [pdf, other

    cs.CV

    Multi-Head Feature Pyramid Networks for Breast Mass Detection

    Authors: Hexiang Zhang, Zhenghua Xu, Dan Yao, Shuo Zhang, Junyang Chen, Thomas Lukasiewicz

    Abstract: Analysis of X-ray images is one of the main tools to diagnose breast cancer. The ability to quickly and accurately detect the location of masses from the huge amount of image data is the key to reducing the morbidity and mortality of breast cancer. Currently, the main factor limiting the accuracy of breast mass detection is the unequal focus on the mass boxes, leading the network to focus too much… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: 7 pages, 3 figures,Received by the ICASSP2023

  40. Semantic Network Model for Sign Language Comprehension

    Authors: Xinchen Kang, Dengfeng Yao, Minghu Jiang, Yunlong Huang, Fanshu Li

    Abstract: In this study, the authors propose a computational cognitive model for sign language (SL) perception and comprehension with detailed algorithmic descriptions based on cognitive functionalities in human language processing. The semantic network model (SNM) that represents semantic relations between concepts, it is used as a form of knowledge representation. The proposed model is applied in the comp… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: 19 pages, 6 figures and 1 table

    Journal ref: Kang, X., Yao, D., Jiang, M., Huang, Y., & Li, F. (2022). Semantic Network Model for Sign Language Comprehension. International Journal of Cognitive Informatics and Natural Intelligence (IJCINI), 16(1), 1-19

  41. arXiv:2301.05599  [pdf, other

    q-bio.NC cs.AI cs.LG eess.SP

    Short-length SSVEP data extension by a novel generative adversarial networks based framework

    Authors: Yudong Pan, Ning Li, Yangsong Zhang, Peng Xu, Dezhong Yao

    Abstract: Steady-state visual evoked potentials (SSVEPs) based brain-computer interface (BCI) has received considerable attention due to its high information transfer rate (ITR) and available quantity of targets. However, the performance of frequency identification methods heavily hinges on the amount of user calibration data and data length, which hinders the deployment in real-world applications. Recently… ▽ More

    Submitted 2 October, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: 16 pages, 9 figures, 4 tables

  42. arXiv:2210.03402  [pdf

    eess.SY cs.LG nlin.AO

    Research on Self-adaptive Online Vehicle Velocity Prediction Strategy Considering Traffic Information Fusion

    Authors: Ziyan Zhang, Junhao Shen, Dongwei Yao, Feng Wu

    Abstract: In order to increase the prediction accuracy of the online vehicle velocity prediction (VVP) strategy, a self-adaptive velocity prediction algorithm fused with traffic information was presented for the multiple scenarios. Initially, traffic scenarios were established inside the co-simulation environment. In addition, the algorithm of a general regressive neural network (GRNN) paired with datasets… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: 9 pages, 7 figures

  43. arXiv:2209.10837  [pdf, other

    cs.CV cs.NE q-bio.NC

    A Spatial-channel-temporal-fused Attention for Spiking Neural Networks

    Authors: Wuque Cai, Hongze Sun, Rui Liu, Yan Cui, Jun Wang, Yang Xia, Dezhong Yao, Daqing Guo

    Abstract: Spiking neural networks (SNNs) mimic brain computational strategies, and exhibit substantial capabilities in spatiotemporal information processing. As an essential factor for human perception, visual attention refers to the dynamic process for selecting salient regions in biological vision systems. Although visual attention mechanisms have achieved great success in computer vision applications, th… ▽ More

    Submitted 28 May, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 14 pages, 9 figures, 5 tabes; This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  44. arXiv:2208.08024  [pdf, other

    cs.IR

    CCL4Rec: Contrast over Contrastive Learning for Micro-video Recommendation

    Authors: Shengyu Zhang, Bofang Li, Dong Yao, Fuli Feng, Jieming Zhu, Wenyan Fan, Zhou Zhao, Xiaofei He, Tat-seng Chua, Fei Wu

    Abstract: Micro-video recommender systems suffer from the ubiquitous noises in users' behaviors, which might render the learned user representation indiscriminating, and lead to trivial recommendations (e.g., popular items) or even weird ones that are far beyond users' interests. Contrastive learning is an emergent technique for learning discriminating representations with random data augmentations. However… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: 11 pages, 4 figures

  45. arXiv:2208.08011  [pdf, other

    cs.IR

    Re4: Learning to Re-contrast, Re-attend, Re-construct for Multi-interest Recommendation

    Authors: Shengyu Zhang, Lingxiao Yang, Dong Yao, Yujie Lu, Fuli Feng, Zhou Zhao, Tat-seng Chua, Fei Wu

    Abstract: Effectively representing users lie at the core of modern recommender systems. Since users' interests naturally exhibit multiple aspects, it is of increasing interest to develop multi-interest frameworks for recommendation, rather than represent each user with an overall embedding. Despite their effectiveness, existing methods solely exploit the encoder (the forward flow) to represent multiple aspe… ▽ More

    Submitted 21 February, 2024; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: 11 pages, 4 figures, accepted by WWW 2022

  46. arXiv:2206.06129  [pdf, other

    cs.NE cs.AI q-bio.NC

    A Synapse-Threshold Synergistic Learning Approach for Spiking Neural Networks

    Authors: Hongze Sun, Wuque Cai, Baoxin Yang, Yan Cui, Yang Xia, Dezhong Yao, Daqing Guo

    Abstract: Spiking neural networks (SNNs) have demonstrated excellent capabilities in various intelligent scenarios. Most existing methods for training SNNs are based on the concept of synaptic plasticity; however, learning in the realistic brain also utilizes intrinsic non-synaptic mechanisms of neurons. The spike threshold of biological neurons is a critical intrinsic neuronal feature that exhibits rich dy… ▽ More

    Submitted 3 April, 2023; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: 15 pages, 11 figures, 4 Tables, submitted for publication

  47. arXiv:2203.09129  [pdf, other

    cs.SD cs.LG eess.AS

    Contrastive Learning with Positive-Negative Frame Mask for Music Representation

    Authors: Dong Yao, Zhou Zhao, Shengyu Zhang, Jieming Zhu, Yudong Zhu, Rui Zhang, Xiuqiang He

    Abstract: Self-supervised learning, especially contrastive learning, has made an outstanding contribution to the development of many deep learning research fields. Recently, researchers in the acoustic signal processing field noticed its success and leveraged contrastive learning for better music representation. Typically, existing approaches maximize the similarity between two distorted audio segments samp… ▽ More

    Submitted 3 April, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: Accepted by WWW2022

  48. arXiv:2202.12713  [pdf, other

    cs.SI cs.AI cs.LG

    HTGN-BTW: Heterogeneous Temporal Graph Network with Bi-Time-Window Training Strategy for Temporal Link Prediction

    Authors: Chongjian Yue, Lun Du, Qiang Fu, Wendong Bi, Hengyu Liu, Yu Gu, Di Yao

    Abstract: With the development of temporal networks such as E-commerce networks and social networks, the issue of temporal link prediction has attracted increasing attention in recent years. The Temporal Link Prediction task of WSDM Cup 2022 expects a single model that can work well on two kinds of temporal graphs simultaneously, which have quite different characteristics and data properties, to predict whe… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: 5 pages, Second Winner Award at Temporal Link Prediction task of WSDM Cup 2022

  49. arXiv:2202.11025  [pdf, other

    cs.RO

    3D ToF LiDAR in Mobile Robotics: A Review

    Authors: Tao Yang, You Li, Cheng Zhao, Dexin Yao, Guanyin Chen, Li Sun, Tomas Krajnik, Zhi Yan

    Abstract: In the past ten years, the use of 3D Time-of-Flight (ToF) LiDARs in mobile robotics has grown rapidly. Based on our accumulation of relevant research, this article systematically reviews and analyzes the use 3D ToF LiDARs in research and industrial applications. The former includes object detection, robot localization, long-term autonomy, LiDAR data processing under adverse weather conditions, and… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: 16 pages, 10 figures, 5 tables, 4 equations

  50. arXiv:2201.00689  [pdf, other

    cs.IR cs.AI

    CausalMTA: Eliminating the User Confounding Bias for Causal Multi-touch Attribution

    Authors: Di Yao, Chang Gong, Lei Zhang, Sheng Chen, Jingping Bi

    Abstract: Multi-touch attribution (MTA), aiming to estimate the contribution of each advertisement touchpoint in conversion journeys, is essential for budget allocation and automatically advertising. Existing methods first train a model to predict the conversion probability of the advertisement journeys with historical data and calculate the attribution of each touchpoint using counterfactual predictions. A… ▽ More

    Submitted 21 July, 2022; v1 submitted 20 December, 2021; originally announced January 2022.

    Comments: 11 pages, 5 figures This paper has been accepted in KDD 2022