Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 211 results for author: Gong, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10376  [pdf, other

    q-bio.NC cs.CL

    Large Language Model-based FMRI Encoding of Language Functions for Subjects with Neurocognitive Disorder

    Authors: Yuejiao Wang, Xianmin Gong, Lingwei Meng, Xixin Wu, Helen Meng

    Abstract: Functional magnetic resonance imaging (fMRI) is essential for developing encoding models that identify functional changes in language-related brain areas of individuals with Neurocognitive Disorders (NCD). While large language model (LLM)-based fMRI encoding has shown promise, existing studies predominantly focus on healthy, young adults, overlooking older NCD populations and cognitive level corre… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 5 pages, accepted by Interspeech 2024

  2. arXiv:2407.09992  [pdf, other

    cs.MM

    TOP:A New Target-Audience Oriented Content Paraphrase Task

    Authors: Boda Lin, Jiaxin Shi, Haolong Yan, Binghao Tang, Xiaocheng Gong, Si Li

    Abstract: Recommendation systems usually recommend the existing contents to different users. However, in comparison to static recommendation methods, a recommendation logic that dynamically adjusts based on user interest preferences may potentially attract a larger user base. Thus, we consider paraphrasing existing content based on the interests of the users to modify the content to better align with the pr… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: 8 pages

  3. arXiv:2407.08974  [pdf, other

    q-bio.QM cs.LG math.GN q-bio.BM

    Topology-enhanced machine learning model (Top-ML) for anticancer peptide prediction

    Authors: Joshua Zhi En Tan, JunJie Wee, Xue Gong, Kelin Xia

    Abstract: Recently, therapeutic peptides have demonstrated great promise for cancer treatment. To explore powerful anticancer peptides, artificial intelligence (AI)-based approaches have been developed to systematically screen potential candidates. However, the lack of efficient featurization of peptides has become a bottleneck for these machine-learning models. In this paper, we propose a topology-enhanced… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  4. Self-consistent Deep Geometric Learning for Heterogeneous Multi-source Spatial Point Data Prediction

    Authors: Dazhou Yu, Xiaoyun Gong, Yun Li, Meikang Qiu, Liang Zhao

    Abstract: Multi-source spatial point data prediction is crucial in fields like environmental monitoring and natural resource management, where integrating data from various sensors is the key to achieving a holistic environmental understanding. Existing models in this area often fall short due to their domain-specific nature and lack a strategy for integrating information from various sources in the absence… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  5. Blockchain Based Zero-Knowledge Proof of Location in IoT

    Authors: Wei Wu, Erwu Liu, Xinglin Gong, Rui Wang

    Abstract: With the development of precise positioning technology, a growing number of location-based services (LBSs) facilitate people's life. Most LBSs require proof of location (PoL) to prove that the user satisfies the service requirement, which exposes the user's privacy. In this paper, we propose a zero-knowledge proof of location (zk-PoL) protocol to better protect the user's privacy. With the zk-PoL… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Published on ICC 2020-2020 IEEE International Conference on Communications (ICC)

  6. arXiv:2406.17184  [pdf, ps, other

    cs.LG stat.ML

    Minimax Optimality in Contextual Dynamic Pricing with General Valuation Models

    Authors: Xueping Gong, Jiheng Zhang

    Abstract: Dynamic pricing, the practice of adjusting prices based on contextual factors, has gained significant attention due to its impact on revenue maximization. In this paper, we address the contextual dynamic pricing problem, which involves pricing decisions based on observable product features and customer characteristics. We propose a novel algorithm that achieves improved regret bounds while minimiz… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 29 pages

  7. arXiv:2406.15050  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Tri-VQA: Triangular Reasoning Medical Visual Question Answering for Multi-Attribute Analysis

    Authors: Lin Fan, Xun Gong, Cenyang Zheng, Yafei Ou

    Abstract: The intersection of medical Visual Question Answering (Med-VQA) is a challenging research topic with advantages including patient engagement and clinical expert involvement for second opinions. However, existing Med-VQA methods based on joint embedding fail to explain whether their provided results are based on correct reasoning or coincidental answers, which undermines the credibility of VQA answ… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    ACM Class: I.2.7; I.2.10; J.3

  8. arXiv:2406.10569  [pdf, other

    cs.LG cs.CV

    MDA: An Interpretable Multi-Modal Fusion with Missing Modalities and Intrinsic Noise

    Authors: Lin Fan, Yafei Ou, Cenyang Zheng, Pengyu Dai, Tamotsu Kamishima, Masayuki Ikebe, Kenji Suzuki, Xun Gong

    Abstract: Multi-modal fusion is crucial in medical data research, enabling a comprehensive understanding of diseases and improving diagnostic performance by combining diverse modalities. However, multi-modal fusion faces challenges, including capturing interactions between modalities, addressing missing modalities, handling erroneous modal information, and ensuring interpretability. Many existing researcher… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    ACM Class: I.5.2; I.2.7; I.2.10; J.3

  9. arXiv:2405.14545  [pdf, other

    q-bio.BM cs.LG

    A Cross-Field Fusion Strategy for Drug-Target Interaction Prediction

    Authors: Hongzhi Zhang, Xiuwen Gong, Shirui Pan, Jia Wu, Bo Du, Wenbin Hu

    Abstract: Drug-target interaction (DTI) prediction is a critical component of the drug discovery process. In the drug development engineering field, predicting novel drug-target interactions is extremely crucial.However, although existing methods have achieved high accuracy levels in predicting known drugs and drug targets, they fail to utilize global protein information during DTI prediction. This leads to… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  10. arXiv:2405.14536  [pdf, other

    q-bio.MN cs.AI cs.LG

    Regressor-free Molecule Generation to Support Drug Response Prediction

    Authors: Kun Li, Xiuwen Gong, Shirui Pan, Jia Wu, Bo Du, Wenbin Hu

    Abstract: Drug response prediction (DRP) is a crucial phase in drug discovery, and the most important metric for its evaluation is the IC50 score. DRP results are heavily dependent on the quality of the generated molecules. Existing molecule generation methods typically employ classifier-based guidance, enabling sampling within the IC50 classification range. However, these methods fail to ensure the samplin… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 22 pages, 7 figures, 9 tables,

  11. arXiv:2405.14294  [pdf, other

    cs.CV

    Tuning-free Universally-Supervised Semantic Segmentation

    Authors: Xiaobo Yang, Xiaojin Gong

    Abstract: This work presents a tuning-free semantic segmentation framework based on classifying SAM masks by CLIP, which is universally applicable to various types of supervision. Initially, we utilize CLIP's zero-shot classification ability to generate pseudo-labels or perform open-vocabulary segmentation. However, the misalignment between mask and CLIP text embeddings leads to suboptimal results. To addre… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  12. arXiv:2405.05130  [pdf, other

    cs.CV cs.MM

    Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection

    Authors: Shengyang Sun, Xiaojin Gong

    Abstract: Weakly supervised multimodal violence detection aims to learn a violence detection model by leveraging multiple modalities such as RGB, optical flow, and audio, while only video-level annotations are available. In the pursuit of effective multimodal violence detection (MVD), information redundancy, modality imbalance, and modality asynchrony are identified as three key challenges. In this work, we… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted by ICME 2024

  13. arXiv:2405.02288  [pdf, other

    cs.CV cs.AI cs.RO

    Prospective Role of Foundation Models in Advancing Autonomous Vehicles

    Authors: Jianhua Wu, Bingzhao Gao, Jincheng Gao, Jianhao Yu, Hongqing Chu, Qiankun Yu, Xun Gong, Yi Chang, H. Eric Tseng, Hong Chen, Jie Chen

    Abstract: With the development of artificial intelligence and breakthroughs in deep learning, large-scale Foundation Models (FMs), such as GPT, Sora, etc., have achieved remarkable results in many fields including natural language processing and computer vision. The application of FMs in autonomous driving holds considerable promise. For example, they can contribute to enhancing scene understanding and reas… ▽ More

    Submitted 17 May, 2024; v1 submitted 8 December, 2023; originally announced May 2024.

    Comments: 45 pages,8 figures

  14. arXiv:2404.19582  [pdf, other

    cs.LG cs.CR

    Leveraging Label Information for Stealthy Data Stealing in Vertical Federated Learning

    Authors: Duanyi Yao, Songze Li, Xueluan Gong, Sizai Hou, Gaoning Pan

    Abstract: We develop DMAVFL, a novel attack strategy that evades current detection mechanisms. The key idea is to integrate a discriminator with auxiliary classifier that takes a full advantage of the label information (which was completely ignored in previous attacks): on one hand, label information helps to better characterize embeddings of samples from distinct classes, yielding an improved reconstructio… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  15. arXiv:2404.13830  [pdf, other

    cs.CV

    A Comprehensive Survey and Taxonomy on Point Cloud Registration Based on Deep Learning

    Authors: Yu-Xin Zhang, Jie Gui, Xiaofeng Cong, Xin Gong, Wenbing Tao

    Abstract: Point cloud registration (PCR) involves determining a rigid transformation that aligns one point cloud to another. Despite the plethora of outstanding deep learning (DL)-based registration methods proposed, comprehensive and systematic studies on DL-based PCR techniques are still lacking. In this paper, we present a comprehensive survey and taxonomy of recently proposed PCR methods. Firstly, we co… ▽ More

    Submitted 4 July, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

    Comments: This paper is accepted by IJCAI 2024

  16. arXiv:2404.11118  [pdf, other

    cs.CV

    MHLR: Moving Haar Learning Rate Scheduler for Large-scale Face Recognition Training with One GPU

    Authors: Xueyuan Gong, Yain-whar Si, Zheng Zhang, Xiaochen Yuan, Ke Wang, Xinyuan Zhang, Cong Lin, Xiaoxiang Liu

    Abstract: Face recognition (FR) has seen significant advancements due to the utilization of large-scale datasets. Training deep FR models on large-scale datasets with multiple GPUs is now a common practice. In fact, computing power has evolved into a foundational and indispensable resource in the area of deep learning. It is nearly impossible to train a deep FR model without holding adequate hardware resour… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  17. arXiv:2404.02460  [pdf, other

    cs.CV cs.AI

    TSNet:A Two-stage Network for Image Dehazing with Multi-scale Fusion and Adaptive Learning

    Authors: Xiaolin Gong, Zehan Zheng, Heyuan Du

    Abstract: Image dehazing has been a popular topic of research for a long time. Previous deep learning-based image dehazing methods have failed to achieve satisfactory dehazing effects on both synthetic datasets and real-world datasets, exhibiting poor generalization. Moreover, single-stage networks often result in many regions with artifacts and color distortion in output images. To address these issues, th… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 12 pages, 10 figures, 7 tables

  18. Advanced Long-Content Speech Recognition With Factorized Neural Transducer

    Authors: Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian

    Abstract: In this paper, we propose two novel approaches, which integrate long-content information into the factorized neural transducer (FNT) based architecture in both non-streaming (referred to as LongFNT ) and streaming (referred to as SLongFNT ) scenarios. We first investigate whether long-content transcriptions can improve the vanilla conformer transducer (C-T) models. Our experiments indicate that th… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted by TASLP 2024

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 32, pp. 1803-1815, 2024

  19. arXiv:2403.03217  [pdf, other

    cs.CV

    Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion

    Authors: Meng Zheng, Benjamin Planche, Xuan Gong, Fan Yang, Terrence Chen, Ziyan Wu

    Abstract: 3D patient body modeling is critical to the success of automated patient positioning for smart medical scanning and operating rooms. Existing CNN-based end-to-end patient modeling solutions typically require a) customized network designs demanding large amount of relevant training data, covering extensive realistic clinical scenarios (e.g., patient covered by sheets), which leads to suboptimal gen… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: MICCAI 2022

  20. arXiv:2403.01619  [pdf, other

    cs.CV cs.GR

    Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation

    Authors: Tianyu Luan, Zhong Li, Lele Chen, Xuan Gong, Lichang Chen, Yi Xu, Junsong Yuan

    Abstract: Existing 3D mesh shape evaluation metrics mainly focus on the overall shape but are usually less sensitive to local details. This makes them inconsistent with human evaluation, as human perception cares about both overall and detailed shape. In this paper, we propose an analytic metric named Spectrum Area Under the Curve Difference (SAUCD) that demonstrates better consistency with human evaluation… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024. Project page: https://bit.ly/saucd

  21. arXiv:2402.09251  [pdf

    physics.comp-ph cond-mat.mtrl-sci cs.AI

    Universal Machine Learning Kohn-Sham Hamiltonian for Materials

    Authors: Yang Zhong, Hongyu Yu, Jihui Yang, Xingyu Guo, Hongjun Xiang, Xingao Gong

    Abstract: While density functional theory (DFT) serves as a prevalent computational approach in electronic structure calculations, its computational demands and scalability limitations persist. Recently, leveraging neural networks to parameterize the Kohn-Sham DFT Hamiltonian has emerged as a promising avenue for accelerating electronic structure computations. Despite advancements, challenges such as the ne… ▽ More

    Submitted 15 April, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: 20 pages, 9 figures

    Journal ref: Chin. Phys. Lett. 41, 077103 (2024)

  22. arXiv:2402.07631  [pdf, other

    cs.SI cond-mat.dis-nn cond-mat.stat-mech nlin.AO physics.data-an

    Higher-order Connection Laplacians for Directed Simplicial Complexes

    Authors: Xue Gong, Desmond J. Higham, Konstantinos Zygalakis, Ginestra Bianconi

    Abstract: Higher-order networks encode the many-body interactions existing in complex systems, such as the brain, protein complexes, and social interactions. Simplicial complexes are higher-order networks that allow a comprehensive investigation of the interplay between topology and dynamics. However, simplicial complexes have the limitation that they only capture undirected higher-order interactions while… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 34 pages, 13 figures

  23. arXiv:2401.15942  [pdf, other

    cs.CV

    Generating Multi-Center Classifier via Conditional Gaussian Distribution

    Authors: Zhemin Zhang, Xun Gong

    Abstract: The linear classifier is widely used in various image classification tasks. It works by optimizing the distance between a sample and its corresponding class center. However, in real-world data, one class can contain several local clusters, e.g., birds of different poses. To address this complexity, we propose a novel multi-center classifier. Different from the vanilla linear classifier, our propos… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  24. arXiv:2401.09587  [pdf, other

    cs.LG math.OC

    Bilevel Optimization under Unbounded Smoothness: A New Algorithm and Convergence Analysis

    Authors: Jie Hao, Xiaochuan Gong, Mingrui Liu

    Abstract: Bilevel optimization is an important formulation for many machine learning problems. Current bilevel optimization algorithms assume that the gradient of the upper-level function is Lipschitz. However, recent studies reveal that certain neural networks such as recurrent neural networks (RNNs) and long-short-term memory networks (LSTMs) exhibit potential unbounded smoothness, rendering conventional… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted by ICLR 2024, Spotlight

  25. arXiv:2401.05899  [pdf, other

    cs.LG

    Optimistic Model Rollouts for Pessimistic Offline Policy Optimization

    Authors: Yuanzhao Zhai, Yiying Li, Zijian Gao, Xudong Gong, Kele Xu, Dawei Feng, Ding Bo, Huaimin Wang

    Abstract: Model-based offline reinforcement learning (RL) has made remarkable progress, offering a promising avenue for improving generalization with synthetic model rollouts. Existing works primarily focus on incorporating pessimism for policy optimization, usually via constructing a Pessimistic Markov Decision Process (P-MDP). However, the P-MDP discourages the policies from learning in out-of-distributio… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  26. arXiv:2312.14478  [pdf, other

    cs.LG

    Federated Learning via Input-Output Collaborative Distillation

    Authors: Xuan Gong, Shanglin Li, Yuxiang Bao, Barry Yao, Yawen Huang, Ziyan Wu, Baochang Zhang, Yefeng Zheng, David Doermann

    Abstract: Federated learning (FL) is a machine learning paradigm in which distributed local nodes collaboratively train a central model without sharing individually held private data. Existing FL methods either iteratively share local model parameters or deploy co-distillation. However, the former is highly susceptible to private data leakage, and the latter design relies on the prerequisites of task-releva… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024

  27. arXiv:2312.05281  [pdf, other

    cs.CV

    X2-Softmax: Margin Adaptive Loss Function for Face Recognition

    Authors: Jiamu Xu, Xiaoxiang Liu, Xinyuan Zhang, Yain-Whar Si, Xiaofan Li, Zheng Shi, Ke Wang, Xueyuan Gong

    Abstract: Learning the discriminative features of different faces is an important task in face recognition. By extracting face features in neural networks, it becomes easy to measure the similarity of different face images, which makes face recognition possible. To enhance the neural network's face feature separability, incorporating an angular margin during training is common practice. State-of-the-art los… ▽ More

    Submitted 19 December, 2023; v1 submitted 8 December, 2023; originally announced December 2023.

  28. arXiv:2312.03585  [pdf, other

    cs.CV cs.AI

    Foundation Model Assisted Weakly Supervised Semantic Segmentation

    Authors: Xiaobo Yang, Xiaojin Gong

    Abstract: This work aims to leverage pre-trained foundation models, such as contrastive language-image pre-training (CLIP) and segment anything model (SAM), to address weakly supervised semantic segmentation (WSSS) using image-level labels. To this end, we propose a coarse-to-fine framework based on CLIP and SAM for generating high-quality segmentation seeds. Specifically, we construct an image classificati… ▽ More

    Submitted 10 December, 2023; v1 submitted 6 December, 2023; originally announced December 2023.

  29. arXiv:2311.07582  [pdf

    cs.CL cs.AI

    Evaluating the Potential of Leading Large Language Models in Reasoning Biology Questions

    Authors: Xinyu Gong, Jason Holmes, Yiwei Li, Zhengliang Liu, Qi Gan, Zihao Wu, Jianli Zhang, Yusong Zou, Yuxi Teng, Tian Jiang, Hongtu Zhu, Wei Liu, Tianming Liu, Yajun Yan

    Abstract: Recent advances in Large Language Models (LLMs) have presented new opportunities for integrating Artificial General Intelligence (AGI) into biological research and education. This study evaluated the capabilities of leading LLMs, including GPT-4, GPT-3.5, PaLM2, Claude2, and SenseNova, in answering conceptual biology questions. The models were tested on a 108-question multiple-choice exam covering… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  30. arXiv:2311.05988  [pdf, other

    cs.CV

    Vision Big Bird: Random Sparsification for Full Attention

    Authors: Zhemin Zhang, Xun Gong

    Abstract: Recently, Transformers have shown promising performance in various vision tasks. However, the high costs of global self-attention remain challenging for Transformers, especially for high-resolution vision tasks. Inspired by one of the most successful transformers-based models for NLP: Big Bird, we propose a novel sparse attention mechanism for Vision Transformers (ViT). Specifically, we separate t… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2304.06250

  31. arXiv:2311.01155  [pdf, other

    cs.CV

    Learning Intra and Inter-Camera Invariance for Isolated Camera Supervised Person Re-identification

    Authors: Menglin Wang, Xiaojin Gong

    Abstract: Supervised person re-identification assumes that a person has images captured under multiple cameras. However when cameras are placed in distance, a person rarely appears in more than one camera. This paper thus studies person re-ID under such isolated camera supervised (ISCS) setting. Instead of trying to generate fake cross-camera features like previous methods, we explore a novel perspective by… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: ACM MultiMedia 2023

  32. arXiv:2310.17218  [pdf, other

    cs.CV

    Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification

    Authors: Jiachen Li, Xiaojin Gong

    Abstract: This work aims to adapt large-scale pre-trained vision-language models, such as contrastive language-image pretraining (CLIP), to enhance the performance of object reidentification (Re-ID) across various supervision settings. Although prompt learning has enabled a recent work named CLIP-ReID to achieve promising performance, the underlying mechanisms and the necessity of prompt learning remain unc… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  33. arXiv:2310.16654  [pdf, other

    cs.CL

    ChatGPT is a Potential Zero-Shot Dependency Parser

    Authors: Boda Lin, Xinyi Zhou, Binghao Tang, Xiaocheng Gong, Si Li

    Abstract: Pre-trained language models have been widely used in dependency parsing task and have achieved significant improvements in parser performance. However, it remains an understudied question whether pre-trained language models can spontaneously exhibit the ability of dependency parsing without introducing additional parser structure in the zero-shot scenario. In this paper, we propose to explore the… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 10 pages

  34. arXiv:2310.14457  [pdf, other

    cs.RO cs.AI physics.data-an

    A generalized likelihood-weighted optimal sampling algorithm for rare-event probability quantification

    Authors: Xianliang Gong, Yulin Pan

    Abstract: In this work, we introduce a new acquisition function for sequential sampling to efficiently quantify rare-event statistics of an input-to-response (ItR) system with given input probability and expensive function evaluations. Our acquisition is a generalization of the likelihood-weighted (LW) acquisition that was initially designed for the same purpose and then extended to many other applications.… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  35. arXiv:2310.10487  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Type-aware Decoding via Explicitly Aggregating Event Information for Document-level Event Extraction

    Authors: Gang Zhao, Yidong Shi, Shudong Lu, Xinjie Yang, Guanting Dong, Jian Xu, Xiaocheng Gong, Si Li

    Abstract: Document-level event extraction (DEE) faces two main challenges: arguments-scattering and multi-event. Although previous methods attempt to address these challenges, they overlook the interference of event-unrelated sentences during event detection and neglect the mutual interference of different event roles during argument extraction. Therefore, this paper proposes a novel Schema-based Explicitly… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Submitted to ICASSP 2024

  36. arXiv:2310.10481  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    DemoSG: Demonstration-enhanced Schema-guided Generation for Low-resource Event Extraction

    Authors: Gang Zhao, Xiaocheng Gong, Xinjie Yang, Guanting Dong, Shudong Lu, Si Li

    Abstract: Most current Event Extraction (EE) methods focus on the high-resource scenario, which requires a large amount of annotated data and can hardly be applied to low-resource domains. To address EE more effectively with limited resources, we propose the Demonstration-enhanced Schema-guided Generation (DemoSG) model, which benefits low-resource EE from two aspects: Firstly, we propose the demonstration-… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted by Findings of EMNLP2023

  37. arXiv:2310.07182  [pdf, other

    cs.GR

    Generate Coherent Rays Directly

    Authors: Fengqi Liu, Zaonan Tan, Weilai Xiang, Chenhao Lu, Dan Li, Xu Gong, Yulong Shi, Songnan Shi, Qilong Kou, Bo Hu

    Abstract: The path tracing method generates incoherent rays by randomly sampling directions. This randomness makes it unsuitable for modern processor architectures that rely on coherence to achieve optimal performance. Many efforts have been made to address this issue by reordering rays based on their origin, end, or direction to enhance coherence. However, a drawback of reordering methods is the need to en… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 8 pages

  38. arXiv:2310.00029   

    cs.AI cs.GT cs.LG cs.RO

    Adversarial Driving Behavior Generation Incorporating Human Risk Cognition for Autonomous Vehicle Evaluation

    Authors: Zhen Liu, Hang Gao, Hao Ma, Shuo Cai, Yunfeng Hu, Ting Qu, Hong Chen, Xun Gong

    Abstract: Autonomous vehicle (AV) evaluation has been the subject of increased interest in recent years both in industry and in academia. This paper focuses on the development of a novel framework for generating adversarial driving behavior of background vehicle interfering against the AV to expose effective and rational risky events. Specifically, the adversarial behavior is learned by a reinforcement lear… ▽ More

    Submitted 14 October, 2023; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: We find there is expression error in III.A. A correction edition will be offered

  39. arXiv:2309.13853  [pdf, other

    cs.ET

    A Ferroelectric Compute-in-Memory Annealer for Combinatorial Optimization Problems

    Authors: Xunzhao Yin, Yu Qian, Alptekin Vardar, Marcel Gunther, Franz Muller, Nellie Laleni, Zijian Zhao, Zhouhang Jiang, Zhiguo Shi, Yiyu Shi, Xiao Gong, Cheng Zhuo, Thomas Kampfe, Kai Ni

    Abstract: Computationally hard combinatorial optimization problems (COPs) are ubiquitous in many applications, including logistical planning, resource allocation, chip design, drug explorations, and more. Due to their critical significance and the inability of conventional hardware in efficiently handling scaled COPs, there is a growing interest in developing computing hardware tailored specifically for COP… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: 39 pages, 12 figures

  40. arXiv:2308.15107  [pdf, ps, other

    cs.LG

    Stochastic Graph Bandit Learning with Side-Observations

    Authors: Xueping Gong, Jiheng Zhang

    Abstract: In this paper, we investigate the stochastic contextual bandit with general function space and graph feedback. We propose an algorithm that addresses this problem by adapting to both the underlying graph structures and reward gaps. To the best of our knowledge, our algorithm is the first to provide a gap-dependent upper bound in this stochastic setting, bridging the research gap left by the work i… ▽ More

    Submitted 6 January, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: contextual bandit, graph feedback

  41. arXiv:2308.03572  [pdf, ps, other

    cs.LG cs.AI

    Provably Efficient Learning in Partially Observable Contextual Bandit

    Authors: Xueping Gong, Jiheng Zhang

    Abstract: In this paper, we investigate transfer learning in partially observable contextual bandits, where agents have limited knowledge from other agents and partial information about hidden confounders. We first convert the problem to identifying or partially identifying causal effects between actions and rewards through optimization problems. To solve these optimization problems, we discretize the origi… ▽ More

    Submitted 4 September, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: 47 pages

  42. arXiv:2307.05383  [pdf

    eess.SP cs.HC cs.LG

    Human Emotion Recognition Based On Galvanic Skin Response signal Feature Selection and SVM

    Authors: Di Fan, Mingyang Liu, Xiaohan Zhang, Xiaopeng Gong

    Abstract: A novel human emotion recognition method based on automatically selected Galvanic Skin Response (GSR) signal features and SVM is proposed in this paper. GSR signals were acquired by e-Health Sensor Platform V2.0. Then, the data is de-noised by wavelet function and normalized to get rid of the individual difference. 30 features are extracted from the normalized data, however, directly using of thes… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  43. arXiv:2306.11970  [pdf, other

    cs.CV cs.GR cs.GT

    RSMT: Real-time Stylized Motion Transition for Characters

    Authors: Xiangjun Tang, Linjun Wu, He Wang, Bo Hu, Xu Gong, Yuchen Liao, Songnan Li, Qilong Kou, Xiaogang Jin

    Abstract: Styled online in-between motion generation has important application scenarios in computer animation and games. Its core challenge lies in the need to satisfy four critical requirements simultaneously: generation speed, motion quality, style diversity, and synthesis controllability. While the first two challenges demand a delicate balance between simple fast models and learning capacity for genera… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Journal ref: SIGGRAPH 2023 Conference Proceedings

  44. arXiv:2306.06209  [pdf, other

    cs.CV cs.CR cs.LG

    Backdoor Attack with Sparse and Invisible Trigger

    Authors: Yinghua Gao, Yiming Li, Xueluan Gong, Zhifeng Li, Shu-Tao Xia, Qian Wang

    Abstract: Deep neural networks (DNNs) are vulnerable to backdoor attacks, where the adversary manipulates a small portion of training data such that the victim model predicts normally on the benign samples but classifies the triggered samples as the target class. The backdoor attack is an emerging yet threatening training-phase threat, leading to serious risks in DNN-based applications. In this paper, we re… ▽ More

    Submitted 5 June, 2024; v1 submitted 11 May, 2023; originally announced June 2023.

    Comments: This paper was accepted by IEEE Transactions on Information Forensics and Security (TIFS). The first two authors contributed equally to this work. 14 pages

  45. arXiv:2306.05246  [pdf, other

    cs.CV cs.GR

    A Task-driven Network for Mesh Classification and Semantic Part Segmentation

    Authors: Qiujie Dong, Xiaoran Gong, Rui Xu, Zixiong Wang, Shuangmin Chen, Shiqing Xin, Changhe Tu, Wenping Wang

    Abstract: With the rapid development of geometric deep learning techniques, many mesh-based convolutional operators have been proposed to bridge irregular mesh structures and popular backbone networks. In this paper, we show that while convolutions are helpful, a simple architecture based exclusively on multi-layer perceptrons (MLPs) is competent enough to deal with mesh classification and semantic segmenta… ▽ More

    Submitted 28 December, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: 10 pages

  46. arXiv:2306.04644   

    cs.CV cs.AI

    Decom--CAM: Tell Me What You See, In Details! Feature-Level Interpretation via Decomposition Class Activation Map

    Authors: Yuguang Yang, Runtang Guo, Sheng Wu, Yimi Wang, Juan Zhang, Xuan Gong, Baochang Zhang

    Abstract: Interpretation of deep learning remains a very challenging problem. Although the Class Activation Map (CAM) is widely used to interpret deep model predictions by highlighting object location, it fails to provide insight into the salient features used by the model to make decisions. Furthermore, existing evaluation protocols often overlook the correlation between interpretability performance and th… ▽ More

    Submitted 29 May, 2024; v1 submitted 27 May, 2023; originally announced June 2023.

    Comments: This version has not included sufficient evidence for its claims

  47. arXiv:2306.01995  [pdf, ps, other

    cs.LG stat.ML

    Asymptotically Optimal Pure Exploration for Infinite-Armed Bandits

    Authors: Xiao-Yue Gong, Mark Sellke

    Abstract: We study pure exploration with infinitely many bandit arms generated i.i.d. from an unknown distribution. Our goal is to efficiently select a single high quality arm whose average reward is, with probability $1-δ$, within $\varepsilon$ of being among the top $η$-fraction of arms; this is a natural adaptation of the classical PAC guarantee for infinite action sets. We consider both the fixed confid… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  48. arXiv:2306.01863  [pdf, other

    cs.ET

    Embedding Security into Ferroelectric FET Array via In-Situ Memory Operation

    Authors: Yixin Xu, Yi Xiao, Zijian Zhao, Franz Müller, Alptekin Vardar, Xiao Gong, Sumitha George, Thomas Kämpfe, Vijaykrishnan Narayanan, Kai Ni

    Abstract: Non-volatile memories (NVMs) have the potential to reshape next-generation memory systems because of their promising properties of near-zero leakage power consumption, high density and non-volatility. However, NVMs also face critical security threats that exploit the non-volatile property. Compared to volatile memory, the capability of retaining data even after power down makes NVM more vulnerable… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

  49. arXiv:2306.00012  [pdf, other

    cs.LG cs.AI

    Graph Neural Network for spatiotemporal data: methods and applications

    Authors: Yun Li, Dazhou Yu, Zhenke Liu, Minxing Zhang, Xiaoyun Gong, Liang Zhao

    Abstract: In the era of big data, there has been a surge in the availability of data containing rich spatial and temporal information, offering valuable insights into dynamic systems and processes for applications such as weather forecasting, natural disaster management, intelligent transport systems, and precision agriculture. Graph neural networks (GNNs) have emerged as a powerful tool for modeling and un… ▽ More

    Submitted 29 May, 2023; originally announced June 2023.

  50. arXiv:2305.13947  [pdf, ps, other

    eess.SP cs.AI

    Deep-Learning-Aided Alternating Least Squares for Tensor CP Decomposition and Its Application to Massive MIMO Channel Estimation

    Authors: Xiao Gong, Wei Chen, Bo Ai, Geert Leus

    Abstract: CANDECOMP/PARAFAC (CP) decomposition is the mostly used model to formulate the received tensor signal in a multi-domain massive multiple-input multiple-output (MIMO) system, as the receiver generally sums the components from different paths or users. To achieve accurate and low-latency channel estimation, good and fast CP decomposition algorithms are desired. The CP alternating least squares (CPAL… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.