Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 174 results for author: Pan, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02118  [pdf, other

    cs.CL

    Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale

    Authors: Wenzhen Zheng, Wenbo Pan, Xu Xu, Libo Qin, Li Yue, Ming Zhou

    Abstract: In recent years, Large Language Models (LLMs) have made significant strides towards Artificial General Intelligence. However, training these models from scratch requires substantial computational resources and vast amounts of text data. In this paper, we explore an alternative approach to constructing an LLM for a new language by continually pretraining (CPT) from existing pretrained LLMs, instead… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 8 pages

  2. arXiv:2406.19767  [pdf, other

    cs.IT eess.SP

    Subgraph Matching via Partial Optimal Transport

    Authors: Wen-Xin Pan, Isabel Haasler, Pascal Frossard

    Abstract: In this work, we propose a novel approach for subgraph matching, the problem of finding a given query graph in a large source graph, based on the fused Gromov-Wasserstein distance. We formulate the subgraph matching problem as a partial fused Gromov-Wasserstein problem, which allows us to build on existing theory and computational methods in order to solve this challenging problem. We extend our m… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  3. arXiv:2406.19593  [pdf, other

    cs.CL cs.CV

    SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

    Authors: Xin Su, Man Luo, Kris W Pan, Tien Pei Chou, Vasudev Lal, Phillip Howard

    Abstract: Synthetic data generation has gained significant attention recently for its utility in training large vision and language models. However, the application of synthetic data to the training of multimodal context-augmented generation systems has been relatively unexplored. This gap in existing work is important because existing vision and language models (VLMs) are not trained specifically for conte… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2406.19247  [pdf, other

    cs.CV

    Local Manifold Learning for No-Reference Image Quality Assessment

    Authors: Timin Gao, Wensheng Pan, Yan Zhang, Sicheng Zhao, Shengchuan Zhang, Xiawu Zheng, Ke Li, Liujuan Cao, Rongrong Ji

    Abstract: Contrastive learning has considerably advanced the field of Image Quality Assessment (IQA), emerging as a widely adopted technique. The core mechanism of contrastive learning involves minimizing the distance between quality-similar (positive) examples while maximizing the distance between quality-dissimilar (negative) examples. Despite its successes, current contrastive learning methods often negl… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  5. arXiv:2406.12215  [pdf, other

    math.NA cs.GR math.OC

    Discrete Variable Topology Optimization Using Multi-Cut Formulation and Adaptive Trust Regions

    Authors: Zisheng Ye, Wenxiao Pan

    Abstract: We present a new framework for solving general topology optimization (TO) problems that find an optimal material distribution within a design space to maximize the performance of a structure while satisfying design constraints. These problems involve state variables that nonlinearly depend on the design variables, with objective functions that can be convex or non-convex, and may include multiple… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  6. arXiv:2406.10744   

    cs.CV

    Technique Report of CVPR 2024 PBDL Challenges

    Authors: Ying Fu, Yu Li, Shaodi You, Boxin Shi, Jose Alvarez, Coert van Gemeren, Linwei Chen, Yunhao Zou, Zichun Wang, Yichen Li, Yuze Han, Yingkai Zhang, Jianan Wang, Qinglin Liu, Wei Yu, Xiaoqian Lv, Jianing Li, Shengping Zhang, Xiangyang Ji, Yuanpei Chen, Yuhan Zhang, Weihang Peng, Liwen Zhang, Zhe Xu, Dingyong Gou , et al. (77 additional authors not shown)

    Abstract: The intersection of physics-based vision and deep learning presents an exciting frontier for advancing computer vision technologies. By leveraging the principles of physics to inform and enhance deep learning models, we can develop more robust and accurate vision systems. Physics-based vision aims to invert the processes to recover scene properties such as shape, reflectance, light distribution, a… ▽ More

    Submitted 27 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: The author list and contents need to be verified by all authors

  7. arXiv:2406.00116  [pdf, other

    cs.HC cs.LG

    A Sim2Real Approach for Identifying Task-Relevant Properties in Interpretable Machine Learning

    Authors: Eura Nofshin, Esther Brown, Brian Lim, Weiwei Pan, Finale Doshi-Velez

    Abstract: Existing user studies suggest that different tasks may require explanations with different properties. However, user studies are expensive. In this paper, we introduce a generalizable, cost-effective method for identifying task-relevant explanation properties in silico, which can guide the design of more expensive user studies. We use our approach to identify relevant proxies for three example tas… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  8. arXiv:2405.20195  [pdf, other

    cs.HC

    Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations

    Authors: Zilin Ma, Susannah, Su, Nathan Zhao, Linn Bieske, Blake Bullwinkel, Yanyi Zhang, Sophia, Yang, Ziqing Luo, Siyao Li, Gekai Liao, Boxiang Wang, Jinglun Gao, Zihan Wen, Claude Bruderlein, Weiwei Pan

    Abstract: Humanitarian negotiations in conflict zones, called \emph{frontline negotiation}, are often highly adversarial, complex, and high-risk. Several best-practices have emerged over the years that help negotiators extract insights from large datasets to navigate nuanced and rapidly evolving scenarios. Recent advances in large language models (LLMs) have sparked interest in the potential for AI to aid d… ▽ More

    Submitted 30 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  9. arXiv:2405.19909  [pdf, other

    cs.LG cs.AI cs.RO

    Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning

    Authors: Tenglong Liu, Yang Li, Yixing Lan, Hao Gao, Wei Pan, Xin Xu

    Abstract: In offline reinforcement learning, the challenge of out-of-distribution (OOD) is pronounced. To address this, existing methods often constrain the learned policy through policy regularization. However, these methods often suffer from the issue of unnecessary conservativeness, hampering policy improvement. This occurs due to the indiscriminate use of all actions from the behavior policy that genera… ▽ More

    Submitted 1 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: ICML 2024, 19 pages

  10. arXiv:2405.18194  [pdf, other

    cs.LG cs.CR

    Delving into Differentially Private Transformer

    Authors: Youlong Ding, Xueyang Wu, Yining Meng, Yonggang Luo, Hao Wang, Weike Pan

    Abstract: Deep learning with differential privacy (DP) has garnered significant attention over the past years, leading to the development of numerous methods aimed at enhancing model accuracy and training efficiency. This paper delves into the problem of training Transformer models with differential privacy. Our treatment is modular: the logic is to `reduce' the problem of training DP Transformer to the mor… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  11. arXiv:2405.17250  [pdf, ps, other

    cs.RO eess.SY

    "Pass the butter": A study on desktop-classic multitasking robotic arm based on advanced YOLOv7 and BERT

    Authors: Haohua Que, Wenbin Pan, Jie Xu, Hao Luo, Pei Wang, Li Zhang

    Abstract: In recent years, various intelligent autonomous robots have begun to appear in daily life and production. Desktop-level robots are characterized by their flexible deployment, rapid response, and suitability for light workload environments. In order to meet the current societal demand for service robot technology, this study proposes using a miniaturized desktop-level robot (by ROS) as a carrier, l… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  12. arXiv:2405.16413  [pdf, other

    cs.AI cs.CL cs.LG stat.AP

    Augmented Risk Prediction for the Onset of Alzheimer's Disease from Electronic Health Records with Large Language Models

    Authors: Jiankun Wang, Sumyeong Ahn, Taykhoom Dalal, Xiaodan Zhang, Weishen Pan, Qiannan Zhang, Bin Chen, Hiroko H. Dodge, Fei Wang, Jiayu Zhou

    Abstract: Alzheimer's disease (AD) is the fifth-leading cause of death among Americans aged 65 and older. Screening and early detection of AD and related dementias (ADRD) are critical for timely intervention and for identifying clinical trial participants. The widespread adoption of electronic health records (EHRs) offers an important resource for developing ADRD screening tools such as machine learning bas… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  13. arXiv:2405.11280  [pdf, other

    cs.LG

    Joint Analysis of Single-Cell Data across Cohorts with Missing Modalities

    Authors: Marianne Arriola, Weishen Pan, Manqi Zhou, Qiannan Zhang, Chang Su, Fei Wang

    Abstract: Joint analysis of multi-omic single-cell data across cohorts has significantly enhanced the comprehensive analysis of cellular processes. However, most of the existing approaches for this purpose require access to samples with complete modality availability, which is impractical in many real-world scenarios. In this paper, we propose (Single-Cell Cross-Cohort Cross-Category) integration, a novel f… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: 10 pages, 7 figures, 5 tables

  14. arXiv:2404.14949  [pdf, other

    cs.CV

    Multi-Modal Prompt Learning on Blind Image Quality Assessment

    Authors: Wensheng Pan, Timin Gao, Yan Zhang, Runze Hu, Xiawu Zheng, Enwei Zhang, Yuting Gao, Yutao Liu, Yunhang Shen, Ke Li, Shengchuan Zhang, Liujuan Cao, Rongrong Ji

    Abstract: Image Quality Assessment (IQA) models benefit significantly from semantic information, which allows them to treat different types of objects distinctly. Currently, leveraging semantic information to enhance IQA is a crucial research direction. Traditional methods, hindered by a lack of sufficiently annotated data, have employed the CLIP image-text pretraining model as their backbone to gain semant… ▽ More

    Submitted 18 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

  15. arXiv:2404.11624  [pdf, ps, other

    math.GM cs.LG

    Token Space: A Category Theory Framework for AI Computations

    Authors: Wuming Pan

    Abstract: This paper introduces the Token Space framework, a novel mathematical construct designed to enhance the interpretability and effectiveness of deep learning models through the application of category theory. By establishing a categorical structure at the Token level, we provide a new lens through which AI computations can be understood, emphasizing the relationships between tokens, such as grouping… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 42 pages,5 tables

    MSC Class: I.2.6

  16. arXiv:2403.08941  [pdf, other

    stat.ML cs.LG

    Towards Model-Agnostic Posterior Approximation for Fast and Accurate Variational Autoencoders

    Authors: Yaniv Yacoby, Weiwei Pan, Finale Doshi-Velez

    Abstract: Inference for Variational Autoencoders (VAEs) consists of learning two models: (1) a generative model, which transforms a simple distribution over a latent space into the distribution over observed data, and (2) an inference model, which approximates the posterior of the latent codes given data. The two components are learned jointly via a lower bound to the generative model's log marginal likelih… ▽ More

    Submitted 12 June, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted at the Workshop at the 6th Symposium on Advances in Approximate Bayesian Inference (AABI) 2024

  17. arXiv:2403.01977  [pdf, other

    cs.RO cs.AI cs.CV

    TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions

    Authors: Maytus Piriyajitakonkij, Mingfei Sun, Mengmi Zhang, Wei Pan

    Abstract: Robot navigation under visual corruption presents a formidable challenge. To address this, we propose a Test-time Adaptation (TTA) method, named as TTA-Nav, for point-goal navigation under visual corruptions. Our "plug-and-play" method incorporates a top-down decoder to a pre-trained navigation model. Firstly, the pre-trained navigation model gets a corrupted image and extracts features. Secondly,… ▽ More

    Submitted 14 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Submitted to IROS2024

  18. arXiv:2402.18784  [pdf, other

    cs.AI q-bio.NC

    Brain-inspired and Self-based Artificial Intelligence

    Authors: Yi Zeng, Feifei Zhao, Yuxuan Zhao, Dongcheng Zhao, Enmeng Lu, Qian Zhang, Yuwei Wang, Hui Feng, Zhuoya Zhao, Jihang Wang, Qingqun Kong, Yinqian Sun, Yang Li, Guobin Shen, Bing Han, Yiting Dong, Wenxuan Pan, Xiang He, Aorigele Bao, Jin Wang

    Abstract: The question "Can machines think?" and the Turing Test to assess whether machines could achieve human-level intelligence is one of the roots of AI. With the philosophical argument "I think, therefore I am", this paper challenge the idea of a "thinking machine" supported by current AIs since there is no sense of self in them. Current artificial intelligence is only seemingly intelligent information… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  19. arXiv:2402.17375  [pdf, other

    eess.SY cs.LG

    Impact of Computation in Integral Reinforcement Learning for Continuous-Time Control

    Authors: Wenhan Cao, Wei Pan

    Abstract: Integral reinforcement learning (IntRL) demands the precise computation of the utility function's integral at its policy evaluation (PEV) stage. This is achieved through quadrature rules, which are weighted sums of utility functions evaluated from state samples obtained in discrete time. Our research reveals a critical yet underexplored phenomenon: the choice of the computational method -- in this… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  20. arXiv:2402.15259  [pdf, other

    cs.MA cs.LG

    Open Ad Hoc Teamwork with Cooperative Game Theory

    Authors: Jianhong Wang, Yang Li, Yuan Zhang, Wei Pan, Samuel Kaski

    Abstract: Ad hoc teamwork poses a challenging problem, requiring the design of an agent to collaborate with teammates without prior coordination or joint training. Open ad hoc teamwork (OAHT) further complicates this challenge by considering environments with a changing number of teammates, referred to as open teams. One promising solution in practice to this problem is leveraging the generalizability of gr… ▽ More

    Submitted 10 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Published at ICML 2024, 29 pages

  21. arXiv:2402.12733  [pdf, other

    cs.IR cs.AI

    BMLP: Behavior-aware MLP for Heterogeneous Sequential Recommendation

    Authors: Weixin Li, Yuhao Wu, Yang Liu, Weike Pan, Zhong Ming

    Abstract: In real recommendation scenarios, users often have different types of behaviors, such as clicking and buying. Existing research methods show that it is possible to capture the heterogeneous interests of users through different types of behaviors. However, most multi-behavior approaches have limitations in learning the relationship between different behaviors. In this paper, we propose a novel mult… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  22. arXiv:2402.12416  [pdf, other

    cs.MA cs.AI

    Aligning Individual and Collective Objectives in Multi-Agent Cooperation

    Authors: Yang Li, Wenhao Zhang, Jianhong Wang, Shao Zhang, Yali Du, Ying Wen, Wei Pan

    Abstract: Among the research topics in multi-agent learning, mixed-motive cooperation is one of the most prominent challenges, primarily due to the mismatch between individual and collective goals. The cutting-edge research is focused on incorporating domain knowledge into rewards and introducing additional mechanisms to incentivize cooperation. However, these approaches often face shortcomings such as the… ▽ More

    Submitted 22 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 19 pages

  23. arXiv:2401.15369  [pdf, other

    cs.IR

    Privacy-Preserving Cross-Domain Sequential Recommendation

    Authors: Zhaohao Lin, Weike Pan, Zhong Ming

    Abstract: Cross-domain sequential recommendation is an important development direction of recommender systems. It combines the characteristics of sequential recommender systems and cross-domain recommender systems, which can capture the dynamic preferences of users and alleviate the problem of cold-start users. However, in recent years, people pay more and more attention to their privacy. They do not want o… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  24. arXiv:2401.14923  [pdf, other

    cs.AI cs.LG

    Reinforcement Learning Interventions on Boundedly Rational Human Agents in Frictionful Tasks

    Authors: Eura Nofshin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

    Abstract: Many important behavior changes are frictionful; they require individuals to expend effort over a long period with little immediate gratification. Here, an artificial intelligence (AI) agent can provide personalized interventions to help individuals stick to their goals. In these settings, the AI agent must personalize rapidly (before the individual disengages) and interpretably, to help us unders… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: In AAMAS 2024

  25. arXiv:2401.12596  [pdf, other

    cs.CV

    UniHDA: A Unified and Versatile Framework for Multi-Modal Hybrid Domain Adaptation

    Authors: Hengjia Li, Yang Liu, Yuqi Lin, Zhanwei Zhang, Yibo Zhao, weihang Pan, Tu Zheng, Zheng Yang, Yuchun Jiang, Boxi Wu, Deng Cai

    Abstract: Recently, generative domain adaptation has achieved remarkable progress, enabling us to adapt a pre-trained generator to a new target domain. However, existing methods simply adapt the generator to a single target domain and are limited to a single modality, either text-driven or image-driven. Moreover, they cannot maintain well consistency with the source domain, which impedes the inheritance of… ▽ More

    Submitted 15 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  26. arXiv:2401.04971  [pdf, other

    cs.IR

    A Survey on Cross-Domain Sequential Recommendation

    Authors: Shu Chen, Zitao Xu, Weike Pan, Qiang Yang, Zhong Ming

    Abstract: Cross-domain sequential recommendation (CDSR) shifts the modeling of user preferences from flat to stereoscopic by integrating and learning interaction information from multiple domains at different granularities (ranging from inter-sequence to intra-sequence and from single-domain to cross-domain). In this survey, we first define the CDSR problem using a four-dimensional tensor and then analyze i… ▽ More

    Submitted 17 May, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: Accepted to the IJCAI 2024 Survey Track

  27. arXiv:2401.03676  [pdf, other

    cs.SE cs.AI

    Assessing AI Detectors in Identifying AI-Generated Code: Implications for Education

    Authors: Wei Hung Pan, Ming Jie Chok, Jonathan Leong Shan Wong, Yung Xin Shin, Yeong Shian Poon, Zhou Yang, Chun Yong Chong, David Lo, Mei Kuan Lim

    Abstract: Educators are increasingly concerned about the usage of Large Language Models (LLMs) such as ChatGPT in programming education, particularly regarding the potential exploitation of imperfections in Artificial Intelligence Generated Content (AIGC) Detectors for academic misconduct. In this paper, we present an empirical study where the LLM is examined for its attempts to bypass detection by AIGC Det… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 11 pages, paper accepted at 46th International Conference on Software Engineering, Software Engineering Education and Training Track (ICSE-SEET 2024)

  28. arXiv:2312.15248  [pdf, other

    physics.soc-ph cs.DM math.CO

    Type-II Apollonian Model

    Authors: Fei Ma, Jinzhi Ouyang, Ping Wang, Haobin Shi, Wei Pan

    Abstract: The family of planar graphs is a particularly important family and models many real-world networks. In this paper, we propose a principled framework based on the widely-known Apollonian packing process to generate new planar network, i.e., Type-II Apollonian network $\mathcal{A}_{t}$. The manipulation is different from that of the typical Apollonian network, and is proceeded in terms of the iterat… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  29. arXiv:2312.08631  [pdf, other

    cs.CV

    Semi-supervised Semantic Segmentation Meets Masked Modeling:Fine-grained Locality Learning Matters in Consistency Regularization

    Authors: Wentao Pan, Zhe Xu, Jiangpeng Yan, Zihan Wu, Raymond Kai-yu Tong, Xiu Li, Jianhua Yao

    Abstract: Semi-supervised semantic segmentation aims to utilize limited labeled images and abundant unlabeled images to achieve label-efficient learning, wherein the weak-to-strong consistency regularization framework, popularized by FixMatch, is widely used as a benchmark scheme. Despite its effectiveness, we observe that such scheme struggles with satisfactory segmentation for the local regions. This can… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  30. arXiv:2312.06987   

    cs.CR

    A new lightweight additive homomorphic encryption algorithm

    Authors: Wuqiong Pan, Hongliang Gu

    Abstract: This article describes a lightweight additive homomorphic algorithm with the same encryption and decryption keys. Compared to standard additive homomorphic algorithms like Paillier, this algorithm reduces the computational cost of encryption and decryption from modular exponentiation to modular multiplication, and reduces the computational cost of ciphertext addition from modular multiplication to… ▽ More

    Submitted 1 April, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: This algorithm proposed in this paper has serious security problem. It can be attacked by Orthogonal lattice

  31. arXiv:2312.05643  [pdf, other

    cs.NE cs.LG

    NiSNN-A: Non-iterative Spiking Neural Networks with Attention with Application to Motor Imagery EEG Classification

    Authors: Chuhan Zhang, Wei Pan, Cosimo Della Santina

    Abstract: Motor imagery, an important category in electroencephalogram (EEG) research, often intersects with scenarios demanding low energy consumption, such as portable medical devices and isolated environment operations. Traditional deep learning algorithms, despite their effectiveness, are characterized by significant computational demands accompanied by high energy usage. As an alternative, spiking neur… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  32. arXiv:2311.09008  [pdf, other

    cs.CL

    End-to-end Task-oriented Dialogue: A Survey of Tasks, Methods, and Future Directions

    Authors: Libo Qin, Wenbo Pan, Qiguang Chen, Lizi Liao, Zhou Yu, Yue Zhang, Wanxiang Che, Min Li

    Abstract: End-to-end task-oriented dialogue (EToD) can directly generate responses in an end-to-end fashion without modular training, which attracts escalating popularity. The advancement of deep neural networks, especially the successful use of large pre-trained models, has further led to significant progress in EToD research in recent years. In this paper, we present a thorough review and provide a unifie… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP2023

  33. arXiv:2311.08244  [pdf, other

    cs.RO

    Language and Sketching: An LLM-driven Interactive Multimodal Multitask Robot Navigation Framework

    Authors: Weiqin Zu, Wenbin Song, Ruiqing Chen, Ze Guo, Fanglei Sun, Zheng Tian, Wei Pan, Jun Wang

    Abstract: The socially-aware navigation system has evolved to adeptly avoid various obstacles while performing multiple tasks, such as point-to-point navigation, human-following, and -guiding. However, a prominent gap persists: in Human-Robot Interaction (HRI), the procedure of communicating commands to robots demands intricate mathematical formulations. Furthermore, the transition between tasks does not qu… ▽ More

    Submitted 21 March, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

  34. arXiv:2311.03680   

    cs.RO cs.AI

    Deep Bayesian Reinforcement Learning for Spacecraft Proximity Maneuvers and Docking

    Authors: Desong Du, Naiming Qi, Yanfang Liu, Wei Pan

    Abstract: In the pursuit of autonomous spacecraft proximity maneuvers and docking(PMD), we introduce a novel Bayesian actor-critic reinforcement learning algorithm to learn a control policy with the stability guarantee. The PMD task is formulated as a Markov decision process that reflects the relative dynamic model, the docking cone and the cost function. Drawing from the principles of Lyapunov theory, we f… ▽ More

    Submitted 21 May, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Because of a conflict of interest between me and my author's institution, my author and I do not want this paper to continue publication

  35. arXiv:2310.20424  [pdf, other

    cs.AR cs.LG

    DDC-PIM: Efficient Algorithm/Architecture Co-design for Doubling Data Capacity of SRAM-based Processing-In-Memory

    Authors: Cenlin Duan, Jianlei Yang, Xiaolin He, Yingjie Qi, Yikun Wang, Yiou Wang, Ziyan He, Bonan Yan, Xueyan Wang, Xiaotao Jia, Weitao Pan, Weisheng Zhao

    Abstract: Processing-in-memory (PIM), as a novel computing paradigm, provides significant performance benefits from the aspect of effective data movement reduction. SRAM-based PIM has been demonstrated as one of the most promising candidates due to its endurance and compatibility. However, the integration density of SRAM-based PIM is much lower than other non-volatile memory-based ones, due to its inherent… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 14 pages, to be published in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD)

  36. arXiv:2310.17816  [pdf, other

    stat.ML cs.LG stat.ME

    Local Discovery by Partitioning: Polynomial-Time Causal Discovery Around Exposure-Outcome Pairs

    Authors: Jacqueline Maasch, Weishen Pan, Shantanu Gupta, Volodymyr Kuleshov, Kyra Gan, Fei Wang

    Abstract: Causal discovery is crucial for causal inference in observational studies, as it can enable the identification of valid adjustment sets (VAS) for unbiased effect estimation. However, global causal discovery is notoriously hard in the nonparametric setting, with exponential time and sample complexity in the worst case. To address this, we propose local discovery by partitioning (LDP): a local causa… ▽ More

    Submitted 1 June, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the Fortieth Conference on Uncertainty in Artificial Intelligence (2024)

  37. arXiv:2310.15699  [pdf, other

    cs.RO

    DACOOP-A: Decentralized Adaptive Cooperative Pursuit via Attention

    Authors: Zheng Zhang, Dengyu Zhang, Qingrui Zhang, Wei Pan, Tianjiang Hu

    Abstract: Integrating rule-based policies into reinforcement learning promises to improve data efficiency and generalization in cooperative pursuit problems. However, most implementations do not properly distinguish the influence of neighboring robots in observation embedding or inter-robot interaction rules, leading to information loss and inefficient cooperation. This paper proposes a cooperative pursuit… ▽ More

    Submitted 28 October, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 8 Pages; This manuscript has been accepted by IEEE Robotics and Automation Letters

  38. arXiv:2310.06606  [pdf, other

    cs.RO

    SYNLOCO: Synthesizing Central Pattern Generator and Reinforcement Learning for Quadruped Locomotion

    Authors: Xinyu Zhang, Zhiyuan Xiao, Qingrui Zhang, Wei Pan

    Abstract: The Central Pattern Generator (CPG) is adept at generating rhythmic gait patterns characterized by consistent timing and adequate foot clearance. Yet, its open-loop configuration often compromises the system's control performance in response to environmental variations. On the other hand, Reinforcement Learning (RL), celebrated for its model-free properties, has gained significant traction in robo… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 7 Pages

  39. arXiv:2309.14295  [pdf, other

    cs.RO

    Unwieldy Object Delivery with Nonholonomic Mobile Base: A Stable Pushing Approach

    Authors: Yujie Tang, Hai Zhu, Susan Potters, Martijn Wisse, Wei Pan

    Abstract: This paper addresses the problem of pushing manipulation with nonholonomic mobile robots. Pushing is a fundamental skill that enables robots to move unwieldy objects that cannot be grasped. We propose a stable pushing method that maintains stiff contact between the robot and the object to avoid consuming repositioning actions. We prove that a line contact, rather than a single point contact, is ne… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: The short version of the paper is accepted by RAL

  40. arXiv:2309.11443  [pdf, other

    cs.CV cs.LG

    Signature Activation: A Sparse Signal View for Holistic Saliency

    Authors: Jose Roberto Tello Ayala, Akl C. Fahed, Weiwei Pan, Eugene V. Pomerantsev, Patrick T. Ellinor, Anthony Philippakis, Finale Doshi-Velez

    Abstract: The adoption of machine learning in healthcare calls for model transparency and explainability. In this work, we introduce Signature Activation, a saliency method that generates holistic and class-agnostic explanations for Convolutional Neural Network (CNN) outputs. Our method exploits the fact that certain kinds of medical images, such as angiograms, have clear foreground and background objects.… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  41. arXiv:2309.09550  [pdf, other

    cs.NE cs.AI

    Adaptive Reorganization of Neural Pathways for Continual Learning with Spiking Neural Networks

    Authors: Bing Han, Feifei Zhao, Wenxuan Pan, Zhaoya Zhao, Xianqi Li, Qingqun Kong, Yi Zeng

    Abstract: The human brain can self-organize rich and diverse sparse neural pathways to incrementally master hundreds of cognitive tasks. However, most existing continual learning algorithms for deep artificial and spiking neural networks are unable to adequately auto-regulate the limited resources in the network, which leads to performance drop along with energy consumption rise as the increase of tasks. In… ▽ More

    Submitted 8 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

  42. arXiv:2309.05263  [pdf, other

    cs.NE cs.AI

    Brain-inspired Evolutionary Architectures for Spiking Neural Networks

    Authors: Wenxuan Pan, Feifei Zhao, Zhuoya Zhao, Yi Zeng

    Abstract: The complex and unique neural network topology of the human brain formed through natural evolution enables it to perform multiple cognitive functions simultaneously. Automated evolutionary mechanisms of biological network structure inspire us to explore efficient architectural optimization for Spiking Neural Networks (SNNs). Instead of manually designed fixed architectures or hierarchical Network… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  43. arXiv:2309.04156  [pdf, other

    cs.SD cs.CL eess.AS

    Cross-Utterance Conditioned VAE for Speech Generation

    Authors: Yang Li, Cheng Yu, Guangzhi Sun, Weiqin Zu, Zheng Tian, Ying Wen, Wei Pan, Chao Zhang, Jun Wang, Yang Yang, Fanglei Sun

    Abstract: Speech synthesis systems powered by neural networks hold promise for multimedia production, but frequently face issues with producing expressive speech and seamless editing. In response, we present the Cross-Utterance Conditioned Variational Autoencoder speech synthesis (CUC-VAE S2) framework to enhance prosody and ensure natural speech generation. This framework leverages the powerful representat… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 13 pages;

  44. arXiv:2309.00827  [pdf, other

    cs.CV

    Few shot font generation via transferring similarity guided global style and quantization local style

    Authors: Wei Pan, Anna Zhu, Xinyu Zhou, Brian Kenji Iwana, Shilin Li

    Abstract: Automatic few-shot font generation (AFFG), aiming at generating new fonts with only a few glyph references, reduces the labor cost of manually designing fonts. However, the traditional AFFG paradigm of style-content disentanglement cannot capture the diverse local details of different fonts. So, many component-based approaches are proposed to tackle this problem. The issue with component-based app… ▽ More

    Submitted 14 September, 2023; v1 submitted 2 September, 2023; originally announced September 2023.

    Comments: Accepted by ICCV 2023

  45. arXiv:2309.00254  [pdf, other

    cs.LG cs.CL cs.CR

    Why do universal adversarial attacks work on large language models?: Geometry might be the answer

    Authors: Varshini Subhash, Anna Bialas, Weiwei Pan, Finale Doshi-Velez

    Abstract: Transformer based large language models with emergent capabilities are becoming increasingly ubiquitous in society. However, the task of understanding and interpreting their internal workings, in the context of adversarial attacks, remains largely unsolved. Gradient-based universal adversarial attacks have been shown to be highly effective on large language models and potentially dangerous due to… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Comments: 2nd AdvML Frontiers Workshop at 40th International Conference on Machine Learning, Honolulu, Hawaii, USA, 2023

  46. arXiv:2308.15701  [pdf, other

    cs.IR

    A Survey on Multi-Behavior Sequential Recommendation

    Authors: Xiaoqing Chen, Zhitao Li, Weike Pan, Zhong Ming

    Abstract: Recommender systems is set up to address the issue of information overload in traditional information retrieval systems, which is focused on recommending information that is of most interest to users from massive information. Generally, there is a sequential nature and heterogeneity to the behavior of a person interacting with a system, leading to the proposal of multi-behavior sequential recommen… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  47. arXiv:2308.04749  [pdf, other

    cs.AI

    Enhancing Efficient Continual Learning with Dynamic Structure Development of Spiking Neural Networks

    Authors: Bing Han, Feifei Zhao, Yi Zeng, Wenxuan Pan, Guobin Shen

    Abstract: Children possess the ability to learn multiple cognitive tasks sequentially, which is a major challenge toward the long-term goal of artificial general intelligence. Existing continual learning frameworks are usually applicable to Deep Neural Networks (DNNs) and lack the exploration on more brain-inspired, energy-efficient Spiking Neural Networks (SNNs). Drawing on continual learning mechanisms du… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Journal ref: IJCAI2023 Camera ready

  48. arXiv:2308.04719  [pdf, other

    cs.AI

    JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games

    Authors: Yang Li, Kun Xiong, Yingping Zhang, Jiangcheng Zhu, Stephen Mcaleer, Wei Pan, Jun Wang, Zonghong Dai, Yaodong Yang

    Abstract: This paper presents an empirical exploration of non-transitivity in perfect-information games, specifically focusing on Xiangqi, a traditional Chinese board game comparable in game-tree complexity to chess and shogi. By analyzing over 10,000 records of human Xiangqi play, we highlight the existence of both transitive and non-transitive elements within the game's strategic structure. To address non… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 28 pages, accepted by Transactions on Machine Learning Research (TMLR)

  49. arXiv:2308.01420  [pdf, other

    cs.CL cs.LG

    SAP-sLDA: An Interpretable Interface for Exploring Unstructured Text

    Authors: Charumathi Badrinath, Weiwei Pan, Finale Doshi-Velez

    Abstract: A common way to explore text corpora is through low-dimensional projections of the documents, where one hopes that thematically similar documents will be clustered together in the projected space. However, popular algorithms for dimensionality reduction of text corpora, like Latent Dirichlet Allocation (LDA), often produce projections that do not capture human notions of document similarity. We pr… ▽ More

    Submitted 28 July, 2023; originally announced August 2023.

  50. arXiv:2308.01197  [pdf, other

    cs.IR cs.CR cs.LG

    GNN4FR: A Lossless GNN-based Federated Recommendation Framework

    Authors: Guowei Wu, Weike Pan, Zhong Ming

    Abstract: Graph neural networks (GNNs) have gained wide popularity in recommender systems due to their capability to capture higher-order structure information among the nodes of users and items. However, these methods need to collect personal interaction data between a user and the corresponding items and then model them in a central server, which would break the privacy laws such as GDPR. So far, no exist… ▽ More

    Submitted 25 July, 2023; originally announced August 2023.