Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 186 results for author: Tang, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00487  [pdf, other

    cs.CL

    It's Morphing Time: Unleashing the Potential of Multiple LLMs via Multi-objective Optimization

    Authors: Bingdong Li, Zixiang Di, Yanting Yang, Hong Qian, Peng Yang, Hao Hao, Ke Tang, Aimin Zhou

    Abstract: In this paper, we introduce a novel approach for large language model merging via black-box multi-objective optimization algorithms. The goal of model merging is to combine multiple models, each excelling in different tasks, into a single model that outperforms any of the individual source models. However, model merging faces two significant challenges: First, existing methods rely heavily on huma… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2406.04777  [pdf, other

    cs.LG

    TDT Loss Takes It All: Integrating Temporal Dependencies among Targets into Non-Autoregressive Time Series Forecasting

    Authors: Qi Xiong, Kai Tang, Minbo Ma, Jie Xu, Tianrui Li

    Abstract: Learning temporal dependencies among targets (TDT) benefits better time series forecasting, where targets refer to the predicted sequence. Although autoregressive methods model TDT recursively, they suffer from inefficient inference and error accumulation. We argue that integrating TDT learning into non-autoregressive methods is essential for pursuing effective and efficient time series forecastin… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2405.18884  [pdf

    cs.NE

    Learning Mixture-of-Experts for General-Purpose Black-Box Discrete Optimization

    Authors: Shengcai Liu, Zhiyuan Wang, Yew-Soon Ong, Xin Yao, Ke Tang

    Abstract: Real-world applications involve various discrete optimization problems. Designing a specialized optimizer for each of these problems is challenging, typically requiring significant domain knowledge and human efforts. Hence, developing general-purpose optimizers as an off-the-shelf tool for a wide range of problems has been a long-standing research target. This article introduces MEGO, a novel gene… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 34 pages, 6 figures

  4. Node Injection Attack Based on Label Propagation Against Graph Neural Network

    Authors: Peican Zhu, Zechen Pan, Keke Tang, Xiaodong Cui, Jinhuan Wang, Qi Xuan

    Abstract: Graph Neural Network (GNN) has achieved remarkable success in various graph learning tasks, such as node classification, link prediction and graph classification. The key to the success of GNN lies in its effective structure information representation through neighboring aggregation. However, the attacker can easily perturb the aggregation process through injecting fake nodes, which reveals that G… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted by TCSS;DOI:10.1109/TCSS.2024.3395794

  5. arXiv:2405.17272  [pdf, other

    cs.LG cs.AI

    DPN: Decoupling Partition and Navigation for Neural Solvers of Min-max Vehicle Routing Problems

    Authors: Zhi Zheng, Shunyu Yao, Zhenkun Wang, Xialiang Tong, Mingxuan Yuan, Ke Tang

    Abstract: The min-max vehicle routing problem (min-max VRP) traverses all given customers by assigning several routes and aims to minimize the length of the longest route. Recently, reinforcement learning (RL)-based sequential planning methods have exhibited advantages in solving efficiency and optimality. However, these methods fail to exploit the problem-specific properties in learning representations, re… ▽ More

    Submitted 6 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2405.08674  [pdf, other

    cs.LG cs.AI

    Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models

    Authors: Bingdong Li, Zixiang Di, Yongfan Lu, Hong Qian, Feng Wang, Peng Yang, Ke Tang, Aimin Zhou

    Abstract: Multi-objective Bayesian optimization (MOBO) has shown promising performance on various expensive multi-objective optimization problems (EMOPs). However, effectively modeling complex distributions of the Pareto optimal solutions is difficult with limited function evaluations. Existing Pareto set learning algorithms may exhibit considerable instability in such expensive scenarios, leading to signif… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  7. arXiv:2405.08604  [pdf, other

    cs.LG cs.AI

    Towards Geometry-Aware Pareto Set Learning for Neural Multi-Objective Combinatorial Optimization

    Authors: Yongfan Lu, Zixiang Di, Bingdong Li, Shengcai Liu, Hong Qian, Peng Yang, Ke Tang, Aimin Zhou

    Abstract: Multi-objective combinatorial optimization (MOCO) problems are prevalent in various real-world applications. Most existing neural MOCO methods rely on problem decomposition to transform an MOCO problem into a series of singe-objective combinatorial optimization (SOCO) problems. However, these methods often approximate partial regions of the Pareto front and spend excessive time on diversity enhanc… ▽ More

    Submitted 23 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  8. arXiv:2405.02897  [pdf, other

    cs.RO

    DexiTac: Soft Dexterous Tactile Gripping

    Authors: Chenghua Lu, Kailuan Tang, Max Yang, Tianqi Yue, Nathan F. Lepora

    Abstract: Grasping object,whether they are flat, round, or narrow and whether they have regular or irregular shapes,introduces difficulties in determining the ideal grasping posture, even for the most state-of-the-art grippers. In this article, we presented a reconfigurable pneumatic gripper with fingers that could be set in various configurations, such as hooking, supporting, closuring, and pinching. Each… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 11 pages, 12 figures

  9. arXiv:2404.15777  [pdf, other

    cs.CL

    A Comprehensive Survey on Evaluating Large Language Model Applications in the Medical Industry

    Authors: Yining Huang, Keke Tang, Meilian Chen, Boyuan Wang

    Abstract: Since the inception of the Transformer architecture in 2017, Large Language Models (LLMs) such as GPT and BERT have evolved significantly, impacting various industries with their advanced capabilities in language understanding and generation. These models have shown potential to transform the medical field, highlighting the necessity for specialized evaluation frameworks to ensure their effective… ▽ More

    Submitted 29 May, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 42 pages, 1 figure

  10. arXiv:2404.15744  [pdf, other

    cs.LG cs.AI cs.CR

    A General Black-box Adversarial Attack on Graph-based Fake News Detectors

    Authors: Peican Zhu, Zechen Pan, Yang Liu, Jiwei Tian, Keke Tang, Zhen Wang

    Abstract: Graph Neural Network (GNN)-based fake news detectors apply various methods to construct graphs, aiming to learn distinctive news embeddings for classification. Since the construction details are unknown for attackers in a black-box scenario, it is unrealistic to conduct the classical adversarial attacks that require a specific adjacency matrix. In this paper, we propose the first general black-box… ▽ More

    Submitted 25 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI2024

  11. S4TP: Social-Suitable and Safety-Sensitive Trajectory Planning for Autonomous Vehicles

    Authors: Xiao Wang, Ke Tang, Xingyuan Dai, Jintao Xu, Quancheng Du, Rui Ai, Yuxiao Wang, Weihao Gu

    Abstract: In public roads, autonomous vehicles (AVs) face the challenge of frequent interactions with human-driven vehicles (HDVs), which render uncertain driving behavior due to varying social characteristics among humans. To effectively assess the risks prevailing in the vicinity of AVs in social interactive traffic scenarios and achieve safe autonomous driving, this article proposes a social-suitable and… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 12 pages,4 figures, published to IEEE Transactions on Intelligent Vehicles

  12. arXiv:2404.08892  [pdf, other

    cs.CV cs.AI cs.LG

    ChangeAnywhere: Sample Generation for Remote Sensing Change Detection via Semantic Latent Diffusion Model

    Authors: Kai Tang, Jin Chen

    Abstract: Remote sensing change detection (CD) is a pivotal technique that pinpoints changes on a global scale based on multi-temporal images. With the recent expansion of deep learning, supervised deep learning-based CD models have shown satisfactory performance. However, CD sample labeling is very time-consuming as it is densely labeled and requires expert knowledge. To alleviate this problem, we introduc… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Concise manuscript version of ChangeAnywhere

  13. arXiv:2404.02934  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    GreedLlama: Performance of Financial Value-Aligned Large Language Models in Moral Reasoning

    Authors: Jeffy Yu, Maximilian Huber, Kevin Tang

    Abstract: This paper investigates the ethical implications of aligning Large Language Models (LLMs) with financial optimization, through the case study of GreedLlama, a model fine-tuned to prioritize economically beneficial outcomes. By comparing GreedLlama's performance in moral reasoning tasks to a base Llama2 model, our results highlight a concerning trend: GreedLlama demonstrates a marked preference for… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 9 pages, 1 figure

  14. arXiv:2404.02491  [pdf, other

    cs.CL cs.AI cs.LG

    Measuring Social Norms of Large Language Models

    Authors: Ye Yuan, Kexin Tang, Jianhao Shen, Ming Zhang, Chenguang Wang

    Abstract: We present a new challenge to examine whether large language models understand social norms. In contrast to existing datasets, our dataset requires a fundamental understanding of social norms to solve. Our dataset features the largest set of social norm skills, consisting of 402 skills and 12,383 questions covering a wide set of social norms ranging from opinions and arguments to culture and laws.… ▽ More

    Submitted 22 May, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  15. arXiv:2403.16112  [pdf, other

    cs.CV cs.AI cs.LG

    Opportunities and challenges in the application of large artificial intelligence models in radiology

    Authors: Liangrui Pan, Zhenyu Zhao, Ying Lu, Kewei Tang, Liyong Fu, Qingchun Liang, Shaoliang Peng

    Abstract: Influenced by ChatGPT, artificial intelligence (AI) large models have witnessed a global upsurge in large model research and development. As people enjoy the convenience by this AI large model, more and more large models in subdivided fields are gradually being proposed, especially large models in radiology imaging field. This article first introduces the development history of large models, techn… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  16. arXiv:2403.16002  [pdf, other

    cs.CV

    SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking

    Authors: Xiaojun Hou, Jiazheng Xing, Yijie Qian, Yaowei Guo, Shuo Xin, Junhao Chen, Kai Tang, Mengmeng Wang, Zhengkai Jiang, Liang Liu, Yong Liu

    Abstract: Multimodal Visual Object Tracking (VOT) has recently gained significant attention due to its robustness. Early research focused on fully fine-tuning RGB-based trackers, which was inefficient and lacked generalized representation due to the scarcity of multimodal data. Therefore, recent studies have utilized prompt tuning to transfer pre-trained RGB-based trackers to multimodal data. However, the m… ▽ More

    Submitted 27 March, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  17. arXiv:2403.00014  [pdf, other

    cs.SI cs.AI cs.LG

    GIN-SD: Source Detection in Graphs with Incomplete Nodes via Positional Encoding and Attentive Fusion

    Authors: Le Cheng, Peican Zhu, Keke Tang, Chao Gao, Zhen Wang

    Abstract: Source detection in graphs has demonstrated robust efficacy in the domain of rumor source identification. Although recent solutions have enhanced performance by leveraging deep neural networks, they often require complete user data. In this paper, we address a more challenging task, rumor source detection with incomplete user data, and propose a novel framework, i.e., Source Detection in Graphs wi… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

    Comments: The paper is accepted by AAAI24

    Report number: Vol. 38, No. 1, 55-63

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence 2024

  18. Label Informed Contrastive Pretraining for Node Importance Estimation on Knowledge Graphs

    Authors: Tianyu Zhang, Chengbin Hou, Rui Jiang, Xuegong Zhang, Chenghu Zhou, Ke Tang, Hairong Lv

    Abstract: Node Importance Estimation (NIE) is a task of inferring importance scores of the nodes in a graph. Due to the availability of richer data and knowledge, recent research interests of NIE have been dedicating to knowledge graphs for predicting future or missing node importance scores. Existing state-of-the-art NIE methods train the model by available labels, and they consider every interested node e… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted by IEEE TNNLS

  19. arXiv:2402.17574  [pdf, other

    cs.AI cs.CL

    Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization

    Authors: Wenqi Zhang, Ke Tang, Hai Wu, Mengna Wang, Yongliang Shen, Guiyang Hou, Zeqi Tan, Peng Li, Yueting Zhuang, Weiming Lu

    Abstract: Large Language Models (LLMs) exhibit robust problem-solving capabilities for diverse tasks. However, most LLM-based agents are designed as specific task solvers with sophisticated prompt engineering, rather than agents capable of learning and evolving through interactions. These task solvers necessitate manually crafted prompts to inform task rules and regulate LLM behaviors, inherently incapacita… ▽ More

    Submitted 6 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL-2024 Main, camera-ready version

  20. arXiv:2402.13297  [pdf, other

    q-bio.QM cs.AI

    Integrating Deep Learning and Synthetic Biology: A Co-Design Approach for Enhancing Gene Expression via N-terminal Coding Sequences

    Authors: Zhanglu Yan, Weiran Chu, Yuhua Sheng, Kaiwen Tang, Shida Wang, Yanfeng Liu, Weng-Fai Wong

    Abstract: N-terminal coding sequence (NCS) influences gene expression by impacting the translation initiation rate. The NCS optimization problem is to find an NCS that maximizes gene expression. The problem is important in genetic engineering. However, current methods for NCS optimization such as rational design and statistics-guided approaches are labor-intensive yield only relatively small improvements. T… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  21. arXiv:2402.09282  [pdf, other

    cs.CL

    Leveraging Large Language Models for Enhanced NLP Task Performance through Knowledge Distillation and Optimized Training Strategies

    Authors: Yining Huang, Keke Tang, Meilian Chen

    Abstract: Emerging Large Language Models (LLMs) like GPT-4 have revolutionized Natural Language Processing (NLP), showing potential in traditional tasks such as Named Entity Recognition (NER). Our study explores a three-phase training strategy that harnesses GPT-4's capabilities to enhance the BERT model's performance on NER. Initially, GPT-4 annotates a subset of the CONLL2003 and additional BBC dataset wi… ▽ More

    Submitted 24 March, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: 16 pages, 3 figures

  22. arXiv:2401.13246  [pdf, other

    cs.CL

    SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning

    Authors: Guoxin Chen, Kexin Tang, Chao Yang, Fuying Ye, Yu Qiao, Yiming Qian

    Abstract: Elucidating the reasoning process with structured explanations from question to answer is crucial, as it significantly enhances the interpretability, traceability, and trustworthiness of question-answering (QA) systems. However, structured explanations demand models to perform intricately structured reasoning, which poses great challenges. Most existing methods focus on single-step reasoning throu… ▽ More

    Submitted 4 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Camera ready version for ACL 2024 Main Conference

  23. arXiv:2401.12983  [pdf

    cs.CL cs.AI physics.ed-ph

    Assessing Large Language Models in Mechanical Engineering Education: A Study on Mechanics-Focused Conceptual Understanding

    Authors: Jie Tian, Jixin Hou, Zihao Wu, Peng Shu, Zhengliang Liu, Yujie Xiang, Beikang Gu, Nicholas Filla, Yiwei Li, Ning Liu, Xianyan Chen, Keke Tang, Tianming Liu, Xianqiao Wang

    Abstract: This study is a pioneering endeavor to investigate the capabilities of Large Language Models (LLMs) in addressing conceptual questions within the domain of mechanical engineering with a focus on mechanics. Our examination involves a manually crafted exam encompassing 126 multiple-choice questions, spanning various aspects of mechanics courses, including Fluid Mechanics, Mechanical Vibration, Engin… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: 30 pages, 7 figures, and 1 table

  24. arXiv:2401.11963  [pdf, other

    cs.NE cs.AI cs.LG

    Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms

    Authors: Pengyi Li, Jianye Hao, Hongyao Tang, Xian Fu, Yan Zheng, Ke Tang

    Abstract: Evolutionary Reinforcement Learning (ERL), which integrates Evolutionary Algorithms (EAs) and Reinforcement Learning (RL) for optimization, has demonstrated remarkable performance advancements. By fusing both approaches, ERL has emerged as a promising research direction. This survey offers a comprehensive overview of the diverse research branches in ERL. Specifically, we systematically summarize r… ▽ More

    Submitted 21 June, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  25. arXiv:2312.06632  [pdf, other

    cs.AI

    Control Risk for Potential Misuse of Artificial Intelligence in Science

    Authors: Jiyan He, Weitao Feng, Yaosen Min, Jingwei Yi, Kunsheng Tang, Shuai Li, Jie Zhang, Kejiang Chen, Wenbo Zhou, Xing Xie, Weiming Zhang, Nenghai Yu, Shuxin Zheng

    Abstract: The expanding application of Artificial Intelligence (AI) in scientific fields presents unprecedented opportunities for discovery and innovation. However, this growth is not without risks. AI models in science, if misused, can amplify risks like creation of harmful substances, or circumvention of established regulations. In this study, we aim to raise awareness of the dangers of AI misuse in scien… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  26. arXiv:2312.02087  [pdf, other

    cs.CV

    VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

    Authors: Yuchao Gu, Yipin Zhou, Bichen Wu, Licheng Yu, Jia-Wei Liu, Rui Zhao, Jay Zhangjie Wu, David Junhao Zhang, Mike Zheng Shou, Kevin Tang

    Abstract: Current diffusion-based video editing primarily focuses on structure-preserved editing by utilizing various dense correspondences to ensure temporal consistency and motion alignment. However, these approaches are often ineffective when the target edit involves a shape change. To embark on video editing with shape change, we explore customized video subject swapping in this work, where we aim to re… ▽ More

    Submitted 5 December, 2023; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Project page at https://videoswap.github.io

  27. arXiv:2312.01739  [pdf, other

    cs.LG cs.AI

    Divide-and-Conquer Strategy for Large-Scale Dynamic Bayesian Network Structure Learning

    Authors: Hui Ouyang, Cheng Chen, Ke Tang

    Abstract: Dynamic Bayesian Networks (DBNs), renowned for their interpretability, have become increasingly vital in representing complex stochastic processes in various domains such as gene expression analysis, healthcare, and traffic prediction. Structure learning of DBNs from data is challenging, particularly for datasets with thousands of variables. Most current algorithms for DBN structure learning are a… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  28. arXiv:2312.01150  [pdf, other

    cs.NE

    Pointer Networks Trained Better via Evolutionary Algorithms

    Authors: Muyao Zhong, Shengcai Liu, Bingdong Li, Haobo Fu, Ke Tang, Peng Yang

    Abstract: Pointer Network (PtrNet) is a specific neural network for solving Combinatorial Optimization Problems (COPs). While PtrNets offer real-time feed-forward inference for complex COPs instances, its quality of the results tends to be less satisfactory. One possible reason is that such issue suffers from the lack of global search ability of the gradient descent, which is frequently employed in traditio… ▽ More

    Submitted 11 March, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: None

    MSC Class: 68T07

  29. arXiv:2312.00663  [pdf, other

    cs.CV cs.RO

    Generalized Label-Efficient 3D Scene Parsing via Hierarchical Feature Aligned Pre-Training and Region-Aware Fine-tuning

    Authors: Kangcheng Liu, Yong-Jin Liu, Kai Tang, Ming Liu, Baoquan Chen

    Abstract: Deep neural network models have achieved remarkable progress in 3D scene understanding while trained in the closed-set setting and with full labels. However, the major bottleneck for current 3D recognition approaches is that they do not have the capacity to recognize any unseen novel classes beyond the training categories in diverse kinds of real-world applications. In the meantime, current state-… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence, Manuscript Info: 22 Pages, 16 Figures, and 8 Tables

  30. arXiv:2311.15345  [pdf, other

    cs.SI

    A Sample Reuse Strategy for Dynamic Influence Maximization Problem

    Authors: Shaofeng Zhang, Shengcai Liu, Ke Tang

    Abstract: Dynamic influence maximization problem (DIMP) aims to maintain a group of influential users within an evolving social network, so that the influence scope can be maximized at any given moment. A primary category of DIMP algorithms focuses on the renewal of reverse reachable (RR) sets, which is designed for static social network scenarios, to accelerate the estimation of influence spread. And the g… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  31. arXiv:2311.12831  [pdf, other

    cs.CV cs.GR cs.LG

    ECNR: Efficient Compressive Neural Representation of Time-Varying Volumetric Datasets

    Authors: Kaiyuan Tang, Chaoli Wang

    Abstract: Due to its conceptual simplicity and generality, compressive neural representation has emerged as a promising alternative to traditional compression methods for managing massive volumetric datasets. The current practice of neural compression utilizes a single large multilayer perceptron (MLP) to encode the global volume, incurring slow training and inference. This paper presents an efficient compr… ▽ More

    Submitted 9 March, 2024; v1 submitted 2 October, 2023; originally announced November 2023.

    Comments: Accepted by IEEE PacificVis 2024 (conference papers track)

  32. arXiv:2311.12320  [pdf, other

    cs.AI

    A Survey on Multimodal Large Language Models for Autonomous Driving

    Authors: Can Cui, Yunsheng Ma, Xu Cao, Wenqian Ye, Yang Zhou, Kaizhao Liang, Jintai Chen, Juanwu Lu, Zichong Yang, Kuei-Da Liao, Tianren Gao, Erlong Li, Kun Tang, Zhipeng Cao, Tong Zhou, Ao Liu, Xinrui Yan, Shuqi Mei, Jianguo Cao, Ziran Wang, Chao Zheng

    Abstract: With the emergence of Large Language Models (LLMs) and Vision Foundation Models (VFMs), multimodal AI systems benefiting from large models have the potential to equally perceive the real world, make decisions, and control tools as humans. In recent months, LLMs have shown widespread attention in autonomous driving and map systems. Despite its immense potential, there is still a lack of a comprehen… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  33. arXiv:2311.09535  [pdf, other

    cs.CR

    FunctionMarker: Watermarking Language Datasets via Knowledge Injection

    Authors: Shuai Li, Kejiang Chen, Kunsheng Tang, Wen Huang, Jie Zhang, Weiming Zhang, Nenghai Yu

    Abstract: Large Language Models (LLMs) have demonstrated superior performance in various natural language processing tasks. Meanwhile, they require extensive training data, raising concerns related to dataset copyright protection. Backdoor-based watermarking is a viable approach to protect the copyright of classification datasets. However, these methods may introduce malicious misclassification behaviors in… ▽ More

    Submitted 17 November, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

  34. arXiv:2310.19046  [pdf, other

    cs.NE

    Large Language Models as Evolutionary Optimizers

    Authors: Shengcai Liu, Caishun Chen, Xinghua Qu, Ke Tang, Yew-Soon Ong

    Abstract: Evolutionary algorithms (EAs) have achieved remarkable success in tackling complex combinatorial optimization problems. However, EAs often demand carefully-designed operators with the aid of domain expertise to achieve satisfactory performance. In this work, we present the first study on large language models (LLMs) as evolutionary combinatorial optimizers. The main advantage is that it requires m… ▽ More

    Submitted 26 April, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: Accepted by CEC 2024

  35. arXiv:2310.09874  [pdf, other

    cs.IR

    TF-DCon: Leveraging Large Language Models (LLMs) to Empower Training-Free Dataset Condensation for Content-Based Recommendation

    Authors: Jiahao Wu, Qijiong Liu, Hengchang Hu, Wenqi Fan, Shengcai Liu, Qing Li, Xiao-Ming Wu, Ke Tang

    Abstract: Modern techniques in Content-based Recommendation (CBR) leverage item content information to provide personalized services to users, but suffer from resource-intensive training on large datasets. To address this issue, we explore the dataset condensation for textual CBR in this paper. The goal of dataset condensation is to synthesize a small yet informative dataset, upon which models can achieve p… ▽ More

    Submitted 12 January, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

  36. arXiv:2310.08896  [pdf, other

    cs.NE

    Migrant Resettlement by Evolutionary Multi-objective Optimization

    Authors: Dan-Xuan Liu, Yu-Ran Gu, Chao Qian, Xin Mu, Ke Tang

    Abstract: Migration has been a universal phenomenon, which brings opportunities as well as challenges for global development. As the number of migrants (e.g., refugees) increases rapidly in recent years, a key challenge faced by each country is the problem of migrant resettlement. This problem has attracted scientific research attention, from the perspective of maximizing the employment rate. Previous works… ▽ More

    Submitted 26 October, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

  37. arXiv:2310.08106  [pdf, other

    cs.CV

    Generalized Logit Adjustment: Calibrating Fine-tuned Models by Removing Label Bias in Foundation Models

    Authors: Beier Zhu, Kaihua Tang, Qianru Sun, Hanwang Zhang

    Abstract: Foundation models like CLIP allow zero-shot transfer on various tasks without additional training data. Yet, the zero-shot performance is less competitive than a fully supervised one. Thus, to enhance the performance, fine-tuning and ensembling are also commonly adopted to better fit the downstream tasks. However, we argue that such prior work has overlooked the inherent biases in foundation model… ▽ More

    Submitted 26 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: V2 proposed a more effective method for label distribution estimation. V1 fixed a typo in abstract; Accepted by NeurIPS2023

  38. arXiv:2310.06577  [pdf, other

    cs.CV cs.GR

    SketchBodyNet: A Sketch-Driven Multi-faceted Decoder Network for 3D Human Reconstruction

    Authors: Fei Wang, Kongzhang Tang, Hefeng Wu, Baoquan Zhao, Hao Cai, Teng Zhou

    Abstract: Reconstructing 3D human shapes from 2D images has received increasing attention recently due to its fundamental support for many high-level 3D applications. Compared with natural images, freehand sketches are much more flexible to depict various shapes, providing a high potential and valuable way for 3D human reconstruction. However, such a task is highly challenging. The sparse abstract character… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 9 pages, to appear in Pacific Graphics 2023

  39. arXiv:2310.01038  [pdf, other

    cs.IR

    Dataset Condensation for Recommendation

    Authors: Jiahao Wu, Wenqi Fan, Shengcai Liu, Qijiong Liu, Rui He, Qing Li, Ke Tang

    Abstract: Training recommendation models on large datasets often requires significant time and computational resources. Consequently, an emergent imperative has arisen to construct informative, smaller-scale datasets for efficiently training. Dataset compression techniques explored in other domains show potential possibility to address this problem, via sampling a subset or synthesizing a small dataset. How… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  40. arXiv:2309.12723  [pdf, other

    cs.IR

    Enhancing Graph Collaborative Filtering via Uniformly Co-Clustered Intent Modeling

    Authors: Jiahao Wu, Wenqi Fan, Shengcai Liu, Qijiong Liu, Qing Li, Ke Tang

    Abstract: Graph-based collaborative filtering has emerged as a powerful paradigm for delivering personalized recommendations. Despite their demonstrated effectiveness, these methods often neglect the underlying intents of users, which constitute a pivotal facet of comprehensive user interests. Consequently, a series of approaches have arisen to tackle this limitation by introducing independent intent repres… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: In submission

  41. Coco-LIC: Continuous-Time Tightly-Coupled LiDAR-Inertial-Camera Odometry using Non-Uniform B-spline

    Authors: Xiaolei Lang, Chao Chen, Kai Tang, Yukai Ma, Jiajun Lv, Yong Liu, Xingxing Zuo

    Abstract: In this paper, we propose an efficient continuous-time LiDAR-Inertial-Camera Odometry, utilizing non-uniform B-splines to tightly couple measurements from the LiDAR, IMU, and camera. In contrast to uniform B-spline-based continuous-time methods, our non-uniform B-spline approach offers significant advantages in terms of achieving real-time efficiency and high accuracy. This is accomplished by dyna… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: has been accepted by RAL 2023

  42. arXiv:2308.14012  [pdf, other

    cs.NE cs.SI

    Neural Influence Estimator: Towards Real-time Solutions to Influence Blocking Maximization

    Authors: Wenjie Chen, Shengcai Liu, Yew-Soon Ong, Ke Tang

    Abstract: Real-time solutions to the influence blocking maximization (IBM) problems are crucial for promptly containing the spread of misinformation. However, achieving this goal is non-trivial, mainly because assessing the blocked influence of an IBM problem solution typically requires plenty of expensive Monte Carlo simulations (MCSs). Although several approaches have been proposed to enhance efficiency,… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

  43. arXiv:2308.12481  [pdf

    eess.SP cs.LG

    Fall Detection using Knowledge Distillation Based Long short-term memory for Offline Embedded and Low Power Devices

    Authors: Hannah Zhou, Allison Chen, Celine Buer, Emily Chen, Kayleen Tang, Lauryn Gong, Zhiqi Liu, Jianbin Tang

    Abstract: This paper presents a cost-effective, low-power approach to unintentional fall detection using knowledge distillation-based LSTM (Long Short-Term Memory) models to significantly improve accuracy. With a primary focus on analyzing time-series data collected from various sensors, the solution offers real-time detection capabilities, ensuring prompt and reliable identification of falls. The authors i… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: 4 pages

  44. arXiv:2308.08222  [pdf, other

    cs.RO cs.LG

    HyperSNN: A new efficient and robust deep learning model for resource constrained control applications

    Authors: Zhanglu Yan, Shida Wang, Kaiwen Tang, Weng-Fai Wong

    Abstract: In light of the increasing adoption of edge computing in areas such as intelligent furniture, robotics, and smart homes, this paper introduces HyperSNN, an innovative method for control tasks that uses spiking neural networks (SNNs) in combination with hyperdimensional computing. HyperSNN substitutes expensive 32-bit floating point multiplications with 8-bit integer additions, resulting in reduced… ▽ More

    Submitted 17 August, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

  45. arXiv:2306.14690  [pdf, other

    cs.NE

    Chance-Constrained Multiple-Choice Knapsack Problem: Model, Algorithms, and Applications

    Authors: Xuanfeng Li, Shengcai Liu, Jin Wang, Xiao Chen, Yew-Soon Ong, Ke Tang

    Abstract: The multiple-choice knapsack problem (MCKP) is a classic NP-hard combinatorial optimization problem. Motivated by several significant real-world applications, this work investigates a novel variant of MCKP called chance-constrained multiple-choice knapsack problem (CCMCKP), where the item weights are random variables. In particular, we focus on the practical scenario of CCMCKP, where the probabili… ▽ More

    Submitted 14 December, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

  46. arXiv:2306.11689  [pdf, other

    econ.EM cs.AI stat.ME stat.ML

    Statistical Tests for Replacing Human Decision Makers with Algorithms

    Authors: Kai Feng, Han Hong, Ke Tang, Jingyuan Wang

    Abstract: This paper proposes a statistical framework with which artificial intelligence can improve human decision making. The performance of each human decision maker is first benchmarked against machine predictions; we then replace the decisions made by a subset of the decision makers with the recommendation from the proposed artificial intelligence algorithm. Using a large nationwide dataset of pregnanc… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 65 pages, 19 figures

  47. arXiv:2306.10700  [pdf, other

    cs.LG

    Perturbation-Based Two-Stage Multi-Domain Active Learning

    Authors: Rui He, Zeyu Dai, Shan He, Ke Tang

    Abstract: In multi-domain learning (MDL) scenarios, high labeling effort is required due to the complexity of collecting data from various domains. Active Learning (AL) presents an encouraging solution to this issue by annotating a smaller number of highly informative instances, thereby reducing the labeling effort. Previous research has relied on conventional AL strategies for MDL scenarios, which underuti… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  48. arXiv:2306.06849  [pdf, other

    cs.LG cs.CV

    Mitigating Transformer Overconfidence via Lipschitz Regularization

    Authors: Wenqian Ye, Yunsheng Ma, Xu Cao, Kun Tang

    Abstract: Though Transformers have achieved promising results in many computer vision tasks, they tend to be over-confident in predictions, as the standard Dot Product Self-Attention (DPSA) can barely preserve distance for the unbounded input domain. In this work, we fill this gap by proposing a novel Lipschitz Regularized Transformer (LRFormer). Specifically, we present a new similarity function with the d… ▽ More

    Submitted 18 July, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: Accepted by UAI 2023. (https://proceedings.mlr.press/v216/ye23a.html)

  49. arXiv:2305.18702  [pdf, other

    stat.ML cs.LG math.NA

    Adversarial Adaptive Sampling: Unify PINN and Optimal Transport for the Approximation of PDEs

    Authors: Kejun Tang, Jiayu Zhai, Xiaoliang Wan, Chao Yang

    Abstract: Solving partial differential equations (PDEs) is a central task in scientific computing. Recently, neural network approximation of PDEs has received increasing attention due to its flexible meshless discretization and its potential for high-dimensional problems. One fundamental numerical difficulty is that random samples in the training set introduce statistical errors into the discretization of l… ▽ More

    Submitted 14 March, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: ICLR, 2024

  50. arXiv:2305.17665  [pdf, other

    cs.LG stat.ML

    Acceleration of stochastic gradient descent with momentum by averaging: finite-sample rates and asymptotic normality

    Authors: Kejie Tang, Weidong Liu, Yichen Zhang, Xi Chen

    Abstract: Stochastic gradient descent with momentum (SGDM) has been widely used in many machine learning and statistical applications. Despite the observed empirical benefits of SGDM over traditional SGD, the theoretical understanding of the role of momentum for different learning rates in the optimization process remains widely open. We analyze the finite-sample convergence rate of SGDM under the strongly… ▽ More

    Submitted 1 February, 2024; v1 submitted 28 May, 2023; originally announced May 2023.