Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 552 results for author: Zhou, C

Searching in archive cs. Search in all archives.
.
  1. STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning

    Authors: Yanan Zhang, Chao Zhou, Di Huang

    Abstract: Existing 3D object detection suffers from expensive annotation costs and poor transferability to unknown data due to the domain gap, Unsupervised Domain Adaptation (UDA) aims to generalize detection models trained in labeled source domains to perform robustly on unexplored target domains, providing a promising solution for cross-domain 3D object detection. Although Self-Training (ST) based cross-d… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE-TIV

  2. arXiv:2406.18575  [pdf

    cs.CV cs.LG

    Research on Driver Facial Fatigue Detection Based on Yolov8 Model

    Authors: Chang Zhou, Yang Zhao, Shaobo Liu, Yi Zhao, Xingchen Li, Chiyu Cheng

    Abstract: In a society where traffic accidents frequently occur, fatigue driving has emerged as a grave issue. Fatigue driving detection technology, especially those based on the YOLOv8 deep learning model, has seen extensive research and application as an effective preventive measure. This paper discusses in depth the methods and technologies utilized in the YOLOv8 model to detect driver fatigue, elaborate… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by the 5th International Conference on Information Science, Parallel and Distributed Systems (ISPDS 2024), 2024 IEEE

  3. arXiv:2406.15734  [pdf, other

    cs.CL cs.AI

    RankAdaptor: Hierarchical Dynamic Low-Rank Adaptation for Structural Pruned LLMs

    Authors: Changhai Zhou, Shijie Han, Shiyang Zhang, Shichao Weng, Zekai Liu, Cheng Jin

    Abstract: The efficient compression of large language models (LLMs) is becoming increasingly popular. However, recovering the accuracy of compressed LLMs is still a major challenge. Structural pruning with standard Low-Rank Adaptation (LoRA) is a common technique in current LLM compression. In structural pruning, the model architecture is modified unevenly, resulting in suboptimal performance in various dow… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  4. arXiv:2406.15501  [pdf

    cs.CR

    Secure Combination of Untrusted Time information Based on Optimized Dempster-Shafer Theory

    Authors: Yang Li, Yujie Luo, Yichen Zhang, Ao Sun, Wei Huang, Shuai Zhang, Tao Zhang, Chuang Zhou, Li Ma, Jie Yang, Mei Wu, Heng Wang, Yan Pan, Yun Shao, Xing Chen, Ziyang Chen, Song Yu, Hong Guo, Bingjie Xu

    Abstract: Secure precision time synchronization is important for applications of Cyber-Physical Systems. However, several attacks, especially the Time Delay Attack (TDA), deteriorates the performance of time synchronization system seriously. Multiple paths scheme is thought as an effective security countermeasure to decrease the influence of TDA. However, the effective secure combination algorithm is still… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  5. arXiv:2406.15034  [pdf, other

    cs.CV

    SVFormer: A Direct Training Spiking Transformer for Efficient Video Action Recognition

    Authors: Liutao Yu, Liwei Huang, Chenlin Zhou, Han Zhang, Zhengyu Ma, Huihui Zhou, Yonghong Tian

    Abstract: Video action recognition (VAR) plays crucial roles in various domains such as surveillance, healthcare, and industrial automation, making it highly significant for the society. Consequently, it has long been a research spot in the computer vision field. As artificial neural networks (ANNs) are flourishing, convolution neural networks (CNNs), including 2D-CNNs and 3D-CNNs, as well as variants of th… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted by IJCAI 2024 workshop - Human Brain and Artificial Intelligence

  6. arXiv:2406.14024  [pdf, other

    cs.CL

    The Reason behind Good or Bad: Towards a Better Mathematical Verifier with Natural Language Feedback

    Authors: Bofei Gao, Zefan Cai, Runxin Xu, Peiyi Wang, Ce Zheng, Runji Lin, Keming Lu, Junyang Lin, Chang Zhou, Tianyu Liu, Baobao Chang

    Abstract: Mathematical verfier achieves success in mathematical reasoning tasks by validating the correctness of solutions. However, existing verifiers are trained with binary classification labels, which are not informative enough for the model to accurately assess the solutions. To mitigate the aforementioned insufficiency of binary labels, we introduce step-wise natural language feedbacks as rationale la… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 9 pages

  7. arXiv:2406.13941  [pdf, other

    cs.IR cs.AI

    UpDLRM: Accelerating Personalized Recommendation using Real-World PIM Architecture

    Authors: Sitian Chen, Haobin Tan, Amelie Chi Zhou, Yusen Li, Pavan Balaji

    Abstract: Deep Learning Recommendation Models (DLRMs) have gained popularity in recommendation systems due to their effectiveness in handling large-scale recommendation tasks. The embedding layers of DLRMs have become the performance bottleneck due to their intensive needs on memory capacity and memory bandwidth. In this paper, we propose UpDLRM, which utilizes real-world processingin-memory (PIM) hardware,… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  8. arXiv:2406.13542  [pdf, other

    cs.CL cs.AI cs.LG

    Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

    Authors: Guanting Dong, Keming Lu, Chengpeng Li, Tingyu Xia, Bowen Yu, Chang Zhou, Jingren Zhou

    Abstract: One core capability of large language models (LLMs) is to follow natural language instructions. However, the issue of automatically constructing high-quality training data to enhance the complex instruction-following abilities of LLMs without manual annotation remains unresolved. In this paper, we introduce AutoIF, the first scalable and reliable method for automatically generating instruction-fol… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  9. arXiv:2406.12809  [pdf, other

    cs.CL

    Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones?

    Authors: Zhe Yang, Yichang Zhang, Tianyu Liu, Jian Yang, Junyang Lin, Chang Zhou, Zhifang Sui

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities, but still suffer from inconsistency issues (e.g. LLMs can react differently to disturbances like rephrasing or inconsequential order change). In addition to these inconsistencies, we also observe that LLMs, while capable of solving hard problems, can paradoxically fail at easier ones. To evaluate this hard-to-easy inconsistenc… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 25 pages, 12 figures, 10 tables

  10. arXiv:2406.10678  [pdf, other

    cs.CV

    A Late-Stage Bitemporal Feature Fusion Network for Semantic Change Detection

    Authors: Chenyao Zhou, Haotian Zhang, Han Guo, Zhengxia Zou, Zhenwei Shi

    Abstract: Semantic change detection is an important task in geoscience and earth observation. By producing a semantic change map for each temporal phase, both the land use land cover categories and change information can be interpreted. Recently some multi-task learning based semantic change detection methods have been proposed to decompose the task into semantic segmentation and binary change detection sub… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  11. arXiv:2406.10239  [pdf

    cs.IR cs.LG

    Predict Click-Through Rates with Deep Interest Network Model in E-commerce Advertising

    Authors: Chang Zhou, Yang Zhao, Yuelin Zou, Jin Cao, Wenhan Fan, Yi Zhao, Chiyu Cheng

    Abstract: This paper proposes new methods to enhance click-through rate (CTR) prediction models using the Deep Interest Network (DIN) model, specifically applied to the advertising system of Alibaba's Taobao platform. Unlike traditional deep learning approaches, this research focuses on localized user behavior activation for tailored ad targeting by leveraging extensive user behavior data. Compared to tradi… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by the 5th International Conference on Information Science, Parallel and Distributed Systems (ISPDS 2024), 2024 IEEE

  12. arXiv:2406.10228  [pdf, other

    cs.CV cs.AI cs.CL

    VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models

    Authors: Chenyu Zhou, Mengdan Zhang, Peixian Chen, Chaoyou Fu, Yunhang Shen, Xiawu Zheng, Xing Sun, Rongrong Ji

    Abstract: The swift progress of Multi-modal Large Models (MLLMs) has showcased their impressive ability to tackle tasks blending vision and language. Yet, most current models and benchmarks cater to scenarios with a narrow scope of visual and textual contexts. These models often fall short when faced with complex comprehension tasks, which involve navigating through a plethora of irrelevant and potentially… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Project Page: https://zhourax.github.io/VEGA/

  13. arXiv:2406.04821  [pdf, other

    cs.RO

    Deep Learning Powered Estimate of The Extrinsic Parameters on Unmanned Surface Vehicles

    Authors: Yi Shen, Hao Liu, Chang Zhou, Wentao Wang, Zijun Gao, Qi Wang

    Abstract: Unmanned Surface Vehicles (USVs) are pivotal in marine exploration, but their sensors' accuracy is compromised by the dynamic marine environment. Traditional calibration methods fall short in these conditions. This paper introduces a deep learning architecture that predicts changes in the USV's dynamic metacenter and refines sensors' extrinsic parameters in real time using a Time-Sequence General… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted by The 9th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS 2024)

  14. arXiv:2406.04371  [pdf, other

    cs.CL cs.AI

    Phased Instruction Fine-Tuning for Large Language Models

    Authors: Wei Pang, Chuan Zhou, Xiao-Hua Zhou, Xiaojie Wang

    Abstract: Instruction Fine-Tuning enhances pre-trained language models from basic next-word prediction to complex instruction-following. However, existing One-off Instruction Fine-Tuning (One-off IFT) method, applied on a diverse instruction, may not effectively boost models' adherence to instructions due to the simultaneous handling of varying instruction complexities. To improve this, Phased Instruction F… ▽ More

    Submitted 16 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: The final version, to be appear at ACL 2024 Findings

  15. arXiv:2406.03647  [pdf, other

    cs.LG cs.AI

    Decision-focused Graph Neural Networks for Combinatorial Optimization

    Authors: Yang Liu, Chuan Zhou, Peng Zhang, Shirui Pan, Zhao Li, Hongyang Chen

    Abstract: In recent years, there has been notable interest in investigating combinatorial optimization (CO) problems by neural-based framework. An emerging strategy to tackle these challenging problems involves the adoption of graph neural networks (GNNs) as an alternative to traditional algorithms, a subject that has attracted considerable attention. Despite the growing popularity of GNNs and traditional a… ▽ More

    Submitted 9 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: 9 pages

  16. arXiv:2406.03511  [pdf, other

    cs.LG cs.AI

    MagiNet: Mask-Aware Graph Imputation Network for Incomplete Traffic Data

    Authors: Jianping Zhou, Bin Lu, Zhanyu Liu, Siyu Pan, Xuejun Feng, Hua Wei, Guanjie Zheng, Xinbing Wang, Chenghu Zhou

    Abstract: Due to detector malfunctions and communication failures, missing data is ubiquitous during the collection of traffic data. Therefore, it is of vital importance to impute the missing values to facilitate data analysis and decision-making for Intelligent Transportation System (ITS). However, existing imputation methods generally perform zero pre-filling techniques to initialize missing values, intro… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 19 pages, 7 figures

  17. arXiv:2406.02872  [pdf, other

    cs.LG cs.AI

    Combinatorial Optimization with Automated Graph Neural Networks

    Authors: Yang Liu, Peng Zhang, Yang Gao, Chuan Zhou, Zhao Li, Hongyang Chen

    Abstract: In recent years, graph neural networks (GNNs) have become increasingly popular for solving NP-hard combinatorial optimization (CO) problems, such as maximum cut and maximum independent set. The core idea behind these methods is to represent a CO problem as a graph and then use GNNs to learn the node/graph embedding with combinatorial information. Although these methods have achieved promising resu… ▽ More

    Submitted 9 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 9 pages

  18. arXiv:2405.21075  [pdf, other

    cs.CV cs.CL

    Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

    Authors: Chaoyou Fu, Yuhan Dai, Yongdong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun

    Abstract: In the quest for artificial general intelligence, Multi-modal Large Language Models (MLLMs) have emerged as a focal point in recent advancements. However, the predominant focus remains on developing their capabilities in static image understanding. The potential of MLLMs in processing sequential visual data is still insufficiently explored, highlighting the absence of a comprehensive, high-quality… ▽ More

    Submitted 16 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: Project Page: https://video-mme.github.io

  19. arXiv:2405.17931  [pdf, other

    cs.CL cs.LG

    Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment

    Authors: Keming Lu, Bowen Yu, Fei Huang, Yang Fan, Runji Lin, Chang Zhou

    Abstract: Effectively aligning Large Language Models (LLMs) with human-centric values while preventing the degradation of abilities acquired through Pre-training and Supervised Fine-tuning (SFT) poses a central challenge in Reinforcement Learning from Human Feedback (RLHF). In this paper, we first discover that interpolating RLHF and SFT model parameters can adjust the trade-off between human preference and… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  20. arXiv:2405.17765  [pdf, other

    cs.CV

    PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild

    Authors: Kun Yuan, Hongbo Liu, Mading Li, Muyi Sun, Ming Sun, Jiachao Gong, Jinhua Hao, Chao Zhou, Yansong Tang

    Abstract: Video quality assessment (VQA) is a challenging problem due to the numerous factors that can affect the perceptual quality of a video, \eg, content attractiveness, distortion type, motion pattern, and level. However, annotating the Mean opinion score (MOS) for videos is expensive and time-consuming, which limits the scale of VQA datasets, and poses a significant obstacle for deep learning-based me… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: CVPR 2024, 11 pages, 4 figures, 7 tables

  21. Adaptive Device-Edge Collaboration on DNN Inference in AIoT: A Digital Twin-Assisted Approach

    Authors: Shisheng Hu, Mushu Li, Jie Gao, Conghao Zhou, Xuemin Shen

    Abstract: Device-edge collaboration on deep neural network (DNN) inference is a promising approach to efficiently utilizing network resources for supporting artificial intelligence of things (AIoT) applications. In this paper, we propose a novel digital twin (DT)-assisted approach to device-edge collaboration on DNN inference that determines whether and when to stop local inference at a device and upload th… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Journal ref: IEEE Internet Things J. (Volume: 11, Issue: 7, 01 April 2024)

  22. arXiv:2405.17337  [pdf, other

    cs.CL cs.AI

    Cost-efficient Knowledge-based Question Answering with Large Language Models

    Authors: Junnan Dong, Qinggang Zhang, Chuang Zhou, Hao Chen, Daochen Zha, Xiao Huang

    Abstract: Knowledge-based question answering (KBQA) is widely used in many scenarios that necessitate domain knowledge. Large language models (LLMs) bring opportunities to KBQA, while their costs are significantly higher and absence of domain-specific knowledge during pre-training. We are motivated to combine LLMs and prior small models on knowledge graphs (KGMs) for both inferential accuracy and cost savin… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  23. arXiv:2405.16417  [pdf, other

    cs.CV

    CRoFT: Robust Fine-Tuning with Concurrent Optimization for OOD Generalization and Open-Set OOD Detection

    Authors: Lin Zhu, Yifeng Yang, Qinying Gu, Xinbing Wang, Chenghu Zhou, Nanyang Ye

    Abstract: Recent vision-language pre-trained models (VL-PTMs) have shown remarkable success in open-vocabulary tasks. However, downstream use cases often involve further fine-tuning of VL-PTMs, which may distort their general knowledge and impair their ability to handle distribution shifts. In real-world scenarios, machine learning systems inevitably encounter both covariate shifts (e.g., changes in image s… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML2024

  24. arXiv:2405.15460  [pdf

    cs.RO

    TD3 Based Collision Free Motion Planning for Robot Navigation

    Authors: Hao Liu, Yi Shen, Chang Zhou, Yuelin Zou, Zijun Gao, Qi Wang

    Abstract: This paper addresses the challenge of collision-free motion planning in automated navigation within complex environments. Utilizing advancements in Deep Reinforcement Learning (DRL) and sensor technologies like LiDAR, we propose the TD3-DWA algorithm, an innovative fusion of the traditional Dynamic Window Approach (DWA) with the Twin Delayed Deep Deterministic Policy Gradient (TD3). This hybrid al… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  25. arXiv:2405.15316  [pdf, other

    cs.LG cs.CR

    Decaf: Data Distribution Decompose Attack against Federated Learning

    Authors: Zhiyang Dai, Chunyi Zhou, Anmin Fu

    Abstract: In contrast to prevalent Federated Learning (FL) privacy inference techniques such as generative adversarial networks attacks, membership inference attacks, property inference attacks, and model inversion attacks, we devise an innovative privacy threat: the Data Distribution Decompose Attack on FL, termed Decaf. This attack enables an honest-but-curious FL server to meticulously profile the propor… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  26. arXiv:2405.14992  [pdf, other

    cs.CL cs.LG

    Linking In-context Learning in Transformers to Human Episodic Memory

    Authors: Li Ji-An, Corey Y. Zhou, Marcus K. Benna, Marcelo G. Mattar

    Abstract: Understanding the connections between artificial and biological intelligent systems can reveal fundamental principles underlying general intelligence. While many artificial intelligence (AI) models have a neuroscience counterpart, such connections are largely missing in Transformer models and the self-attention mechanism. Here, we examine the relationship between attention heads and human episodic… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  27. arXiv:2405.13934  [pdf, other

    cs.LG

    Text-Free Multi-domain Graph Pre-training: Toward Graph Foundation Models

    Authors: Xingtong Yu, Chang Zhou, Yuan Fang, Xinming Zhang

    Abstract: Given the ubiquity of graph data, it is intriguing to ask: Is it possible to train a graph foundation model on a broad range of graph data across diverse domains? A major hurdle toward this goal lies in the fact that graphs from different domains often exhibit profoundly divergent characteristics. Although there have been some initial efforts in integrating multi-domain graphs for pre-training, th… ▽ More

    Submitted 28 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Under review

  28. arXiv:2405.13409  [pdf, other

    cs.GR

    Specular Polynomials

    Authors: Zhimin Fan, Jie Guo, Yiming Wang, Tianyu Xiao, Hao Zhang, Chenxi Zhou, Zhenyu Chen, Pengpei Hong, Yanwen Guo, Ling-Qi Yan

    Abstract: Finding valid light paths that involve specular vertices in Monte Carlo rendering requires solving many non-linear, transcendental equations in high-dimensional space. Existing approaches heavily rely on Newton iterations in path space, which are limited to obtaining at most a single solution each time and easily diverge when initialized with improper seeds. We propose specular polynomials, a Ne… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 13 pages, 13 figures, accepted by SIGGRAPH 2024

    ACM Class: I.3.3

  29. arXiv:2405.13381  [pdf

    cs.LG

    Optimizing Search Advertising Strategies: Integrating Reinforcement Learning with Generalized Second-Price Auctions for Enhanced Ad Ranking and Bidding

    Authors: Chang Zhou, Yang Zhao, Jin Cao, Yi Shen, Xiaoling Cui, Chiyu Cheng

    Abstract: This paper explores the integration of strategic optimization methods in search advertising, focusing on ad ranking and bidding mechanisms within E-commerce platforms. By employing a combination of reinforcement learning and evolutionary strategies, we propose a dynamic model that adjusts to varying user interactions and optimizes the balance between advertiser cost, user relevance, and platform r… ▽ More

    Submitted 29 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted by 2024 5th International Conference on Electronic communication and Artificial Intelligence (ICECAI 2024)

  30. arXiv:2405.12786  [pdf, other

    cs.CR

    Rethinking the Vulnerabilities of Face Recognition Systems:From a Practical Perspective

    Authors: Jiahao Chen, Zhiqiang Shen, Yuwen Pu, Chunyi Zhou, Changjiang Li, Jiliang Li, Ting Wang, Shouling Ji

    Abstract: Face Recognition Systems (FRS) have increasingly integrated into critical applications, including surveillance and user authentication, highlighting their pivotal role in modern security systems. Recent studies have revealed vulnerabilities in FRS to adversarial (e.g., adversarial patch attacks) and backdoor attacks (e.g., training data poisoning), raising significant concerns about their reliabil… ▽ More

    Submitted 8 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: 19 pages,version 3

  31. arXiv:2405.12751  [pdf, other

    cs.CR

    A Stealthy Backdoor Attack for Without-Label-Sharing Split Learning

    Authors: Yuwen Pu, Zhuoyuan Ding, Jiahao Chen, Chunyi Zhou, Qingming Li, Chunqiang Hu, Shouling Ji

    Abstract: As a novel privacy-preserving paradigm aimed at reducing client computational costs and achieving data utility, split learning has garnered extensive attention and proliferated widespread applications across various fields, including smart health and smart transportation, among others. While recent studies have primarily concentrated on addressing privacy leakage concerns in split learning, such a… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 15 pages

  32. arXiv:2405.12719  [pdf, other

    cs.CR

    How to Train a Backdoor-Robust Model on a Poisoned Dataset without Auxiliary Data?

    Authors: Yuwen Pu, Jiahao Chen, Chunyi Zhou, Zhou Feng, Qingming Li, Chunqiang Hu, Shouling Ji

    Abstract: Backdoor attacks have attracted wide attention from academia and industry due to their great security threat to deep neural networks (DNN). Most of the existing methods propose to conduct backdoor attacks by poisoning the training dataset with different strategies, so it's critical to identify the poisoned samples and then train a clean model on the unreliable dataset in the context of defending b… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 13 pages, under review

  33. arXiv:2405.10681  [pdf, other

    cs.IR

    Know in AdVance: Linear-Complexity Forecasting of Ad Campaign Performance with Evolving User Interest

    Authors: XiaoYu Wang, YongHui Guo, Hui Sheng, Peili Lv, Chi Zhou, Wei Huang, ShiQin Ta, Dongbo Huang, XiuJin Yang, Lan Xu, Hao Zhou, Yusheng Ji

    Abstract: Real-time Bidding (RTB) advertisers wish to \textit{know in advance} the expected cost and yield of ad campaigns to avoid trial-and-error expenses. However, Campaign Performance Forecasting (CPF), a sequence modeling task involving tens of thousands of ad auctions, poses challenges of evolving user interest, auction representation, and long context, making coarse-grained and static-modeling method… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 12 pages, 4 figures, accepted at ACM SIGKDD 2024

  34. arXiv:2405.10516  [pdf, other

    cs.CL cs.AI

    Language Models can Evaluate Themselves via Probability Discrepancy

    Authors: Tingyu Xia, Bowen Yu, Yuan Wu, Yi Chang, Chang Zhou

    Abstract: In this paper, we initiate our discussion by demonstrating how Large Language Models (LLMs), when tasked with responding to queries, display a more even probability distribution in their answers if they are more adept, as opposed to their less skilled counterparts. Expanding on this foundational insight, we propose a new self-evaluation method ProbDiff for assessing the efficacy of various LLMs. T… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: ACL 2024 Findings

  35. arXiv:2405.10515  [pdf

    cs.LG

    Improved AdaBoost for Virtual Reality Experience Prediction Based on Long Short-Term Memory Network

    Authors: Wenhan Fan, Zhicheng Ding, Ruixin Huang, Chang Zhou, Xuyang Zhang

    Abstract: A classification prediction algorithm based on Long Short-Term Memory Network (LSTM) improved AdaBoost is used to predict virtual reality (VR) user experience. The dataset is randomly divided into training and test sets in the ratio of 7:3.During the training process, the model's loss value decreases from 0.65 to 0.31, which shows that the model gradually reduces the discrepancy between the predic… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  36. arXiv:2405.07233  [pdf, other

    cs.LG cs.AI physics.ao-ph

    OXYGENERATOR: Reconstructing Global Ocean Deoxygenation Over a Century with Deep Learning

    Authors: Bin Lu, Ze Zhao, Luyu Han, Xiaoying Gan, Yuntao Zhou, Lei Zhou, Luoyi Fu, Xinbing Wang, Chenghu Zhou, Jing Zhang

    Abstract: Accurately reconstructing the global ocean deoxygenation over a century is crucial for assessing and protecting marine ecosystem. Existing expert-dominated numerical simulations fail to catch up with the dynamic variation caused by global warming and human activities. Besides, due to the high-cost data collection, the historical observations are severely sparse, leading to big challenge for precis… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2024

  37. arXiv:2405.04299  [pdf, other

    cs.CV

    ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers

    Authors: Jinke Li, Xiao He, Chonghua Zhou, Xiaoqiang Cheng, Yang Wen, Dan Zhang

    Abstract: 3D occupancy, an advanced perception technology for driving scenarios, represents the entire scene without distinguishing between foreground and background by quantifying the physical space into a grid map. The widely adopted projection-first deformable attention, efficient in transforming image features into 3D representations, encounters challenges in aggregating multi-view features due to senso… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  38. arXiv:2405.04289  [pdf, ps, other

    cs.NE

    Direct Training High-Performance Deep Spiking Neural Networks: A Review of Theories and Methods

    Authors: Chenlin Zhou, Han Zhang, Liutao Yu, Yumin Ye, Zhaokun Zhou, Liwei Huang, Zhengyu Ma, Xiaopeng Fan, Huihui Zhou, Yonghong Tian

    Abstract: Spiking neural networks (SNNs) offer a promising energy-efficient alternative to artificial neural networks (ANNs), in virtue of their high biological plausibility, rich spatial-temporal dynamics, and event-driven computation. The direct training algorithms based on the surrogate gradient method provide sufficient flexibility to design novel SNN architectures and explore the spatial-temporal dynam… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 29 pages

  39. arXiv:2405.04032  [pdf, other

    cs.CR cs.AI

    Locally Differentially Private In-Context Learning

    Authors: Chunyan Zheng, Keke Sun, Wenhao Zhao, Haibo Zhou, Lixin Jiang, Shaoyang Song, Chunlai Zhou

    Abstract: Large pretrained language models (LLMs) have shown surprising In-Context Learning (ICL) ability. An important application in deploying large language models is to augment LLMs with a private database for some specific task. The main problem with this promising commercial use is that LLMs have been shown to memorize their training data and their prompt data are vulnerable to membership inference at… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: This paper was published at LREC-Coling 2024

  40. arXiv:2405.03697  [pdf, other

    cs.HC

    GeoViz: A Multi-View Visualization Platform for Spatio-temporal Knowledge Graph

    Authors: Jianping Zhou, Junhao Li, Guanjie Zheng, Yunqiang Zhu, Xinbing Wang, Chenghu Zhou

    Abstract: In this paper, we propose a multi-view visualization technology for spatio-temporal knowledge graph(STKG), which utilizes three distinct perspectives: knowledge tree, knowledge net, and knowledge map, to facilitate a comprehensive analysis of the STKG. The knowledge tree enables the visualization of hierarchical interrelation within the STKG, while the knowledge net elucidates semantic relationshi… ▽ More

    Submitted 29 April, 2024; originally announced May 2024.

    Comments: 4 pages, 2 figures

  41. arXiv:2405.03091  [pdf

    cs.CV cs.LG

    Research on Image Recognition Technology Based on Multimodal Deep Learning

    Authors: Jinyin Wang, Xingchen Li, Yixuan Jin, Yihao Zhong, Keke Zhang, Chang Zhou

    Abstract: This project investigates the human multi-modal behavior identification algorithm utilizing deep neural networks. According to the characteristics of different modal information, different deep neural networks are used to adapt to different modal video information. Through the integration of various deep neural networks, the algorithm successfully identifies behaviors across multiple modalities. I… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  42. arXiv:2405.02045  [pdf, other

    cs.HC

    Are We in The Zone? Exploring The Features and Method of Detecting Simultaneous Flow Experiences Based on EEG Signals

    Authors: Baiqiao Zhang, Xiangxian Li, Yunfan Zhou, Juan Liu, Weiying Liu, Chao Zhou, Yulong Bian

    Abstract: When executing interdependent personal tasks for the team's purpose, simultaneous individual flow(simultaneous flow) is the antecedent condition of achieving shared team flow. Detecting simultaneous flow helps better understanding the status of team members, which is thus important for optimizing multi-user interaction systems. However, there is currently a lack exploration on objective features a… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  43. arXiv:2405.01906  [pdf, other

    cs.AI cs.LG

    Instance-Conditioned Adaptation for Large-scale Generalization of Neural Combinatorial Optimization

    Authors: Changliang Zhou, Xi Lin, Zhenkun Wang, Xialiang Tong, Mingxuan Yuan, Qingfu Zhang

    Abstract: The neural combinatorial optimization (NCO) approach has shown great potential for solving routing problems without the requirement of expert knowledge. However, existing constructive NCO methods cannot directly solve large-scale instances, which significantly limits their application prospects. To address these crucial shortcomings, this work proposes a novel Instance-Conditioned Adaptation Model… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 17 pages, 6 figures

  44. arXiv:2405.00066  [pdf, other

    cs.CR cs.AI

    Research and application of artificial intelligence based webshell detection model: A literature review

    Authors: Mingrui Ma, Lansheng Han, Chunjie Zhou

    Abstract: Webshell, as the "culprit" behind numerous network attacks, is one of the research hotspots in the field of cybersecurity. However, the complexity, stealthiness, and confusing nature of webshells pose significant challenges to the corresponding detection schemes. With the rise of Artificial Intelligence (AI) technology, researchers have started to apply different intelligent algorithms and neural… ▽ More

    Submitted 28 April, 2024; originally announced May 2024.

    Comments: 21 pages, 6 figures

  45. arXiv:2404.19563  [pdf, other

    cs.CL

    RepEval: Effective Text Evaluation with LLM Representation

    Authors: Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xinbing Wang, Chenghu Zhou

    Abstract: Automatic evaluation metrics for generated texts play an important role in the NLG field, especially with the rapid growth of LLMs. However, existing metrics are often limited to specific scenarios, making it challenging to meet the evaluation requirements of expanding LLM applications. Therefore, there is a demand for new, flexible, and effective metrics. In this study, we introduce RepEval, the… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  46. arXiv:2404.17685  [pdf

    cs.RO

    Localization Through Particle Filter Powered Neural Network Estimated Monocular Camera Poses

    Authors: Yi Shen, Hao Liu, Xinxin Liu, Wenjing Zhou, Chang Zhou, Yizhou Chen

    Abstract: The reduced cost and computational and calibration requirements of monocular cameras make them ideal positioning sensors for mobile robots, albeit at the expense of any meaningful depth measurement. Solutions proposed by some scholars to this localization problem involve fusing pose estimates from convolutional neural networks (CNNs) with pose estimates from geometric constraints on motion to gene… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  47. arXiv:2404.17379  [pdf

    cs.RO

    Adaptive speed planning for Unmanned Vehicle Based on Deep Reinforcement Learning

    Authors: Hao Liu, Yi Shen, Wenjing Zhou, Yuelin Zou, Chang Zhou, Shuyao He

    Abstract: In order to solve the problem of frequent deceleration of unmanned vehicles when approaching obstacles, this article uses a Deep Q-Network (DQN) and its extension, the Double Deep Q-Network (DDQN), to develop a local navigation system that adapts to obstacles while maintaining optimal speed planning. By integrating improved reward functions and obstacle angle determination methods, the system demo… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  48. arXiv:2404.17196  [pdf, other

    cs.CR cs.AI

    Human-Imperceptible Retrieval Poisoning Attacks in LLM-Powered Applications

    Authors: Quan Zhang, Binqi Zeng, Chijin Zhou, Gwihwan Go, Heyuan Shi, Yu Jiang

    Abstract: Presently, with the assistance of advanced LLM application development frameworks, more and more LLM-powered applications can effortlessly augment the LLMs' knowledge with external content using the retrieval augmented generation (RAG) technique. However, these frameworks' designs do not have sufficient consideration of the risk of external content, thereby allowing attackers to undermine the appl… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  49. arXiv:2404.17070  [pdf, other

    cs.RO

    Deep Reinforcement Learning for Bipedal Locomotion: A Brief Survey

    Authors: Lingfan Bao, Joseph Humphreys, Tianhu Peng, Chengxu Zhou

    Abstract: Bipedal robots are garnering increasing global attention due to their potential applications and advancements in artificial intelligence, particularly in Deep Reinforcement Learning (DRL). While DRL has driven significant progress in bipedal locomotion, developing a comprehensive and unified framework capable of adeptly performing a wide range of tasks remains a challenge. This survey systematical… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 14 pages, 4 figures

  50. arXiv:2404.16304  [pdf, other

    cs.CV

    BezierFormer: A Unified Architecture for 2D and 3D Lane Detection

    Authors: Zhiwei Dong, Xi Zhu, Xiya Cao, Ran Ding, Wei Li, Caifa Zhou, Yongliang Wang, Qiangbo Liu

    Abstract: Lane detection has made significant progress in recent years, but there is not a unified architecture for its two sub-tasks: 2D lane detection and 3D lane detection. To fill this gap, we introduce BézierFormer, a unified 2D and 3D lane detection architecture based on Bézier curve lane representation. BézierFormer formulate queries as Bézier control points and incorporate a novel Bézier curve atten… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: ICME 2024, 11 pages, 8 figures