Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 97 results for author: Du, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14158  [pdf, other

    cs.DC cs.AI

    Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

    Authors: Wei An, Xiao Bi, Guanting Chen, Shanhuang Chen, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Wenjun Gao, Kang Guan, Jianzhong Guo, Yongqiang Guo, Zhe Fu, Ying He, Panpan Huang, Jiashi Li, Wenfeng Liang, Xiaodong Liu, Xin Liu, Yiyuan Liu, Yuxuan Liu, Shanghao Lu, Xuan Lu, Xiaotao Nie, Tian Pei , et al. (27 additional authors not shown)

    Abstract: The rapid progress in Deep Learning (DL) and Large Language Models (LLMs) has exponentially increased demands of computational power and bandwidth. This, combined with the high costs of faster computing chips and interconnects, has significantly inflated High Performance Computing (HPC) construction costs. To address these challenges, we introduce the Fire-Flyer AI-HPC architecture, a synergistic… ▽ More

    Submitted 31 August, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

    Comments: This is the preprint version of the paper accepted for presentation at the 2024 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'24). \c{opyright} 2024 IEEE. Personal use of this material is permitted. For other uses, permission from IEEE must be obtained. Please refer to IEEE Xplore for the final published version

  2. arXiv:2408.12232  [pdf, other

    cs.CV

    BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking

    Authors: Hanzheng Wang, Wei Li, Xiang-Gen Xia, Qian Du

    Abstract: Hyperspectral object tracking (HOT) has exhibited potential in various applications, particularly in scenes where objects are camouflaged. Existing trackers can effectively retrieve objects via band regrouping because of the bias in existing HOT datasets, where most objects tend to have distinguishing visual appearances rather than spectral characteristics. This bias allows the tracker to directly… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  3. arXiv:2408.12109  [pdf, other

    cs.CV cs.CL

    RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data

    Authors: Chenglong Wang, Yang Gan, Yifu Huo, Yongyu Mu, Murun Yang, Qiaozhi He, Tong Xiao, Chunliang Zhang, Tongran Liu, Quan Du, Di Yang, Jingbo Zhu

    Abstract: Large vision-language models (LVLMs) often fail to align with human preferences, leading to issues like generating misleading content without proper visual context (also known as hallucination). A promising solution to this problem is using human-preference alignment techniques, such as best-of-n sampling and reinforcement learning. However, these techniques face the difficulty arising from the sc… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  4. arXiv:2408.08152  [pdf, other

    cs.CL cs.AI cs.LG cs.LO

    DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

    Authors: Huajian Xin, Z. Z. Ren, Junxiao Song, Zhihong Shao, Wanjia Zhao, Haocheng Wang, Bo Liu, Liyue Zhang, Xuan Lu, Qiushi Du, Wenjun Gao, Qihao Zhu, Dejian Yang, Zhibin Gou, Z. F. Wu, Fuli Luo, Chong Ruan

    Abstract: We introduce DeepSeek-Prover-V1.5, an open-source language model designed for theorem proving in Lean 4, which enhances DeepSeek-Prover-V1 by optimizing both training and inference processes. Pre-trained on DeepSeekMath-Base with specialization in formal mathematical languages, the model undergoes supervised fine-tuning using an enhanced formal theorem proving dataset derived from DeepSeek-Prover-… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  5. arXiv:2408.00422  [pdf, ps, other

    math.FA cs.SI math.CO math.PR

    Ginzburg--Landau Functionals in the Large-Graph Limit

    Authors: Edith Zhang, James Scott, Qiang Du, Mason A. Porter

    Abstract: Ginzburg--Landau (GL) functionals on graphs, which are relaxations of graph-cut functionals on graphs, have yielded a variety of insights in image segmentation and graph clustering. In this paper, we study large-graph limits of GL functionals by taking a functional-analytic view of graphs as nonlocal kernels. For a graph $W_n$ with $n$ nodes, the corresponding graph GL functional $\GL^{W_n}_\ep$ i… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 37 pages

  6. arXiv:2406.16087  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy

    Authors: Chen Wang, Kaiyi Ji, Junyi Geng, Zhongqiang Ren, Taimeng Fu, Fan Yang, Yifan Guo, Haonan He, Xiangyu Chen, Zitong Zhan, Qiwei Du, Shaoshu Su, Bowen Li, Yuheng Qiu, Yi Du, Qihang Li, Yifan Yang, Xiao Lin, Zhipeng Zhao

    Abstract: Data-driven methods such as reinforcement and imitation learning have achieved remarkable success in robot autonomy. However, their data-centric nature still hinders them from generalizing well to ever-changing environments. Moreover, collecting large datasets for robotic tasks is often impractical and expensive. To overcome these challenges, we introduce a new self-supervised neural-symbolic (NeS… ▽ More

    Submitted 6 August, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  7. arXiv:2406.11931  [pdf, other

    cs.SE cs.AI cs.LG

    DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

    Authors: DeepSeek-AI, Qihao Zhu, Daya Guo, Zhihong Shao, Dejian Yang, Peiyi Wang, Runxin Xu, Y. Wu, Yukun Li, Huazuo Gao, Shirong Ma, Wangding Zeng, Xiao Bi, Zihui Gu, Hanwei Xu, Damai Dai, Kai Dong, Liyue Zhang, Yishi Piao, Zhibin Gou, Zhenda Xie, Zhewen Hao, Bingxuan Wang, Junxiao Song, Deli Chen , et al. (15 additional authors not shown)

    Abstract: We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Coder-V2 substantially enhances the coding and mathe… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  8. arXiv:2406.10469  [pdf, other

    eess.IV cs.CV cs.MM

    Object-Attribute-Relation Representation based Video Semantic Communication

    Authors: Qiyuan Du, Yiping Duan, Qianqian Yang, Xiaoming Tao, Mérouane Debbah

    Abstract: With the rapid growth of multimedia data volume, there is an increasing need for efficient video transmission in applications such as virtual reality and future video streaming services. Semantic communication is emerging as a vital technique for ensuring efficient and reliable transmission in low-bandwidth, high-noise settings. However, most current approaches focus on joint source-channel coding… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  9. arXiv:2406.04705  [pdf

    cs.CR

    EAIA: An Efficient and Anonymous Identity Authentication Scheme in 5G-V2V

    Authors: Qianmin Du, Jianhong Zhou, Maode Ma

    Abstract: Vehicle Ad-hoc Networks (VANETs) have experienced significant development in recent years, playing a crucial role in enhancing the driving experience by enabling safer and more efficient inter-vehicle interactions through information exchange. Vehicle-to-vehicle (V2V) communication is particularly vital as it not only helps to prevent collisions and improve traffic efficiency but also provides ess… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  10. arXiv:2405.13860  [pdf, other

    cs.CV

    MAGIC: Map-Guided Few-Shot Audio-Visual Acoustics Modeling

    Authors: Diwei Huang, Kunyang Lin, Peihao Chen, Qing Du, Mingkui Tan

    Abstract: Few-shot audio-visual acoustics modeling seeks to synthesize the room impulse response in arbitrary locations with few-shot observations. To sufficiently exploit the provided few-shot data for accurate acoustic modeling, we present a *map-guided* framework by constructing acoustic-related visual semantic feature maps of the scenes. Visual features preserve semantic details related to sound and map… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 17 pages, 12 pages for main paper, 5 pages for supplementary

  11. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  12. Social Force Embedded Mixed Graph Convolutional Network for Multi-class Trajectory Prediction

    Authors: Quancheng Du, Xiao Wang, Shouguo Yin, Lingxi Li, Huansheng Ning

    Abstract: Accurate prediction of agent motion trajectories is crucial for autonomous driving, contributing to the reduction of collision risks in human-vehicle interactions and ensuring ample response time for other traffic participants. Current research predominantly focuses on traditional deep learning methods, including convolutional neural networks (CNNs) and recurrent neural networks (RNNs). These meth… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 11 pages,3 figures, published to IEEE Transactions on Intelligent vehicles

  13. S4TP: Social-Suitable and Safety-Sensitive Trajectory Planning for Autonomous Vehicles

    Authors: Xiao Wang, Ke Tang, Xingyuan Dai, Jintao Xu, Quancheng Du, Rui Ai, Yuxiao Wang, Weihao Gu

    Abstract: In public roads, autonomous vehicles (AVs) face the challenge of frequent interactions with human-driven vehicles (HDVs), which render uncertain driving behavior due to varying social characteristics among humans. To effectively assess the risks prevailing in the vicinity of AVs in social interactive traffic scenarios and achieve safe autonomous driving, this article proposes a social-suitable and… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 12 pages,4 figures, published to IEEE Transactions on Intelligent Vehicles

  14. arXiv:2404.11326  [pdf, other

    cs.CV

    Single-temporal Supervised Remote Change Detection for Domain Generalization

    Authors: Qiangang Du, Jinlong Peng, Xu Chen, Qingdong He, Liren He, Qiang Nie, Wenbing Zhu, Mingmin Chi, Yabiao Wang, Chengjie Wang

    Abstract: Change detection is widely applied in remote sensing image analysis. Existing methods require training models separately for each dataset, which leads to poor domain generalization. Moreover, these methods rely heavily on large amounts of high-quality pair-labelled data for training, which is expensive and impractical. In this paper, we propose a multimodal contrastive learning (ChangeCLIP) based… ▽ More

    Submitted 23 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  15. arXiv:2404.11318  [pdf, other

    cs.CV

    Leveraging Fine-Grained Information and Noise Decoupling for Remote Sensing Change Detection

    Authors: Qiangang Du, Jinlong Peng, Changan Wang, Xu Chen, Qingdong He, Wenbing Zhu, Mingmin Chi, Yabiao Wang, Chengjie Wang

    Abstract: Change detection aims to identify remote sense object changes by analyzing data between bitemporal image pairs. Due to the large temporal and spatial span of data collection in change detection image pairs, there are often a significant amount of task-specific and task-agnostic noise. Previous effort has focused excessively on denoising, with this goes a great deal of loss of fine-grained informat… ▽ More

    Submitted 21 June, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  16. arXiv:2403.12582  [pdf, other

    cs.CL

    AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework

    Authors: Xiang Li, Zhenyu Li, Chen Shi, Yong Xu, Qing Du, Mingkui Tan, Jun Huang, Wei Lin

    Abstract: The task of financial analysis primarily encompasses two key areas: stock trend prediction and the corresponding financial question answering. Currently, machine learning and deep learning algorithms (ML&DL) have been widely applied for stock trend predictions, leading to significant progress. However, these methods fail to provide reasons for predictions, lacking interpretability and reasoning pr… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: COLING 2024. The first three authors contributed equally. Project website: https://github.com/AlphaFin-proj/AlphaFin

  17. arXiv:2403.11561  [pdf, other

    cs.CV

    Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection

    Authors: Liren He, Zhengkai Jiang, Jinlong Peng, Liang Liu, Qiangang Du, Xiaobin Hu, Wenbing Zhu, Mingmin Chi, Yabiao Wang, Chengjie Wang

    Abstract: In the field of multi-class anomaly detection, reconstruction-based methods derived from single-class anomaly detection face the well-known challenge of "learning shortcuts", wherein the model fails to learn the patterns of normal samples as it should, opting instead for shortcuts such as identity mapping or artificial noise elimination. Consequently, the model becomes unable to reconstruct genuin… ▽ More

    Submitted 16 July, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by ECCV 2024

  18. arXiv:2403.10067  [pdf, other

    eess.IV cs.CV

    Hybrid Convolutional and Attention Network for Hyperspectral Image Denoising

    Authors: Shuai Hu, Feng Gao, Xiaowei Zhou, Junyu Dong, Qian Du

    Abstract: Hyperspectral image (HSI) denoising is critical for the effective analysis and interpretation of hyperspectral data. However, simultaneously modeling global and local features is rarely explored to enhance HSI denoising. In this letter, we propose a hybrid convolution and attention network (HCANet), which leverages both the strengths of convolution neural networks (CNNs) and Transformers. To enhan… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: IEEE GRSL 2024

  19. arXiv:2403.05852  [pdf, other

    cs.CV

    SSF-Net: Spatial-Spectral Fusion Network with Spectral Angle Awareness for Hyperspectral Object Tracking

    Authors: Hanzheng Wang, Wei Li, Xiang-Gen Xia, Qian Du, Jing Tian

    Abstract: Hyperspectral video (HSV) offers valuable spatial, spectral, and temporal information simultaneously, making it highly suitable for handling challenges such as background clutter and visual similarity in object tracking. However, existing methods primarily focus on band regrouping and rely on RGB trackers for feature extraction, resulting in limited exploration of spectral information and difficul… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  20. LightSword: A Customized Virtual Reality Exergame for Long-Term Cognitive Inhibition Training in Older Adults

    Authors: Qiuxin Du, Zhen Song, Haiyan Jiang, Xiaoying Wei, Dongdong Weng, Mingming Fan

    Abstract: The decline of cognitive inhibition significantly impacts older adults' quality of life and well-being, making it a vital public health problem in today's aging society. Previous research has demonstrated that Virtual reality (VR) exergames have great potential to enhance cognitive inhibition among older adults. However, existing commercial VR exergames were unsuitable for older adults' long-term… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 23 pages

    Journal ref: Proceedings of the CHI Conference on Human Factors in Computing Systems 2024 (CHI '24)

  21. arXiv:2402.05123  [pdf, ps, other

    cs.CL

    A Survey on Data Selection for LLM Instruction Tuning

    Authors: Jiahao Wang, Bolin Zhang, Qianlong Du, Jiajun Zhang, Dianhui Chu

    Abstract: Instruction tuning is a vital step of training large language models (LLM), so how to enhance the effect of instruction tuning has received increased attention. Existing works indicate that the quality of the dataset is more crucial than the quantity during instruction tuning of LLM. Therefore, recently a lot of studies focus on exploring the methods of selecting high-quality subset from instructi… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  22. arXiv:2401.02954  [pdf, other

    cs.CL cs.AI cs.LG

    DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

    Authors: DeepSeek-AI, :, Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li , et al. (63 additional authors not shown)

    Abstract: The rapid development of open-source large language models (LLMs) has been truly remarkable. However, the scaling law described in previous literature presents varying conclusions, which casts a dark cloud over scaling LLMs. We delve into the study of scaling laws and present our distinctive findings that facilitate scaling of large scale models in two commonly used open-source configurations, 7B… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  23. arXiv:2312.08926   

    cs.AI cs.CL

    Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent

    Authors: Haoran Liao, Qinyi Du, Shaohua Hu, Hao He, Yanyan Xu, Jidong Tian, Yaohui Jin

    Abstract: Large language models (LLMs) face challenges in solving complex mathematical problems that require comprehensive capacities to parse the statements, associate domain knowledge, perform compound logical reasoning, and integrate the intermediate rationales. Tackling all these problems once could be arduous for LLMs, thus leading to confusion in generation. In this work, we explore the potential of e… ▽ More

    Submitted 16 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: There are unfair comparisons on miniF2F. This will be fixed in the future

  24. arXiv:2311.15653  [pdf, other

    cs.CL

    MoDS: Model-oriented Data Selection for Instruction Tuning

    Authors: Qianlong Du, Chengqing Zong, Jiajun Zhang

    Abstract: Instruction tuning has become the de facto method to equip large language models (LLMs) with the ability of following user instructions. Usually, hundreds of thousands or millions of instruction-following pairs are employed to fine-tune the foundation LLMs. Recently, some studies show that a small number of high-quality instruction data is enough. However, how to select appropriate instruction dat… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  25. arXiv:2311.04442  [pdf, other

    eess.IV cs.CV

    SS-MAE: Spatial-Spectral Masked Auto-Encoder for Multi-Source Remote Sensing Image Classification

    Authors: Junyan Lin, Feng Gao, Xiaocheng Shi, Junyu Dong, Qian Du

    Abstract: Masked image modeling (MIM) is a highly popular and effective self-supervised learning method for image understanding. Existing MIM-based methods mostly focus on spatial feature modeling, neglecting spectral feature modeling. Meanwhile, existing MIM-based methods use Transformer for feature extraction, some local or high-frequency information may get lost. To this end, we propose a spatial-spectra… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: IEEE TGRS 2023

  26. arXiv:2311.01149  [pdf, other

    cs.CL

    ChineseWebText: Large-scale High-quality Chinese Web Text Extracted with Effective Evaluation Model

    Authors: Jianghao Chen, Pu Jian, Tengxiao Xi, Dongyi Yi, Qianlong Du, Chenglin Ding, Guibo Zhu, Chengqing Zong, Jinqiao Wang, Jiajun Zhang

    Abstract: During the development of large language models (LLMs), the scale and quality of the pre-training data play a crucial role in shaping LLMs' capabilities. To accelerate the research of LLMs, several large-scale datasets, such as C4 [1], Pile [2], RefinedWeb [3] and WanJuan [4], have been released to the public. However, most of the released corpus focus mainly on English, and there is still lack of… ▽ More

    Submitted 10 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

  27. arXiv:2309.12010  [pdf, other

    eess.IV cs.CV

    Convolution and Attention Mixer for Synthetic Aperture Radar Image Change Detection

    Authors: Haopeng Zhang, Zijing Lin, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li

    Abstract: Synthetic aperture radar (SAR) image change detection is a critical task and has received increasing attentions in the remote sensing community. However, existing SAR change detection methods are mainly based on convolutional neural networks (CNNs), with limited consideration of global attention mechanism. In this letter, we explore Transformer-like architecture for SAR change detection to incorpo… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE GRSL

  28. arXiv:2308.13906  [pdf, other

    eess.SP cs.LG

    A Two-Dimensional Deep Network for RF-based Drone Detection and Identification Towards Secure Coverage Extension

    Authors: Zixiao Zhao, Qinghe Du, Xiang Yao, Lei Lu, Shijiao Zhang

    Abstract: As drones become increasingly prevalent in human life, they also raises security concerns such as unauthorized access and control, as well as collisions and interference with manned aircraft. Therefore, ensuring the ability to accurately detect and identify between different drones holds significant implications for coverage extension. Assisted by machine learning, radio frequency (RF) detection c… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  29. arXiv:2308.04386  [pdf, other

    cs.CL

    Learning Evaluation Models from Large Language Models for Sequence Generation

    Authors: Chenglong Wang, Hang Zhou, Kaiyan Chang, Tongran Liu, Chunliang Zhang, Quan Du, Tong Xiao, Jingbo Zhu

    Abstract: Large language models achieve state-of-the-art performance on sequence generation evaluation, but typically have a large number of parameters. This is a computational challenge as presented by applying their evaluation capability at scale. To overcome the challenge, in this paper, we propose \textbf{ECT}, an \textbf{e}valuation \textbf{c}apability \textbf{t}ransfer method, to transfer the evaluati… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  30. arXiv:2305.12649  [pdf, other

    cs.CV

    Imbalance-Agnostic Source-Free Domain Adaptation via Avatar Prototype Alignment

    Authors: Hongbin Lin, Mingkui Tan, Yifan Zhang, Zhen Qiu, Shuaicheng Niu, Dong Liu, Qing Du, Yanxia Liu

    Abstract: Source-free Unsupervised Domain Adaptation (SF-UDA) aims to adapt a well-trained source model to an unlabeled target domain without access to the source data. One key challenge is the lack of source data during domain adaptation. To handle this, we propose to mine the hidden knowledge of the source model and exploit it to generate source avatar prototypes. To this end, we propose a Contrastive Pro… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2106.15326

  31. arXiv:2304.09376  [pdf, other

    cs.LG cs.CV eess.IV

    Physical Knowledge Enhanced Deep Neural Network for Sea Surface Temperature Prediction

    Authors: Yuxin Meng, Feng Gao, Eric Rigall, Ran Dong, Junyu Dong, Qian Du

    Abstract: Traditionally, numerical models have been deployed in oceanography studies to simulate ocean dynamics by representing physical equations. However, many factors pertaining to ocean dynamics seem to be ill-defined. We argue that transferring physical knowledge from observed data could further improve the accuracy of numerical models when predicting Sea Surface Temperature (SST). Recently, the advanc… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: IEEE TGRS 2023

  32. arXiv:2304.09373  [pdf, other

    eess.IV cs.CV

    Multi-scale Adaptive Fusion Network for Hyperspectral Image Denoising

    Authors: Haodong Pan, Feng Gao, Junyu Dong, Qian Du

    Abstract: Removing the noise and improving the visual quality of hyperspectral images (HSIs) is challenging in academia and industry. Great efforts have been made to leverage local, global or spectral context information for HSI denoising. However, existing methods still have limitations in feature interaction exploitation among multiple scales and rich spectral structure preservation. In view of this, we p… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: IEEE JSTASRS 2023, code at: https://github.com/summitgao/MAFNet

  33. arXiv:2303.14736  [pdf, other

    cs.CV

    Disentangling Writer and Character Styles for Handwriting Generation

    Authors: Gang Dai, Yifan Zhang, Qingfeng Wang, Qing Du, Zhuliang Yu, Zhuoman Liu, Shuangping Huang

    Abstract: Training machines to synthesize diverse handwritings is an intriguing task. Recently, RNN-based methods have been proposed to generate stylized online Chinese characters. However, these methods mainly focus on capturing a person's overall writing style, neglecting subtle style inconsistencies between characters written by the same person. For example, while a person's handwriting typically exhibit… ▽ More

    Submitted 31 March, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

    Comments: accepted by CVPR 2023. Source code: https://github.com/dailenson/SDT

  34. Statistical Age-of-Information Optimization for Status Update over Multi-State Fading Channels

    Authors: Yuquan Xiao, Qinghe Du

    Abstract: Age of information (AoI) is a powerful metric to evaluate the freshness of information, where minimization of average statistics, such as the average AoI and average peak AoI, currently prevails in guiding freshness optimization for related applications. Although minimizing the statistics does improve the received information's freshness for status update systems in the sense of average, the time-… ▽ More

    Submitted 27 November, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: This paper has been accepted by IEEE Transactions on Vehicular Technology

  35. arXiv:2302.14764  [pdf, other

    eess.SP cs.NI

    Robust Secrecy via Aerial Reflection and Jamming: Joint Optimization of Deployment and Transmission

    Authors: Xiao Tang, Hongliang He, Limeng Dong, Lixin Li, Qinghe Du, Zhu Han

    Abstract: Reconfigurable intelligent surfaces (RISs) are recognized with great potential to strengthen wireless security, yet the performance gain largely depends on the deployment location of RISs in the network topology. In this paper, we consider the anti-eavesdropping communication established through a RIS at a fixed location, as well as an aerial platform mounting another RIS and a friendly jammer to… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: 14 pages, 10 figures, accepted at IEEE IoTJ

  36. arXiv:2301.08851  [pdf, other

    cs.SE

    LWS: A Framework for Log-based Workload Simulation in Session-based SUT

    Authors: Yongqi Han, Qingfeng Du, Jincheng Xu, Shengjie Zhao, Zhekang Chen, Li Cao, Kanglin Yin, Dan Pei

    Abstract: Artificial intelligence for IT Operations (AIOps) plays a critical role in operating and managing cloud-native systems and microservice-based applications but is limited by the lack of high-quality datasets with diverse scenarios. Realistic workloads are the premise and basis of generating such AIOps datasets, with the session-based workload being one of the most typical examples. Due to privacy c… ▽ More

    Submitted 27 April, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

  37. arXiv:2301.06530  [pdf, other

    cs.SE

    KEWS: A KPIs-Based Evaluation Framework of Workload Simulation On Microservice System

    Authors: Pengsheng Li, Qingfeng Du, Shengjie Zhao

    Abstract: Simulating the workload is an essential procedure in microservice systems as it helps augment realistic workloads whilst safeguarding user privacy. The efficacy of such simulation depends on its dynamic assessment. The straightforward and most efficient approach to this is comparing the original workload with the simulated one using Key Performance Indicators (KPIs), which capture the state of the… ▽ More

    Submitted 27 November, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

  38. Nearest Neighbor-Based Contrastive Learning for Hyperspectral and LiDAR Data Classification

    Authors: Meng Wang, Feng Gao, Junyu Dong, Heng-Chao Li, Qian Du

    Abstract: The joint hyperspectral image (HSI) and LiDAR data classification aims to interpret ground objects at more detailed and precise level. Although deep learning methods have shown remarkable success in the multisource data classification task, self-supervised learning has rarely been explored. It is commonly nontrivial to build a robust self-supervised learning model for multisource data classificati… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: IEEE TGRS 2023

  39. SuperYOLO: Super Resolution Assisted Object Detection in Multimodal Remote Sensing Imagery

    Authors: Jiaqing Zhang, Jie Lei, Weiying Xie, Zhenman Fang, Yunsong Li, Qian Du

    Abstract: Accurately and timely detecting multiscale small objects that contain tens of pixels from remote sensing images (RSI) remains challenging. Most of the existing solutions primarily design complex deep neural networks to learn strong feature representations for objects separated from the background, which often results in a heavy computation burden. In this article, we propose an accurate yet fast o… ▽ More

    Submitted 8 April, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

    Comments: The article is accepted by IEEE Transactions on Geoscience and Remote Sensing

  40. Single-source Domain Expansion Network for Cross-Scene Hyperspectral Image Classification

    Authors: Yuxiang Zhang, Wei Li, Weidong Sun, Ran Tao, Qian Du

    Abstract: Currently, cross-scene hyperspectral image (HSI) classification has drawn increasing attention. It is necessary to train a model only on source domain (SD) and directly transferring the model to target domain (TD), when TD needs to be processed in real time and cannot be reused for training. Based on the idea of domain generalization, a Single-source Domain Expansion Network (SDEnet) is developed… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

  41. arXiv:2208.13726  [pdf, other

    cs.IT

    Improved Grant-Free Access for URLLC via Multi-Tier-Driven Computing: Network-Load Learning, Prediction, and Resource Allocation

    Authors: Zixiao Zhao, Qinghe Du, George K. Karagiannidis

    Abstract: Grant-Free (GF) access has been recognized as a promising candidate for Ultra-Reliable and Low-Latency Communications (URLLC). However, even with GF access, URLLC still may not effectively gain high reliability and millimeter-level latency, simultaneously. This is because the network load is typically time-varying and not known to the base station (BS), and thus, the resource allocated for GF acce… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

  42. Synthetic Aperture Radar Image Change Detection via Layer Attention-Based Noise-Tolerant Network

    Authors: Desen Meng, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li

    Abstract: Recently, change detection methods for synthetic aperture radar (SAR) images based on convolutional neural networks (CNN) have gained increasing research attention. However, existing CNN-based methods neglect the interactions among multilayer convolutions, and errors involved in the preclassification restrict the network optimization. To this end, we proposed a layer attention-based noise-tolerant… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: Accepted by IEEE Geoscience and Remote Sensing Letters (GRSL) 2022, code is available at https://github.com/summitgao/LANTNet

  43. arXiv:2208.04094  [pdf, other

    cs.CV cs.AI

    Towards Semantic Communications: Deep Learning-Based Image Semantic Coding

    Authors: Danlan Huang, Feifei Gao, Xiaoming Tao, Qiyuan Du, Jianhua Lu

    Abstract: Semantic communications has received growing interest since it can remarkably reduce the amount of data to be transmitted without missing critical information. Most existing works explore the semantic encoding and transmission for text and apply techniques in Natural Language Processing (NLP) to interpret the meaning of the text. In this paper, we conceive the semantic communications for image dat… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

  44. arXiv:2206.05641  [pdf, ps, other

    cs.CV cs.LG eess.IV

    An Unsupervised Deep-Learning Method for Bone Age Assessment

    Authors: Hao Zhu, Wan-Jing Nie, Yue-Jie Hou, Qi-Meng Du, Si-Jing Li, Chi-Chun Zhou

    Abstract: The bone age, reflecting the degree of development of the bones, can be used to predict the adult height and detect endocrine diseases of children. Both examinations of radiologists and variability of operators have a significant impact on bone age assessment. To decrease human intervention , machine learning algorithms are used to assess the bone age automatically. However, conventional supervise… ▽ More

    Submitted 11 June, 2022; originally announced June 2022.

  45. arXiv:2205.09933  [pdf, other

    cs.CV eess.IV

    Hyperspectral Unmixing Based on Nonnegative Matrix Factorization: A Comprehensive Review

    Authors: Xin-Ru Feng, Heng-Chao Li, Rui Wang, Qian Du, Xiuping Jia, Antonio Plaza

    Abstract: Hyperspectral unmixing has been an important technique that estimates a set of endmembers and their corresponding abundances from a hyperspectral image (HSI). Nonnegative matrix factorization (NMF) plays an increasingly significant role in solving this problem. In this article, we present a comprehensive survey of the NMF-based methods proposed for hyperspectral unmixing. Taking the NMF model as a… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  46. arXiv:2204.05823  [pdf, other

    cs.CV cs.AI

    Adaptive Cross-Attention-Driven Spatial-Spectral Graph Convolutional Network for Hyperspectral Image Classification

    Authors: Jin-Yu Yang, Heng-Chao Li, Wen-Shuai Hu, Lei Pan, Qian Du

    Abstract: Recently, graph convolutional networks (GCNs) have been developed to explore spatial relationship between pixels, achieving better classification performance of hyperspectral images (HSIs). However, these methods fail to sufficiently leverage the relationship between spectral bands in HSI data. As such, we propose an adaptive cross-attention-driven spatial-spectral graph convolutional network (ACS… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  47. A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data Classification

    Authors: Heng-Chao Li, Wen-Shuai Hu, Wei Li, Jun Li, Qian Du, Antonio Plaza

    Abstract: The problem of effectively exploiting the information multiple data sources has become a relevant but challenging research topic in remote sensing. In this paper, we propose a new approach to exploit the complementarity of two data sources: hyperspectral images (HSIs) and light detection and ranging (LiDAR) data. Specifically, we develop a new dual-channel spatial, spectral and multiscale attentio… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

    Comments: 16 pages, 10 figures

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, vol. 33, no. 2, pp. 747-761, Feb. 2022

  48. MS-HLMO: Multi-scale Histogram of Local Main Orientation for Remote Sensing Image Registration

    Authors: Chenzhong Gao, Wei Li, Ran Tao, Qian Du

    Abstract: Multi-source image registration is challenging due to intensity, rotation, and scale differences among the images. Considering the characteristics and differences of multi-source remote sensing images, a feature-based registration algorithm named Multi-scale Histogram of Local Main Orientation (MS-HLMO) is proposed. Harris corner detection is first adopted to generate feature points. The HLMO feat… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

  49. arXiv:2203.12856  [pdf, other

    cs.CV cs.LG

    Beyond Fixation: Dynamic Window Visual Transformer

    Authors: Pengzhen Ren, Changlin Li, Guangrun Wang, Yun Xiao, Qing Du, Xiaodan Liang, Xiaojun Chang

    Abstract: Recently, a surge of interest in visual transformers is to reduce the computational cost by limiting the calculation of self-attention to a local window. Most current work uses a fixed single-scale window for modeling by default, ignoring the impact of window size on model performance. However, this may limit the modeling potential of these window-based models for multi-scale information. In this… ▽ More

    Submitted 8 April, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Journal ref: CVPR2022

  50. arXiv:2203.09176  [pdf, other

    cs.CL

    ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generation

    Authors: Bei Li, Quan Du, Tao Zhou, Yi Jing, Shuhan Zhou, Xin Zeng, Tong Xiao, JingBo Zhu, Xuebo Liu, Min Zhang

    Abstract: Residual networks are an Euler discretization of solutions to Ordinary Differential Equations (ODE). This paper explores a deeper relationship between Transformer and numerical ODE methods. We first show that a residual block of layers in Transformer can be described as a higher-order solution to ODE. Inspired by this, we design a new architecture, {\it ODE Transformer}, which is analogous to the… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: Long paper accepted by ACL2022 main conference. arXiv admin note: substantial text overlap with arXiv:2104.02308