Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–48 of 48 results for author: Qian, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03177  [pdf, other

    cs.HC eess.SP

    EDPNet: An Efficient Dual Prototype Network for Motor Imagery EEG Decoding

    Authors: Can Han, Chen Liu, Crystal Cai, Jun Wang, Dahong Qian

    Abstract: Motor imagery electroencephalograph (MI-EEG) decoding plays a crucial role in developing motor imagery brain-computer interfaces (MI-BCIs). However, decoding intentions from MI remains challenging due to the inherent complexity of EEG signals relative to the small-sample size. In this paper, we propose an Efficient Dual Prototype Network (EDPNet) to enable accurate and fast MI decoding. EDPNet emp… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2406.07925  [pdf, other

    cs.DC

    FDLoRA: Personalized Federated Learning of Large Language Model via Dual LoRA Tuning

    Authors: Jiaxing QI, Zhongzhi Luan, Shaohan Huang, Carol Fung, Hailong Yang, Depei Qian

    Abstract: Large language models (LLMs) have emerged as important components across various fields, yet their training requires substantial computation resources and abundant labeled data. It poses a challenge to robustly training LLMs for individual users (clients). To tackle this challenge, the intuitive idea is to introduce federated learning (FL), which can collaboratively train models on distributed pri… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2405.18944  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall cs.LG

    Predicting Many Properties of Crystals by a Single Deep Learning Model

    Authors: Haosheng Xu, Dongheng Qian, Jing Wang

    Abstract: The use of machine learning methods for predicting the properties of crystalline materials encounters significant challenges, primarily related to input encoding, output versatility, and interpretability. Here, we introduce CrystalBERT, an adaptable transformer-based framework with novel structure that integrates space group, elemental, and unit cell information. The method's adaptability lies not… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 7 pages, 4 figures. The codes are available upon reasonable request

  4. arXiv:2404.17837  [pdf, other

    cs.CV cs.HC

    Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs

    Authors: Yiming Bao, Xu Zhao, Dahong Qian

    Abstract: Temporal 3D human pose estimation from monocular videos is a challenging task in human-centered computer vision due to the depth ambiguity of 2D-to-3D lifting. To improve accuracy and address occlusion issues, inertial sensor has been introduced to provide complementary source of information. However, it remains challenging to integrate heterogeneous sensor data for producing physically rational 3… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 10 pages, 5 figures, Under Review

  5. arXiv:2404.10296  [pdf, other

    cs.LG cs.AI cs.NE

    Engineering software 2.0 by interpolating neural networks: unifying training, solving, and calibration

    Authors: Chanwook Park, Sourav Saha, Jiachen Guo, Xiaoyu Xie, Satyajit Mojumder, Miguel A. Bessa, Dong Qian, Wei Chen, Gregory J. Wagner, Jian Cao, Wing Kam Liu

    Abstract: The evolution of artificial intelligence (AI) and neural network theories has revolutionized the way software is programmed, shifting from a hard-coded series of codes to a vast neural network. However, this transition in engineering software has faced challenges such as data scarcity, multi-modality of data, low model accuracy, and slow inference. Here, we propose a new network based on interpola… ▽ More

    Submitted 22 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: 9 pages, 3 figures

  6. arXiv:2404.03226  [pdf, other

    cs.DC

    INSPIRIT: Optimizing Heterogeneous Task Scheduling through Adaptive Priority in Task-based Runtime Systems

    Authors: Yiqing Wang, Xiaoyan Liu, Hailong Yang, Xinyu Yang, Pengbo Wang, Yi Liu, Zhongzhi Luan, Depei Qian

    Abstract: As modern HPC computing platforms become increasingly heterogeneous, it is challenging for programmers to fully leverage the computation power of massive parallelism offered by such heterogeneity. Consequently, task-based runtime systems have been proposed as an intermediate layer to hide the complex heterogeneity from the application programmers. The core functionality of these systems is to real… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 11 pages

  7. arXiv:2403.19098  [pdf, other

    cs.CV

    GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving

    Authors: Yunpeng Zhang, Deheng Qian, Ding Li, Yifeng Pan, Yong Chen, Zhenbao Liang, Zhiyao Zhang, Shurui Zhang, Hongxu Li, Maolei Fu, Yun Ye, Zhujin Liang, Yi Shan, Dalong Du

    Abstract: Modeling complicated interactions among the ego-vehicle, road agents, and map elements has been a crucial part for safety-critical autonomous driving. Previous works on end-to-end autonomous driving rely on the attention mechanism for handling heterogeneous interactions, which fails to capture the geometric priors and is also computationally intensive. In this paper, we propose the Interaction Sce… ▽ More

    Submitted 6 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: project page: https://github.com/zhangyp15/GraphAD

  8. arXiv:2402.15678  [pdf, other

    cs.DC

    Minions: Accelerating Large Language Model Inference with Adaptive and Collective Speculative Decoding

    Authors: Siqi Wang, Hailong Yang, Xuezhu Wang, Tongxuan Liu, Pengbo Wang, Xuning Liang, Kejie Ma, Tianyu Feng, Xin You, Yongjun Bao, Yi Liu, Zhongzhi Luan, Depei Qian

    Abstract: Large language models (LLM) have recently attracted surging interest due to their outstanding capabilities across various domains. However, enabling efficient LLM inference is challenging due to its autoregressive decoding that generates tokens only one at a time. Although research works apply pruning or quantization to speed up LLM inference, they typically require fine-tuning the LLM, incurring… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  9. arXiv:2401.09895  [pdf

    cs.CV

    Skeleton-Guided Instance Separation for Fine-Grained Segmentation in Microscopy

    Authors: Jun Wang, Chengfeng Zhou, Zhaoyan Ming, Lina Wei, Xudong Jiang, Dahong Qian

    Abstract: One of the fundamental challenges in microscopy (MS) image analysis is instance segmentation (IS), particularly when segmenting cluster regions where multiple objects of varying sizes and shapes may be connected or even overlapped in arbitrary orientations. Existing IS methods usually fail in handling such scenarios, as they rely on coarse instance representations such as keypoints and horizontal… ▽ More

    Submitted 19 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  10. arXiv:2312.07623  [pdf

    cs.CV

    Supervised Contrastive Learning for Fine-grained Chromosome Recognition

    Authors: Ruijia Chang, Suncheng Xiang, Chengyu Zhou, Kui Su, Dahong Qian, Jun Wang

    Abstract: Chromosome recognition is an essential task in karyotyping, which plays a vital role in birth defect diagnosis and biomedical research. However, existing classification methods face significant challenges due to the inter-class similarity and intra-class variation of chromosomes. To address this issue, we propose a supervised contrastive learning strategy that is tailored to train model-agnostic d… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  11. arXiv:2312.02535  [pdf, other

    cs.CV

    Towards Open-set Gesture Recognition via Feature Activation Enhancement and Orthogonal Prototype Learning

    Authors: Chen Liu, Can Han, Chengfeng Zhou, Crystal Cai, Suncheng Xiang, Hualiang Ni, Dahong Qian

    Abstract: Gesture recognition is a foundational task in human-machine interaction (HMI). While there has been significant progress in gesture recognition based on surface electromyography (sEMG), accurate recognition of predefined gestures only within a closed set is still inadequate in practice. It is essential to effectively discern and reject unknown gestures of disinterest in a robust system. Numerous m… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  12. arXiv:2309.01189  [pdf, other

    cs.LG cs.AI cs.SE

    LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection

    Authors: Jiaxing Qi, Shaohan Huang, Zhongzhi Luan, Carol Fung, Hailong Yang, Depei Qian

    Abstract: The increasing volume of log data produced by software-intensive systems makes it impractical to analyze them manually. Many deep learning-based methods have been proposed for log-based anomaly detection. These methods face several challenges such as high-dimensional and noisy log data, class imbalance, generalization, and model interpretability. Recently, ChatGPT has shown promising results in va… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  13. arXiv:2308.00929  [pdf, other

    cs.CV

    Towards Discriminative Representation with Meta-learning for Colonoscopic Polyp Re-Identification

    Authors: Suncheng Xiang, Qingzhong Chen, Shilun Cai, Chengfeng Zhou, Crystal Cai, Sijia Du, Zhengjie Zhang, Yunshi Zhong, Dahong Qian

    Abstract: Colonoscopic Polyp Re-Identification aims to match the same polyp from a large gallery with images from different views taken using different cameras and plays an important role in the prevention and treatment of colorectal cancer in computer-aided diagnosis. However, traditional methods for object ReID directly adopting CNN models trained on the ImageNet dataset usually produce unsatisfactory ret… ▽ More

    Submitted 28 November, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

  14. arXiv:2307.10625  [pdf, other

    cs.CV

    Learning Discriminative Visual-Text Representation for Polyp Re-Identification

    Authors: Suncheng Xiang, Cang Liu, Sijia Du, Dahong Qian

    Abstract: Colonoscopic Polyp Re-Identification aims to match a specific polyp in a large gallery with different cameras and views, which plays a key role for the prevention and treatment of colorectal cancer in the computer-aided diagnosis. However, traditional methods mainly focus on the visual representation learning, while neglect to explore the potential of semantic features during training, which may e… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  15. arXiv:2305.00194  [pdf, other

    cs.CV

    Searching from Area to Point: A Hierarchical Framework for Semantic-Geometric Combined Feature Matching

    Authors: Yesheng Zhang, Xu Zhao, Dahong Qian

    Abstract: Feature matching is a crucial technique in computer vision. A unified perspective for this task is to treat it as a searching problem, aiming at an efficient search strategy to narrow the search space to point matches between images. One of the key aspects of search strategy is the search space, which in current approaches is not carefully defined, resulting in limited matching accuracy. This pape… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2023; originally announced May 2023.

    Comments: v3

  16. arXiv:2304.09498  [pdf, other

    cs.CV

    Learning Robust Visual-Semantic Embedding for Generalizable Person Re-identification

    Authors: Suncheng Xiang, Jingsheng Gao, Mengyuan Guan, Jiacheng Ruan, Chengfeng Zhou, Ting Liu, Dahong Qian, Yuzhuo Fu

    Abstract: Generalizable person re-identification (Re-ID) is a very hot research topic in machine learning and computer vision, which plays a significant role in realistic scenarios due to its various applications in public security and video surveillance. However, previous methods mainly focus on the visual representation learning, while neglect to explore the potential of semantic features during training,… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  17. AutoQNN: An End-to-End Framework for Automatically Quantizing Neural Networks

    Authors: Cheng Gong, Ye Lu, Surong Dai, Deng Qian, Chenkun Du, Tao Li

    Abstract: Exploring the expected quantizing scheme with suitable mixed-precision policy is the key point to compress deep neural networks (DNNs) in high efficiency and accuracy. This exploration implies heavy workloads for domain experts, and an automatic compression method is needed. However, the huge search space of the automatic method introduces plenty of computing budgets that make the automatic proces… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 22 pages, 9 figures, 7 tables, Journal of Computer Science and Technology

  18. arXiv:2303.15671  [pdf, other

    cs.CV

    Colo-SCRL: Self-Supervised Contrastive Representation Learning for Colonoscopic Video Retrieval

    Authors: Qingzhong Chen, Shilun Cai, Crystal Cai, Zefang Yu, Dahong Qian, Suncheng Xiang

    Abstract: Colonoscopic video retrieval, which is a critical part of polyp treatment, has great clinical significance for the prevention and treatment of colorectal cancer. However, retrieval models trained on action recognition datasets usually produce unsatisfactory retrieval results on colonoscopic datasets due to the large domain gap between them. To seek a solution to this problem, we construct a large-… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted by ICME 2023

  19. arXiv:2301.04799  [pdf, ps, other

    cs.CV

    Adaptive Context Selection for Polyp Segmentation

    Authors: Ruifei Zhang, Guanbin Li, Zhen Li, Shuguang Cui, Dahong Qian, Yizhou Yu

    Abstract: Accurate polyp segmentation is of great significance for the diagnosis and treatment of colorectal cancer. However, it has always been very challenging due to the diverse shape and size of polyp. In recent years, state-of-the-art methods have achieved significant breakthroughs in this task with the help of deep convolutional neural networks. However, few algorithms explicitly consider the impact o… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: Accepted by MICCAI2020

  20. arXiv:2211.00933  [pdf, other

    cs.CV

    Deep Multimodal Fusion for Generalizable Person Re-identification

    Authors: Suncheng Xiang, Hao Chen, Wei Ran, Zefang Yu, Ting Liu, Dahong Qian, Yuzhuo Fu

    Abstract: Person re-identification plays a significant role in realistic scenarios due to its various applications in public security and video surveillance. Recently, leveraging the supervised or semi-unsupervised learning paradigms, which benefits from the large-scale datasets and strong computing performance, has achieved a competitive performance on a specific target domain. However, when Re-ID models a… ▽ More

    Submitted 29 December, 2022; v1 submitted 2 November, 2022; originally announced November 2022.

  21. arXiv:2209.02478  [pdf, other

    cs.DC

    Mimose: An Input-Aware Checkpointing Planner for Efficient Training on GPU

    Authors: Jianjin Liao, Mingzhen Li, Qingxiao Sun, Jiwei Hao, Fengwei Yu, Shengdong Chen, Ye Tao, Zicheng Zhang, Hailong Yang, Zhongzhi Luan, Depei Qian

    Abstract: Larger deep learning models usually lead to higher model quality with an ever-increasing GPU memory footprint. Although tensor checkpointing techniques have been proposed to enable training under a restricted GPU memory budget, the input tensor dynamics have been unexploited for optimizing performance while reducing GPU memory footprint. Specifically, due to the diverse datasets and subsequent dat… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  22. arXiv:2208.14228  [pdf, other

    cs.DC

    EasyScale: Accuracy-consistent Elastic Training for Deep Learning

    Authors: Mingzhen Li, Wencong Xiao, Biao Sun, Hanyu Zhao, Hailong Yang, Shiru Ren, Zhongzhi Luan, Xianyan Jia, Yi Liu, Yong Li, Wei Lin, Depei Qian

    Abstract: Distributed synchronized GPU training is commonly used for deep learning. The resource constraint of using a fixed number of GPUs makes large-scale training jobs suffer from long queuing time for resource allocation, and lowers the cluster utilization. Adapting to resource elasticity can alleviate this but often introduces inconsistent model accuracy, due to lacking of capability to decouple model… ▽ More

    Submitted 6 November, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    Comments: To be appeared at SC'23. Link: https://sc23.supercomputing.org/presentation/?id=pap262&sess=sess168

  23. arXiv:2208.11960  [pdf, other

    cs.CV

    FusePose: IMU-Vision Sensor Fusion in Kinematic Space for Parametric Human Pose Estimation

    Authors: Yiming Bao, Xu Zhao, Dahong Qian

    Abstract: There exist challenging problems in 3D human pose estimation mission, such as poor performance caused by occlusion and self-occlusion. Recently, IMU-vision sensor fusion is regarded as valuable for solving these problems. However, previous researches on the fusion of IMU and vision data, which is heterogeneous, fail to adequately utilize either IMU raw data or reliable high-level vision features.… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: 11 pages,8 figures

  24. arXiv:2208.11483  [pdf, other

    cs.CV

    SubFace: Learning with Softmax Approximation for Face Recognition

    Authors: Hongwei Xu, Suncheng Xiang, Dahong Qian

    Abstract: The softmax-based loss functions and its variants (e.g., cosface, sphereface, and arcface) significantly improve the face recognition performance in wild unconstrained scenes. A common practice of these algorithms is to perform optimizations on the multiplication between the embedding features and the linear transformation matrix. However in most cases, the dimension of embedding features is given… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  25. arXiv:2207.03366  [pdf, other

    cs.CV

    A simple normalization technique using window statistics to improve the out-of-distribution generalization on medical images

    Authors: Chengfeng Zhou, Songchang Chen, Chenming Xu, Jun Wang, Feng Liu, Chun Zhang, Juan Ye, Hefeng Huang, Dahong Qian

    Abstract: Since data scarcity and data heterogeneity are prevailing for medical images, well-trained Convolutional Neural Networks (CNNs) using previous normalization methods may perform poorly when deployed to a new site. However, a reliable model for real-world clinical applications should be able to generalize well both on in-distribution (IND) and out-of-distribution (OOD) data (e.g., the new site data)… ▽ More

    Submitted 13 July, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

  26. Learning-Based Framework for Camera Calibration with Distortion Correction and High Precision Feature Detection

    Authors: Yesheng Zhang, Xu Zhao, Dahong Qian

    Abstract: Camera calibration is a crucial technique which significantly influences the performance of many robotic systems. Robustness and high precision have always been the pursuit of diverse calibration methods. State-of-the-art calibration techniques based on classical Zhang's method, however, still suffer from environmental noise, radial lens distortion and sub-optimal parameter estimation. Therefore,… ▽ More

    Submitted 29 April, 2023; v1 submitted 31 January, 2022; originally announced February 2022.

    Journal ref: in IEEE Robotics and Automation Letters, vol. 7, no. 4, pp. 10470-10477, Oct. 2022

  27. Real-time automatic polyp detection in colonoscopy using feature enhancement module and spatiotemporal similarity correlation unit

    Authors: Jianwei Xu, Ran Zhao, Yizhou Yu, Qingwei Zhang, Xianzhang Bian, Jun Wang, Zhizheng Ge, Dahong Qian

    Abstract: Automatic detection of polyps is challenging because different polyps vary greatly, while the changes between polyps and their analogues are small. The state-of-the-art methods are based on convolutional neural networks (CNNs). However, they may fail due to lack of training data, resulting in high rates of missed detection and false positives (FPs). In order to solve these problems, our method com… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: This paper has been accepted by Biomedical Signal Processing and Control. Please cite the paper as Xu, J., Zhao, R., Yu, Y., Zhang, Q., Bian, X., Wang, J., Ge, Z., Qian, D., 2021. Real-time automatic polyp detection in colonoscopy using feature enhancement module and spatiotemporal similarity correlation unit. Biomedical Signal Processing and Control 66, 102503

    Journal ref: Biomedical Signal Processing and Control, vol. 66, p. 102503, Apr. 2021

  28. arXiv:2201.00194  [pdf, other

    cs.LG cs.DC cs.PL

    FamilySeer: Towards Optimized Tensor Codes by Exploiting Computation Subgraph Similarity

    Authors: Shanjun Zhang, Mingzhen Li, Hailong Yang, Yi Liu, Zhongzhi Luan, Depei Qian

    Abstract: Deploying various deep learning (DL) models efficiently has boosted the research on DL compilers. The difficulty of generating optimized tensor codes drives DL compiler to ask for the auto-tuning approaches, and the increasing demands require increasing auto-tuning efficiency and quality. Currently, the DL compilers partition the input DL models into several subgraphs and leverage the auto-tuning… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

  29. arXiv:2110.05074  [pdf, other

    cs.CV

    Rethinking Person Re-Identification via Semantic-Based Pretraining

    Authors: Suncheng Xiang, Jingsheng Gao, Zirui Zhang, Mengyuan Guan, Binjie Yan, Ting Liu, Dahong Qian, Yuzhuo Fu

    Abstract: Pretraining is a dominant paradigm in computer vision. Generally, supervised ImageNet pretraining is commonly used to initialize the backbones of person re-identification (Re-ID) models. However, recent works show a surprising result that CNN-based pretraining on ImageNet has limited impacts on Re-ID system due to the large domain gap between ImageNet and person Re-ID data. To seek an alternative… ▽ More

    Submitted 26 December, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

  30. arXiv:2107.02413  [pdf

    cs.RO eess.SY

    DL-AMP and DBTO: An Automatic Merge Planning and Trajectory Optimization and Its Application in Autonomous Driving

    Authors: Yuncheng Jiang, Qi Lin, Jiwei Zhang, Jun Wang, Danjian Qian, Yuxi Cai

    Abstract: This paper presents an automatic merging algorithm for autonomous driving vehicles, which decouples the specific motion planning problem into a Dual-Layer Automatic Merge Planning (DL_AMP) and a Descent-Based Trajectory Optimization (DBTO). This work leads to great improvements in finding the best merge opportunity, lateral and longitudinal merge planning and control, trajectory postprocessing and… ▽ More

    Submitted 29 July, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: 8 pages, preprint on Feb 2, 2021, accepted by ITSC2021 on Jun, 25, 2021

    Report number: its.ITSC21.64.567543b5

  31. arXiv:2105.03068  [pdf, ps, other

    eess.IV cs.CV

    Self-Adaptive Transfer Learning for Multicenter Glaucoma Classification in Fundus Retina Images

    Authors: Yiming Bao, Jun Wang, Tong Li, Linyan Wang, Jianwei Xu, Juan Ye, Dahong Qian

    Abstract: The early diagnosis and screening of glaucoma are important for patients to receive treatment in time and maintain eyesight. Nowadays, deep learning (DL) based models have been successfully used for computer-aided diagnosis (CAD) of glaucoma from retina fundus images. However, a DL model pre-trained using a dataset from one hospital center may have poor performance on a dataset from another new ho… ▽ More

    Submitted 8 August, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

    Comments: 10 pages, 2 figures

  32. Accelerating Sparse Approximate Matrix Multiplication on GPUs

    Authors: Xiaoyan Liu, Yi Liu, Ming Dun, Bohong Yin, Hailong Yang, Zhongzhi Luan, Depei Qian

    Abstract: Although the matrix multiplication plays a vital role in computational linear algebra, there are few efficient solutions for matrix multiplication of the near-sparse matrices. The Sparse Approximate Matrix Multiply (SpAMM) is one of the algorithms to fill the performance gap neglected by traditional optimizations for dense/sparse matrix multiplication. However, existing SpAMM algorithms fail to ex… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

  33. arXiv:2010.15528  [pdf, other

    cs.CV

    An End to End Network Architecture for Fundamental Matrix Estimation

    Authors: Yesheng Zhang, Xu Zhao, Dahong Qian

    Abstract: In this paper, we present a novel end-to-end network architecture to estimate fundamental matrix directly from stereo images. To establish a complete working pipeline, different deep neural networks in charge of finding correspondences in images, performing outlier rejection and calculating fundamental matrix, are integrated into an end-to-end network architecture. To well train the network and… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

  34. arXiv:2005.13133  [pdf, other

    cs.CV cs.LG cs.RO

    Robust Trajectory Forecasting for Multiple Intelligent Agents in Dynamic Scene

    Authors: Yanliang Zhu, Dongchun Ren, Mingyu Fan, Deheng Qian, Xin Li, Huaxia Xia

    Abstract: Trajectory forecasting, or trajectory prediction, of multiple interacting agents in dynamic scenes, is an important problem for many applications, such as robotic systems and autonomous driving. The problem is a great challenge because of the complex interactions among the agents and their interactions with the surrounding scenes. In this paper, we present a novel method for the robust trajectory… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

  35. arXiv:2003.12009  [pdf, other

    eess.SP cs.IT cs.LG stat.ML

    Multi-Lead ECG Classification via an Information-Based Attention Convolutional Neural Network

    Authors: Hao Tung, Chao Zheng, Xinsheng Mao, Dahong Qian

    Abstract: Objective: A novel structure based on channel-wise attention mechanism is presented in this paper. Embedding with the proposed structure, an efficient classification model that accepts multi-lead electrocardiogram (ECG) as input is constructed. Methods: One-dimensional convolutional neural networks (CNN) have proven to be effective in pervasive classification tasks, enabling the automatic extract… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

  36. The Deep Learning Compiler: A Comprehensive Survey

    Authors: Mingzhen Li, Yi Liu, Xiaoyan Liu, Qingxiao Sun, Xin You, Hailong Yang, Zhongzhi Luan, Lin Gan, Guangwen Yang, Depei Qian

    Abstract: The difficulty of deploying various deep learning (DL) models on diverse DL hardware has boosted the research and development of DL compilers in the community. Several DL compilers have been proposed from both industry and academia such as Tensorflow XLA and TVM. Similarly, the DL compilers take the DL models described in different DL frameworks as input, and then generate optimized codes for dive… ▽ More

    Submitted 28 August, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

    Journal ref: IEEE Transactions on Parallel & Distributed Systems, vol. 32, no. 03, pp. 708-727, 2021

  37. arXiv:2001.02354  [pdf, other

    cs.CV cs.LG cs.RO

    VisionNet: A Drivable-space-based Interactive Motion Prediction Network for Autonomous Driving

    Authors: Yanliang Zhu, Deheng Qian, Dongchun Ren, Huaxia Xia

    Abstract: The comprehension of environmental traffic situation largely ensures the driving safety of autonomous vehicles. Recently, the mission has been investigated by plenty of researches, while it is hard to be well addressed due to the limitation of collective influence in complex scenarios. These approaches model the interactions through the spatial relations between the target obstacle and its neighbo… ▽ More

    Submitted 7 January, 2020; originally announced January 2020.

  38. arXiv:2001.00493  [pdf, other

    cs.CR cs.LG

    Privacy for Rescue: A New Testimony Why Privacy is Vulnerable In Deep Models

    Authors: Ruiyuan Gao, Ming Dun, Hailong Yang, Zhongzhi Luan, Depei Qian

    Abstract: The huge computation demand of deep learning models and limited computation resources on the edge devices calls for the cooperation between edge device and cloud service by splitting the deep models into two halves. However, transferring the intermediates results from the partial models between edge device and cloud service makes the user privacy vulnerable since the attacker can intercept the int… ▽ More

    Submitted 31 December, 2019; originally announced January 2020.

  39. arXiv:1910.13346  [pdf, other

    cs.DC cs.PF cs.PL

    Intelligent-Unrolling: Exploiting Regular Patterns in Irregular Applications

    Authors: Changxi Liu, Hailong Yang, Xu Liu, Zhongzhi Luan, Depei Qian

    Abstract: Modern optimizing compilers are able to exploit memory access or computation patterns to generate vectorization codes. However, such patterns in irregular applications are unknown until runtime due to the input dependence. Thus, either compiler's static optimization or profile-guided optimization based on specific inputs cannot predict the patterns for any common input, which leads to suboptimal c… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

  40. arXiv:1907.11678  [pdf, other

    cs.DC

    Massively Scaling Seismic Processing on Sunway TaihuLight Supercomputer

    Authors: Yongmin Hu, Hailong Yang, Zhongzhi Luan, Depei Qian

    Abstract: Common Midpoint (CMP) and Common Reflection Surface (CRS) are widely used methods for improving the signal-to-noise ratio in the field of seismic processing. These methods are computationally intensive and require high performance computing. This paper optimizes these methods on the Sunway many-core architecture and implements large-scale seismic processing on the Sunway Taihulight supercomputer.… ▽ More

    Submitted 4 August, 2019; v1 submitted 26 July, 2019; originally announced July 2019.

  41. arXiv:1906.01797  [pdf, other

    cs.CV

    StarNet: Pedestrian Trajectory Prediction using Deep Neural Network in Star Topology

    Authors: Yanliang Zhu, Deheng Qian, Dongchun Ren, Huaxia Xia

    Abstract: Pedestrian trajectory prediction is crucial for many important applications. This problem is a great challenge because of complicated interactions among pedestrians. Previous methods model only the pairwise interactions between pedestrians, which not only oversimplifies the interactions among pedestrians but also is computationally inefficient. In this paper, we propose a novel model StarNet to de… ▽ More

    Submitted 12 January, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Accepted by the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019)

  42. arXiv:1905.11669  [pdf, other

    cs.LG stat.ML

    CompactNet: Platform-Aware Automatic Optimization for Convolutional Neural Networks

    Authors: Weicheng Li, Rui Wang, Zhongzhi Luan, Di Huang, Zidong Du, Yunji Chen, Depei Qian

    Abstract: Convolutional Neural Network (CNN) based Deep Learning (DL) has achieved great progress in many real-life applications. Meanwhile, due to the complex model structures against strict latency and memory restriction, the implementation of CNN models on the resource-limited platforms is becoming more challenging. This work proposes a solution, called CompactNet\footnote{Project URL: \url{https://githu… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

  43. arXiv:1904.07404  [pdf, other

    cs.LG cs.PL stat.ML

    swTVM: Towards Optimized Tensor Code Generation for Deep Learning on Sunway Many-Core Processor

    Authors: Mingzhen Li, Changxi Liu, Jianjin Liao, Xuegui Zheng, Hailong Yang, Rujun Sun, Jun Xu, Lin Gan, Guangwen Yang, Zhongzhi Luan, Depei Qian

    Abstract: The flourish of deep learning frameworks and hardware platforms has been demanding an efficient compiler that can shield the diversity in both software and hardware in order to provide application portability. Among the existing deep learning compilers, TVM is well known for its efficiency in code generation and optimization across diverse hardware devices. In the meanwhile, the Sunway many-core p… ▽ More

    Submitted 11 July, 2022; v1 submitted 15 April, 2019; originally announced April 2019.

  44. arXiv:1601.05850  [pdf

    cs.SE

    Regression Testing of Virtual Prototypes Using Symbolic Execution

    Authors: Bin Lin, Dejun Qian

    Abstract: Recently virtual platforms and virtual prototyping techniques have been widely applied for accelerating software development in electronics companies. It has been proved that these techniques can greatly shorten time-to-market and improve product quality. One challenge is how to test and validate a virtual prototype. In this paper, we present how to conduct regression testing of virtual prototypes… ▽ More

    Submitted 21 January, 2016; originally announced January 2016.

  45. arXiv:1412.4213  [pdf, ps, other

    cs.DC

    Scalable Hierarchical Scheduling for Malleable Parallel Jobs on Multiprocessor-based Systems

    Authors: Yangjie Cao, Hongyang Sun, Depei Qian, Weiguo Wu

    Abstract: The proliferation of multi-core and multiprocessor-based computer systems has led to explosive development of parallel applications and hence the need for efficient schedulers. In this paper, we study hierarchical scheduling for malleable parallel jobs on multiprocessor-based systems, which appears in many distributed and multilayered computing environments. We propose a hierarchical scheduling al… ▽ More

    Submitted 13 December, 2014; originally announced December 2014.

  46. arXiv:1203.6122  [pdf, ps, other

    cs.SI physics.soc-ph

    Diffusion of Real-Time Information in Social-Physical Networks

    Authors: Dajun Qian, Osman Yağan, Lei Yang, Junshan Zhang

    Abstract: We study the diffusion behavior of real-time information. Typically, real-time information is valuable only for a limited time duration, and hence needs to be delivered before its "deadline." Therefore, real-time information is much easier to spread among a group of people with frequent interactions than between isolated individuals. With this insight, we consider a social network which consists o… ▽ More

    Submitted 31 March, 2012; v1 submitted 27 March, 2012; originally announced March 2012.

    Comments: add one more figure

  47. arXiv:1201.2698  [pdf, ps, other

    physics.data-an cs.SI physics.soc-ph

    Optimal Allocation of Interconnecting Links in Cyber-Physical Systems: Interdependence, Cascading Failures and Robustness

    Authors: Osman Yagan, Dajun Qian, Junshan Zhang, Douglas Cochran

    Abstract: We consider a cyber-physical system consisting of two interacting networks, i.e., a cyber-network overlaying a physical-network. It is envisioned that these systems are more vulnerable to attacks since node failures in one network may result in (due to the interdependence) failures in the other network, causing a cascade of failures that would potentially lead to the collapse of the entire infrast… ▽ More

    Submitted 3 April, 2012; v1 submitted 12 January, 2012; originally announced January 2012.

    Comments: 13 pages, 6 figures. To appear in the Special Issue of IEEE Transactions on Parallel and Distributed Systems on Cyber-Physical Systems, 2012

    Journal ref: IEEE Transactions on Parallel and Distributed Systems (TPDS): Special Issue on Cyber-Physical Systems, vol. 23, no. 9, pp. 1708-1720, September 2012

  48. arXiv:1112.4002  [pdf, ps, other

    cs.SI physics.soc-ph

    Conjoining Speeds up Information Diffusion in Overlaying Social-Physical Networks

    Authors: Osman Yagan, Dajun Qian, Junshan Zhang, Douglas Cochran

    Abstract: We study the diffusion of information in an overlaying social-physical network. Specifically, we consider the following set-up: There is a physical information network where information spreads amongst people through conventional communication media (e.g., face-to-face communication, phone calls), and conjoint to this physical network, there are online social networks where information spreads via… ▽ More

    Submitted 3 August, 2012; v1 submitted 16 December, 2011; originally announced December 2011.

    Comments: 14 pages, 4 figures

    Journal ref: IEEE Journal on Selected Areas in Communications (JSAC): Special Issue on Network Science, 31(6):1038-1048, June 2013