Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleNovember 2024
Efficient transformer tracking with adaptive attention
AbstractRecently, several trackers utilising Transformer architecture have shown significant performance improvement. However, the high computational cost of multi‐head attention, a core component in the Transformer, has limited real‐time running speed,...
The authors propose a novel adaptive attention for tracking task that enhances features through spatial sparse attention mechanism with less than 1/4 of the computational complexity of multi‐head attention. Based on adaptive attention, the authors build ...
- research-articleOctober 2024
Progressive Local and Non-Local Interactive Networks with Deeply Discriminative Training for Image Deraining
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 10326–10335https://doi.org/10.1145/3664647.3681028In this paper, we develop a progressive local and non-local interactive network with multi-scale cross-content deeply discriminative learning to solve image deraining. The proposed model contains two key techniques: 1) Progressive Local and Non-Local ...
Compiler Support for Sparse Tensor Convolutions
Proceedings of the ACM on Programming Languages (PACMPL), Volume 8, Issue OOPSLA2Article No.: 281, Pages 275–303https://doi.org/10.1145/3689721This paper extends prior work on sparse tensor algebra compilers to generate asymptotically efficient code for tensor expressions with affine subscript expressions. Our technique enables compiler support for a wide range of sparse computations, including ...
- research-articleAugust 2024
Dissecting Convolutional Neural Networks for Runtime and Scalability Prediction
ICPP '24: Proceedings of the 53rd International Conference on Parallel ProcessingPages 168–178https://doi.org/10.1145/3673038.3673107Given the computational complexity of deep neural networks (DNN), accurate prediction of their training and inference time using performance modeling is crucial for efficient infrastructure planning and DNN development. However, existing methods often ...
- short-paperJuly 2024
Property Prediction of Functional Organic Molecular Crystals with Graph Neural Networks
PEARC '24: Practice and Experience in Advanced Research Computing 2024: Human Powered ComputingArticle No.: 50, Pages 1–4https://doi.org/10.1145/3626203.3670584Predicting the properties of molecular crystals is imperative to the field of materials design. In lieu of alternative methods, advances in machine learning have made it possible to predict the properties of materials before synthesis. This is ...
-
- research-articleJuly 2024
Multi‐scale skeleton simplification graph convolutional network for skeleton‐based action recognition
AbstractHuman action recognition based on graph convolutional networks (GCNs) is one of the hotspots in computer vision. However, previous methods generally rely on handcrafted graph, which limits the effectiveness of the model in characterising the ...
The authors propose a multi‐scale skeleton simplification graph convolutional network (M3S‐GCN) for skeleton‐based action recognition. The model leverages skeleton simplification and multi‐scale modelling to effectively capture the intricate connections ...
- research-articleNovember 2024
Toward High-Accuracy, Programmable Extreme-Edge Intelligence for Neuromorphic Vision Sensors utilizing Magnetic Domain Wall Motion-based MTJ
DAC '24: Proceedings of the 61st ACM/IEEE Design Automation ConferenceArticle No.: 81, Pages 1–6https://doi.org/10.1145/3649329.3657359The desire to empower resource-limited edge devices with computer vision (CV) must overcome the high energy consumption of collecting and processing vast sensory data. To address the challenge, this work proposes an energy-efficient non-von-Neumann in-...
- research-articleJune 2024
SLIDEX: A Novel Architecture for Sliding Window Processing
ICS '24: Proceedings of the 38th ACM International Conference on SupercomputingPages 312–323https://doi.org/10.1145/3650200.3656613Efficient image processing is increasingly crucial in constrained embedded and real-time platforms, especially in emerging applications such as Autonomous Driving (AD) or Augmented/Virtual Reality (AR/VR). A commonality among most image processing ...
- research-articleApril 2024
Temporal channel reconfiguration multi‐graph convolution network for skeleton‐based action recognition
AbstractSkeleton‐based action recognition has received much attention and achieved remarkable achievements in the field of human action recognition. In time series action prediction for different scales, existing methods mainly focus on attention ...
The authors introduce a novel Temporal Channel Reconfiguration Multi‐Graph Convolution Network (TRMGCN) that addresses limitations in existing methods for skeleton‐based action recognition by introducing the Temporal Channel Fusion with Guidance (TCFG) ...
- research-articleMay 2024
Rnbformer: A High-Performance Roadside Noise Barriers Recognition Algorithm
CACML '24: Proceedings of the 2024 3rd Asia Conference on Algorithms, Computing and Machine LearningPages 293–297https://doi.org/10.1145/3654823.3654877Traffic noise pollution is a significant source of urban pollution. Roadside noise barriers (RNBs) serve as a primary solution to urban traffic noise pollution, but achieving precise positioning of RNBs requires on-site inspections and manual labeling, ...
- research-articleSeptember 2024
A Framework for Fine-Grained Synchronization of Dependent GPU Kernels
CGO '24: Proceedings of the 2024 IEEE/ACM International Symposium on Code Generation and OptimizationPages 93–105https://doi.org/10.1109/CGO57630.2024.10444873Machine Learning (ML) models execute several parallel computations including Generalized Matrix Multiplication, Convolution, Dropout, etc. These computations are commonly executed on Graphics Processing Units (GPUs), by dividing the computation into ...
- research-articleFebruary 2024
ConvStencil: Transform Stencil Computation to Matrix Multiplication on Tensor Cores
- Yuetao Chen,
- Kun Li,
- Yuhao Wang,
- Donglin Bai,
- Lei Wang,
- Lingxiao Ma,
- Liang Yuan,
- Yunquan Zhang,
- Ting Cao,
- Mao Yang
PPoPP '24: Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel ProgrammingPages 333–347https://doi.org/10.1145/3627535.3638476Tensor Core Unit (TCU) is increasingly integrated into modern high-performance processors to enhance matrix multiplication performance. However, constrained to its over-specification, its potential for improving other critical scientific operations like ...
- research-articleJanuary 2024
MeFiNet: Modeling multi-semantic convolution-based feature interactions for CTR prediction
Extracting more information from feature interactions is essential to improve click-through rate (CTR) prediction accuracy. Although deep learning technology can help capture high-order feature interactions, the combination of features lacks ...
- research-articleJanuary 2024
A novel lightweight blue sheep target real-time detection algorithm
International Journal of Computing Science and Mathematics (IJCSM), Volume 20, Issue 3Pages 208–227https://doi.org/10.1504/ijcsm.2024.142732The utilisation of advanced algorithms in detecting blue sheep populations offers significant real-time statistical data for blue sheep protection. However, numerous classical algorithms employ convolution structures as fundamental units for data ...
- research-articleJanuary 2024
Temporal autoencoder architectures with attention for ECG anomaly detection
International Journal of Business Intelligence and Data Mining (IJBIDM), Volume 24, Issue 2Pages 146–159https://doi.org/10.1504/ijbidm.2024.136430Anomaly detection is a crucial step in any diagnostic procedure. With the advent of continuous monitoring devices, it is inevitable to use technological assistance for the same. Many methods, including autoencoders, have been proposed for anomaly ...
- research-articleDecember 2023
Neural Field Convolutions by Repeated Differentiation
ACM Transactions on Graphics (TOG), Volume 42, Issue 6Article No.: 206, Pages 1–11https://doi.org/10.1145/3618340Neural fields are evolving towards a general-purpose continuous representation for visual computing. Yet, despite their numerous appealing properties, they are hardly amenable to signal processing. As a remedy, we present a method to perform general ...
Optimizing Direct Convolutions on ARM Multi-Cores
SC '23: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisArticle No.: 70, Pages 1–13https://doi.org/10.1145/3581784.3607107Convolution kernels are widely seen in deep learning workloads and are often responsible for performance bottlenecks. Recent research has demonstrated that a direct convolution approach can outperform the traditional convolution implementation based on ...
- case-studyNovember 2023
CR‐Net: Robot grasping detection method integrating convolutional block attention module and residual module
AbstractGrasping detection, which involves identifying and assessing the grasp ability of objects by robotic systems, has garnered significant attention in recent years due to its pivotal role in the development of robotic systems and automated assembly ...
In order to solve the problem of low detection accuracy in robot grasping detection, a new model is proposed in this paper, which uses the attention module to extract RGB and depth channels. Then, the feature fusion module is used to combine these ...
- research-articleOctober 2023
SAUNet: Spatial-Attention Unfolding Network for Image Compressive Sensing
MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 5099–5108https://doi.org/10.1145/3581783.3612242Image Compressive Sensing (CS) enables compressed capture of natural images via a spatial multiplexing camera and accurate reconstruction from few measurements via an advanced algorithm. Deep learning, especially deep unfolding, has recently achieved ...
- posterNovember 2024
SLIDEX: Sliding Window Extension for Image Processing
PACT '23: Proceedings of the 32nd International Conference on Parallel Architectures and Compilation TechniquesPages 332–334https://doi.org/10.1109/PACT58117.2023.00039With the rising need for efficient image processing in emerging applications such as Autonomous Driving (AD) and Augmented/Virtual Reality (AR/VR), many existing solutions do not meet their performance and energy efficiency requirements or are domain-...