Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleDecember 2024
TAPS: Temporal Attention-Based Pruning and Scaling for Efficient Video Action Recognition
AbstractVideo neural networks are computationally expensive. For real-time applications they require significant compute resources that are lacking on edge devices. Various methods were proposed to reduce the computational load of neural networks. Among ...
- research-articleDecember 2024
Active Object Segmentation: A New Modality for Egocentric Action Recognition
MMAsia '24: Proceedings of the 6th ACM International Conference on Multimedia in AsiaArticle No.: 4, Pages 1–7https://doi.org/10.1145/3696409.3700164Egocentric actions typically exhibit Human-Object Interactions (HOIs), involving the transformation of objects (e.g., “cutting” an “onion”) using various tools and utensils (e.g., “knife” and “chopping board”). Recognising these actions requires networks ...
- ArticleDecember 2024
Text-Enhanced Zero-Shot Action Recognition: A Training-Free Approach
AbstractVision-language models (VLMs) have demonstrated remarkable performance across various visual tasks, leveraging joint learning of visual and textual representations. While these models excel in zero-shot image tasks, their application to zero-shot ...
- ArticleDecember 2024
Multi-teacher Invariance Distillation for Domain-Generalized Action Recognition
AbstractIn this work, we tackle the problem of domain-generalized action recognition, i.e. we train a model on a source domain and then test the model on other unseen target domains with different data distributions. Generalizing across different domains ...
- ArticleNovember 2024
TeleoWatch: Pose-Transformer-Based Advanced Action Recognition
Progress in Pattern Recognition, Image Analysis, Computer Vision, and ApplicationsPages 31–45https://doi.org/10.1007/978-3-031-76607-7_3AbstractRecognition of human actions from videos is a valuable application for building management, security systems, accident prevention, accident intervention and several other applications. This study provides a framework for joint-based action ...
-
- ArticleNovember 2024
Data Collection-Free Masked Video Modeling
AbstractPre-training video transformers generally requires a large amount of data, presenting significant challenges in terms of data collection costs and concerns related to privacy, licensing, and inherent biases. Synthesizing data is one of the ...
- ArticleOctober 2024
Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition
AbstractWe address a novel cross-domain few-shot learning task (CD-FSL) with multimodal input and unlabeled target data for egocentric action recognition. This paper simultaneously tackles two critical challenges associated with egocentric action ...
- ArticleOctober 2024
Context-Aware Action Recognition: Introducing a Comprehensive Dataset for Behavior Contrast
AbstractWhile datasets on everyday actions, sports, and cooking are abundant, there’s a significant scarcity in datasets focused on industrial domain activities, especially for distinguishing between proper and improper actions. This shortage poses a ...
- research-articleSeptember 2024
Bird Action Recognition in Wetlands using Deep Learning
- Javier Rodriguez-Juan,
- Adrian Berenguer-Agullo,
- Manuel Benavent-Lledo,
- David Mulero-Perez,
- Jose Garcia-Rodriguez,
- Esther Sebastián-González
GoodIT '24: Proceedings of the 2024 International Conference on Information Technology for Social GoodPages 350–357https://doi.org/10.1145/3677525.3678681The current decline in bird species and protected natural areas highlights the importance of providing solutions to improve the understanding of bird biodiversity and its interaction with its environment. This study focuses on the development and ...
- articleSeptember 2024
Research on Artificial Intelligence Technology in Accurate Recognition of Sports Training Actions
International Journal of e-Collaboration (IJEC-IGI), Volume 20, Issue 1Pages 1–18https://doi.org/10.4018/IJeC.349210In sports training, accurate identification of athletes' movements is helpful to judge whether athletes' actions are standard or not, thus providing precise movement data for training and improving athletes' levels. The convolutional neural network VGG ...
- ArticleAugust 2024
SDE-Net: Skeleton Action Recognition Based on Spatio-Temporal Dependence Enhanced Networks
Advanced Intelligent Computing Technology and ApplicationsPages 380–392https://doi.org/10.1007/978-981-97-5588-2_32AbstractGraph Convolutional Networks (GCNs) have succeeded remarkably in skeleton-based action recognition tasks. However, the existing GCN-based methods, where the interframe edges of the graph connect only the same joints and ignore the correlations ...
- ArticleAugust 2024
Spatial-Temporal Transformer Network for Continuous Action Recognition in Industrial Assembly
- Jianfeng Huang,
- Xiang Liu,
- Huan Hu,
- Shanghua Tang,
- Chenyang Li,
- Shaoan Zhao,
- Yimin Lin,
- Kai Wang,
- Zhaoxiang Liu,
- Shiguo Lian
Advanced Intelligent Computing Technology and ApplicationsPages 114–130https://doi.org/10.1007/978-981-97-5609-4_9AbstractNow, it is still an open issue to automatically detect whether the worker’s manual operations are compliant with the standard in industrial assembly. In this paper, we first present a spatio-temporal Transformer network (STTN) to recognize each ...
- ArticleJune 2024
Exercise Recognition and Repetition Counting for Automatic Workout Documentation Using Computer Vision
Digital Human Modeling and Applications in Health, Safety, Ergonomics and Risk ManagementPages 298–309https://doi.org/10.1007/978-3-031-61066-0_18AbstractThis paper aims to study various approaches using deep learning methods to perform human action recognition (HAR). More specifically, a subset of HAR focused on recognising exercises and counting repetitions using deep learning. The paper ...
- research-articleJuly 2024
YNU-Dance: A Multimodal Ethnic Dance Action Dataset
CNIOT '24: Proceedings of the 2024 5th International Conference on Computing, Networks and Internet of ThingsPages 273–281https://doi.org/10.1145/3670105.3670151This paper propose a novel dance action dataset – YNU-Dance. To preserve and inherit ethnic dances, we have collected and constructed a dataset of ethnic dance actions. The dataset encompasses unique dances from 10 different ethnic groups, including ...
- research-articleJuly 2024
Attention-Based AdaptSepCX Network for Effective Student Action Recognition in Online Learning
Procedia Computer Science (PROCS), Volume 233, Issue CPages 164–174https://doi.org/10.1016/j.procs.2024.03.206AbstractIn the realm of online learning and distance education, the issue of inadequate supervision looms large, posing a significant obstacle. This paper delves into the challenges posed by the lack of supervision in online learning environments and ...
- ArticleDecember 2023
SoccerKDNet: A Knowledge Distillation Framework for Action Recognition in Soccer Videos
Pattern Recognition and Machine IntelligencePages 457–464https://doi.org/10.1007/978-3-031-45170-6_47AbstractClassifying player actions from soccer videos is a challenging problem, which has become increasingly important in sports analytics over the years. Most state-of-the-art methods employ highly complex offline networks, which makes it difficult to ...
- ArticleDecember 2023
Primitive Action Recognition Based on Semantic Facts
AbstractTo interact with humans, a robot has to know actions done by each agent presents in the environment, robotic or not. Robots are not omniscient and can’t perceive every actions made but, as humans do, we can equip the robot with the ability to ...
- research-articleMay 2024
Action and Gesture Recognition using Deep Learning and Computer Vision for Deaf and Dumb People
ICIMMI '23: Proceedings of the 5th International Conference on Information Management & Machine IntelligenceArticle No.: 79, Pages 1–8https://doi.org/10.1145/3647444.3647906This paper presents a novel approach to gesture-based sign language recognition, utilizing a two-step process involving keypoint detection and Long Short-Term Memory (LSTM) networks. Sign language recognition, a crucial technology for enhancing ...
- ArticleNovember 2023
Temporal Modeling Approach for Video Action Recognition Based on Vision-language Models
AbstractThe usage of large-scale vision-language pre-training models plays an important role in reducing computational consumption and improving the accuracy of the video action recognition task. However, pre-training models trained by image data may ...
- ArticleOctober 2024
Action Recognition and Action Anticipation Tasks in the Trauma THOMPSON Challenge Technical Report
AI for Brain Lesion Detection and Trauma Video Action RecognitionPages 72–81https://doi.org/10.1007/978-3-031-71626-3_9AbstractThis article introduces our methods and experimental results in the submission to Action Recognition and Action Anticipation tasks (Track 1) in the Trauma THOMPSON Challenge. This article introduces our methods and experimental results in the ...