Keyword: Action Recognition : Search

Article

TAPS: Temporal Attention-Based Pruning and Scaling for Efficient Video Action Recognition

Computer Vision – ACCV 2024Pages 422–438https://doi.org/10.1007/978-981-96-0908-6_24

Abstract

Video neural networks are computationally expensive. For real-time applications they require significant compute resources that are lacking on edge devices. Various methods were proposed to reduce the computational load of neural networks. Among ...

research-article

Active Object Segmentation: A New Modality for Egocentric Action Recognition

MMAsia '24: Proceedings of the 6th ACM International Conference on Multimedia in AsiaArticle No.: 4, Pages 1–7https://doi.org/10.1145/3696409.3700164

Egocentric actions typically exhibit Human-Object Interactions (HOIs), involving the transformation of objects (e.g., “cutting” an “onion”) using various tools and utensils (e.g., “knife” and “chopping board”). Recognising these actions requires networks ...

Article

Text-Enhanced Zero-Shot Action Recognition: A Training-Free Approach

Pattern RecognitionPages 327–342https://doi.org/10.1007/978-3-031-78354-8_21

Abstract

Vision-language models (VLMs) have demonstrated remarkable performance across various visual tasks, leveraging joint learning of visual and textual representations. While these models excel in zero-shot image tasks, their application to zero-shot ...

Article

Multi-teacher Invariance Distillation for Domain-Generalized Action Recognition

Pattern RecognitionPages 116–132https://doi.org/10.1007/978-3-031-78110-0_8

Abstract

In this work, we tackle the problem of domain-generalized action recognition, i.e. we train a model on a source domain and then test the model on other unseen target domains with different data distributions. Generalizing across different domains ...

Article

TeleoWatch: Pose-Transformer-Based Advanced Action Recognition

Progress in Pattern Recognition, Image Analysis, Computer Vision, and ApplicationsPages 31–45https://doi.org/10.1007/978-3-031-76607-7_3

Abstract

Recognition of human actions from videos is a valuable application for building management, security systems, accident prevention, accident intervention and several other applications. This study provides a framework for joint-based action ...

Article

Data Collection-Free Masked Video Modeling

Computer Vision – ECCV 2024Pages 37–56https://doi.org/10.1007/978-3-031-73247-8_3

Abstract

Pre-training video transformers generally requires a large amount of data, presenting significant challenges in terms of data collection costs and concerns related to privacy, licensing, and inherent biases. Synthesizing data is one of the ...

Article

Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition

Computer Vision – ECCV 2024Pages 182–199https://doi.org/10.1007/978-3-031-73414-4_11

Abstract

We address a novel cross-domain few-shot learning task (CD-FSL) with multimodal input and unlabeled target data for egocentric action recognition. This paper simultaneously tackles two critical challenges associated with egocentric action ...

Article

Context-Aware Action Recognition: Introducing a Comprehensive Dataset for Behavior Contrast

Computer Vision – ECCV 2024Pages 254–270https://doi.org/10.1007/978-3-031-73229-4_15

Abstract

While datasets on everyday actions, sports, and cooking are abundant, there’s a significant scarcity in datasets focused on industrial domain activities, especially for distinguishing between proper and improper actions. This shortage poses a ...

research-article

Open Access

Bird Action Recognition in Wetlands using Deep Learning

GoodIT '24: Proceedings of the 2024 International Conference on Information Technology for Social GoodPages 350–357https://doi.org/10.1145/3677525.3678681

The current decline in bird species and protected natural areas highlights the importance of providing solutions to improve the understanding of bird biodiversity and its interaction with its environment. This study focuses on the development and ...

article

Research on Artificial Intelligence Technology in Accurate Recognition of Sports Training Actions

International Journal of e-Collaboration (IJEC-IGI), Volume 20, Issue 1Pages 1–18https://doi.org/10.4018/IJeC.349210

In sports training, accurate identification of athletes' movements is helpful to judge whether athletes' actions are standard or not, thus providing precise movement data for training and improving athletes' levels. The convolutional neural network VGG ...

Article

SDE-Net: Skeleton Action Recognition Based on Spatio-Temporal Dependence Enhanced Networks

Advanced Intelligent Computing Technology and ApplicationsPages 380–392https://doi.org/10.1007/978-981-97-5588-2_32

Abstract

Graph Convolutional Networks (GCNs) have succeeded remarkably in skeleton-based action recognition tasks. However, the existing GCN-based methods, where the interframe edges of the graph connect only the same joints and ignore the correlations ...

Article

Spatial-Temporal Transformer Network for Continuous Action Recognition in Industrial Assembly

Advanced Intelligent Computing Technology and ApplicationsPages 114–130https://doi.org/10.1007/978-981-97-5609-4_9

Abstract

Now, it is still an open issue to automatically detect whether the worker’s manual operations are compliant with the standard in industrial assembly. In this paper, we first present a spatio-temporal Transformer network (STTN) to recognize each ...

Article

Exercise Recognition and Repetition Counting for Automatic Workout Documentation Using Computer Vision

Digital Human Modeling and Applications in Health, Safety, Ergonomics and Risk ManagementPages 298–309https://doi.org/10.1007/978-3-031-61066-0_18

Abstract

This paper aims to study various approaches using deep learning methods to perform human action recognition (HAR). More specifically, a subset of HAR focused on recognising exercises and counting repetitions using deep learning. The paper ...

research-article

YNU-Dance: A Multimodal Ethnic Dance Action Dataset

CNIOT '24: Proceedings of the 2024 5th International Conference on Computing, Networks and Internet of ThingsPages 273–281https://doi.org/10.1145/3670105.3670151

This paper propose a novel dance action dataset – YNU-Dance. To preserve and inherit ethnic dances, we have collected and constructed a dataset of ethnic dance actions. The dataset encompasses unique dances from 10 different ethnic groups, including ...

research-article

Attention-Based AdaptSepCX Network for Effective Student Action Recognition in Online Learning

Procedia Computer Science (PROCS), Volume 233, Issue CPages 164–174https://doi.org/10.1016/j.procs.2024.03.206

Abstract

In the realm of online learning and distance education, the issue of inadequate supervision looms large, posing a significant obstacle. This paper delves into the challenges posed by the lack of supervision in online learning environments and ...

Article

SoccerKDNet: A Knowledge Distillation Framework for Action Recognition in Soccer Videos

Pattern Recognition and Machine IntelligencePages 457–464https://doi.org/10.1007/978-3-031-45170-6_47

Abstract

Classifying player actions from soccer videos is a challenging problem, which has become increasingly important in sports analytics over the years. Most state-of-the-art methods employ highly complex offline networks, which makes it difficult to ...

Article

Primitive Action Recognition Based on Semantic Facts

Social RoboticsPages 350–362https://doi.org/10.1007/978-981-99-8715-3_29

Abstract

To interact with humans, a robot has to know actions done by each agent presents in the environment, robotic or not. Robots are not omniscient and can’t perceive every actions made but, as humans do, we can equip the robot with the ability to ...

research-article

Action and Gesture Recognition using Deep Learning and Computer Vision for Deaf and Dumb People

ICIMMI '23: Proceedings of the 5th International Conference on Information Management & Machine IntelligenceArticle No.: 79, Pages 1–8https://doi.org/10.1145/3647444.3647906

This paper presents a novel approach to gesture-based sign language recognition, utilizing a two-step process involving keypoint detection and Long Short-Term Memory (LSTM) networks. Sign language recognition, a crucial technology for enhancing ...

Article

Temporal Modeling Approach for Video Action Recognition Based on Vision-language Models

Neural Information ProcessingPages 512–523https://doi.org/10.1007/978-981-99-8067-3_38

Abstract

The usage of large-scale vision-language pre-training models plays an important role in reducing computational consumption and improving the accuracy of the video action recognition task. However, pre-training models trained by image data may ...

Article

Action Recognition and Action Anticipation Tasks in the Trauma THOMPSON Challenge Technical Report

AI for Brain Lesion Detection and Trauma Video Action RecognitionPages 72–81https://doi.org/10.1007/978-3-031-71626-3_9

Abstract

This article introduces our methods and experimental results in the submission to Action Recognition and Action Anticipation tasks (Track 1) in the Trauma THOMPSON Challenge. This article introduces our methods and experimental results in the ...

Search Results

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

TAPS: Temporal Attention-Based Pruning and Scaling for Efficient Video Action Recognition

Active Object Segmentation: A New Modality for Egocentric Action Recognition

Text-Enhanced Zero-Shot Action Recognition: A Training-Free Approach

Multi-teacher Invariance Distillation for Domain-Generalized Action Recognition

TeleoWatch: Pose-Transformer-Based Advanced Action Recognition

Upcoming Conferences

Data Collection-Free Masked Video Modeling

Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition

Context-Aware Action Recognition: Introducing a Comprehensive Dataset for Behavior Contrast

Bird Action Recognition in Wetlands using Deep Learning

Research on Artificial Intelligence Technology in Accurate Recognition of Sports Training Actions

SDE-Net: Skeleton Action Recognition Based on Spatio-Temporal Dependence Enhanced Networks

Spatial-Temporal Transformer Network for Continuous Action Recognition in Industrial Assembly

Exercise Recognition and Repetition Counting for Automatic Workout Documentation Using Computer Vision

YNU-Dance: A Multimodal Ethnic Dance Action Dataset

Attention-Based AdaptSepCX Network for Effective Student Action Recognition in Online Learning

SoccerKDNet: A Knowledge Distillation Framework for Action Recognition in Soccer Videos

Primitive Action Recognition Based on Semantic Facts

Action and Gesture Recognition using Deep Learning and Computer Vision for Deaf and Dumb People

Temporal Modeling Approach for Video Action Recognition Based on Vision-language Models

Action Recognition and Action Anticipation Tasks in the Trauma THOMPSON Challenge Technical Report

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences