CVIU: Vol 247, No C

Volume 247, Issue COct 2024

Volume 247, Issue C

Oct 2024

Publisher:

Elsevier Science Inc.
655 Avenue of the Americas New York, NY
United States

ISSN:1077-3142

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

editorial

Editorial Board

https://doi.org/10.1016/S1077-3142(24)00196-6

Special issue on Advances in Deep Learning for Human-Centric Visual Understanding

research-article

Multi-domain awareness for compressed deepfake videos detection over social networks guided by common mechanisms between artifacts

https://doi.org/10.1016/j.cviu.2024.104072

Abstract

The viral spread of massive deepfake videos over social networks has caused serious security problems. Despite the remarkable advancements achieved by existing deepfake detection algorithms, deepfake videos over social networks are inevitably ...

Highlights

We analyzed the common mechanisms of compression artifacts and deepfake artifacts.
Based on common mechanisms between artifacts, we designed an anti-compression model.
We designed adaptive notch filter to remove the interference of ...

research-article

Modality adaptation via feature difference learning for depth human parsing

https://doi.org/10.1016/j.cviu.2024.104070

Abstract

In the field of human parsing, depth data offers unique advantages over RGB data due to its illumination invariance and geometric detail, which motivates us to explore human parsing with only depth input. However, depth data is challenging to ...

Highlights

An MAFDL pipeline leveraging RGB semantic knowledge to enhance depth human parsing.
DGDA to bridge the RGB-depth modality gap by learning inter-modal feature difference.
FAC as explicit supervision at pixel and batch levels for ...

research-article

SHOWMe: Robust object-agnostic hand-object 3D reconstruction from RGB video

https://doi.org/10.1016/j.cviu.2024.104073

Abstract

In this paper, we tackle the problem of detailed hand-object 3D reconstruction from monocular video with unknown objects, for applications where the required accuracy and level of detail is important, e.g. object hand-over in human–robot ...

Highlights

Object-agnostic hand-object 3D reconstruction from monocular hand-object motion video
Robust rigid-transformation estimation network that leverages large pre-trained model
Two-stage pipeline for 3D hand-object reconsruction
New ...

research-article

Classroom teacher action recognition based on spatio-temporal dual-branch feature fusion

https://doi.org/10.1016/j.cviu.2024.104068

Abstract

The classroom teaching action recognition task refers to recognizing and understanding teacher action through video temporal and spatial information. Due to complex backgrounds and significant occlusions, recognizing teacher action in the ...

Highlights

We propose the teacher action recognition method based on two-branch architecture.
We constructed a classroom teacher action dataset in a real-world setting.
Through experimental validation, our proposed method outperforms other ...

research-article

Enhanced local distribution learning for real image super-resolution

https://doi.org/10.1016/j.cviu.2024.104092

Abstract

Previous work has shown that CNN-based local distribution learning can efficiently reconstruct high-resolution images, but with limited performance improvement against complex degraded images. In this paper, we propose an enhanced local ...

Highlights

CNN-based enhanced local distribution learning method is proposed.
Parallel attention module is proposed to extract effective feature.
Dilated neighborhood sampling strategy is proposed.

research-article

UAHOI: Uncertainty-aware robust interaction learning for HOI detection

https://doi.org/10.1016/j.cviu.2024.104091

Abstract

This paper focuses on Human–Object Interaction (HOI) detection, addressing the challenge of identifying and understanding the interactions between humans and objects within a given image or video frame. Spearheaded by Detection Transformer (DETR),...

Highlights

We introduce an uncertainty-aware framework in HOI Detection.
We refine both detection and interaction predictions through prediction variance.
The proposed method outperforms existing approaches, enhancing both accuracy and ...

Special issue on Eyes on People: Recent Trends on Human Analysis, Perception and Generation

research-article

Lightning fast video anomaly detection via multi-scale adversarial distillation

https://doi.org/10.1016/j.cviu.2024.104074

Abstract

We propose a very fast frame-level model for anomaly detection in video, which learns to detect anomalies by distilling knowledge from multiple highly accurate object-level teacher models. To improve the fidelity of our student, we distill the ...

Graphical abstract

Display Omitted

Highlights

We introduce a novel teacher-student framework for anomaly detection in video.
We learn to detect anomalies by distilling from multiple highly accurate object-level teachers.
We propose adversarial knowledge distillation in the ...

Computer Vision and Image Understanding

Sections

Editorial Board

Multi-label image classification using adaptive graph convolutional networks: From a single domain to multiple domains

EnsCLR: Unsupervised skeleton-based action recognition via ensemble contrastive learning of representation

Low-light image enhancement based on cell vibration energy model and lightness difference

Pseudo initialization based Few-Shot Class Incremental Learning

Implicit and explicit commonsense for multi-sentence video captioning

Enhanced dual contrast representation learning with cell separation and merging for breast cancer diagnosis

Advancing Image Generation with Denoising Diffusion Probabilistic Model and ConvNeXt-V2: A novel approach for enhanced diversity and quality

Object discriminability re-extraction for distractor-aware visual object tracking

Subtle signals: Video-based detection of infant non-nutritive sucking as a neurodevelopmental cue

Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval

Artifact feature purification for cross-domain detection of AI-generated images

Image-to-image translation based face photo de-meshing using GANs

Multi-domain awareness for compressed deepfake videos detection over social networks guided by common mechanisms between artifacts

Modality adaptation via feature difference learning for depth human parsing

SHOWMe: Robust object-agnostic hand-object 3D reconstruction from RGB video

Classroom teacher action recognition based on spatio-temporal dual-branch feature fusion

Enhanced local distribution learning for real image super-resolution

UAHOI: Uncertainty-aware robust interaction learning for HOI detection

Lightning fast video anomaly detection via multi-scale adversarial distillation

Sections

Save to Binder

Comments