Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- ArticleOctober 2022
MFIM: Megapixel Facial Identity Manipulation
AbstractFace swapping is a task that changes a facial identity of a given image to that of another person. In this work, we propose a novel face-swapping framework called Megapixel Facial Identity Manipulation (MFIM). The face-swapping model should ...
- ArticleOctober 2022
Look Both Ways: Self-supervising Driver Gaze Estimation and Road Scene Saliency
AbstractWe present a new on-road driving dataset, called “Look Both Ways”, which contains synchronized video of both driver faces and the forward road scene, along with ground truth gaze data registered from eye tracking glasses worn by the drivers. Our ...
- ArticleOctober 2022
Pre-training Strategies and Datasets for Facial Representation Learning
AbstractWhat is the best way to learn a universal face representation? Recent work on Deep Learning in the area of face analysis has focused on supervised learning for specific tasks of interest (e.g. face recognition, facial landmark localization etc.) ...
- ArticleOctober 2022
BoundaryFace: A Mining Framework with Noise Label Self-correction for Face Recognition
AbstractFace recognition has made tremendous progress in recent years due to the advances in loss functions and the explosive growth in training sets size. A properly designed loss is seen as key to extract discriminative features for classification. ...
- ArticleOctober 2022
Towards Racially Unbiased Skin Tone Estimation via Scene Disambiguation
AbstractVirtual facial avatars will play an increasingly important role in immersive communication, games and the metaverse, and it is therefore critical that they be inclusive. This requires accurate recovery of the albedo, regardless of age, sex, or ...
-
- ArticleOctober 2022
Self-supervised Sparse Representation for Video Anomaly Detection
AbstractVideo anomaly detection (VAD) aims at localizing unexpected actions or activities in a video sequence. Existing mainstream VAD techniques are based on either the one-class formulation, which assumes all training data are normal, or weakly-...
- ArticleOctober 2022
Detecting and Recovering Sequential DeepFake Manipulation
AbstractSince photorealistic faces can be readily generated by facial manipulation technologies nowadays, potential malicious abuse of these technologies has drawn great concerns. Numerous deepfake detection methods are thus proposed. However, existing ...
- ArticleOctober 2022
Image-Based CLIP-Guided Essence Transfer
AbstractWe make the distinction between (i) style transfer, in which a source image is manipulated to match the textures and colors of a target image, and (ii) essence transfer, in which one edits the source image to include high-level semantic attributes ...
- ArticleOctober 2022
GIMO: Gaze-Informed Human Motion Prediction in Context
AbstractPredicting human motion is critical for assistive robots and AR/VR applications, where the interaction with humans needs to be safe and comfortable. Meanwhile, an accurate prediction depends on understanding both the scene context and human ...
- ArticleOctober 2022
- ArticleOctober 2022
Generative Adversarial Network for Future Hand Segmentation from Egocentric Video
AbstractWe introduce the novel problem of anticipating a time series of future hand masks from egocentric video. A key challenge is to model the stochasticity of future head motions, which globally impact the head-worn camera video analysis. To this end, ...
- ArticleOctober 2022
Egocentric Activity Recognition and Localization on a 3D Map
AbstractGiven a video captured from a first person perspective and the environment context of where the video is recorded, can we recognize what the person is doing and identify where the action occurs in the 3D space? We address this challenging problem ...
- ArticleOctober 2022
SOS! Self-supervised Learning over Sets of Handled Objects in Egocentric Action Recognition
AbstractLearning an egocentric action recognition model from video data is challenging due to distractors in the background, e.g., irrelevant objects. Further integrating object information into an action model is hence beneficial. Existing methods often ...
- ArticleOctober 2022
FairStyle: Debiasing StyleGAN2 with Style Channel Manipulations
AbstractRecent advances in generative adversarial networks have shown that it is possible to generate high-resolution and hyperrealistic images. However, the images produced by GANs are only as fair and representative as the datasets on which they are ...
- ArticleOctober 2022
Parameterized Temperature Scaling for Boosting the Expressive Power in Post-Hoc Uncertainty Calibration
AbstractWe address the problem of uncertainty calibration and introduce a novel calibration method, Parametrized Temperature Scaling (PTS). Standard deep neural networks typically yield uncalibrated predictions, which can be transformed into calibrated ...
- ArticleOctober 2022
Latent Space Smoothing for Individually Fair Representations
AbstractFair representation learning transforms user data into a representation that ensures fairness and utility regardless of the downstream application. However, learning individually fair representations, i.e., guaranteeing that similar individuals ...
- ArticleOctober 2022
Privacy-Preserving Action Recognition via Motion Difference Quantization
AbstractThe widespread use of smart computer vision systems in our personal spaces has led to an increased consciousness about the privacy and security risks that these systems pose. On the one hand, we want these systems to assist in our daily lives by ...
- ArticleOctober 2022
Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing
- Hsin-Ping Huang,
- Deqing Sun,
- Yaojie Liu,
- Wen-Sheng Chu,
- Taihong Xiao,
- Jinwei Yuan,
- Hartwig Adam,
- Ming-Hsuan Yang
AbstractWhile recent face anti-spoofing methods perform well under the intra-domain setups, an effective approach needs to account for much larger appearance variations of images acquired in complex scenes with different sensors for robust performance. In ...
- ArticleOctober 2022
Decouple-and-Sample: Protecting Sensitive Information in Task Agnostic Data Release
AbstractWe propose sanitizer, a framework for secure and task-agnostic data release. While releasing datasets continues to make a big impact in various applications of computer vision, its impact is mostly realized when data sharing is not inhibited by ...