Multi-spectral Class Center Network for Face Manipulation Detection and Localization

Miao, Changtao; Chu, Qi; Tan, Zhentao; Jin, Zhenchao; Zhuang, Wanyi; Wu, Yue; Liu, Bin; Hu, Honggang; Yu, Nenghai

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.10794v2 (cs)

[Submitted on 18 May 2023 (v1), revised 19 Sep 2023 (this version, v2), latest version 13 Jul 2024 (v3)]

Title:Multi-spectral Class Center Network for Face Manipulation Detection and Localization

Authors:Changtao Miao, Qi Chu, Zhentao Tan, Zhenchao Jin, Wanyi Zhuang, Yue Wu, Bin Liu, Honggang Hu, Nenghai Yu

View PDF

Abstract:As Deepfake contents continue to proliferate on the internet, advancing face manipulation forensics has become a pressing issue. To combat this emerging threat, previous methods mainly focus on studying how to distinguish authentic and manipulated face images. Despite impressive, image-level classification lacks explainability and is limited to some specific application scenarios. Existing forgery localization methods suffer from imprecise and inconsistent pixel-level annotations. To alleviate these problems, this paper first re-constructs the FaceForensics++ dataset by introducing pixel-level annotations, then builds an extensive benchmark for localizing tampered regions. Next, a novel Multi-Spectral Class Center Network (MSCCNet) is proposed for face manipulation detection and localization. Specifically, inspired by the power of frequency-related forgery traces, we design Multi-Spectral Class Center (MSCC) module to learn more generalizable and semantic-agnostic features. Based on the features of different frequency bands, the MSCC module collects multispectral class centers and computes pixel-to-class relations. Applying multi-spectral class-level representations suppresses the semantic information of the visual concepts, which is insensitive to manipulations. Furthermore, we propose a Multi-level Features Aggregation (MFA) module to employ more low-level forgery artifacts and structure textures. Experimental results quantitatively and qualitatively indicate the effectiveness and superiority of the proposed MSCCNet on comprehensive localization benchmarks. We expect this work to inspire more studies on pixel-level face manipulation localization. The annotations and codes are available.

Comments:	Email Address: miaoct@mail.this http URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2305.10794 [cs.CV]
	(or arXiv:2305.10794v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.10794

Submission history

From: Changtao Miao [view email]
[v1] Thu, 18 May 2023 08:09:20 UTC (601 KB)
[v2] Tue, 19 Sep 2023 09:01:50 UTC (992 KB)
[v3] Sat, 13 Jul 2024 14:29:30 UTC (5,731 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-spectral Class Center Network for Face Manipulation Detection and Localization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-spectral Class Center Network for Face Manipulation Detection and Localization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators