Training CNNs in Presence of JPEG Compression: Multimedia Forensics vs Computer Vision

Mandelli, Sara; Bonettini, Nicolò; Bestagini, Paolo; Tubaro, Stefano

Computer Science > Computer Vision and Pattern Recognition

arXiv:2009.12088 (cs)

[Submitted on 25 Sep 2020]

Title:Training CNNs in Presence of JPEG Compression: Multimedia Forensics vs Computer Vision

Authors:Sara Mandelli, Nicolò Bonettini, Paolo Bestagini, Stefano Tubaro

View PDF

Abstract:Convolutional Neural Networks (CNNs) have proved very accurate in multiple computer vision image classification tasks that required visual inspection in the past (e.g., object recognition, face detection, etc.). Motivated by these astonishing results, researchers have also started using CNNs to cope with image forensic problems (e.g., camera model identification, tampering detection, etc.). However, in computer vision, image classification methods typically rely on visual cues easily detectable by human eyes. Conversely, forensic solutions rely on almost invisible traces that are often very subtle and lie in the fine details of the image under analysis. For this reason, training a CNN to solve a forensic task requires some special care, as common processing operations (e.g., resampling, compression, etc.) can strongly hinder forensic traces. In this work, we focus on the effect that JPEG has on CNN training considering different computer vision and forensic image classification problems. Specifically, we consider the issues that rise from JPEG compression and misalignment of the JPEG grid. We show that it is necessary to consider these effects when generating a training dataset in order to properly train a forensic detector not losing generalization capability, whereas it is almost possible to ignore these effects for computer vision tasks.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Image and Video Processing (eess.IV)
Cite as:	arXiv:2009.12088 [cs.CV]
	(or arXiv:2009.12088v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2009.12088

Submission history

From: Nicolò Bonettini [view email]
[v1] Fri, 25 Sep 2020 08:47:21 UTC (2,635 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Training CNNs in Presence of JPEG Compression: Multimedia Forensics vs Computer Vision

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Training CNNs in Presence of JPEG Compression: Multimedia Forensics vs Computer Vision

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators