Performance Comparison and Visualization of AI-Generated-Image Detection Methods

Uploaded by

Horatiu Florea

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views

Performance Comparison and Visualization of AI-Generated-Image Detection Methods

Uploaded by

Horatiu Florea

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Received 7 February 2024, accepted 23 April 2024, date of publication 26 April 2024, date of current version 8 May 2024.

Digital Object Identifier 10.1109/ACCESS.2024.3394250

Performance Comparison and Visualization of

AI-Generated-Image Detection Methods
DAEEOL PARK , HYUNSIK NA , AND DAESEON CHOI , (Member, IEEE)
Department of Software, Soongsil University, Seoul 07027, South Korea
Corresponding author: Daeseon Choi (sunchoi@ssu.ac.kr)
This work was supported by the Institute of Information and Communications Technology Planning and Evaluation (IITP) Grant funded by
Korea Government (MSIT) (Robust AI and Distributed Attack Detection for Edge AI Security) under Grant 2021-0-00511.

ABSTRACT Recent advancements in artificial intelligence (AI) have revolutionized the field of image
generation. This has concurrently escalated social problems and concerns related to AI image generation,
underscoring the necessity for an effective AI-generated-image detection method. Therefore, numerous
methods for detecting AI-generated images have been developed, but there remains a need for research
comparing the effectiveness of and visualizing these detection methods. In this study, we classify
AI-generated-image detection methods by the image features they use and compare their generalization
performance in detecting AI-generated images of different types. We selected five AI-generated-image
detection methods for performance evaluation and selected vision transformer as an additional method
for comparison. We use two types of training datasets, i.e., ProGAN and latent diffusion; combine
existing AI-generated-image test datasets into a diverse test dataset; and divide them into three types of
generative models, i.e., generative adversarial network (GAN), diffusion, and transformer, to evaluate the
comprehensive performance of the detection methods. We also analyze their detection performance on
images with data augmentation, considering scenarios that make it difficult to detect AI-generated images.
Grad-CAM and t-SNE are used to visualize the detection area and data distribution of each detection
method. As a result, we determine that artifact-feature-based detection performs well on GAN and real
images, whereas image-encoder-feature-based detection performs well on diffusion and transformer images.
In summary, our research analyzes the comparative detection performance of various AI-generated-image
detection methods, identifies their limitations, and suggests directions for further research.

INDEX TERMS Generative AI, AI-generated-image detection, synthetic-image detection, performance

comparison, GAN, diffusion model, transformer, Grad-CAM, t-SNE.

I. INTRODUCTION facilitated the creation of superior-quality images that meet

Since the emergence of artificial intelligence (AI) image the needs of users. These AI image generative models have
generative models, numerous advancements have been made progressed over time, heightening the seeming authenticity
in the field. The introduction of the generative adversarial and naturalness of AI-generated images and rendering them
network (GAN) [1] marked the start of AI image generation, increasingly indiscernible from real pictures. Therefore, as AI
with various GAN models having been developed to create images gain ground, a range of social concerns have arisen
images of diverse subjects such as faces, artwork, and land- as a result of the misuse of AI images. When ‘‘Théâtre
scapes. Subsequently, the diffusion model [2] was introduced, D’opéra Spatial,’’ a painting that was awarded first prize in
enabling the generation of higher-quality images compared the digital category of the Colorado State Art Competition
to those generated by the GAN model. The emergence 2022, was revealed to be generated by Midjourney [4], an AI
of AI image-generation models utilizing the transformer image generative model, concerns were raised about the
structure [3] found in language processing models has extent to which AI-generated images should be regarded
as creative works, if at all, and the rightful ownership of
The associate editor coordinating the review of this manuscript and AI-generated-image copyrights [5]. On a more serious note,
approving it for publication was Yue Zhang . a social media user posted a fabricated photo of an explosion
2024 The Authors. This work is licensed under a Creative Commons Attribution 4.0 License.
VOLUME 12, 2024 For more information, see https://creativecommons.org/licenses/by/4.0/ 62609
D. Park et al.: Performance Comparison and Visualization of AI-Generated-Image Detection Methods

near the Pentagon in the United States [6], causing the U.S. their performance in scenarios that disrupt
stock market to drop within a few minutes. Later, it was AI-generated-image detection.
revealed that the photo was created by AI, demonstrating • Our Grad-CAM analysis revealed that the presence
the potential for AI-generated images to cause significant of learned features in the region in which detection
real-life problems. is performed is more crucial than the region itself.
To address ethical concerns and potential damages arising On the other hand, the t-SNE analysis showed how
from AI-generated images, there is a need for technology the generated-image data are distributed and how each
that can distinguish between real and AI-generated images. model detects images using the artifact method and
Consequently, numerous studies have been conducted on image-encoder methods.
AI-generated-image detection. These studies involve training The remaining parts of the paper are structured as follows:
a deep learning model using both real and AI-generated Section II describes the image-generation models used to
images to identify features unique to AI-generated images. create the images that constitute the test datasets used to
AI-generated-image detection research has resulted in vari- test the detection methods and discusses related research on
ous detection methods and the use of AI-generated features AI-generated-image detection focusing on performance com-
for detection. However, research on the most effective parison and data augmentation. Section III introduces the six
methods and feature detection models to detect AI-generated detection methods used in our study, and Section IV presents
images has thus far been deficient. It is also unclear which the construction of the training and test datasets. Section V
is the optimal method for discerning images generated by describes the training setting and test setting of each method,
different types of generative models. Furthermore, there is and Section VI shows the detection performance results.
a need for analysis of how detection methods distinguish Section VII contains Grad-CAM and t-SNE visualizations of
between real images and AI-generated images. the detection methods. Section VIII discusses the results, and
In this study, we evaluated and compared AI-generated- finally, Section IX concludes our study and outlines plans and
image detection methods based on their generalization recommendations for future work.
performance on various AI-generated images, divided
into GAN-generated, diffusion-generated, and transformer- II. RELATED WORK
generated images. We selected five detection methods that A. AI IMAGE GENERATIVE MODEL
provide a pretrained model or training code, and vision
An AI image generative model is a model that can generate
transformer (ViT) [7] as an additional detection method for
images using either img-to-img translation, which generates
comparison. For each method, we trained the model using
a new image in a different field while maintaining the
two distinct training datasets: ProGAN-generated images
concept of the original image, or text-to-img translation,
and latent-diffusion-generated images. We then tested the
which generates a corresponding image from a text prompt
models on separate GAN-generated, diffusion-generated, and
that describes the desired image. AI image generative models
transformer-generated image test datasets to determine which
can be classified into three different categories based on
method has the strongest generalization performance on
their structural design, specifically: GAN, diffusion, and
each type of generated image. By combining multiple test
transformer.
datasets from existing AI-generated-image detection research
literature, we constructed a rich test dataset that includes 1) GAN-BASED MODEL
a total of 23 AI-generated-image datasets and 3 real-image GAN [1] is a structured deep learning model consisting
datasets. To gain further insight, we conducted tests of of a generator and a discriminator. The generator produces
detection performance on JPEG compression quality and an image that tries to fool the discriminator, whereas the
Gaussian blur to analyze the robustness of each method discriminator tries to determine whether the image is real or
to JPEG-compressed images and Gaussian-blurred images. fake. This adversarial method by which the generator and
We also performed visualizations using Grad-CAM [8] and discriminator learn from each other improves the quality of
t-SNE [9] to see what regions the detection method detects production from the generator, resulting in a better-quality
in the image and how it classifies the image in feature image.
space. Since its advent, GAN has been used to develop numerous
The main contributions of our work are as follows: models for generating images. CycleGAN [10] transforms
• We built various kinds of generative-model test datasets an image into another with a different art style or in
with three different generative model structures to com- a different domain while maintaining the concept of the
pare existing AI-generated-image detection methods. original image. The model accomplishes this by introducing
• We divided the AI-generated-image detection methods cycle consistency loss, which allows the conversion to
into three groups based on the features they use and proceed only long enough to turn the converted image
analyzed the differences in detection performance for back into the original image. Gradually, GAN models were
each group. improved to be able to generate high-resolution images.
• We compared the robustness of AI-generated-image ProGAN [11] has demonstrated that it is more effective to
detection methods to image augmentation to investigate generate high-resolution images by gradually adding layers