GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing

Islam, Khawar; Zaheer, Muhammad Zaigham; Mahmood, Arif; Nandakumar, Karthik; Akhtar, Naveed

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.02366 (cs)

[Submitted on 3 Dec 2024 (v1), last revised 6 Dec 2024 (this version, v3)]

Title:GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing

Authors:Khawar Islam, Muhammad Zaigham Zaheer, Arif Mahmood, Karthik Nandakumar, Naveed Akhtar

View PDF HTML (experimental)

Abstract:Data augmentation is widely used to enhance generalization in visual classification tasks. However, traditional methods struggle when source and target domains differ, as in domain adaptation, due to their inability to address domain gaps. This paper introduces GenMix, a generalizable prompt-guided generative data augmentation approach that enhances both in-domain and cross-domain image classification. Our technique leverages image editing to generate augmented images based on custom conditional prompts, designed specifically for each problem type. By blending portions of the input image with its edited generative counterpart and incorporating fractal patterns, our approach mitigates unrealistic images and label ambiguity, improving the performance and adversarial robustness of the resulting models. Efficacy of our method is established with extensive experiments on eight public datasets for general and fine-grained classification, in both in-domain and cross-domain settings. Additionally, we demonstrate performance improvements for self-supervised learning, learning with data scarcity, and adversarial robustness. As compared to the existing state-of-the-art methods, our technique achieves stronger performance across the board.

Comments:	this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.02366 [cs.CV]
	(or arXiv:2412.02366v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.02366

Submission history

From: Khawar Islam Mr [view email]
[v1] Tue, 3 Dec 2024 10:45:34 UTC (30,979 KB)
[v2] Wed, 4 Dec 2024 16:38:01 UTC (30,979 KB)
[v3] Fri, 6 Dec 2024 00:42:40 UTC (30,979 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators