DreamTeacher: Pretraining Image Backbones with Deep Generative Models

Li, Daiqing; Ling, Huan; Kar, Amlan; Acuna, David; Kim, Seung Wook; Kreis, Karsten; Torralba, Antonio; Fidler, Sanja

Computer Science > Computer Vision and Pattern Recognition

arXiv:2307.07487 (cs)

[Submitted on 14 Jul 2023]

Title:DreamTeacher: Pretraining Image Backbones with Deep Generative Models

Authors:Daiqing Li, Huan Ling, Amlan Kar, David Acuna, Seung Wook Kim, Karsten Kreis, Antonio Torralba, Sanja Fidler

View PDF

Abstract:In this work, we introduce a self-supervised feature representation learning framework DreamTeacher that utilizes generative networks for pre-training downstream image backbones. We propose to distill knowledge from a trained generative model into standard image backbones that have been well engineered for specific perception tasks. We investigate two types of knowledge distillation: 1) distilling learned generative features onto target image backbones as an alternative to pretraining these backbones on large labeled datasets such as ImageNet, and 2) distilling labels obtained from generative networks with task heads onto logits of target backbones. We perform extensive analyses on multiple generative models, dense prediction benchmarks, and several pre-training regimes. We empirically find that our DreamTeacher significantly outperforms existing self-supervised representation learning approaches across the board. Unsupervised ImageNet pre-training with DreamTeacher leads to significant improvements over ImageNet classification pre-training on downstream datasets, showcasing generative models, and diffusion generative models specifically, as a promising approach to representation learning on large, diverse datasets without requiring manual annotation.

Comments:	Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2307.07487 [cs.CV]
	(or arXiv:2307.07487v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2307.07487

Submission history

From: Daiqing Li [view email]
[v1] Fri, 14 Jul 2023 17:17:17 UTC (47,795 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DreamTeacher: Pretraining Image Backbones with Deep Generative Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DreamTeacher: Pretraining Image Backbones with Deep Generative Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators