Context Encoders: Feature Learning by Inpainting

Pathak, Deepak; Krahenbuhl, Philipp; Donahue, Jeff; Darrell, Trevor; Efros, Alexei A.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1604.07379 (cs)

[Submitted on 25 Apr 2016 (v1), last revised 21 Nov 2016 (this version, v2)]

Title:Context Encoders: Feature Learning by Inpainting

Authors:Deepak Pathak, Philipp Krahenbuhl, Jeff Donahue, Trevor Darrell, Alexei A. Efros

View PDF

Abstract:We present an unsupervised visual feature learning algorithm driven by context-based pixel prediction. By analogy with auto-encoders, we propose Context Encoders -- a convolutional neural network trained to generate the contents of an arbitrary image region conditioned on its surroundings. In order to succeed at this task, context encoders need to both understand the content of the entire image, as well as produce a plausible hypothesis for the missing part(s). When training context encoders, we have experimented with both a standard pixel-wise reconstruction loss, as well as a reconstruction plus an adversarial loss. The latter produces much sharper results because it can better handle multiple modes in the output. We found that a context encoder learns a representation that captures not just appearance but also the semantics of visual structures. We quantitatively demonstrate the effectiveness of our learned features for CNN pre-training on classification, detection, and segmentation tasks. Furthermore, context encoders can be used for semantic inpainting tasks, either stand-alone or as initialization for non-parametric methods.

Comments:	New results on ImageNet Generation
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
Cite as:	arXiv:1604.07379 [cs.CV]
	(or arXiv:1604.07379v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1604.07379
Journal reference:	CVPR 2016

Submission history

From: Deepak Pathak [view email]
[v1] Mon, 25 Apr 2016 19:42:46 UTC (8,753 KB)
[v2] Mon, 21 Nov 2016 20:56:42 UTC (9,190 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Context Encoders: Feature Learning by Inpainting

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Context Encoders: Feature Learning by Inpainting

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators