Emergence of Object Segmentation in Perturbed Generative Models

Bielski, Adam; Favaro, Paolo

Computer Science > Computer Vision and Pattern Recognition

arXiv:1905.12663 (cs)

[Submitted on 29 May 2019 (v1), last revised 2 Nov 2019 (this version, v2)]

Title:Emergence of Object Segmentation in Perturbed Generative Models

Authors:Adam Bielski, Paolo Favaro

View PDF

Abstract:We introduce a novel framework to build a model that can learn how to segment objects from a collection of images without any human annotation. Our method builds on the observation that the location of object segments can be perturbed locally relative to a given background without affecting the realism of a scene. Our approach is to first train a generative model of a layered scene. The layered representation consists of a background image, a foreground image and the mask of the foreground. A composite image is then obtained by overlaying the masked foreground image onto the background. The generative model is trained in an adversarial fashion against a discriminator, which forces the generative model to produce realistic composite images. To force the generator to learn a representation where the foreground layer corresponds to an object, we perturb the output of the generative model by introducing a random shift of both the foreground image and mask relative to the background. Because the generator is unaware of the shift before computing its output, it must produce layered representations that are realistic for any such random perturbation. Finally, we learn to segment an image by defining an autoencoder consisting of an encoder, which we train, and the pre-trained generator as the decoder, which we freeze. The encoder maps an image to a feature vector, which is fed as input to the generator to give a composite image matching the original input image. Because the generator outputs an explicit layered representation of the scene, the encoder learns to detect and segment objects. We demonstrate this framework on real images of several object categories.

Comments:	33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Spotlight presentation
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1905.12663 [cs.CV]
	(or arXiv:1905.12663v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1905.12663

Submission history

From: Adam Bielski [view email]
[v1] Wed, 29 May 2019 18:17:39 UTC (5,506 KB)
[v2] Sat, 2 Nov 2019 17:46:33 UTC (8,538 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Emergence of Object Segmentation in Perturbed Generative Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Emergence of Object Segmentation in Perturbed Generative Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators