GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields

Niemeyer, Michael; Geiger, Andreas

Computer Science > Computer Vision and Pattern Recognition

arXiv:2011.12100 (cs)

[Submitted on 24 Nov 2020 (v1), last revised 29 Apr 2021 (this version, v2)]

Title:GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields

Authors:Michael Niemeyer, Andreas Geiger

View PDF

Abstract:Deep generative models allow for photorealistic image synthesis at high resolutions. But for many applications, this is not enough: content creation also needs to be controllable. While several recent works investigate how to disentangle underlying factors of variation in the data, most of them operate in 2D and hence ignore that our world is three-dimensional. Further, only few works consider the compositional nature of scenes. Our key hypothesis is that incorporating a compositional 3D scene representation into the generative model leads to more controllable image synthesis. Representing scenes as compositional generative neural feature fields allows us to disentangle one or multiple objects from the background as well as individual objects' shapes and appearances while learning from unstructured and unposed image collections without any additional supervision. Combining this scene representation with a neural rendering pipeline yields a fast and realistic image synthesis model. As evidenced by our experiments, our model is able to disentangle individual objects and allows for translating and rotating them in the scene as well as changing the camera pose.

Comments:	Accepted to CVPR 2021 (oral). Project page: this http URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2011.12100 [cs.CV]
	(or arXiv:2011.12100v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2011.12100

Submission history

From: Michael Niemeyer [view email]
[v1] Tue, 24 Nov 2020 14:14:15 UTC (2,462 KB)
[v2] Thu, 29 Apr 2021 14:46:36 UTC (2,467 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators