Object-Centric Representation Learning with Generative Spatial-Temporal Factorization

Nanbo, Li; Raza, Muhammad Ahmed; Wenbin, Hu; Sun, Zhaole; Fisher, Robert B.

Computer Science > Machine Learning

arXiv:2111.05393 (cs)

[Submitted on 9 Nov 2021]

Title:Object-Centric Representation Learning with Generative Spatial-Temporal Factorization

Authors:Li Nanbo, Muhammad Ahmed Raza, Hu Wenbin, Zhaole Sun, Robert B. Fisher

View PDF

Abstract:Learning object-centric scene representations is essential for attaining structural understanding and abstraction of complex scenes. Yet, as current approaches for unsupervised object-centric representation learning are built upon either a stationary observer assumption or a static scene assumption, they often: i) suffer single-view spatial ambiguities, or ii) infer incorrectly or inaccurately object representations from dynamic scenes. To address this, we propose Dynamics-aware Multi-Object Network (DyMON), a method that broadens the scope of multi-view object-centric representation learning to dynamic scenes. We train DyMON on multi-view-dynamic-scene data and show that DyMON learns -- without supervision -- to factorize the entangled effects of observer motions and scene object dynamics from a sequence of observations, and constructs scene object spatial representations suitable for rendering at arbitrary times (querying across time) and from arbitrary viewpoints (querying across space). We also show that the factorized scene representations (w.r.t. objects) support querying about a single object by space and time independently.

Comments:	Accepted at NeurIPS 2021
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2111.05393 [cs.LG]
	(or arXiv:2111.05393v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2111.05393

Submission history

From: Nanbo Li [view email]
[v1] Tue, 9 Nov 2021 20:04:16 UTC (7,337 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-11

Change to browse by:

cs
cs.CV

References & Citations

DBLP - CS Bibliography

listing | bibtex

Nanbo Li
Wenbin Hu
Robert B. Fisher

export BibTeX citation

Computer Science > Machine Learning

Title:Object-Centric Representation Learning with Generative Spatial-Temporal Factorization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Object-Centric Representation Learning with Generative Spatial-Temporal Factorization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators