Revealing Occlusions with 4D Neural Fields

Van Hoorick, Basile; Tendulkar, Purva; Suris, Didac; Park, Dennis; Stent, Simon; Vondrick, Carl

Computer Science > Computer Vision and Pattern Recognition

arXiv:2204.10916 (cs)

[Submitted on 22 Apr 2022]

Title:Revealing Occlusions with 4D Neural Fields

Authors:Basile Van Hoorick, Purva Tendulkar, Didac Suris, Dennis Park, Simon Stent, Carl Vondrick

View PDF

Abstract:For computer vision systems to operate in dynamic situations, they need to be able to represent and reason about object permanence. We introduce a framework for learning to estimate 4D visual representations from monocular RGB-D, which is able to persist objects, even once they become obstructed by occlusions. Unlike traditional video representations, we encode point clouds into a continuous representation, which permits the model to attend across the spatiotemporal context to resolve occlusions. On two large video datasets that we release along with this paper, our experiments show that the representation is able to successfully reveal occlusions for several tasks, without any architectural changes. Visualizations show that the attention mechanism automatically learns to follow occluded objects. Since our approach can be trained end-to-end and is easily adaptable, we believe it will be useful for handling occlusions in many video understanding tasks. Data, code, and models are available at this https URL.

Comments:	CVPR 2022 (Oral)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2204.10916 [cs.CV]
	(or arXiv:2204.10916v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2204.10916

Submission history

From: Basile Van Hoorick [view email]
[v1] Fri, 22 Apr 2022 20:14:42 UTC (15,430 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Revealing Occlusions with 4D Neural Fields

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Revealing Occlusions with 4D Neural Fields

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators