DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

Zhang, Yinda; Bai, Mingru; Kohli, Pushmeet; Izadi, Shahram; Xiao, Jianxiong

Computer Science > Computer Vision and Pattern Recognition

arXiv:1603.04922 (cs)

[Submitted on 16 Mar 2016 (v1), last revised 16 Aug 2017 (this version, v4)]

Title:DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

Authors:Yinda Zhang, Mingru Bai, Pushmeet Kohli, Shahram Izadi, Jianxiong Xiao

View PDF

Abstract:While deep neural networks have led to human-level performance on computer vision tasks, they have yet to demonstrate similar gains for holistic scene understanding. In particular, 3D context has been shown to be an extremely important cue for scene understanding - yet very little research has been done on integrating context information with deep models. This paper presents an approach to embed 3D context into the topology of a neural network trained to perform holistic scene understanding. Given a depth image depicting a 3D scene, our network aligns the observed scene with a predefined 3D scene template, and then reasons about the existence and location of each object within the scene template. In doing so, our model recognizes multiple objects in a single forward pass of a 3D convolutional neural network, capturing both global scene and local object information simultaneously. To create training data for this 3D network, we generate partly hallucinated depth images which are rendered by replacing real objects with a repository of CAD models of the same object category. Extensive experiments demonstrate the effectiveness of our algorithm compared to the state-of-the-arts. Source code and data are available at this http URL.

Comments:	Accepted by ICCV2017
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1603.04922 [cs.CV]
	(or arXiv:1603.04922v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1603.04922

Submission history

From: Yinda Zhang [view email]
[v1] Wed, 16 Mar 2016 00:09:41 UTC (85,350 KB)
[v2] Mon, 21 Mar 2016 06:46:42 UTC (10,956 KB)
[v3] Thu, 24 Nov 2016 15:34:03 UTC (7,584 KB)
[v4] Wed, 16 Aug 2017 04:12:50 UTC (8,801 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators