SceneNet RGB-D: 5M Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth

McCormac, John; Handa, Ankur; Leutenegger, Stefan; Davison, Andrew J.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1612.05079v1 (cs)

[Submitted on 15 Dec 2016 (this version), latest version 30 Jan 2017 (v3)]

Title:SceneNet RGB-D: 5M Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth

Authors:John McCormac, Ankur Handa, Stefan Leutenegger, Andrew J. Davison

View PDF

Abstract:We introduce SceneNet RGB-D, expanding the previous work of SceneNet to enable large scale photorealistic rendering of indoor scene trajectories. It provides pixel-perfect ground truth for scene understanding problems such as semantic segmentation, instance segmentation, and object detection, and also for geometric computer vision problems such as optical flow, depth estimation, camera pose estimation, and 3D reconstruction. Random sampling permits virtually unlimited scene configurations, and here we provide a set of 5M rendered RGB-D images from over 15K trajectories in synthetic layouts with random but physically simulated object poses. Each layout also has random lighting, camera trajectories, and textures. The scale of this dataset is well suited for pre-training data-driven computer vision techniques from scratch with RGB-D inputs, which previously has been limited by relatively small labelled datasets in NYUv2 and SUN RGB-D. It also provides a basis for investigating 3D scene labelling tasks by providing perfect camera poses and depth data as proxy for a SLAM system. We host the dataset at this http URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1612.05079 [cs.CV]
	(or arXiv:1612.05079v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1612.05079

Submission history

From: Ankur Handa [view email]
[v1] Thu, 15 Dec 2016 14:22:38 UTC (4,313 KB)
[v2] Fri, 16 Dec 2016 01:37:54 UTC (4,313 KB)
[v3] Mon, 30 Jan 2017 11:06:14 UTC (4,311 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SceneNet RGB-D: 5M Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SceneNet RGB-D: 5M Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators