Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

Sundermeyer, Martin; Marton, Zoltan-Csaba; Durner, Maximilian; Brucker, Manuel; Triebel, Rudolph

Computer Science > Computer Vision and Pattern Recognition

arXiv:1902.01275v2 (cs)

[Submitted on 4 Feb 2019 (v1), last revised 17 Jul 2019 (this version, v2)]

Title:Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

Authors:Martin Sundermeyer, Zoltan-Csaba Marton, Maximilian Durner, Manuel Brucker, Rudolph Triebel

View PDF

Abstract:We propose a real-time RGB-based pipeline for object detection and 6D pose estimation. Our novel 3D orientation estimation is based on a variant of the Denoising Autoencoder that is trained on simulated views of a 3D model using Domain Randomization. This so-called Augmented Autoencoder has several advantages over existing methods: It does not require real, pose-annotated training data, generalizes to various test sensors and inherently handles object and view symmetries. Instead of learning an explicit mapping from input images to object poses, it provides an implicit representation of object orientations defined by samples in a latent space. Our pipeline achieves state-of-the-art performance on the T-LESS dataset both in the RGB and RGB-D domain. We also evaluate on the LineMOD dataset where we can compete with other synthetically trained approaches. We further increase performance by correcting 3D orientation estimates to account for perspective errors when the object deviates from the image center and show extended results.

Comments:	Code available at: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1902.01275 [cs.CV]
	(or arXiv:1902.01275v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1902.01275

Submission history

From: Martin Sundermeyer [view email]
[v1] Mon, 4 Feb 2019 16:03:57 UTC (7,888 KB)
[v2] Wed, 17 Jul 2019 14:12:26 UTC (6,600 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators