Digging Into Self-Supervised Monocular Depth Estimation

Godard, Clément; Mac Aodha, Oisin; Firman, Michael; Brostow, Gabriel

Computer Science > Computer Vision and Pattern Recognition

arXiv:1806.01260 (cs)

[Submitted on 4 Jun 2018 (v1), last revised 17 Aug 2019 (this version, v4)]

Title:Digging Into Self-Supervised Monocular Depth Estimation

Authors:Clément Godard, Oisin Mac Aodha, Michael Firman, Gabriel Brostow

View PDF

Abstract:Per-pixel ground-truth depth data is challenging to acquire at scale. To overcome this limitation, self-supervised learning has emerged as a promising alternative for training models to perform monocular depth estimation. In this paper, we propose a set of improvements, which together result in both quantitatively and qualitatively improved depth maps compared to competing self-supervised methods.
Research on self-supervised monocular training usually explores increasingly complex architectures, loss functions, and image formation models, all of which have recently helped to close the gap with fully-supervised methods. We show that a surprisingly simple model, and associated design choices, lead to superior predictions. In particular, we propose (i) a minimum reprojection loss, designed to robustly handle occlusions, (ii) a full-resolution multi-scale sampling method that reduces visual artifacts, and (iii) an auto-masking loss to ignore training pixels that violate camera motion assumptions. We demonstrate the effectiveness of each component in isolation, and show high quality, state-of-the-art results on the KITTI benchmark.

Comments:	ICCV 19
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1806.01260 [cs.CV]
	(or arXiv:1806.01260v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1806.01260

Submission history

From: Clément Godard [view email]
[v1] Mon, 4 Jun 2018 17:58:05 UTC (8,875 KB)
[v2] Tue, 5 Jun 2018 19:06:28 UTC (8,879 KB)
[v3] Fri, 3 May 2019 01:27:58 UTC (9,137 KB)
[v4] Sat, 17 Aug 2019 22:57:30 UTC (7,793 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Digging Into Self-Supervised Monocular Depth Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Digging Into Self-Supervised Monocular Depth Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators