Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras

Gordon, Ariel; Li, Hanhan; Jonschkowski, Rico; Angelova, Anelia

Computer Science > Computer Vision and Pattern Recognition

arXiv:1904.04998 (cs)

[Submitted on 10 Apr 2019]

Title:Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras

Authors:Ariel Gordon, Hanhan Li, Rico Jonschkowski, Anelia Angelova

View PDF

Abstract:We present a novel method for simultaneous learning of depth, egomotion, object motion, and camera intrinsics from monocular videos, using only consistency across neighboring video frames as supervision signal. Similarly to prior work, our method learns by applying differentiable warping to frames and comparing the result to adjacent ones, but it provides several improvements: We address occlusions geometrically and differentiably, directly using the depth maps as predicted during training. We introduce randomized layer normalization, a novel powerful regularizer, and we account for object motion relative to the scene. To the best of our knowledge, our work is the first to learn the camera intrinsic parameters, including lens distortion, from video in an unsupervised manner, thereby allowing us to extract accurate depth and motion from arbitrary videos of unknown origin at scale. We evaluate our results on the Cityscapes, KITTI and EuRoC datasets, establishing new state of the art on depth prediction and odometry, and demonstrate qualitatively that depth prediction can be learned from a collection of YouTube videos.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:1904.04998 [cs.CV]
	(or arXiv:1904.04998v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1904.04998
Journal reference:	The IEEE International Conference on Computer Vision (ICCV), 2019, pp. 8977-8986

Submission history

From: Ariel Gordon [view email]
[v1] Wed, 10 Apr 2019 04:16:30 UTC (7,812 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators