Computer vision is a research field that teaches computers to perform visual tasks such as understanding of a robot's surroundings or the ability to recognize objects. These are done through the use of sensors and algorithms.
Monocular depth estimation (MDE), which is the task of using a single image to predict scene depths, has gained considerable interest, in large part owing to the popularity of applying deep learning methods to solve “computer vision ...
In this work we propose consistent depth estimation for viewpoint reconstruction in data-driven simulation, combining aspects of learning-based monocular depth prediction and structure-from-motion to increase temporal video depth accuracy.