Deeplidar: Deep surface normal guided depth prediction for outdoor scene from sparse lidar data and single color image

J Qiu, Z Cui, Y Zhang, X Zhang, S Liu… - Proceedings of the …, 2019 - openaccess.thecvf.com
Proceedings of the IEEE/CVF conference on computer vision and …, 2019openaccess.thecvf.com
In this paper, we propose a deep learning architecture that produces accurate dense depth
for the outdoor scene from a single color image and a sparse depth. Inspired by the indoor
depth completion, our network estimates surface normals as the intermediate representation
to produce dense depth, and can be trained end-to-end. With a modified encoder-decoder
structure, our network effectively fuses the dense color image and the sparse LiDAR depth.
To address outdoor specific challenges, our network predicts a confidence mask to handle …
Abstract
In this paper, we propose a deep learning architecture that produces accurate dense depth for the outdoor scene from a single color image and a sparse depth. Inspired by the indoor depth completion, our network estimates surface normals as the intermediate representation to produce dense depth, and can be trained end-to-end. With a modified encoder-decoder structure, our network effectively fuses the dense color image and the sparse LiDAR depth. To address outdoor specific challenges, our network predicts a confidence mask to handle mixed LiDAR signals near foreground boundaries due to occlusion, and combines estimates from the color image and surface normals with learned attention maps to improve the depth accuracy especially for distant areas. Extensive experiments demonstrate that our model improves upon the state-of-the-art performance on KITTI depth completion benchmark. Ablation study shows the positive impact of each model components to the final performance, and comprehensive analysis shows that our model generalizes well to the input with higher sparsity or from indoor scenes.
openaccess.thecvf.com