MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model

Shao, Shuwei; Pei, Zhongcai; Chen, Weihai; Sun, Dingchi; Chen, Peter C. Y.; Li, Zhengguo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.07198 (cs)

[Submitted on 13 Nov 2023]

Title:MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model

Authors:Shuwei Shao, Zhongcai Pei, Weihai Chen, Dingchi Sun, Peter C.Y.Chen, Zhengguo Li

View PDF

Abstract:Over the past few years, self-supervised monocular depth estimation that does not depend on ground-truth during the training phase has received widespread attention. Most efforts focus on designing different types of network architectures and loss functions or handling edge cases, e.g., occlusion and dynamic objects. In this work, we introduce a novel self-supervised depth estimation framework, dubbed MonoDiffusion, by formulating it as an iterative denoising process. Because the depth ground-truth is unavailable in the training phase, we develop a pseudo ground-truth diffusion process to assist the diffusion in MonoDiffusion. The pseudo ground-truth diffusion gradually adds noise to the depth map generated by a pre-trained teacher model. Moreover,the teacher model allows applying a distillation loss to guide the denoised depth. Further, we develop a masked visual condition mechanism to enhance the denoising ability of model. Extensive experiments are conducted on the KITTI and Make3D datasets and the proposed MonoDiffusion outperforms prior state-of-the-art competitors. The source code will be available at this https URL.

Comments:	10 pages, 8 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.07198 [cs.CV]
	(or arXiv:2311.07198v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.07198

Submission history

From: Shuwei Shao [view email]
[v1] Mon, 13 Nov 2023 09:38:30 UTC (4,962 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators