Randomized 3D Scene Generation for Generalizable Self-supervised Pre-training

Li, Lanxiao; Heizmann, Michael

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.04237v1 (cs)

[Submitted on 7 Jun 2023 (this version), latest version 6 Aug 2023 (v2)]

Title:Randomized 3D Scene Generation for Generalizable Self-supervised Pre-training

Authors:Lanxiao Li, Michael Heizmann

View PDF

Abstract:Capturing and labeling real-world 3D data is laborious and time-consuming, which makes it costly to train strong 3D models. To address this issue, previous works generate randomized 3D scenes and pre-train models on generated data. Although the pre-trained models gain promising performance boosts, previous works have two major shortcomings. First, they focus on only one downstream task (i.e., object detection). Second, a fair comparison of generated data is still lacking. In this work, we systematically compare data generation methods using a unified setup. To clarify the generalization of the pre-trained models, we evaluate their performance in multiple tasks (e.g., object detection and semantic segmentation) and with different pre-training methods (e.g., masked autoencoder and contrastive learning). Moreover, we propose a new method to generate 3D scenes with spherical harmonics. It surpasses the previous formula-driven method with a clear margin and achieves on-par results with methods using real-world scans and CAD models.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.04237 [cs.CV]
	(or arXiv:2306.04237v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.04237

Submission history

From: Lanxiao Li [view email]
[v1] Wed, 7 Jun 2023 08:28:38 UTC (4,753 KB)
[v2] Sun, 6 Aug 2023 13:32:46 UTC (4,096 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Randomized 3D Scene Generation for Generalizable Self-supervised Pre-training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Randomized 3D Scene Generation for Generalizable Self-supervised Pre-training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators