LRM-Zero: Training Large Reconstruction Models with Synthesized Data

Xie, Desai; Bi, Sai; Shu, Zhixin; Zhang, Kai; Xu, Zexiang; Zhou, Yi; Pirk, Sören; Kaufman, Arie; Sun, Xin; Tan, Hao

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.09371 (cs)

[Submitted on 13 Jun 2024]

Title:LRM-Zero: Training Large Reconstruction Models with Synthesized Data

Authors:Desai Xie, Sai Bi, Zhixin Shu, Kai Zhang, Zexiang Xu, Yi Zhou, Sören Pirk, Arie Kaufman, Xin Sun, Hao Tan

View PDF HTML (experimental)

Abstract:We present LRM-Zero, a Large Reconstruction Model (LRM) trained entirely on synthesized 3D data, achieving high-quality sparse-view 3D reconstruction. The core of LRM-Zero is our procedural 3D dataset, Zeroverse, which is automatically synthesized from simple primitive shapes with random texturing and augmentations (e.g., height fields, boolean differences, and wireframes). Unlike previous 3D datasets (e.g., Objaverse) which are often captured or crafted by humans to approximate real 3D data, Zeroverse completely ignores realistic global semantics but is rich in complex geometric and texture details that are locally similar to or even more intricate than real objects. We demonstrate that our LRM-Zero, trained with our fully synthesized Zeroverse, can achieve high visual quality in the reconstruction of real-world objects, competitive with models trained on Objaverse. We also analyze several critical design choices of Zeroverse that contribute to LRM-Zero's capability and training stability. Our work demonstrates that 3D reconstruction, one of the core tasks in 3D vision, can potentially be addressed without the semantics of real-world objects. The Zeroverse's procedural synthesis code and interactive visualization are available at: this https URL.

Comments:	23 pages, 8 figures. Our code and interactive visualization are available at: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2406.09371 [cs.CV]
	(or arXiv:2406.09371v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.09371

Submission history

From: Desai Xie [view email]
[v1] Thu, 13 Jun 2024 17:51:00 UTC (10,038 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LRM-Zero: Training Large Reconstruction Models with Synthesized Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LRM-Zero: Training Large Reconstruction Models with Synthesized Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators