Street Gaussians without 3D Object Tracker

Zhang, Ruida; Li, Chengxi; Zhang, Chenyangguang; Liu, Xingyu; Yuan, Haili; Li, Yanyan; Ji, Xiangyang; Lee, Gim Hee

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.05548 (cs)

[Submitted on 7 Dec 2024]

Title:Street Gaussians without 3D Object Tracker

Authors:Ruida Zhang, Chengxi Li, Chenyangguang Zhang, Xingyu Liu, Haili Yuan, Yanyan Li, Xiangyang Ji, Gim Hee Lee

View PDF HTML (experimental)

Abstract:Realistic scene reconstruction in driving scenarios poses significant challenges due to fast-moving objects. Most existing methods rely on labor-intensive manual labeling of object poses to reconstruct dynamic objects in canonical space and move them based on these poses during rendering. While some approaches attempt to use 3D object trackers to replace manual annotations, the limited generalization of 3D trackers -- caused by the scarcity of large-scale 3D datasets -- results in inferior reconstructions in real-world settings. In contrast, 2D foundation models demonstrate strong generalization capabilities. To eliminate the reliance on 3D trackers and enhance robustness across diverse environments, we propose a stable object tracking module by leveraging associations from 2D deep trackers within a 3D object fusion strategy. We address inevitable tracking errors by further introducing a motion learning strategy in an implicit feature space that autonomously corrects trajectory errors and recovers missed detections. Experimental results on Waymo-NOTR datasets show we achieve state-of-the-art performance. Our code will be made publicly available.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.05548 [cs.CV]
	(or arXiv:2412.05548v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.05548

Submission history

From: Ruida Zhang [view email]
[v1] Sat, 7 Dec 2024 05:49:42 UTC (2,742 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Street Gaussians without 3D Object Tracker

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Street Gaussians without 3D Object Tracker

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators