LightStereo: Channel Boost Is All Your Need for Efficient 2D Cost Aggregation

Guo, Xianda; Zhang, Chenming; Zhang, Youmin; Zheng, Wenzhao; Nie, Dujun; Poggi, Matteo; Chen, Long

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.19833 (cs)

[Submitted on 28 Jun 2024 (v1), last revised 16 Nov 2024 (this version, v2)]

Title:LightStereo: Channel Boost Is All Your Need for Efficient 2D Cost Aggregation

Authors:Xianda Guo, Chenming Zhang, Youmin Zhang, Wenzhao Zheng, Dujun Nie, Matteo Poggi, Long Chen

View PDF HTML (experimental)

Abstract:We present LightStereo, a cutting-edge stereo-matching network crafted to accelerate the matching process. Departing from conventional methodologies that rely on aggregating computationally intensive 4D costs, LightStereo adopts the 3D cost volume as a lightweight alternative. While similar approaches have been explored previously, our breakthrough lies in enhancing performance through a dedicated focus on the channel dimension of the 3D cost volume, where the distribution of matching costs is encapsulated. Our exhaustive exploration has yielded plenty of strategies to amplify the capacity of the pivotal dimension, ensuring both precision and efficiency. We compare the proposed LightStereo with existing state-of-the-art methods across various benchmarks, which demonstrate its superior performance in speed, accuracy, and resource utilization. LightStereo achieves a competitive EPE metric in the SceneFlow datasets while demanding a minimum of only 22 GFLOPs and 17 ms of runtime, and ranks 1st on KITTI 2015 among real-time models. Our comprehensive analysis reveals the effect of 2D cost aggregation for stereo matching, paving the way for real-world applications of efficient stereo systems. Code will be available at \url{this https URL}.

Comments:	Code will be available at \url{this https URL}
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.19833 [cs.CV]
	(or arXiv:2406.19833v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.19833

Submission history

From: Xianda Guo [view email]
[v1] Fri, 28 Jun 2024 11:11:24 UTC (760 KB)
[v2] Sat, 16 Nov 2024 03:11:30 UTC (1,598 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LightStereo: Channel Boost Is All Your Need for Efficient 2D Cost Aggregation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LightStereo: Channel Boost Is All Your Need for Efficient 2D Cost Aggregation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators