3D Multi-object Detection and Tracking with Sparse Stationary LiDAR

Zhang, Meng; Pan, Zhiyu; Feng, Jianjiang; Zhou, Jie

doi:10.1007/978-3-030-88004-0_2

Meng Zhang¹⁶,
Zhiyu Pan¹⁶,
Jianjiang Feng¹⁶ &
…
Jie Zhou¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 13019))

Included in the following conference series:

Chinese Conference on Pattern Recognition and Computer Vision (PRCV)

2775 Accesses
2 Citations

Abstract

The advent of low-cost LiDAR in recent years makes it feasible for LiDAR to be used in visual surveillance applications such as detection and tracking of players in a football game. However, the extreme sparsity of point cloud acquired by such LiDAR is a challenge for object detection and tracking in large-scale scenes. To alleviate this problem, we propose a method of multi-object detection and tracking from sparse point clouds comprising a short-term tracklet regression stage and a 3D D-IoU data association stage. In the former stage, temporal information is aggregated by the proposed temporal fusion module to predict short-term tracklets formed by three bounding boxes. In the latter stage, the Distance-IoU scores of current tracklets and historical trajectories are computed to associate the data using Hungarian matching algorithm. To reduce the cost of manual annotations, we build a simulated point cloud dataset using Google Research Football for training. A real test dataset of football game is acquired by Livox Mid-100 LiDAR. Our experimental results on both datasets show that fusing multi-frames conduces to improving detection and tracking performance from sparse point clouds. Our 3D D-IoU tracking method also gets a promising performance on the nuScenes autonomous driving dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Tracking-By-Detection Based 3D Multiple Object Tracking for Autonomous Driving

Multi-level Association Based 3D Multiple-Object Tracking Framework for Self-driving Cars

An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds

Notes

1.
We plan to make both datasets publicly available.

References

Buric, M., Ivasic-Kos, M., Pobar, M.: Player tracking in sports videos. In: CloudCom (2019)
Google Scholar
Caesar, H., Bankiti, V., Lang, A.H.: nuScenes: a multimodal dataset for autonomous driving. CoRR (2019)
Google Scholar
Chiu, H., Prioletti, A., Li, J., Bohg, J.: Probabilistic 3D multi-object tracking for autonomous driving. CoRR (2020)
Google Scholar
Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? the KITTI vision benchmark suite. In: CVPR (2012)
Google Scholar
Kurach, K., et al.: Google research football: a novel reinforcement learning environment. In: AAAI (2020)
Google Scholar
Lang, A.H., Vora, S., Caesar, H., Zhou, L., Yang, J., Beijbom, O.: PointPillars: fast encoders for object detection from point clouds. In: CVPR (2019)
Google Scholar
Li, B.: 3D fully convolutional network for vehicle detection in point cloud. In: IROS (2017)
Google Scholar
Lin, T., Goyal, P., Girshick, R.B., He, K., Dollár, P.: Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. 42, 318–327 (2020)
Article Google Scholar
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Chapter Google Scholar
Luo, W., Yang, B., Urtasun, R.: Fast and furious: real time end-to-end 3D detection, tracking and motion forecasting with a single convolutional net. In: CVPR (2018)
Google Scholar
Manafifard, M., Ebadi, H., Moghaddam, H.A.: A survey on player tracking in soccer videos. Comput. Vis. Image Underst. 159, 19–46 (2017)
Article Google Scholar
Qi, C.R., Litany, O., He, K., Guibas, L.J.: Deep hough voting for 3D object detection in point clouds. In: ICCV (2019)
Google Scholar
Qi, C.R., Su, H., Mo, K., Guibas, L.J.: PointNet: deep learning on point sets for 3D classification and segmentation. In: CVPR (2017)
Google Scholar
Qi, H., Feng, C., Cao, Z., Zhao, F., Xiao, Y.: P2B: point-to-box network for 3D object tracking in point clouds. In: CVPR (2020)
Google Scholar
Rahman, M.M., Tan, Y., Xue, J., Lu, K.: Recent advances in 3D object detection in the era of deep neural networks: a survey. IEEE Trans. Image Process. 29, 2947–2962 (2020)
Article Google Scholar
Shi, S., Wang, X., Li, H.: Pointrcnn: 3D object proposal generation and detection from point cloud. In: CVPR (2019)
Google Scholar
Su, H., Maji, S., Kalogerakis, E., Learned-Miller, E.G.: Multi-view convolutional neural networks for 3D shape recognition. In: ICCV. IEEE Computer Society (2015)
Google Scholar
Sun, P., et al.: Scalability in perception for autonomous driving: waymo open dataset. CoRR (2019)
Google Scholar
Wang, S., Sun, Y., Liu, C., Liu, M.: Pointtracknet: an end-to-end network for 3-D object detection and tracking from point clouds. IEEE Rob. Autom. Lett. 5, 3206–3212 (2020)
Article Google Scholar
Weng, X., Wang, J., Held, D., Kitani, K.: 3D multi-object tracking: a baseline and new evaluation metrics. In: IROS (2020)
Google Scholar
Weng, X., Wang, Y., Man, Y., Kitani, K.M.: GNN3DMOT: graph neural network for 3D multi-object tracking with 2D–3D multi-feature learning. In: CVPR (2020)
Google Scholar
Yang, Y., Xu, M., Wu, W., Zhang, R., Peng, Y.: 3D multiview basketball players detection and localization based on probabilistic occupancy. In: DICTA (2018)
Google Scholar
Yang, Z., Sun, Y., Liu, S., Jia, J.: 3DSSD: point-based 3D single stage object detector. In: CVPR (2020)
Google Scholar
Yin, J., Shen, J., Guan, C., Zhou, D., Yang, R.: LiDAR-based online 3D video object detection with graph-based message passing and spatiotemporal transformer attention. In: CVPR (2020)
Google Scholar
Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., Ren, D.: Distance-IoU loss: faster and better learning for bounding box regression. In: AAAI (2020)
Google Scholar

Download references

Acknowledgments

The work was supported by the National Key Research and Development Program of China under Grant 2018AAA0102803.

Author information

Authors and Affiliations

Department of Automation, Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing, 100084, China
Meng Zhang, Zhiyu Pan, Jianjiang Feng & Jie Zhou

Authors

Meng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyu Pan
View author publications
You can also search for this author in PubMed Google Scholar
Jianjiang Feng
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jianjiang Feng .

Editor information

Editors and Affiliations

University of Science and Technology Beijing, Beijing, China
Huimin Ma
Chinese Academy of Sciences, Beijing, China
Liang Wang
Tsinghua University, Beijing, China
Changshui Zhang
Zhejiang University, Hangzhou, China
Fei Wu
Chinese Academy of Sciences, Beijing, China
Tieniu Tan
Hunan University, Changsha, China
Yaonan Wang
Sun Yat-Sen University, Guangzhou, Guangdong, China
Jianhuang Lai
Beijing Jiaotong University, Beijing, China
Yao Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, M., Pan, Z., Feng, J., Zhou, J. (2021). 3D Multi-object Detection and Tracking with Sparse Stationary LiDAR. In: Ma, H., et al. Pattern Recognition and Computer Vision. PRCV 2021. Lecture Notes in Computer Science(), vol 13019. Springer, Cham. https://doi.org/10.1007/978-3-030-88004-0_2

Download citation

DOI: https://doi.org/10.1007/978-3-030-88004-0_2
Published: 22 October 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-88003-3
Online ISBN: 978-3-030-88004-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

3D Multi-object Detection and Tracking with Sparse Stationary LiDAR

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Tracking-By-Detection Based 3D Multiple Object Tracking for Autonomous Driving

Multi-level Association Based 3D Multiple-Object Tracking Framework for Self-driving Cars

An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

3D Multi-object Detection and Tracking with Sparse Stationary LiDAR

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Tracking-By-Detection Based 3D Multiple Object Tracking for Autonomous Driving

Multi-level Association Based 3D Multiple-Object Tracking Framework for Self-driving Cars

An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation