research-article

Online Multiple Object Tracking using Physical Location Prediction

Authors:

Jia LiAuthors Info & Claims

ICMLSC '22: Proceedings of the 2022 6th International Conference on Machine Learning and Soft Computing

Pages 145 - 150

https://doi.org/10.1145/3523150.3523173

Published: 13 April 2022 Publication History

Abstract

Tracking-by-detection is a commonly used paradigm for multiple-object tracking. This paper presents a method that incorporates the prediction of physical locations of people into the tracking-by-detection paradigm. The proposed method predicts the physical locations of people on an estimated ground plane and applies a learning-based framework to extract the appearance features of people across frames in a video stream. The method combines the prediction of physical locations with appearance features to realize online pedestrian tracking. Experimental results show that the proposed method improves multi-object tracking in terms of the Number of Identity Switches (IDSW) and the fragmentations (Frag).

References

[1]

David A Ross, Jongwoo Lim, Ruei-Sung Lin, and Ming-Hsuan Yang. 2008. Incremental learning for robust visual tracking. International Journal of Computer Vision 77, 1-3 (May 2008), 125–141.

Digital Library

[2]

Avidan, S. 2007. Ensemble tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence 29, 2(Jan. 2007), 261–271.

[3]

Caglayan Dicle, Mario Sznaier, and Octavia Camps. 2013. The Way They Move: Tracking Multiple Targets with Similar Appearance. In Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV’13). IEEE, Sydney, NSW, Australia, 2304-2311.

Digital Library

[4]

Ju Hong Yoon, Ming-Hsuan Yang, Jongwoo Lim, and Kuk-Jin Yoon. 2015. Bayesian multi-object tracking using motion context from multiple objects. In Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision (WACV’15). IEEE, Waikoloa, HI, USA, 33–40.

Digital Library

[5]

Alex Bewley, Zongyuan Ge, Lionel Ott, Fabio Ramos, and Ben Upcroft. 2016. Simple online and real-time tracking. In Proceedings of the IEEE International Conference on Image Processing (ICIP’16). IEEE, Phoenix, AZ, USA, 3464–3468.

[6]

Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015 a. Facenet: A unified embedding for face recognition and clustering Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’15). IEEE, Boston, MA, USA. 815–823.

[7]

Y. C. Yoon, D. Y. Kim, K. Yoon, Y. M. Song, and M. Jeon. 2021. Online multiple pedestrian tracking using deep temporal appearance matching association. Information Sciences. 561(Jan. 2021),326-351.

[8]

Anton Milan, Seyed Hamid Rezatofighi, Anthony R. Dick, Ian D. Reid, and Konrad Schindler. 2017. Online multi-target tracking using recurrent neural networks. In Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI’17). San Francisco, CA, USA, 4225–4232.

[9]

Harold W. Kuhn. 1955. The hungarian method for the assignment problem. Nav. Res. Logist. Quart. 2, 1–2 (Mar. 1955), 83–97.

[10]

Amir Sadeghian, Alexandre Alahi, and Silvio Savarese. 2017. Tracking the untrackable: Learning to track multiple cues with long-term dependencies. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV’17). IEEE, Honolulu, HI, USA, 300–311.

[11]

Seung-Hwan Bae and Kuk-Jin Yoon. 2018. Confidence-based data association and discriminative deep appearance learning for robust online multi-object tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 3 (Mar. 2018), 595–610.

[12]

Chanho Kim, Fuxin Li, and James M Rehg. 2018. Multi-object Tracking with Neural Gating Using Bilinear LSTM. In Proceedings of the European Conference on Computer Vision (ECCV’18). Munich, Germany, 200–215.

Digital Library

[13]

Ergys Ristani and Carlo Tomasi. 2018. Features for multi-target multi-camera tracking and re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18). IEEE, Salt Lake City, UT, USA, 6036–6046.

[14]

Xiangwei Lin, Huanqiang Zeng, Jinhui Hou, Jianqing Zhu, Jing Chen, Kai-Kuang Ma. 2020. Vehicle Re-identification Using Joint Pyramid Feature Representation Network. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, Vol. 316. Springer, Cham. https://doi.org/10.1007/978-3-030-44751-9_44

[15]

Richard Hartley, Andrew Zisserman, A. 2004. Multiple View Geometry in Computer Vision second edited. Cambridge University Press.

[16]

Rudolph Emil Kalman. 1960. A new approach to linear filtering and prediction problems. Journal of Basic Engineering 82, 1 (Mar. 1960), 35.

[17]

Z Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable person re-identification: A benchmark. In Proceedings of the International Conference on Computer Vision (ICCV’15). IEEE, Santiago, Chile, 1116–1124.

[18]

Ergys Ristani, Francesco Solera, Roger S. Zou, Rita Cucchiara, and Carlo Tomasi. 2016. Performance measures and a data set for multi-target, multi-camera tracking. In Proceedings of the European Conference on Computer Vision (ECCV’16). Amsterdam, Netherlands, 17–35.

[19]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). IEEE, Las Vegas, NV, USA, 770-778.

[20]

Bochkovskiy Alexey, Wang Chien-Yao, and Liao Hong-Yuan Mark. 2020. YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv preprint. arXiv:cs.CV/2004.10934.

[21]

Alexander Hermans, Lucas Beyer, and Bastian Leibe. 2017. In defense of the triplet loss for person re-identification. arXiv preprint. arXiv abs/1703.07737.

[22]

Laura Leal-Taixé, Anton Milan, Ian Reid, Stefan Roth, and Konrad Schindler. 2015. MOTChallenge 2015: Towards a benchmark for multi-target tracking. arXiv preprint. arXiv:1504.01942.

[23]

Anton Milan, Laura Leal-Taixé, Ian Reid, Stefan Roth, and Konrad Schindler. 2016. MOT16: A benchmark for multi-object tracking. arXiv preprint. arXiv:1603.00831.

[24]

Yuanlu Xu, Xiaobai Liu, Yang Liu, and Song-Chun Zhu. 2016. Multi-view people tracking via hierarchical trajectory composition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). IEEE, Las Vegas, NV, USA, 4256–4265.

[25]

Yuanlu Xu, Xiaobai Liu, Lei Qin, and Song-Chun Zhu. 2017. Cross-view people tracking by scene-centered spatio-temporal parsing. In Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI’17). San Francisco, CA, USA, 4299–4305.

Digital Library

[26]

Jieming Yang, Hongwei Ge, Jinlong Yang, Yubing Tong, Shuzhi Su. 2021. Online multi-object tracking using multi-function integration and tracking simulation training. Applied Intelligence. Springer, 19 (May 2021)1–21. https://doi.org/10.1007/s10489-021-02457-5

Digital Library

[27]

SLA_public. Spatial-Attention Location-Aware Multi-Object Tracking. Retrieved December 11, 2020 from https://motchallenge.net/

[28]

Philipp Bergmann, Tim Meinhardt, and Laura Leal-Taixé. 2019. Tracking without bells and whistles. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’19). IEEE, Seoul, Korea, 941–951.

Recommendations

Appearance features for online multiple camera multiple target tracking
SAC '20: Proceedings of the 35th Annual ACM Symposium on Applied Computing

Multiple object tracking methods in the state-of-the-art are challenged by appearance variation, environment changes and long-term occlusions. Exploiting multiple calibrated and frame synchronized cameras holds the promise of alleviating these problems, ...
Online Multiperson Tracking-by-Detection from a Single, Uncalibrated Camera

In this paper, we address the problem of automatically detecting and tracking a variable number of persons in complex scenes using a monocular, potentially moving, uncalibrated camera. We propose a novel approach for multiperson tracking-by-detection in ...
Multiple Fisheye Camera Tracking via Real-Time Feature Clustering
MMAsia '19: Proceedings of the 1st ACM International Conference on Multimedia in Asia

Recently, Multi-Target Multi-Camera Tracking (MTMC) makes a breakthrough due to the release of DukeMTMC and show the feasibility of related applications. However, most of the existing MTMC methods focus on the batch methods which attempt to find the ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICMLSC '22: Proceedings of the 2022 6th International Conference on Machine Learning and Soft Computing

January 2022

185 pages

ISBN:9781450387477

DOI:10.1145/3523150

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 April 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

Conference

ICMLSC 2022

ICMLSC 2022: 2022 The 6th International Conference on Machine Learning and Soft Computing

January 15 - 17, 2022

Haikou, China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
52
Total Downloads

Downloads (Last 12 months)13
Downloads (Last 6 weeks)1

Reflects downloads up to 22 Dec 2024

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents