Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3523150.3523173acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlscConference Proceedingsconference-collections
research-article

Online Multiple Object Tracking using Physical Location Prediction

Published: 13 April 2022 Publication History

Abstract

Tracking-by-detection is a commonly used paradigm for multiple-object tracking. This paper presents a method that incorporates the prediction of physical locations of people into the tracking-by-detection paradigm. The proposed method predicts the physical locations of people on an estimated ground plane and applies a learning-based framework to extract the appearance features of people across frames in a video stream. The method combines the prediction of physical locations with appearance features to realize online pedestrian tracking. Experimental results show that the proposed method improves multi-object tracking in terms of the Number of Identity Switches (IDSW) and the fragmentations (Frag).

References

[1]
David A Ross, Jongwoo Lim, Ruei-Sung Lin, and Ming-Hsuan Yang. 2008. Incremental learning for robust visual tracking. International Journal of Computer Vision 77, 1-3 (May 2008), 125–141.
[2]
Avidan, S. 2007. Ensemble tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence 29, 2(Jan. 2007), 261–271.
[3]
Caglayan Dicle, Mario Sznaier, and Octavia Camps. 2013. The Way They Move: Tracking Multiple Targets with Similar Appearance. In Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV’13). IEEE, Sydney, NSW, Australia, 2304-2311.
[4]
Ju Hong Yoon, Ming-Hsuan Yang, Jongwoo Lim, and Kuk-Jin Yoon. 2015. Bayesian multi-object tracking using motion context from multiple objects. In Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision (WACV’15). IEEE, Waikoloa, HI, USA, 33–40.
[5]
Alex Bewley, Zongyuan Ge, Lionel Ott, Fabio Ramos, and Ben Upcroft. 2016. Simple online and real-time tracking. In Proceedings of the IEEE International Conference on Image Processing (ICIP’16). IEEE, Phoenix, AZ, USA, 3464–3468.
[6]
Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015 a. Facenet: A unified embedding for face recognition and clustering Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’15). IEEE, Boston, MA, USA. 815–823.
[7]
Y. C. Yoon, D. Y. Kim, K. Yoon, Y. M. Song, and M. Jeon. 2021. Online multiple pedestrian tracking using deep temporal appearance matching association. Information Sciences. 561(Jan. 2021),326-351.
[8]
Anton Milan, Seyed Hamid Rezatofighi, Anthony R. Dick, Ian D. Reid, and Konrad Schindler. 2017. Online multi-target tracking using recurrent neural networks. In Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI’17). San Francisco, CA, USA, 4225–4232.
[9]
Harold W. Kuhn. 1955. The hungarian method for the assignment problem. Nav. Res. Logist. Quart. 2, 1–2 (Mar. 1955), 83–97.
[10]
Amir Sadeghian, Alexandre Alahi, and Silvio Savarese. 2017. Tracking the untrackable: Learning to track multiple cues with long-term dependencies. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV’17). IEEE, Honolulu, HI, USA, 300–311.
[11]
Seung-Hwan Bae and Kuk-Jin Yoon. 2018. Confidence-based data association and discriminative deep appearance learning for robust online multi-object tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 3 (Mar. 2018), 595–610.
[12]
Chanho Kim, Fuxin Li, and James M Rehg. 2018. Multi-object Tracking with Neural Gating Using Bilinear LSTM. In Proceedings of the European Conference on Computer Vision (ECCV’18). Munich, Germany, 200–215.
[13]
Ergys Ristani and Carlo Tomasi. 2018. Features for multi-target multi-camera tracking and re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’18). IEEE, Salt Lake City, UT, USA, 6036–6046.
[14]
Xiangwei Lin, Huanqiang Zeng, Jinhui Hou, Jianqing Zhu, Jing Chen, Kai-Kuang Ma. 2020. Vehicle Re-identification Using Joint Pyramid Feature Representation Network. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, Vol. 316. Springer, Cham. https://doi.org/10.1007/978-3-030-44751-9_44
[15]
Richard Hartley, Andrew Zisserman, A. 2004. Multiple View Geometry in Computer Vision second edited. Cambridge University Press.
[16]
Rudolph Emil Kalman. 1960. A new approach to linear filtering and prediction problems. Journal of Basic Engineering 82, 1 (Mar. 1960), 35.
[17]
Z Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable person re-identification: A benchmark. In Proceedings of the International Conference on Computer Vision (ICCV’15). IEEE, Santiago, Chile, 1116–1124.
[18]
Ergys Ristani, Francesco Solera, Roger S. Zou, Rita Cucchiara, and Carlo Tomasi. 2016. Performance measures and a data set for multi-target, multi-camera tracking. In Proceedings of the European Conference on Computer Vision (ECCV’16). Amsterdam, Netherlands, 17–35.
[19]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). IEEE, Las Vegas, NV, USA, 770-778.
[20]
Bochkovskiy Alexey, Wang Chien-Yao, and Liao Hong-Yuan Mark. 2020. YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv preprint. arXiv:cs.CV/2004.10934.
[21]
Alexander Hermans, Lucas Beyer, and Bastian Leibe. 2017. In defense of the triplet loss for person re-identification. arXiv preprint. arXiv abs/1703.07737.
[22]
Laura Leal-Taixé, Anton Milan, Ian Reid, Stefan Roth, and Konrad Schindler. 2015. MOTChallenge 2015: Towards a benchmark for multi-target tracking. arXiv preprint. arXiv:1504.01942.
[23]
Anton Milan, Laura Leal-Taixé, Ian Reid, Stefan Roth, and Konrad Schindler. 2016. MOT16: A benchmark for multi-object tracking. arXiv preprint. arXiv:1603.00831.
[24]
Yuanlu Xu, Xiaobai Liu, Yang Liu, and Song-Chun Zhu. 2016. Multi-view people tracking via hierarchical trajectory composition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). IEEE, Las Vegas, NV, USA, 4256–4265.
[25]
Yuanlu Xu, Xiaobai Liu, Lei Qin, and Song-Chun Zhu. 2017. Cross-view people tracking by scene-centered spatio-temporal parsing. In Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI’17). San Francisco, CA, USA, 4299–4305.
[26]
Jieming Yang, Hongwei Ge, Jinlong Yang, Yubing Tong, Shuzhi Su. 2021. Online multi-object tracking using multi-function integration and tracking simulation training. Applied Intelligence. Springer, 19 (May 2021)1–21. https://doi.org/10.1007/s10489-021-02457-5
[27]
SLA_public. Spatial-Attention Location-Aware Multi-Object Tracking. Retrieved December 11, 2020 from https://motchallenge.net/
[28]
Philipp Bergmann, Tim Meinhardt, and Laura Leal-Taixé. 2019. Tracking without bells and whistles. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’19). IEEE, Seoul, Korea, 941–951.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICMLSC '22: Proceedings of the 2022 6th International Conference on Machine Learning and Soft Computing
January 2022
185 pages
ISBN:9781450387477
DOI:10.1145/3523150
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 April 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. data association
  2. prediction of physical locations
  3. tracking-by-detection
  4. ultiple-object tracking

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

Conference

ICMLSC 2022

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 52
    Total Downloads
  • Downloads (Last 12 months)13
  • Downloads (Last 6 weeks)1
Reflects downloads up to 22 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media