Abstract
Reidentifying multiple objects in a camera network are a difficult problem, especially when determining whether the same object appears in a different place at a different time, captured by another camera. In this paper, we propose a novel tracklet-based approach for reidentifying objects despite the illumination condition differences that occur at various times of day. A similarity search is performed by comparing part-level object feature descriptions. Tracking in each camera is made by a recurrent neural network, and the matching between cameras is done by using a similarity neural network to obtain an output in the form of a similarity score. Our approach consists of two main phases. In the first phase, preprocessing is performed through the transfer of tracklets from several cameras. This process generates more samples from each camera, which is beneficial for training. In the second phase, the object definition is applied, which considers appearance information, temporal information and the similarity calculation, hence making object reidentification easier. We have analyzed the proposed strategy when applied to pedestrian reidentification databases in comparison with state-of-the-art work to prove its robustness.
Similar content being viewed by others
References
Song W, Zheng J, Wu Y, Chen C, Liu F (2020) Video-based person re-identification using a novel feature extraction and fusion technique, Multimed Tools Appl, 1–21
Wang Q, Yuan Y et al (2019) Metric learning by simultaneously learning linear transformation matrix and weight matrix for person re-identification. IET Comput Vision 13(4):428–434
Zhao R, Ouyang W, Wang X (2014) Learning mid-level filters for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 144–151
Zhu C, Miao D(2019) Semi-supervised one-pass multi-view learning with variable features and views, Neural Process Lett, 1– 38
Paisitkriangkrai S, Shen C, van den Hengel A (2015) Learning to rank in person re-identification with metric ensembles, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 1846–1855
Zhou Z, Huang Y, Wang W, Wang L,Tan T (2017) See the forest for the trees: Joint spatial and temporal recurrent neural networks for video-based person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 4747–4756
Milan A, Leal-Taixé L, Schindler K, Reid I (2015) Joint tracking and segmentation of multiple targets. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5397–5406
Armenta C, Laurain T, Estrada-Manzo V, Bernal M (2019) A novel identification-based convex control scheme via recurrent high-order neural networks: An application to the internal combustion engine, Neural Process Lett, pp 1–22, 2019
Xu S, Cheng Y, Gu K, Yang Y, Chang S, Zhou P (2017) Jointly attentive spatial-temporal pooling networks for video-based person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 4733–4742
Varior RR, Shuai B, Lu J, Xu D, Wang G (2016) A siamese long short-term memory architecture for human re-identification. In: European Conference on Computer Vision.Springer, pp 135–153
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In: Proceedings of the European Conference on Computer Vision (ECCV), pp 480–496
Cao Z, Simon T, Wei S-E, Sheikh Y (2017) Realtime multi-person 2d pose estimation using part affinity fields. CVPR 1(2):7
Yao H, Zhang S, Hong R, Zhang Y, Xu C, Tian Q (2019) Deep representation learning with part loss for person re-identification. IEEE Trans Image Process 28(6):2860–2871
Gu J, Jiang W, Luo H, Yu H (2020) An efficient global representation constrained by angular triplet loss for vehicle re-identification. Pattern Anal Appl 24(1):367–379
Luo H, Jiang W, Fan X, Zhang C (2020) Stnreid: Deep convolutional networks with pairwise spatial transformer networks for partial person re-identification. IEEE Trans Multimed 22(11):2905–2913
Zhou S, Ke M, Luo P (2019) Multi-camera transfer gan for person re-identification. J Visual Commun Image Represent 59:393–400
Dai J, Zhang P, Wang D, Lu H, Wang H (2018) Video person re-identification by temporal residual learning. IEEE Trans Image Process 28(3):1366–1377
Yi D, Lei Z, Liao S, Li SZ (2014) Deep metric learning for person re-identification. In: Pattern Recognition (ICPR), 2014 22nd International Conference on. IEEE, pp 34–39
Wu L, Shen C, Hengel Avd (2016) Personnet: Person re-identification with deep convolutional neural networks. arXiv preprint arXiv:1601.07255
Wang Q, Gao J, Li X (2019) Weakly supervised adversarial domain adaptation for semantic segmentation in urban scenes. IEEE Trans Image Process 28(9):4376–4386
Zheng L, Yang Y, Hauptmann AG (2016) Person re-identification: Past, present and future. arXiv preprint arXiv:1610.02984
Zhu F, Kong X, Fu H, Tian Q (2017) Pseudo-positive regularization for deep person re-identification. Multimed Syst 24(4):477–489
Huang L, Yang Q, Wu J, Huang Y, Wu Q, Xu J (2020) Generated data with sparse regularized multi-pseudo label for person re-identification. IEEE Signal Process Lett 27:391–395
McLaughlin N, Del Rincon JM, Miller P(2015) Data-augmentation for reducing dataset bias in person re-identification. In: Advanced Video and Signal Based Surveillance (AVSS), 2015 12th IEEE International Conference on. IEEE, pp 1–6
Zhong Z, Zheng L, Kang G, Li S, Yang Y(2017) Random erasing data augmentation, arXiv preprint arXiv:1708.04896
Goodfellow I, Pouget-Abadie J, Mirza M,Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. In: Advances in neural information processing systems, pp 2672–2680
Kaneko T, Kameoka H, Cyclegan-vc (2018) Non-parallel voice conversion using cycle-consistent adversarial networks, In: 26th European Signal Processing Conference (EUSIPCO). IEEE 2018: 2100–2104
Gray D, Tao H (2008) Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: European conference on computer vision. Springer, pp 262–275
Gheissari N, Sebastian TB, Hartley R(2006) Person reidentification using spatiotemporal appearance. In: Computer Vision and Pattern Recognition, 2006 IEEE Computer Society Conference on, vol. 2. IEEE, pp 1528–1535
Farenzena M, Bazzani L, Perina A, Murino V, Cristani M (2010) Person re-identification by symmetry-driven accumulation of local features. In: Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, pp 2360–2367
Ye M, Shen J, Lin G, Xiang T, Shao L, Hoi SC (2020) Deep learning for person re-identification: A survey and outlook. arXiv preprint arXiv:2001.04193
Xu Y, Zhou X, Liu P, Xu H (2019) Rapid pedestrian detection based on deep omega-shape features with partial occlusion handing. Neural Process Lett 49(3):923–937
Zhao L, Li X, Zhuang Y, Wang J (2017) Deeply-learned part-aligned representations for person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 3219–3228
McLaughlin N, Martinez del Rincon J, Miller P(2016) Recurrent convolutional network for video-based person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1325–1334
Yan Y, Ni B, Song Z, Ma, C Yan Y, Yang X (2016) Person re-identification via recurrent feature aggregation. In: European Conference on Computer Vision. Springer, pp 701–716
Liu K, Ma B, Zhang W, Huang R (2015) A spatio-temporal appearance representation for viceo-based pedestrian re-identification. In: Proceedings of the IEEE International Conference on Computer Vision, pp 3810–3818
Wang T, Gong S, Zhu X, Wang S (2014) Person re-identification by video ranking. In: European Conference on Computer Vision. Springer, pp 688–703
Li Y, Wu Z, Karanam S, Radke RJ (2015) Multi-shot human re-identification using adaptive fisher discriminant analysis. BMVC 1: 2
Koestinger M, Hirzer M, Wohlhart P, Roth MM, Bischof H (2012) Large scale metric learning from equivalence constraints. In: Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on. IEEE, pp 2288–2295
Zhao R, Ouyang W, Wang X (2013) Person re-identification by salience matching. In: Proceedings of the IEEE International Conference on Computer Vision, pp 2528–2535
Mignon A, Jurie F (2012) Pcca: A new approach for distance learning from sparse pairwise constraints. In: Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on IEEE, pp 2666–2672
Chen Y-C, Zhu X, Zheng W-S, Lai J-H (2018) Person re-identification by camera correlation aware feature augmentation. IEEE Trans Pattern Anal Mach Intell 40(2):392–408
Sun Y, Zheng L, Deng W, Wang S (2017) Svdnet for pedestrian retrieval. arXiv preprint, 1(6)
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737
Ustinova E, Ganin Y, Lempitsky V, (2017) Multi-region bilinear convolutional neural networks for person re-identification. In: Advanced Video and Signal Based Surveillance (AVSS), 2017 14th IEEE International Conference on.IEEE, pp 1–6
Liu X, Zhao H, Tian M, Sheng L, Shao J, Yi S, Yan J, Wang X (2017) Hydraplus-net: Attentive deep features for pedestrian analysis. arXiv preprint arXiv:1709.09930
Su C, Li J, Zhang S, Xing J, Gao W, Tian Q (2017) Pose-driven deep convolutional model for person re-identification. In: Computer Vision (ICCV), 2017 IEEE International Conference on IEEE, pp 3980–3989
Zhu J-Y, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Computer Vision (ICCV), IEEE International Conference on 2017
Dorai Y, Chausse F, Gazzah S, Essoukri Ben Amara N(2017) Multi target tracking by linking tracklets with a convolutional neural network. In: VISIGRAPP (6: VISAPP), pp 492–498
Johnson J, Alahi A, Fei-Fei L Perceptual losses for real-time style transfer and super-resolution. In: European Conference on Computer Vision
Hirzer M, Beleznai C, Roth PM, Bischof H (2011) Person re-identification by descriptive and discriminative classification. In: Scandinavian conference on Image analysis. Springer, pp 91–102
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: A benchmark. In: Proceedings of the IEEE International Conference on Computer Vision, pp 1116–1124
Gray D, Brennan S, Tao H (2007) Evaluating appearance models for recognition, reacquisition, and tracking. In: Proceedings of IEEE International Workshop on Performance Evaluation for Tracking and Surveillance (PETS), vol. 3, no. 5. Citeseer, pp 1–7
Zheng W, Gong S, Xiang T (2011) Person re-identification by probabilistic relative distance comparison. CVPR 2011:649–656
Zhu J-Y, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE international conference on computer vision, pp 2223–2232
Han H, Zhou M, Shang X, Cao W, Abusorrah A (2020) Kiss+ for rapid and accurate pedestrian re-identification. IEEE Trans Intell Transp Syst 22(1):394–403
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Dorai, Y., Gazzah, S., Chausse, F. et al. Tracklet style transfer and part-level feature description for person reidentification in a camera network. Pattern Anal Applic 24, 875–886 (2021). https://doi.org/10.1007/s10044-021-00990-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-021-00990-0