Abstract
Person search is to detect all persons and identify the query persons from detected persons in the image without proposals and bounding boxes, which is different from person re-identification. In this paper, we propose a fusing multi-task convolutional neural network(FMT-CNN) to tackle the correlation and heterogeneity of detection and re-identification with a single convolutional neural network. We focus on how the interplay of person detection and person re-identification affects the overall performance. We employ person labels in region proposal network to produce features for person re-identification and person detection network, which can improve the accuracy of detection and re-identification simultaneously. We also use a multiple loss to train our re-identification network. Experiment results on CUHK-SYSU Person Search dataset show that the performance of our proposed method is superior to state-of-the-art approaches in both mAP and top-1.



Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Cheng D, Gong Y, Zhou S, et al. (2016) Person re-identification by multi-channel parts-based cnn with improved triplet loss function. IEEE computer vision and pattern recognition, pp 1335-1344
Cheng D, Gong Y, Shi W, et al. (2018) Person re-identification by the asymmetric triplet and identification loss function. Multimed Tools Appl 77(3):3533–3550
Ding S, Lin L, Wang G, et al. (2015) Deep feature learning with relative distance comparison for person re-identification. Pattern Recogn 48(10):2993–3003
Dollar P, Belongie S, Belongie S, et al. (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–45
Engel C, Baumgartner P, Holzmann M, et al. (2010) Person Re-Identification by support vector ranking. British Machine Vision Conference (BMVC) 42:1–11
Felzenszwalb PF, Girshick RB, Mcallester D, et al. (2010) Object detection with discriminatively trained Part-Based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
Gao H, Yu L, Huang Y, et al. (2017) Multi-task learning for person re-identification. In: international conference on intelligent science and big data engineering, pp 259–268
Girshick R (2015) Fast R-CNN. In: IEEE international conference on computer vision, computer science
Girshick R, Donahue J, Darrell T, et al. (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE conference on computer vision and pattern recognition, IEEE Computer Society, pp 580–587
Hamdoun O, Moutarde F, Stanciulescu B, et al. (2008) Person re-identification in multi-camera system by signature based on interest point descriptors collected on short video sequences. In: ACM/IEEE International Conference on Distributed Smart Cameras, pp 1–6
He K, Zhang X, Ren S, et al. (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Jia Y, Shelhamer E, Donahue J, et al. (2014) Caffe: Convolutional architecture for fast feature embedding. In: 22nd ACM international conference on multimedia, pp 675–678
Koestinger M, Hirzer M, Wohlhart P, et al. (2012) Large scale metric learning from equivalence constraints. In: IEEE conference on IEEE computer vision and pattern recognition, pp 2288–2295
Leng Q, Hu R, Liang C, et al. (2015) Person re-identification with content and context re-ranking. Multimed Tools Appl, pp 6989–7014
Li S, Liu X, Liu W, et al. (2016) A discriminative null space based deep learning approach for person re-identification. In: 2016 4th international conference on cloud computing and intelligence systems (CCIS),. IEEE, pp 480–484
Liao S, Li SZ (2015) Efficient PSD constrained asymmetric metric learning for person re-identification. In: IEEE international conference on computer vision, pp 3685–3693
Liao S, Hu Y, Zhu X, et al. (2015) Person re-identification by local maximal occurrence representation and metric learning. In: IEEE conference on computer vision and pattern recognition, pp 2197–2206
Liu H, Feng J, Qi M, et al. (2017) End-to-end comparative attention networks for person re-identification. IEEE Trans Image Process 26(7):3492–3506
McLaughlin N, del Rincon JM, Miller PC (2017) Person reidentification using deep convnets with multitask learning. IEEE Trans Circuits Syst Video Techn, pp 525–539
Nino-Castaneda J, Frías-Velázquez A, Bo NB, Slembrouck M, Guan J, Debard G, Vanrumste B, Tuytelaars T, Philips W (2016) Scalable semi-automatic annotation for multi-camera person tracking. IEEE Trans Image Process 25(5):2259–2274
Ospici M, Cecchi A (2018) Person re-identification across different datasets with multi-task learning, arXiv preprint, pp 1807–09666
Paisitkriangkrai S, Shen C, Van Den Hengel A (2015) Learning to rank in person re-identification with metric ensembles. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 1846–1855
Ren S, He K, Girshick R, et al. (2017) Faster r-CNN: Towards Real-Time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149
Shen Y, Lin W, Yan J, et al. (2015) Person re-identification with correspondence structure learning. In: IEEE international conference on computer vision. IEEE computer society, pp 3200-3208
Sun Y, Zheng L, Deng W, et al. (2017) SVDNet for pedestrian retrieval. In: IEEE international conference on computer vision. IEEE Computer Society, pp 3820–3828
Xiao T, Li H, Ouyang W, et al. (2016) Learning deep feature representations with domain guided dropout for person re-identification, pp 1249-1258
Xiao T, Li S, Wang B, et al. (2017) Joint detection and identification feature learning for person search. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 3376–3385
Yang B, Yan J, Lei Z, et al. (2015) Convolutional channel features. In: IEEE international conference on computer vision, pp 82–90
Yuan C, Xu C, Wang T, et al. (2017) Deep multi-instance learning for end-to-end person re-identification. Multimed Tools Appl, (4):1–31
Zhang S, Benenson R, Schiele B (2015) Filtered channel features for pedestrian detection. IEEE computer vision and pattern recognition, pp 1751–1760
Zhao R, Ouyang W, Wang X (2013) Unsupervised salience learning for person re-identification. IEEE Computer Vision and Pattern Recognition 9:3586–3593
Zheng WS, Gong S, Xiang T (2011) Person re-identification by probabilistic relative distance comparison. Computer Vision and Pattern Recognition 42:649–656
Zheng L, Shen L, Tian L, et al. (2015) Scalable person re-identification: a benchmark. In: IEEE international conference on computer vision, pp 1116–1124
Acknowledgments
This work was supported in part by the National Natural Science Foundation of China under Grant 61872005, in part by the Natural Science Research Project of Anhui universities of China under Grant KJ2019A0005, KJ2019A0032, and in part supported by open project of Anhui University KF2019A03.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhai, S., Liu, S., Wang, X. et al. FMT: fusing multi-task convolutional neural network for person search. Multimed Tools Appl 78, 31605–31616 (2019). https://doi.org/10.1007/s11042-019-07939-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-07939-w