research-article

Deep feature learning with relative distance comparison for person re-identification

Authors:

Shengyong Ding,

Hongyang ChaoAuthors Info & Claims

Pattern Recognition, Volume 48, Issue 10

Pages 2993 - 3003

https://doi.org/10.1016/j.patcog.2015.04.005

Published: 01 October 2015 Publication History

Abstract

Identifying the same individual across different scenes is an important yet difficult task in intelligent video surveillance. Its main difficulty lies in how to preserve similarity of the same person against large appearance and structure variation while discriminating different individuals. In this paper, we present a scalable distance driven feature learning framework based on the deep neural network for person re-identification, and demonstrate its effectiveness to handle the existing challenges. Specifically, given the training images with the class labels (person IDs), we first produce a large number of triplet units, each of which contains three images, i.e. one person with a matched reference and a mismatched reference. Treating the units as the input, we build the convolutional neural network to generate the layered representations, and follow with the L 2 distance metric. By means of parameter optimization, our framework tends to maximize the relative distance between the matched pair and the mismatched pair for each triplet unit. Moreover, a nontrivial issue arising with the framework is that the triplet organization cubically enlarges the number of training triplets, as one image can be involved into several triplet units. To overcome this problem, we develop an effective triplet generation scheme and an optimized gradient descent algorithm, making the computational load mainly depend on the number of original images instead of the number of triplets. On several challenging databases, our approach achieves very promising results and outperforms other state-of-the-art approaches. HighlightsWe present a novel feature learning framework for person re-identification.Our framework is based on the maximum relative distance comparison.The learning algorithm is scalable to process large amount of data.We demonstrate superior performances over other state-of-the-arts.

References

[1]

L. Lin, T. Wu, J. Porway, Z. Xu, A stochastic graph grammar for compositional object representation and recognition, Pattern Recognit., 42 (2009) 1297-1307.

Digital Library

[2]

D. Gray, H. Tao, Viewpoint invariant pedestrian recognition with an ensemble of localized features, in: ECCV, Springer, 2008, pp. 262-275.

[3]

X. Wang, G. Doretto, T. Sebastian, J. Rittscher, P. Tu, Shape and appearance context modeling, in: ICCV, IEEE, 2007, pp. 1-8.

[4]

R. Layne, T.M. Hospedales, S. Gong, Towards person identification and re-identification with attributes, in: ECCV, Springer, 2012, pp. 402-412.

[5]

M. Farenzena, L. Bazzani, A. Perina, V. Murino, M. Cristani, Person re-identification by symmetry-driven accumulation of local features, in: CVPR, IEEE, 2010, pp. 2360-2367.

[6]

L. Lin, R. Zhang, X. Duan, Adaptive scene category discovery with generative learning and compositional sampling, IEEE Trans. Circuits Syst. Video Technol., 25 (2015) 251-260.

[7]

W. Li, R. Zhao, X. Wang, Human re-identification with transferred metric learning, in: ACCV, Springer, 2013, pp. 31-44.

[8]

Z. Li, S. Chang, F. Liang, T.S. Huang, L. Cao, J.R. Smith, Learning locally-adaptive decision functions for person verification, in: CVPR, IEEE, 2013, pp. 3610-3617.

[9]

W.-S. Zheng, S. Gong, T. Xiang, Person re-identification by probabilistic relative distance comparison, in: CVPR, IEEE, 2011, pp. 649-656.

[10]

L. Lin, P. Luo, X. Chen, K. Zeng, Representing and recognizing objects with massive local image patches, Pattern Recognit., 45 (2012) 231-240.

Digital Library

[11]

L. Lin, X. Wang, W. Yang, J. Lai, Discriminatively trained and-or graph models for object shape detection, IEEE Trans. Pattern Anal. Mach. Intell., 37 (2015) 959-972.

[12]

Y. Xu, L. Lin, W.-S. Zheng, X. Liu, Human re-identification by matching compositional template with cluster sampling, in: IEEE International Conference on Computer Vision (ICCV), IEEE, 2013, pp. 3152-3159.

Digital Library

[13]

R. Zhao, W. Ouyang, X. Wang, Learning mid-level filters for person re-identification, in: CVPR, 2013, pp. 144-151.

[14]

A. Mignon, F. Jurie, Pcca: A new approach for distance learning from sparse pairwise constraints, in: CVPR, IEEE, 2012, pp. 2666-2672.

[15]

E.P. Xing, M.I. Jordan, S. Russell, A.Y. Ng, Distance metric learning with application to clustering with side-information, in: NIPS, 2002, pp. 505-512.

[16]

K.Q. Weinberger, J. Blitzer, L.K. Saul, Distance metric learning for large margin nearest neighbor classification, in: NIPS, 2005, pp. 1473-1480.

[17]

J.V. Davis, B. Kulis, P. Jain, S. Sra, I.S. Dhillon, Information-theoretic metric learning, in: ICML, ACM, 2007, pp. 209-216.

[18]

S. Xiang, F. Nie, C. Zhang, Learning a mahalanobis distance metric for data clustering and classification, Pattern Recognit., 41 (2008) 3600-3612.

Digital Library

[19]

D. Yi, Z. Lei, S.Z. Li, Deep metric learning for practical person re-identification, CoRR abs/1407.4979.

[20]

H. Liu, B. Ma, L. Qin, J. Pang, C. Zhang, Q. Huang, Set-label modeling and deep metric learning on person re-identification, Neurocomputing, 151 (2015) 1283-1292.

[21]

L. Lin, X. Liu, S.-C. Zhu, Layered graph matching with composite cluster sampling, IEEE Trans. Pattern Anal. Mach. Intell., 32 (2010) 1426-1442.

Digital Library

[22]

W. Li, R. Zhao, T. Xiao, X. Wang, Deepreid: Deep filter pairing neural network for person re-identification, in: CVPR, 2014, pp. 152-159.

[23]

J. Wang, T. Leung, C. Rosenberg, J. Wang, J. Philbin, B. Chen, Y. Wu, et al., Learning fine-grained image similarity with deep ranking, in: CVPR, 2014.

Digital Library

[24]

A. Krizhevsky, I. Sutskever, G.E. Hinton, Imagenet classification with deep convolutional neural networks, in: NIPS, 2012, pp. 1097-1105.

Digital Library

[25]

Y. Jia, Caffe: An Open Source Convolutional Architecture for Fast Feature Embedding, {http://caffe.berkeleyvision.org/}, 2013.

[26]

T. Wang, S. Gong, X. Zhu, S. Wang, Person re-identification by video ranking, in: European Conference on Computer Vision, IEEE, 2014.

[27]

D. Gray, S. Brennan, H. Tao, Evaluating appearance models for recognition, reacquisition, and tracking, in: PETS, 2007.

[28]

W.R. Schwartz, L.S. Davis, Learning discriminative appearance-based models using partial least squares, in: 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI), IEEE, 2009, pp. 322-329.

Digital Library

[29]

A. Globerson, S.T. Roweis, Metric learning by collapsing classes, in: NIPS, 2005, pp. 451-458.

[30]

L. Ma, X. Yang, D. Tao, Person re-identification over camera networks using multi-task distance metric learning, IEEE Trans. Image Process., 23 (2014) 3656-3670.

[31]

B. Ma, Y. Su, F. Jurie, et al., Bicov: a novel image representation for person re-identification and face verification, in: BMVC, 2012.

[32]

R. Zhao, W. Ouyang, X. Wang, Unsupervised salience learning for person re-identification, in: CVPR, IEEE, 2013, pp. 3586-3593.

[33]

C. Liu, S. Gong, C.C. Loy, X. Lin, Person re-identification: what features are important? in: ECCV, Springer, 2012, pp. 391-401.

[34]

M. Kostinger, M. Hirzer, P. Wohlhart, P.M. Roth, H. Bischof, Large scale metric learning from equivalence constraints, in: CVPR, IEEE, 2012, pp. 2288-2295.

[35]

S. Pedagadi, J. Orwell, S. Velastin, B. Boghossian, Local fisher discriminant analysis for pedestrian re-identification, in: CVPR, IEEE, 2013, pp. 3318-3325.

[36]

R. Zhao, W. Ouyang, X. Wang, Person re-identification by salience matching, in: ICCV, IEEE, 2013, pp. 2528-2535.

Cited By

Li CZhao RWang YJia PZhu WMa YLi M(2024)Disturbance Propagation Model of Pedestrian Fall Behavior in a Pedestrian Crowd and Elimination Mechanism AnalysisIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2023.331407225:2(1519-1529)Online publication date: 1-Feb-2024
https://dl.acm.org/doi/10.1109/TITS.2023.3314072
Zhang GZhang YZhang HChen YZheng Y(2024)Learning dual attention enhancement feature for visible–infrared person re-identificationJournal of Visual Communication and Image Representation10.1016/j.jvcir.2024.10407699:COnline publication date: 1-Mar-2024
https://dl.acm.org/doi/10.1016/j.jvcir.2024.104076
Tagore NMedi PChattopadhyay P(2024)Deep pixel regeneration for occlusion reconstruction in person re-identificationMultimedia Tools and Applications10.1007/s11042-023-15322-z83:2(4443-4463)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1007/s11042-023-15322-z
Show More Cited By

Recommendations

Deep multi-instance learning for end-to-end person re-identification

In this paper, we introduce a deep multi-instance learning framework to boost the instance-level person re-identification performance. Motivated by the observation of considerably dramatic and complex varieties of visual appearances in many current ...
Part-based Feature Extraction for Person Re-identification
ICMLC '18: Proceedings of the 2018 10th International Conference on Machine Learning and Computing

In this paper, we propose a new part-based CNN feature extraction method for end-to-end person re-identification. In our method, the input images are first divided into two different non-overlapping parts, and then two different CNN models are trained ...
A loss combination based deep model for person re-identification

The Convolutional Neural Network (CNN) has significantly improved the state-of-the-art in person re-identification (re-ID). In the existing available identification CNN model, the softmax loss function is employed as the supervision signal to train the ...

Comments

Information & Contributors

Information

Published In

cover image Pattern Recognition

Pattern Recognition Volume 48, Issue 10

October 2015

321 pages

ISSN:0031-3203

Issue’s Table of Contents

Copyright © Elsevier Ltd.

Publisher

Elsevier Science Inc.

United States

Publication History

Published: 01 October 2015

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

192
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 11 Aug 2024

Other Metrics

View Author Metrics

Citations

Cited By

Li CZhao RWang YJia PZhu WMa YLi M(2024)Disturbance Propagation Model of Pedestrian Fall Behavior in a Pedestrian Crowd and Elimination Mechanism AnalysisIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2023.331407225:2(1519-1529)Online publication date: 1-Feb-2024
https://dl.acm.org/doi/10.1109/TITS.2023.3314072
Zhang GZhang YZhang HChen YZheng Y(2024)Learning dual attention enhancement feature for visible–infrared person re-identificationJournal of Visual Communication and Image Representation10.1016/j.jvcir.2024.10407699:COnline publication date: 1-Mar-2024
https://dl.acm.org/doi/10.1016/j.jvcir.2024.104076
Tagore NMedi PChattopadhyay P(2024)Deep pixel regeneration for occlusion reconstruction in person re-identificationMultimedia Tools and Applications10.1007/s11042-023-15322-z83:2(4443-4463)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1007/s11042-023-15322-z
Xu SLuo LHong HHu JYang BHu S(2024)Multi-granularity attention in attention for person re-identification in aerial imagesThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-023-03074-840:6(4149-4166)Online publication date: 1-Jun-2024
https://dl.acm.org/doi/10.1007/s00371-023-03074-8
Chen GXiao SZhang CZhao H(2023)A Theory-Driven Deep Learning Method for Voice Chat–Based Customer Response PredictionInformation Systems Research10.1287/isre.2022.119634:4(1513-1532)Online publication date: 1-Dec-2023
https://dl.acm.org/doi/10.1287/isre.2022.1196
Wang CMa BChang HShan SChen X(2023)Person Search by a Bi-Directional Task-Consistent Learning ModelIEEE Transactions on Multimedia10.1109/TMM.2021.314002525(1190-1203)Online publication date: 1-Jan-2023
https://dl.acm.org/doi/10.1109/TMM.2021.3140025
Peng QYang LXie XLai J(2023)Learning Weak Semantics by Feature Graph for Attribute-Based Person SearchIEEE Transactions on Image Processing10.1109/TIP.2023.327074132(2580-2592)Online publication date: 1-Jan-2023
https://dl.acm.org/doi/10.1109/TIP.2023.3270741
Khatun ADenman SSridharan SFookes C(2023)Pose-driven attention-guided image generation for person re-IdentificationPattern Recognition10.1016/j.patcog.2022.109246137:COnline publication date: 1-May-2023
https://dl.acm.org/doi/10.1016/j.patcog.2022.109246
Xiang JHuang ZJiang XHou J(2023)Similarity learning with deep CRF for person re-identificationPattern Recognition10.1016/j.patcog.2022.109151135:COnline publication date: 1-Mar-2023
https://dl.acm.org/doi/10.1016/j.patcog.2022.109151
Ge PRen CXu XYan H(2023)Unsupervised Domain Adaptation via Deep Conditional Adaptation NetworkPattern Recognition10.1016/j.patcog.2022.109088134:COnline publication date: 1-Feb-2023
https://dl.acm.org/doi/10.1016/j.patcog.2022.109088
Show More Cited By

View Options

View options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Issue’s Table of Contents