Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Deep feature learning with relative distance comparison for person re-identification

Published: 01 October 2015 Publication History
  • Get Citation Alerts
  • Abstract

    Identifying the same individual across different scenes is an important yet difficult task in intelligent video surveillance. Its main difficulty lies in how to preserve similarity of the same person against large appearance and structure variation while discriminating different individuals. In this paper, we present a scalable distance driven feature learning framework based on the deep neural network for person re-identification, and demonstrate its effectiveness to handle the existing challenges. Specifically, given the training images with the class labels (person IDs), we first produce a large number of triplet units, each of which contains three images, i.e. one person with a matched reference and a mismatched reference. Treating the units as the input, we build the convolutional neural network to generate the layered representations, and follow with the L 2 distance metric. By means of parameter optimization, our framework tends to maximize the relative distance between the matched pair and the mismatched pair for each triplet unit. Moreover, a nontrivial issue arising with the framework is that the triplet organization cubically enlarges the number of training triplets, as one image can be involved into several triplet units. To overcome this problem, we develop an effective triplet generation scheme and an optimized gradient descent algorithm, making the computational load mainly depend on the number of original images instead of the number of triplets. On several challenging databases, our approach achieves very promising results and outperforms other state-of-the-art approaches. HighlightsWe present a novel feature learning framework for person re-identification.Our framework is based on the maximum relative distance comparison.The learning algorithm is scalable to process large amount of data.We demonstrate superior performances over other state-of-the-arts.

    References

    [1]
    L. Lin, T. Wu, J. Porway, Z. Xu, A stochastic graph grammar for compositional object representation and recognition, Pattern Recognit., 42 (2009) 1297-1307.
    [2]
    D. Gray, H. Tao, Viewpoint invariant pedestrian recognition with an ensemble of localized features, in: ECCV, Springer, 2008, pp. 262-275.
    [3]
    X. Wang, G. Doretto, T. Sebastian, J. Rittscher, P. Tu, Shape and appearance context modeling, in: ICCV, IEEE, 2007, pp. 1-8.
    [4]
    R. Layne, T.M. Hospedales, S. Gong, Towards person identification and re-identification with attributes, in: ECCV, Springer, 2012, pp. 402-412.
    [5]
    M. Farenzena, L. Bazzani, A. Perina, V. Murino, M. Cristani, Person re-identification by symmetry-driven accumulation of local features, in: CVPR, IEEE, 2010, pp. 2360-2367.
    [6]
    L. Lin, R. Zhang, X. Duan, Adaptive scene category discovery with generative learning and compositional sampling, IEEE Trans. Circuits Syst. Video Technol., 25 (2015) 251-260.
    [7]
    W. Li, R. Zhao, X. Wang, Human re-identification with transferred metric learning, in: ACCV, Springer, 2013, pp. 31-44.
    [8]
    Z. Li, S. Chang, F. Liang, T.S. Huang, L. Cao, J.R. Smith, Learning locally-adaptive decision functions for person verification, in: CVPR, IEEE, 2013, pp. 3610-3617.
    [9]
    W.-S. Zheng, S. Gong, T. Xiang, Person re-identification by probabilistic relative distance comparison, in: CVPR, IEEE, 2011, pp. 649-656.
    [10]
    L. Lin, P. Luo, X. Chen, K. Zeng, Representing and recognizing objects with massive local image patches, Pattern Recognit., 45 (2012) 231-240.
    [11]
    L. Lin, X. Wang, W. Yang, J. Lai, Discriminatively trained and-or graph models for object shape detection, IEEE Trans. Pattern Anal. Mach. Intell., 37 (2015) 959-972.
    [12]
    Y. Xu, L. Lin, W.-S. Zheng, X. Liu, Human re-identification by matching compositional template with cluster sampling, in: IEEE International Conference on Computer Vision (ICCV), IEEE, 2013, pp. 3152-3159.
    [13]
    R. Zhao, W. Ouyang, X. Wang, Learning mid-level filters for person re-identification, in: CVPR, 2013, pp. 144-151.
    [14]
    A. Mignon, F. Jurie, Pcca: A new approach for distance learning from sparse pairwise constraints, in: CVPR, IEEE, 2012, pp. 2666-2672.
    [15]
    E.P. Xing, M.I. Jordan, S. Russell, A.Y. Ng, Distance metric learning with application to clustering with side-information, in: NIPS, 2002, pp. 505-512.
    [16]
    K.Q. Weinberger, J. Blitzer, L.K. Saul, Distance metric learning for large margin nearest neighbor classification, in: NIPS, 2005, pp. 1473-1480.
    [17]
    J.V. Davis, B. Kulis, P. Jain, S. Sra, I.S. Dhillon, Information-theoretic metric learning, in: ICML, ACM, 2007, pp. 209-216.
    [18]
    S. Xiang, F. Nie, C. Zhang, Learning a mahalanobis distance metric for data clustering and classification, Pattern Recognit., 41 (2008) 3600-3612.
    [19]
    D. Yi, Z. Lei, S.Z. Li, Deep metric learning for practical person re-identification, CoRR abs/1407.4979.
    [20]
    H. Liu, B. Ma, L. Qin, J. Pang, C. Zhang, Q. Huang, Set-label modeling and deep metric learning on person re-identification, Neurocomputing, 151 (2015) 1283-1292.
    [21]
    L. Lin, X. Liu, S.-C. Zhu, Layered graph matching with composite cluster sampling, IEEE Trans. Pattern Anal. Mach. Intell., 32 (2010) 1426-1442.
    [22]
    W. Li, R. Zhao, T. Xiao, X. Wang, Deepreid: Deep filter pairing neural network for person re-identification, in: CVPR, 2014, pp. 152-159.
    [23]
    J. Wang, T. Leung, C. Rosenberg, J. Wang, J. Philbin, B. Chen, Y. Wu, et al., Learning fine-grained image similarity with deep ranking, in: CVPR, 2014.
    [24]
    A. Krizhevsky, I. Sutskever, G.E. Hinton, Imagenet classification with deep convolutional neural networks, in: NIPS, 2012, pp. 1097-1105.
    [25]
    Y. Jia, Caffe: An Open Source Convolutional Architecture for Fast Feature Embedding, {http://caffe.berkeleyvision.org/}, 2013.
    [26]
    T. Wang, S. Gong, X. Zhu, S. Wang, Person re-identification by video ranking, in: European Conference on Computer Vision, IEEE, 2014.
    [27]
    D. Gray, S. Brennan, H. Tao, Evaluating appearance models for recognition, reacquisition, and tracking, in: PETS, 2007.
    [28]
    W.R. Schwartz, L.S. Davis, Learning discriminative appearance-based models using partial least squares, in: 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI), IEEE, 2009, pp. 322-329.
    [29]
    A. Globerson, S.T. Roweis, Metric learning by collapsing classes, in: NIPS, 2005, pp. 451-458.
    [30]
    L. Ma, X. Yang, D. Tao, Person re-identification over camera networks using multi-task distance metric learning, IEEE Trans. Image Process., 23 (2014) 3656-3670.
    [31]
    B. Ma, Y. Su, F. Jurie, et al., Bicov: a novel image representation for person re-identification and face verification, in: BMVC, 2012.
    [32]
    R. Zhao, W. Ouyang, X. Wang, Unsupervised salience learning for person re-identification, in: CVPR, IEEE, 2013, pp. 3586-3593.
    [33]
    C. Liu, S. Gong, C.C. Loy, X. Lin, Person re-identification: what features are important? in: ECCV, Springer, 2012, pp. 391-401.
    [34]
    M. Kostinger, M. Hirzer, P. Wohlhart, P.M. Roth, H. Bischof, Large scale metric learning from equivalence constraints, in: CVPR, IEEE, 2012, pp. 2288-2295.
    [35]
    S. Pedagadi, J. Orwell, S. Velastin, B. Boghossian, Local fisher discriminant analysis for pedestrian re-identification, in: CVPR, IEEE, 2013, pp. 3318-3325.
    [36]
    R. Zhao, W. Ouyang, X. Wang, Person re-identification by salience matching, in: ICCV, IEEE, 2013, pp. 2528-2535.

    Cited By

    View all
    • (2024)Disturbance Propagation Model of Pedestrian Fall Behavior in a Pedestrian Crowd and Elimination Mechanism AnalysisIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2023.331407225:2(1519-1529)Online publication date: 1-Feb-2024
    • (2024)Learning dual attention enhancement feature for visible–infrared person re-identificationJournal of Visual Communication and Image Representation10.1016/j.jvcir.2024.10407699:COnline publication date: 1-Mar-2024
    • (2024)Deep pixel regeneration for occlusion reconstruction in person re-identificationMultimedia Tools and Applications10.1007/s11042-023-15322-z83:2(4443-4463)Online publication date: 1-Jan-2024
    • Show More Cited By

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Pattern Recognition
    Pattern Recognition  Volume 48, Issue 10
    October 2015
    321 pages

    Publisher

    Elsevier Science Inc.

    United States

    Publication History

    Published: 01 October 2015

    Author Tags

    1. Deep learning
    2. Distance comparison
    3. Person re-identification

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 11 Aug 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Disturbance Propagation Model of Pedestrian Fall Behavior in a Pedestrian Crowd and Elimination Mechanism AnalysisIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2023.331407225:2(1519-1529)Online publication date: 1-Feb-2024
    • (2024)Learning dual attention enhancement feature for visible–infrared person re-identificationJournal of Visual Communication and Image Representation10.1016/j.jvcir.2024.10407699:COnline publication date: 1-Mar-2024
    • (2024)Deep pixel regeneration for occlusion reconstruction in person re-identificationMultimedia Tools and Applications10.1007/s11042-023-15322-z83:2(4443-4463)Online publication date: 1-Jan-2024
    • (2024)Multi-granularity attention in attention for person re-identification in aerial imagesThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-023-03074-840:6(4149-4166)Online publication date: 1-Jun-2024
    • (2023)A Theory-Driven Deep Learning Method for Voice Chat–Based Customer Response PredictionInformation Systems Research10.1287/isre.2022.119634:4(1513-1532)Online publication date: 1-Dec-2023
    • (2023)Person Search by a Bi-Directional Task-Consistent Learning ModelIEEE Transactions on Multimedia10.1109/TMM.2021.314002525(1190-1203)Online publication date: 1-Jan-2023
    • (2023)Learning Weak Semantics by Feature Graph for Attribute-Based Person SearchIEEE Transactions on Image Processing10.1109/TIP.2023.327074132(2580-2592)Online publication date: 1-Jan-2023
    • (2023)Pose-driven attention-guided image generation for person re-IdentificationPattern Recognition10.1016/j.patcog.2022.109246137:COnline publication date: 1-May-2023
    • (2023)Similarity learning with deep CRF for person re-identificationPattern Recognition10.1016/j.patcog.2022.109151135:COnline publication date: 1-Mar-2023
    • (2023)Unsupervised Domain Adaptation via Deep Conditional Adaptation NetworkPattern Recognition10.1016/j.patcog.2022.109088134:COnline publication date: 1-Feb-2023
    • Show More Cited By

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media