Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3123266.3123279acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

GLAD: Global-Local-Alignment Descriptor for Pedestrian Retrieval

Published: 19 October 2017 Publication History

Abstract

The huge variance of human pose and the misalignment of detected human images significantly increase the difficulty of person Re-Identification (Re-ID). Moreover, efficient Re-ID systems are required to cope with the massive visual data being produced by video surveillance systems. Targeting to solve these problems, this work proposes a Global-Local-Alignment Descriptor (GLAD) and an efficient indexing and retrieval framework, respectively. GLAD explicitly leverages the local and global cues in human body to generate a discriminative and robust representation. It consists of part extraction and descriptor learning modules, where several part regions are first detected and then deep neural networks are designed for representation learning on both the local and global regions. A hierarchical indexing and retrieval framework is designed to eliminate the huge redundancy in the gallery set, and accelerate the online Re-ID procedure. Extensive experimental results show GLAD achieves competitive accuracy compared to the state-of-the-art methods. Our retrieval framework significantly accelerates the online Re-ID procedure without loss of accuracy. Therefore, this work has potential to work better on person Re-ID tasks in real scenarios.

References

[1]
Ejaz Ahmed, Michael Jones, and Tim K Marks. 2015. An improved deep learning architecture for person re-identification CVPR.
[2]
Mykhaylo Andriluka, Leonid Pishchulin, Peter Gehler, and Bernt Schiele. 2014. 2D Human Pose Estimation: New Benchmark and State of the Art Analysis CVPR.
[3]
Apurva Bedagkar-Gala and Shishir K Shah. 2012. Part-based spatio-temporal model for multi-person re-identification. Pattern Recognition Letters Vol. 33, 14 (October. 2012), 1908--1915.
[4]
Dapeng Chen, Zejian Yuan, Badong Chen, and Nanning Zheng. 2016. Similarity learning with spatial constraints for person re-identification CVPR.
[5]
Ying-Cong Chen, Wei-Shi Zheng, and Jianhuang Lai. 2015. Mirror Representation for Modeling View-Specific Transform in Person Re-Identification. IJCAI.
[6]
De Cheng, Yihong Gong, Sanping Zhou, Jinjun Wang, and Nanning Zheng. 2016. Person re-identification by multi-channel parts-based cnn with improved triplet loss function. In CVPR.
[7]
Dong Seon Cheng and Marco Cristani. 2014. Person re-identification by articulated appearance matching. Person Re-Identification. Springer, 139--160.
[8]
Michela Farenzena, Loris Bazzani, Alessandro Perina, Vittorio Murino, and Marco Cristani. 2010. Person re-identification by symmetry-driven accumulation of local features CVPR.
[9]
Pedro Felzenszwalb, David McAllester, and Deva Ramanan. 2008. A discriminatively trained, multiscale, deformable part model CVPR.
[10]
Douglas Gray, Shane Brennan, and Hai Tao. 2007. Evaluating appearance models for recognition, reacquisition, and tracking PETSW.
[11]
John A Hartigan and Manchek A Wong. 1979. Algorithm AS 136: A k-means clustering algorithm. Journal of the Royal Statistical Society. Series C (Applied Statistics), Vol. 28, 1 (1979), 100--108.
[12]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR.
[13]
Eldar Insafutdinov, Leonid Pishchulin, Bjoern Andres, Mykhaylo Andriluka, and Bernt Schiele. 2016. DeeperCut: A deeper, stronger, and faster multi-person pose estimation model ECCV.
[14]
Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Convolutional Architecture for Fast Feature Embedding. arXiv:1408.5093 (2014).
[15]
Cijo Jose and Franccois Fleuret. 2016. Scalable metric learning via weighted approximate rank component analysis ECCV.
[16]
A. Krizhevsky, I. Sutskever, and G. E. Hinton. 2012. Imagenet classification with deep convolutional neural networks NIPS.
[17]
Wei Li, Rui Zhao, Tong Xiao, and Xiaogang Wang. 2014. Deepreid: Deep filter pairing neural network for person re-identification CVPR.
[18]
Shengcai Liao, Yang Hu, Xiangyu Zhu, and Stan Z Li. 2015. Person re-identification by local maximal occurrence representation and metric learning CVPR.
[19]
Shengcai Liao and Stan Z Li. 2015. Efficient psd constrained asymmetric metric learning for person re-identification ICCV.
[20]
Hao Liu, Jiashi Feng, Meibin Qi, Jianguo Jiang, and Shuicheng Yan. 2016. End-to-end comparative attention networks for person re-identification. arXiv:1606.04404 (2016).
[21]
Alejandro Newell, Kaiyu Yang, and Jia Deng. 2016. Stacked hourglass networks for human pose estimation ECCV.
[22]
Leonid Pishchulin, Eldar Insafutdinov, Siyu Tang, Bjoern Andres, Mykhaylo Andriluka, Peter V Gehler, and Bernt Schiele. 2016. Deepcut: Joint subset partition and labeling for multi person pose estimation CVPR.
[23]
Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556 (2014).
[24]
Chi Su, Fan Yang, Shiliang Zhang, Qi Tian, Larry S. Davis, and Wen Gao. 2017. Multi-Task Learning with Low Rank Attribute Embedding for Multi-Camera Person Re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence (2017).
[25]
Chi Su, Shiliang Zhang, Junliang Xing, Wen Gao, and Qi Tian. 2016. Deep attributes driven multi-camera person re-identification ECCV.
[26]
Chi Su, Shiliang Zhang, Junliang Xing, Qi Tian, and Wen Gao. 2017 a. Multi-Type Attributes Driven Multi-Camera Person Re-identification. Pattern Recognition (2017).
[27]
Chi Su, Shiliang Zhang, Fan Yang, Guangxiao Zhang, Qi Tian, Wen Gao, and Larry S. Davis. 2017 b. Attributes driven tracklet-to-tracklet person re-identification using Latent Prototypes Space Mapping. Pattern Recognition Vol. 66 (June. 2017), 4--15.
[28]
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. 2015. Going deeper with convolutions. In CVPR.
[29]
Jonathan J Tompson, Arjun Jain, Yann LeCun, and Christoph Bregler. 2014. Joint training of a convolutional network and a graphical model for human pose estimation NIPS.
[30]
Alexander Toshev and Christian Szegedy. 2014. Deeppose: Human pose estimation via deep neural networks CVPR.
[31]
Rahul Rama Varior, Mrinal Haloi, and Gang Wang. 2016 a. Gated siamese convolutional neural network architecture for human re-identification ECCV.
[32]
Rahul Rama Varior, Bing Shuai, Jiwen Lu, Dong Xu, and Gang Wang. 2016 b. A siamese long short-term memory architecture for human re-identification ECCV.
[33]
Faqiang Wang, Wangmeng Zuo, Liang Lin, David Zhang, and Lei Zhang. 2016. Joint learning of single-image and cross-image representations for person re-identification CVPR.
[34]
Ruiping Wang, Shiguang Shan, Xilin Chen, Qionghai Dai, and Wen Gao. 2012. Manifold--manifold distance and its application to face recognition with image sets. IEEE Transactions on Image Processing Vol. 21, 10 (October. 2012), 4466--4479.
[35]
Shih-En Wei, Varun Ramakrishna, Takeo Kanade, and Yaser Sheikh. 2016. Convolutional pose machines. In CVPR.
[36]
Lin Wu, Chunhua Shen, and Anton van den Hengel. 2016 b. Personnet: person re-identification with deep convolutional neural networks. arXiv:1601.07255 (2016).
[37]
Shangxuan Wu, Ying-Cong Chen, Xiang Li, An-Cong Wu, Jin-Jie You, and Wei-Shi Zheng. 2016 a. An enhanced deep feature representation for person re-identification WACV.
[38]
Tong Xiao, Hongsheng Li, Wanli Ouyang, and Xiaogang Wang. 2016. Learning deep feature representations with domain guided dropout for person re-identification CVPR.
[39]
Hantao Yao, Shiliang Zhang, Dongming Zhang, Yongdong Zhang, Jintao Li, Yu Wang, and Qi Tian. 2017. Large-scale Person Re-Identification as Retrieval. ICME.
[40]
Dong Yi, Zhen Lei, Shengcai Liao, and Stan Z Li. 2014. Deep metric learning for person re-identification. ICPR.
[41]
Li Zhang, Tao Xiang, and Shaogang Gong. 2016. Learning a discriminative null space for person re-identification CVPR.
[42]
Shiliang Zhang, Xiaoyu Wang, Yuanqing Lin, and Qi Tian. 2015. Cross Indexing With Grouplets. IEEE Transactions on Multimedia Vol. 17, 11 (2015), 1969--1979.
[43]
Liang Zheng, Yujia Huang, Huchuan Lu, and Yi Yang. 2017. Pose Invariant Embedding for Deep Person Re-identification. arXiv:1701.07732 (2017).
[44]
Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable person re-identification: A benchmark. In ICCV.
[45]
Zhedong Zheng, Liang Zheng, and Yi Yang. 2016. A Discriminatively Learned CNN Embedding for Person Re-identification. arXiv:1611.05666 (2016).

Cited By

View all
  • (2024)Cross-Video Pedestrian Tracking Algorithm with a Coordinate ConstraintSensors10.3390/s2403077924:3(779)Online publication date: 25-Jan-2024
  • (2024)Cross-Domain Person Re-Identification Based on Feature Fusion InvarianceApplied Sciences10.3390/app1411464414:11(4644)Online publication date: 28-May-2024
  • (2024)Multiple-local feature and attention fused person re-identification methodIntelligent Data Analysis10.3233/IDA-23039228:6(1679-1695)Online publication date: 15-Nov-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '17: Proceedings of the 25th ACM international conference on Multimedia
October 2017
2028 pages
ISBN:9781450349062
DOI:10.1145/3123266
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 October 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. global-local-alignment descriptor
  2. person re-identification
  3. retrieval framework

Qualifiers

  • Research-article

Funding Sources

  • National Science Foundation of China

Conference

MM '17
Sponsor:
MM '17: ACM Multimedia Conference
October 23 - 27, 2017
California, Mountain View, USA

Acceptance Rates

MM '17 Paper Acceptance Rate 189 of 684 submissions, 28%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)41
  • Downloads (Last 6 weeks)7
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Cross-Video Pedestrian Tracking Algorithm with a Coordinate ConstraintSensors10.3390/s2403077924:3(779)Online publication date: 25-Jan-2024
  • (2024)Cross-Domain Person Re-Identification Based on Feature Fusion InvarianceApplied Sciences10.3390/app1411464414:11(4644)Online publication date: 28-May-2024
  • (2024)Multiple-local feature and attention fused person re-identification methodIntelligent Data Analysis10.3233/IDA-23039228:6(1679-1695)Online publication date: 15-Nov-2024
  • (2024)HashReID: Dynamic Network with Binary Codes for Efficient Person Re-identification2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00594(6034-6043)Online publication date: 3-Jan-2024
  • (2024)A Two-Stage Noise-Tolerant Paradigm for Label Corrupted Person Re-IdentificationIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2024.336149146:7(4944-4956)Online publication date: Jul-2024
  • (2024)SSRR: Structural Semantic Representation Reconstruction for Visible-Infrared Person Re-IdentificationIEEE Transactions on Multimedia10.1109/TMM.2023.334785526(6273-6284)Online publication date: 2024
  • (2024)Transformer-Based Person Re-Identification: A Comprehensive ReviewIEEE Transactions on Intelligent Vehicles10.1109/TIV.2024.33506699:7(5222-5239)Online publication date: Jul-2024
  • (2024)Neighbor Consistency and Global-Local Interaction: A Novel Pseudo-Label Refinement Approach for Unsupervised Person Re-IdentificationIEEE Transactions on Information Forensics and Security10.1109/TIFS.2024.346503719(9070-9084)Online publication date: 2024
  • (2024)Contrastive Pedestrian Attentive and Correlation Learning Network for Occluded Person Re-IdentificationIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.337957734:9(8862-8880)Online publication date: Sep-2024
  • (2024)Equity in Unsupervised Domain Adaptation by Nuclear Norm MaximizationIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.334644434:7(5533-5545)Online publication date: Jul-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media