Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Fully Unsupervised Person Re-Identification via Selective Contrastive Learning

Published: 16 February 2022 Publication History

Abstract

Person re-identification (ReID) aims at searching the same identity person among images captured by various cameras. Existing fully supervised person ReID methods usually suffer from poor generalization capability caused by domain gaps. Unsupervised person ReID has attracted a lot of attention recently, because it works without intensive manual annotation and thus shows great potential in adapting to new conditions. Representation learning plays a critical role in unsupervised person ReID. In this work, we propose a novel selective contrastive learning framework for fully unsupervised feature learning. Specifically, different from traditional contrastive learning strategies, we propose to use multiple positives and adaptively selected negatives for defining the contrastive loss, enabling to learn a feature embedding model with stronger identity discriminative representation. Moreover, we propose to jointly leverage global and local features to construct three dynamic memory banks, among which the global and local ones are used for pairwise similarity computation and the mixture memory bank are used for contrastive loss definition. Experimental results demonstrate the superiority of our method in unsupervised person ReID compared with the state of the art. Our code is available at https://github.com/pangbo1997/Unsup_ReID.git.

References

[1]
Mang Ye, Jianbing Shen, Gaojie Lin, Tao Xiang, Ling Shao, and Steven C. H. Hoi. 2021. Deep learning for person re-identification: A survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence. Early access, January 26, 2021.
[2]
Jingya Wang, Xiatian Zhu, Shaogang Gong, and Wei Li. 2018. Transferable joint attribute-identity deep learning for unsupervised person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2275–2284.
[3]
Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, and Jianbin Jiao. 2018. Image-image domain adaptation with preserved self-similarity and domain-dissimilarity for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 994–1003.
[4]
Jiawei Liu, Zheng-Jun Zha, Di Chen, Richang Hong, and Meng Wang. 2019. Adaptive transfer network for cross-domain person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7202–7211.
[5]
Yixiao Ge, Feng Zhu, Dapeng Chen, Rui Zhao, and Hongsheng Li. 2020. Self-paced contrastive learning with hybrid memory for domain adaptive object re-ID. In Advances in Neural Information Processing Systems.
[6]
Yutian Lin, Xuanyi Dong, Liang Zheng, Yan Yan, and Yi Yang. 2019. A bottom-up clustering approach to unsupervised person re-identification. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 8738–8745.
[7]
Dongkai Wang and Shiliang Zhang. 2020. Unsupervised person re-identification via multi-label classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10981–10990.
[8]
Yutian Lin, Lingxi Xie, Yu Wu, Chenggang Yan, and Qi Tian. 2020. Unsupervised person re-identification via softened similarity learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3390–3399.
[9]
Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020. A simple framework for contrastive learning of visual representations. arXiv preprint arXiv:2002.05709.
[10]
Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2020. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 9729–9738.
[11]
Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, and Shengjin Wang. 2018. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In Proceedings of the European Conference on Computer Vision (ECCV’18). 480–496.
[12]
J. Long, E. Shelhamer, and T. Darrell. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’15). 3431–3440.
[13]
Liming Zhao, Xi Li, Yueting Zhuang, and Jingdong Wang. 2017. Deeply-learned part-aligned representations for person re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17). 3219–3228.
[14]
X. Jin, C. Lan, W. Zeng, Z. Chen, and L. Zhang. 2020. Style normalization and restitution for generalizable person re-identification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20). 3140–3149.
[15]
Guile Wu, Xiatian Zhu, and Shaogang Gong. 2020. Tracklet self-supervised learning for unsupervised person re-identification. Proceedings of the AAAI Conference on Artificial Intelligence 34, 7 (2020), 12362–12369.
[16]
Zhun Zhong, Liang Zheng, Shaozi Li, and Yi Yang. 2018. Generalizing a person retrieval model hetero-and homogeneously. In Proceedings of the European Conference on Computer Vision (ECCV’18). 172–188.
[17]
Yu Wu, Yutian Lin, Xuanyi Dong, Yan Yan, and Yi Yang. 2018. Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’18).
[18]
Yu Wu, Yutian Lin, Xuanyi Dong, Yan Yan, Wei Bian, and Yi Yang. 2019. Progressive learning for person re-identification with one example. IEEE Transactions on Image Processing. Early access, January 10, 2019.
[19]
Mang Ye, Xiangyuan Lan, and Pong C. Yuen. 2018. Robust anchor embedding for unsupervised video person re-identification in the wild. In Proceedings of the European Conference on Computer Vision (ECCV’18). 170–186.
[20]
Shengcai Liao, Yang Hu, Xiangyu Zhu, and Stan Z. Li. 2015. Person re-identification by local maximal occurrence representation and metric learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2197–2206.
[21]
Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable person re-identification: A benchmark. In Proceedings of the IEEE International Conference on Computer Vision. 1116–1124.
[22]
Spyros Gidaris, Praveer Singh, and Nikos Komodakis. 2018. Unsupervised representation learning by predicting image rotations. In Proceedings of the International Conference on Learning Representations.
[23]
Zhirong Wu, Yuanjun Xiong, Stella Yu, and Dahua Lin. 2018. Unsupervised feature learning via non-parametric instance-level discrimination. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
[24]
L. Zheng, L. Shen, L. Tian, S. Wang, J. Wang, and Q. Tian. 2015. Scalable person re-identification: A benchmark. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV’15). 1116–1124.
[25]
Ergys Ristani, Francesco Solera, Roger Zou, Rita Cucchiara, and Carlo Tomasi. 2016. Performance measures and a data set for multi-target, multi-camera tracking. In Proceedings of the European Computer Vision Workshop on Benchmarking Multi-Target Tracking.
[26]
Longhui Wei, Shiliang Zhang, Wen Gao, and Qi Tian. 2018. Person transfer GAN to bridge domain gap for person re-identification. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[27]
Y. Wu, Y. Lin, X. Dong, Y. Yan, W. Ouyang, and Y. Yang. 2018. Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5177–5186.
[28]
Liang Zheng, Zhi Bie, Yifan Sun, Jingdong Wang, Chi Su, Shengjin Wang, and Qi Tian. 2016. MARS: A video benchmark for large-scale person re-identification. In Computer Vision—ECCV 2016, Bastian Leibe, Jiri Matas, Nicu Sebe, and Max Welling (Eds.). Springer International, Cham, Switzerland, 868–884.
[29]
Guodong Ding, Salman H. Khan, and Zhenmin Tang. 2019. Dispersion based clustering for unsupervised person re-identification. In Proceedings of the British Machine Vision Conference (BMVC’19). 264.
[30]
Zhun Zhong, Liang Zheng, Zhiming Luo, Shaozi Li, and Yi Yang. 2020. Invariance matters: Exemplar memory for domain adaptive person re-identification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20).
[31]
Yang Fu, Yunchao Wei, Guanshuo Wang, Yuqian Zhou, Honghui Shi, and Thomas Huang. 2019. Self-similarity grouping: A simple unsupervised cross domain adaptation approach for person re-identification. In Proceedings of the 2019 International Conference on Computer Vision (ICCV’19).
[32]
Dongkai Wang and Shiliang Zhang. 2020. Unsupervised person re-identification via multi-label classification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20). 10981–10990.
[33]
Yanbei Chen, Xiatian Zhu, and Shaogang Gong. 2018. Deep association learning for unsupervised video person re-identification. arXiv preprint arXiv:1808.07301.
[34]
Yu Wu, Yutian Lin, Xuanyi Dong, Yan Yan, Wanli Ouyang, and Yi Yang. 2018. Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5177–5186.

Cited By

View all
  • (2024)UnA-Mix: Rethinking Image Mixtures for Unsupervised Person Re-IdentificationProcesses10.3390/pr1201016812:1(168)Online publication date: 10-Jan-2024
  • (2024)Inter-camera Identity Discrimination for Unsupervised Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365285820:8(1-18)Online publication date: 13-Jun-2024
  • (2024)Instance-level Adversarial Source-free Domain Adaptive Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/364990020:7(1-22)Online publication date: 25-Apr-2024
  • Show More Cited By

Index Terms

  1. Fully Unsupervised Person Re-Identification via Selective Contrastive Learning

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Multimedia Computing, Communications, and Applications
    ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 18, Issue 2
    May 2022
    494 pages
    ISSN:1551-6857
    EISSN:1551-6865
    DOI:10.1145/3505207
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 February 2022
    Accepted: 01 September 2021
    Revised: 01 August 2021
    Received: 01 July 2021
    Published in TOMM Volume 18, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Person re-identification
    2. unsupervised learning
    3. contrastive learning

    Qualifiers

    • Research-article
    • Refereed

    Funding Sources

    • National Key Research and Development Project
    • National Science Foundation of China

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)95
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 10 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)UnA-Mix: Rethinking Image Mixtures for Unsupervised Person Re-IdentificationProcesses10.3390/pr1201016812:1(168)Online publication date: 10-Jan-2024
    • (2024)Inter-camera Identity Discrimination for Unsupervised Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365285820:8(1-18)Online publication date: 13-Jun-2024
    • (2024)Instance-level Adversarial Source-free Domain Adaptive Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/364990020:7(1-22)Online publication date: 25-Apr-2024
    • (2024)RAST: Restorable Arbitrary Style TransferACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363877020:5(1-21)Online publication date: 22-Jan-2024
    • (2024)Viewpoint Disentangling and Generation for Unsupervised Object Re-IDACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363295920:5(1-23)Online publication date: 22-Jan-2024
    • (2024)Anchor Association Learning for Unsupervised Video Person Re-IdentificationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.317913335:1(1013-1024)Online publication date: Jan-2024
    • (2024)Asymmetric double networks mutual teaching for unsupervised person Re-identificationNeural Networks10.1016/j.neunet.2023.11.001169:C(744-755)Online publication date: 4-Mar-2024
    • (2023)Identity Feature Disentanglement for Visible-Infrared Person Re-IdentificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/359518319:6(1-20)Online publication date: 12-Jul-2023
    • (2023)Instance-Based Continual Learning: A Real-World Dataset and Baseline for Fresh RecognitionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/359120920:1(1-23)Online publication date: 25-Apr-2023
    • (2023)Context Sensing Attention Network for Video-based Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/357320319:4(1-20)Online publication date: 27-Feb-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media