research-article

Fully Unsupervised Person Re-Identification via Selective Contrastive Learning

Authors:

Xianming LiuAuthors Info & Claims

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 18, Issue 2

Article No.: 64, Pages 1 - 15

https://doi.org/10.1145/3485061

Published: 16 February 2022 Publication History

Abstract

Person re-identification (ReID) aims at searching the same identity person among images captured by various cameras. Existing fully supervised person ReID methods usually suffer from poor generalization capability caused by domain gaps. Unsupervised person ReID has attracted a lot of attention recently, because it works without intensive manual annotation and thus shows great potential in adapting to new conditions. Representation learning plays a critical role in unsupervised person ReID. In this work, we propose a novel selective contrastive learning framework for fully unsupervised feature learning. Specifically, different from traditional contrastive learning strategies, we propose to use multiple positives and adaptively selected negatives for defining the contrastive loss, enabling to learn a feature embedding model with stronger identity discriminative representation. Moreover, we propose to jointly leverage global and local features to construct three dynamic memory banks, among which the global and local ones are used for pairwise similarity computation and the mixture memory bank are used for contrastive loss definition. Experimental results demonstrate the superiority of our method in unsupervised person ReID compared with the state of the art. Our code is available at https://github.com/pangbo1997/Unsup_ReID.git.

References

[1]

Mang Ye, Jianbing Shen, Gaojie Lin, Tao Xiang, Ling Shao, and Steven C. H. Hoi. 2021. Deep learning for person re-identification: A survey and outlook. IEEE Transactions on Pattern Analysis and Machine Intelligence. Early access, January 26, 2021.

[2]

Jingya Wang, Xiatian Zhu, Shaogang Gong, and Wei Li. 2018. Transferable joint attribute-identity deep learning for unsupervised person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2275–2284.

[3]

Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, and Jianbin Jiao. 2018. Image-image domain adaptation with preserved self-similarity and domain-dissimilarity for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 994–1003.

[4]

Jiawei Liu, Zheng-Jun Zha, Di Chen, Richang Hong, and Meng Wang. 2019. Adaptive transfer network for cross-domain person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7202–7211.

[5]

Yixiao Ge, Feng Zhu, Dapeng Chen, Rui Zhao, and Hongsheng Li. 2020. Self-paced contrastive learning with hybrid memory for domain adaptive object re-ID. In Advances in Neural Information Processing Systems.

[6]

Yutian Lin, Xuanyi Dong, Liang Zheng, Yan Yan, and Yi Yang. 2019. A bottom-up clustering approach to unsupervised person re-identification. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 8738–8745.

Digital Library

[7]

Dongkai Wang and Shiliang Zhang. 2020. Unsupervised person re-identification via multi-label classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10981–10990.

Digital Library

[8]

Yutian Lin, Lingxi Xie, Yu Wu, Chenggang Yan, and Qi Tian. 2020. Unsupervised person re-identification via softened similarity learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3390–3399.

[9]

Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020. A simple framework for contrastive learning of visual representations. arXiv preprint arXiv:2002.05709.

[10]

Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2020. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 9729–9738.

[11]

Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, and Shengjin Wang. 2018. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In Proceedings of the European Conference on Computer Vision (ECCV’18). 480–496.

Digital Library

[12]

J. Long, E. Shelhamer, and T. Darrell. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’15). 3431–3440.

[13]

Liming Zhao, Xi Li, Yueting Zhuang, and Jingdong Wang. 2017. Deeply-learned part-aligned representations for person re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’17). 3219–3228.

[14]

X. Jin, C. Lan, W. Zeng, Z. Chen, and L. Zhang. 2020. Style normalization and restitution for generalizable person re-identification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20). 3140–3149.

[15]

Guile Wu, Xiatian Zhu, and Shaogang Gong. 2020. Tracklet self-supervised learning for unsupervised person re-identification. Proceedings of the AAAI Conference on Artificial Intelligence 34, 7 (2020), 12362–12369.

[16]

Zhun Zhong, Liang Zheng, Shaozi Li, and Yi Yang. 2018. Generalizing a person retrieval model hetero-and homogeneously. In Proceedings of the European Conference on Computer Vision (ECCV’18). 172–188.

[17]

Yu Wu, Yutian Lin, Xuanyi Dong, Yan Yan, and Yi Yang. 2018. Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’18).

[18]

Yu Wu, Yutian Lin, Xuanyi Dong, Yan Yan, Wei Bian, and Yi Yang. 2019. Progressive learning for person re-identification with one example. IEEE Transactions on Image Processing. Early access, January 10, 2019.

[19]

Mang Ye, Xiangyuan Lan, and Pong C. Yuen. 2018. Robust anchor embedding for unsupervised video person re-identification in the wild. In Proceedings of the European Conference on Computer Vision (ECCV’18). 170–186.

[20]

Shengcai Liao, Yang Hu, Xiangyu Zhu, and Stan Z. Li. 2015. Person re-identification by local maximal occurrence representation and metric learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2197–2206.

[21]

Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable person re-identification: A benchmark. In Proceedings of the IEEE International Conference on Computer Vision. 1116–1124.

Digital Library

[22]

Spyros Gidaris, Praveer Singh, and Nikos Komodakis. 2018. Unsupervised representation learning by predicting image rotations. In Proceedings of the International Conference on Learning Representations.

[23]

Zhirong Wu, Yuanjun Xiong, Stella Yu, and Dahua Lin. 2018. Unsupervised feature learning via non-parametric instance-level discrimination. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.

[24]

L. Zheng, L. Shen, L. Tian, S. Wang, J. Wang, and Q. Tian. 2015. Scalable person re-identification: A benchmark. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV’15). 1116–1124.

Digital Library

[25]

Ergys Ristani, Francesco Solera, Roger Zou, Rita Cucchiara, and Carlo Tomasi. 2016. Performance measures and a data set for multi-target, multi-camera tracking. In Proceedings of the European Computer Vision Workshop on Benchmarking Multi-Target Tracking.

[26]

Longhui Wei, Shiliang Zhang, Wen Gao, and Qi Tian. 2018. Person transfer GAN to bridge domain gap for person re-identification. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[27]

Y. Wu, Y. Lin, X. Dong, Y. Yan, W. Ouyang, and Y. Yang. 2018. Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5177–5186.

[28]

Liang Zheng, Zhi Bie, Yifan Sun, Jingdong Wang, Chi Su, Shengjin Wang, and Qi Tian. 2016. MARS: A video benchmark for large-scale person re-identification. In Computer Vision—ECCV 2016, Bastian Leibe, Jiri Matas, Nicu Sebe, and Max Welling (Eds.). Springer International, Cham, Switzerland, 868–884.

[29]

Guodong Ding, Salman H. Khan, and Zhenmin Tang. 2019. Dispersion based clustering for unsupervised person re-identification. In Proceedings of the British Machine Vision Conference (BMVC’19). 264.

[30]

Zhun Zhong, Liang Zheng, Zhiming Luo, Shaozi Li, and Yi Yang. 2020. Invariance matters: Exemplar memory for domain adaptive person re-identification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20).

[31]

Yang Fu, Yunchao Wei, Guanshuo Wang, Yuqian Zhou, Honghui Shi, and Thomas Huang. 2019. Self-similarity grouping: A simple unsupervised cross domain adaptation approach for person re-identification. In Proceedings of the 2019 International Conference on Computer Vision (ICCV’19).

[32]

Dongkai Wang and Shiliang Zhang. 2020. Unsupervised person re-identification via multi-label classification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR’20). 10981–10990.

[33]

Yanbei Chen, Xiatian Zhu, and Shaogang Gong. 2018. Deep association learning for unsupervised video person re-identification. arXiv preprint arXiv:1808.07301.

[34]

Yu Wu, Yutian Lin, Xuanyi Dong, Yan Yan, Wanli Ouyang, and Yi Yang. 2018. Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5177–5186.

Cited By

Liu JSun HLiu WGuo AZhang J(2024)UnA-Mix: Rethinking Image Mixtures for Unsupervised Person Re-IdentificationProcesses10.3390/pr1201016812:1(168)Online publication date: 10-Jan-2024
https://doi.org/10.3390/pr12010168
Xiong MHu KLyu ZFang FWang ZHu RMuhammad K(2024)Inter-camera Identity Discrimination for Unsupervised Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365285820:8(1-18)Online publication date: 13-Jun-2024
https://dl.acm.org/doi/10.1145/3652858
Qu XLiu LZhu LNie LZhang H(2024)Instance-level Adversarial Source-free Domain Adaptive Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/364990020:7(1-22)Online publication date: 25-Apr-2024
https://dl.acm.org/doi/10.1145/3649900
Show More Cited By

Index Terms

Fully Unsupervised Person Re-Identification via Selective Contrastive Learning
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision

Recommendations

Unsupervised Person Re-Identification via Multi-Label Classification
Abstract
The challenge of unsupervised person re-identification (ReID) lies in learning discriminative features without true labels. Most of previous works predict single-class pseudo labels through clustering. To improve the quality of generated pseudo ...
Multi-class center dynamic contrastive learning for unsupervised domain adaptation person re-identification
Abstract
Unsupervised domain adaptation person re-identification (UDA Re-ID) aims to leverage the pedestrian knowledge learned from labeled source domain to assist in learning the pedestrian knowledge in the unlabeled target domain. Most of existing ...
Reliability modeling and contrastive learning for unsupervised person re-identification
Abstract
Unsupervised person re-identification (ReID) aims to learn discriminative identity features in scenarios without a ground-truth. Fully unsupervised person ReID methods usually iterate between pseudo-labels prediction and representation learning ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Multimedia Computing, Communications, and Applications

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 18, Issue 2

May 2022

494 pages

ISSN:1551-6857

EISSN:1551-6865

DOI:10.1145/3505207

Editor:
Alberto Del Bimbo
University of Firenze, Italy

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 February 2022

Accepted: 01 September 2021

Revised: 01 August 2021

Received: 01 July 2021

Published in TOMM Volume 18, Issue 2

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Refereed

Funding Sources

National Key Research and Development Project
National Science Foundation of China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

15
Total Citations
View Citations
753
Total Downloads

Downloads (Last 12 months)95
Downloads (Last 6 weeks)2

Reflects downloads up to 10 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Liu JSun HLiu WGuo AZhang J(2024)UnA-Mix: Rethinking Image Mixtures for Unsupervised Person Re-IdentificationProcesses10.3390/pr1201016812:1(168)Online publication date: 10-Jan-2024
https://doi.org/10.3390/pr12010168
Xiong MHu KLyu ZFang FWang ZHu RMuhammad K(2024)Inter-camera Identity Discrimination for Unsupervised Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/365285820:8(1-18)Online publication date: 13-Jun-2024
https://dl.acm.org/doi/10.1145/3652858
Qu XLiu LZhu LNie LZhang H(2024)Instance-level Adversarial Source-free Domain Adaptive Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/364990020:7(1-22)Online publication date: 25-Apr-2024
https://dl.acm.org/doi/10.1145/3649900
Ma YZhao CHuang BLi XBasu A(2024)RAST: Restorable Arbitrary Style TransferACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363877020:5(1-21)Online publication date: 22-Jan-2024
https://dl.acm.org/doi/10.1145/3638770
Li ZShi YLing HChen JLiu BWang RZhao C(2024)Viewpoint Disentangling and Generation for Unsupervised Object Re-IDACM Transactions on Multimedia Computing, Communications, and Applications10.1145/363295920:5(1-23)Online publication date: 22-Jan-2024
https://dl.acm.org/doi/10.1145/3632959
Zeng SWang XLiu MLiu QWang Y(2024)Anchor Association Learning for Unsupervised Video Person Re-IdentificationIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2022.317913335:1(1013-1024)Online publication date: Jan-2024
https://doi.org/10.1109/TNNLS.2022.3179133
Zhang MLi KMa JWang X(2024)Asymmetric double networks mutual teaching for unsupervised person Re-identificationNeural Networks10.1016/j.neunet.2023.11.001169:C(744-755)Online publication date: 4-Mar-2024
https://dl.acm.org/doi/10.1016/j.neunet.2023.11.001
Chen XZheng XLu X(2023)Identity Feature Disentanglement for Visible-Infrared Person Re-IdentificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/359518319:6(1-20)Online publication date: 12-Jul-2023
https://doi.org/10.1145/3595183
Xu ZHu HLiu LZhang DZhang STan W(2023)Instance-Based Continual Learning: A Real-World Dataset and Baseline for Fresh RecognitionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/359120920:1(1-23)Online publication date: 25-Apr-2023
https://dl.acm.org/doi/10.1145/3591209
Wang KDing CPang JXu X(2023)Context Sensing Attention Network for Video-based Person Re-identificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/357320319:4(1-20)Online publication date: 27-Feb-2023
https://dl.acm.org/doi/10.1145/3573203
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Full Text

View this article in Full Text.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View full text|Download PDF

View Issue’s Table of Contents