Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3394171.3413578acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Beyond the Parts: Learning Multi-view Cross-part Correlation for Vehicle Re-identification

Published: 12 October 2020 Publication History
  • Get Citation Alerts
  • Abstract

    Vehicle re-identification (Re-Id) is a challenging task due to the inter-class similarity, the intra-class difference, and the cross-view misalignment of vehicle parts. Although recent methods achieve great improvement by learning detailed features from keypoints or bounding boxes of parts, vehicle Re-Id is still far from being solved. Different from existing methods, we propose a Parsing-guided Cross-part Reasoning Network, named as PCRNet, for vehicle Re-Id. The PCRNet explores vehicle parsing to learn discriminative part-level features, model the correlation among vehicle parts, and achieve precise part alignment for vehicle Re-Id. To accurately segment vehicle parts, we first build a large-scale Multi-grained Vehicle Parsing (MVP) dataset from surveillance images. With the parsed parts, we extract regional features for each part and build a part-neighboring graph to explicitly model the correlation among parts. Then, the graph convolutional networks (GCNs) are adopted to propagate local information among parts, which can discover the most effective local features of varied viewpoints. Moreover, we propose a self-supervised part prediction loss to make the GCNs generate features of invisible parts from visible parts under different viewpoints. By this means, the same vehicle from different viewpoints can be matched with the well-aligned and robust feature representations. Through extensive experiments, our PCRNet significantly outperforms the state-of-the-art methods on three large-scale vehicle Re-Id datasets.

    Supplementary Material

    ZIP File (mmfp1038aux.zip)
    The supplemental material contains one file, i.e., "supplement_mmfp1038.pdf". This file contains the appendix for the paper mmfp1038. It provides more details on the MVP dataset proposed in the paper. The Adobe PDF Reader can be used to view this file.
    MP4 File (3394171.3413578.mp4)
    Vehicle re-identification (Re-Id) is a challenging task. Although recent methods achieve great improvement by learning detailed features from keypoints or parts, vehicle Re-Id is still far from being solved. Different from existing methods, we propose a Parsing- guided Cross-part Reasoning Network for vehicle Re-Id. We explore vehicle parsing to learn part-level features, model the correlation among parts, and achieve precise part alignment for vehicle Re-Id. To segment vehicle parts, we build a Multi-grained Vehicle Parsing dataset. With the parsed parts, we extract regional features and build a part-neighboring graph to explicitly model the correlation among parts. Then, the graph convolutional networks are adopted to propagate local information among parts, which can discover effective local features of varied viewpoints. By this means, the same vehicle from different viewpoints can be matched with the well-aligned and robust feature representations.

    References

    [1]
    Saghir Ahmed Saghir Alfasly, Yongjian Hu, Haoliang Li, Tiancai Liang, Xiaofeng Jin, Beibei Liu, and Qingli Zhao. 2019. Multi-Label-Based Similarity Learning for Vehicle Re-Identification. IEEE Access, Vol. 7 (2019), 162605--162616.
    [2]
    Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, and Hartwig Adam. 2018. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. In European Conference on Computer Vision. 833--851.
    [3]
    Xianjie Chen, Roozbeh Mottaghi, Xiaobai Liu, Sanja Fidler, Raquel Urtasun, and Alan L. Yuille. 2014. Detect What You Can: Detecting and Representing Objects Using Holistic Models and Body Parts. In IEEE Conference on Computer Vision and Pattern Recognition. 1979--1986.
    [4]
    Chuang Gan, Hang Zhao, Peihao Chen, David D. Cox, and Antonio Torralba. 2019. Self-Supervised Moving Vehicle Tracking With Stereo Sound. In IEEE International Conference on Computer Vision. 7052--7061.
    [5]
    Qichuan Geng, Feixiang Lu, Xinyu Huang, Sen Wang, Xinjing Cheng, Zhong Zhou, and Ruigang Yang. 2018. Part-level Car Parsing and Reconstruction from Single Street View. CoRR, Vol. abs/1811.10837 (2018).
    [6]
    Ke Gong, Xiaodan Liang, Dongyu Zhang, Xiaohui Shen, and Liang Lin. 2017. Look into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing. In IEEE Conference on Computer Vision and Pattern Recognition. 6757--6765.
    [7]
    Haiyun Guo, Chaoyang Zhao, Zhiwei Liu, Jinqiao Wang, and Hanqing Lu. 2018. Learning Coarse-to-Fine Structured Feature Embedding for Vehicle Re-Identification. In AAAI Conference on Artificial Intelligence. 6853--6860.
    [8]
    Bing He, Jia Li, Yifan Zhao, and Yonghong Tian. 2019 a. Part-Regularized Near-Duplicate Vehicle Re-Identification. In IEEE Conference on Computer Vision and Pattern Recognition. 3997--4005.
    [9]
    Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition. 770--778.
    [10]
    Lingxiao He, Yinggang Wang, Wu Liu, He Zhao, Zhenan Sun, and Jiashi Feng. 2019 b. Foreground-Aware Pyramid Reconstruction for Alignment-Free Occluded Person Re-Identification. In IEEE International Conference on Computer Vision. 8449--8458.
    [11]
    Pirazh Khorramshahi, Amit Kumar, Neehar Peri, Sai Saketh Rambhatla, Jun-Cheng Chen, and Rama Chellappa. 2019. A Dual-Path Model With Adaptive Attention for Vehicle Re-Identification. In IEEE International Conference on Computer Vision. 6131--6140.
    [12]
    Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In International Conference on Learning Representations.
    [13]
    Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In International Conference on Learning Representations.
    [14]
    Jianshu Li, Jian Zhao, Yunpeng Chen, Sujoy Roy, Shuicheng Yan, Jiashi Feng, and Terence Sim. 2018. Multi-Human Parsing Machines. In ACM International Conference on Multimedia. 45--53.
    [15]
    Shuangqun Li, Xinchen Liu, Wu Liu, Huadong Ma, and Haitao Zhang. 2016. A discriminative null space based deep learning approach for person re-identification. In International Conference on Cloud Computing and Intelligence Systems. 480--484.
    [16]
    Xiaodan Liang, Ke Gong, Xiaohui Shen, and Liang Lin. 2019. Look into Person: Joint Body Parsing & Pose Estimation Network and a New Benchmark. IEEE Trans. Pattern Anal. Mach. Intell., Vol. 41, 4 (2019), 871--885.
    [17]
    Shengcai Liao, Yang Hu, Xiangyu Zhu, and Stan Z. Li. 2015. Person re-identification by Local Maximal Occurrence representation and metric learning. In IEEE Conference on Computer Vision and Pattern Recognition. 2197--2206.
    [18]
    Hongye Liu, Yonghong Tian, Yaowei Wang, Lu Pang, and Tiejun Huang. 2016b. Deep Relative Distance Learning: Tell the Difference between Similar Vehicles. In IEEE Conference on Computer Vision and Pattern Recognition. 2167--2175.
    [19]
    Xinchen Liu, Wu Liu, Tao Mei, and Huadong Ma. 2016a. A Deep Learning-Based Approach to Progressive Vehicle Re-identification for Urban Surveillance. In European Conference on Computer Vision. 869--884.
    [20]
    Xinchen Liu, Wu Liu, Tao Mei, and Huadong Ma. 2018a. PROVID: Progressive and Multimodal Vehicle Reidentification for Large-Scale Urban Surveillance. IEEE Trans. Multimedia, Vol. 20, 3 (2018), 645--658.
    [21]
    Xiaobin Liu, Shiliang Zhang, Qingming Huang, and Wen Gao. 2018b. RAM: A Region-Aware Deep Model for Vehicle Re-Identification. In IEEE International Conference on Multimedia and Expo. 1--6.
    [22]
    Yinglu Liu, Hailin Shi, Yue Si, Hao Shen, Xiaobo Wang, and Tao Mei. 2019. A High-Efficiency Framework for Constructing Large-Scale Face Parsing Benchmark. CoRR, Vol. abs/1905.04830 (2019).
    [23]
    Yihang Lou, Yan Bai, Jun Liu, Shiqi Wang, and Ling-Yu Duan. 2019 a. Embedding Adversarial Learning for Vehicle Re-Identification. IEEE Trans. Image Processing, Vol. 28, 8 (2019), 3794--3807.
    [24]
    Yihang Lou, Yan Bai, Jun Liu, Shiqi Wang, and Lingyu Duan. 2019 b. VERI-Wild: A Large Dataset and a New Method for Vehicle Re-Identification in the Wild. In IEEE Conference on Computer Vision and Pattern Recognition. 3235--3243.
    [25]
    Hao Luo, Youzhi Gu, Xingyu Liao, Shenqi Lai, and Wei Jiang. 2019. Bag of Tricks and a Strong Baseline for Deep Person Re-Identification. In IEEE Conference on Computer Vision and Pattern Recognition Workshops. 1487--1495.
    [26]
    Silvio Savarese and Fei-Fei Li. 2007. 3D generic object categorization, localization and pose estimation. In IEEE International Conference on Computer Vision. 1--8.
    [27]
    Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, and Xiaogang Wang. 2017. Learning Deep Neural Networks for Vehicle Re-ID with Visual-spatio-Temporal Path Proposals. In IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, October 22--29, 2017. 1918--1927.
    [28]
    Brandon M. Smith, Li Zhang, Jonathan Brandt, Zhe Lin, and Jianchao Yang. 2013. Exemplar-Based Face Parsing. In IEEE Conference on Computer Vision and Pattern Recognition. 3484--3491.
    [29]
    Ke Sun, Bin Xiao, Dong Liu, and Jingdong Wang. 2019. Deep High-Resolution Representation Learning for Human Pose Estimation. In IEEE Conference on Computer Vision and Pattern Recognition. 5693--5703.
    [30]
    Zheng Tang, Milind Naphade, Stan Birchfield, Jonathan Tremblay, William Hodge, Ratnesh Kumar, Shuo Wang, and Xiaodong Yang. 2019 a. PAMTRI: Pose-Aware Multi-Task Learning for Vehicle Re-Identification Using Highly Randomized Synthetic Data. In IEEE International Conference on Computer Vision. 211--220.
    [31]
    Zheng Tang, Milind Naphade, Ming-Yu Liu, Xiaodong Yang, Stan Birchfield, Shuo Wang, Ratnesh Kumar, David C. Anastasiu, and Jenq-Neng Hwang. 2019 b. CityFlow: A City-Scale Benchmark for Multi-Target Multi-Camera Vehicle Tracking and Re-Identification. In IEEE Conference on Computer Vision and Pattern Recognition. 8797--8806.
    [32]
    Zhongdao Wang, Luming Tang, Xihui Liu, Zhuliang Yao, Shuai Yi, Jing Shao, Junjie Yan, Shengjin Wang, Hongsheng Li, and Xiaogang Wang. 2017. Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-identification. In IEEE International Conference on Computer Vision. 379--387.
    [33]
    Hongtao Xie, Zhendong Mao, Yongdong Zhang, Han Deng, Chenggang Yan, and Zhineng Chen. 2019. Double-Bit Quantization and Index Hashing for Nearest Neighbor Search. IEEE Trans. Multimedia, Vol. 21, 5 (2019), 1248--1260.
    [34]
    Kota Yamaguchi, M. Hadi Kiapour, and Tamara L. Berg. 2013. Paper Doll Parsing: Retrieving Similar Styles to Parse Clothing Items. In IEEE International Conference on Computer Vision. 3519--3526.
    [35]
    Kota Yamaguchi, M. Hadi Kiapour, Luis E. Ortiz, and Tamara L. Berg. 2012. Parsing clothing in fashion photographs. In IEEE Conference on Computer Vision and Pattern Recognition. 3570--3577.
    [36]
    Chenggang Yan, Liang Li, Chunjie Zhang, Bingtao Liu, Yongdong Zhang, and Qionghai Dai. 2019. Cross-Modality Bridging and Knowledge Transferring for Image Understanding. IEEE Trans. Multimedia, Vol. 21, 10 (2019), 2675--2685.
    [37]
    Linjie Yang, Ping Luo, Chen Change Loy, and Xiaoou Tang. 2015. A large-scale car dataset for fine-grained categorization and verification. In IEEE Conference on Computer Vision and Pattern Recognition. 3973--3981.
    [38]
    Runhao Zeng, Wenbing Huang, Chuang Gan, Mingkui Tan, Yu Rong, Peilin Zhao, and Junzhou Huang. 2019. Graph Convolutional Networks for Temporal Action Localization. In IEEE International Conference on Computer Vision. 7093--7102.
    [39]
    Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, Xiaogang Wang, and Jiaya Jia. 2017. Pyramid Scene Parsing Network. In IEEE Conference on Computer Vision and Pattern Recognition. 6230--6239.
    [40]
    Jian Zhao, Jianshu Li, Yu Cheng, Terence Sim, Shuicheng Yan, and Jiashi Feng. 2018. Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing. In ACM International Conference on Multimedia. 792--800.
    [41]
    Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable Person Re-identification: A Benchmark. In IEEE International Conference on Computer Vision. 1116--1124.
    [42]
    Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, Yi Yang, and Qi Tian. 2017. Person Re-identification in the Wild. In IEEE Conference on Computer Vision and Pattern Recognition. 3346--3355.
    [43]
    Zhun Zhong, Liang Zheng, Guoliang Kang, Shaozi Li, and Yi Yang. 2017. Random Erasing Data Augmentation. CoRR, Vol. abs/1708.04896 (2017).
    [44]
    Yi Zhou and Ling Shao. 2018. Viewpoint-Aware Attentive Multi-View Inference for Vehicle Re-Identification. In IEEE Conference on Computer Vision and Pattern Recognition. 6489--6498.

    Cited By

    View all
    • (2024)Dual-Level Viewpoint-Learning for Cross-Domain Vehicle Re-IdentificationElectronics10.3390/electronics1310182313:10(1823)Online publication date: 8-May-2024
    • (2024)Stripe-Assisted Global Transformer and Spatial–Temporal Enhancement for Vehicle Re-IdentificationApplied Sciences10.3390/app1410396814:10(3968)Online publication date: 7-May-2024
    • (2024)Camera Topology Graph Guided Vehicle Re-IdentificationIEEE Transactions on Multimedia10.1109/TMM.2023.328305426(1565-1577)Online publication date: 2024
    • Show More Cited By

    Index Terms

    1. Beyond the Parts: Learning Multi-view Cross-part Correlation for Vehicle Re-identification

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Conferences
        MM '20: Proceedings of the 28th ACM International Conference on Multimedia
        October 2020
        4889 pages
        ISBN:9781450379885
        DOI:10.1145/3394171
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 12 October 2020

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. graph convolutional network
        2. image segmentation
        3. self-supervised learning
        4. vehicle parsing
        5. vehicle re-identification

        Qualifiers

        • Research-article

        Funding Sources

        • Beijing Academy of Artificial Intelligence (BAAI)
        • Zhejiang Province Nature Science Foundation of China
        • National Nature Science Foundation of China

        Conference

        MM '20
        Sponsor:

        Acceptance Rates

        Overall Acceptance Rate 995 of 4,171 submissions, 24%

        Upcoming Conference

        MM '24
        The 32nd ACM International Conference on Multimedia
        October 28 - November 1, 2024
        Melbourne , VIC , Australia

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)67
        • Downloads (Last 6 weeks)9

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)Dual-Level Viewpoint-Learning for Cross-Domain Vehicle Re-IdentificationElectronics10.3390/electronics1310182313:10(1823)Online publication date: 8-May-2024
        • (2024)Stripe-Assisted Global Transformer and Spatial–Temporal Enhancement for Vehicle Re-IdentificationApplied Sciences10.3390/app1410396814:10(3968)Online publication date: 7-May-2024
        • (2024)Camera Topology Graph Guided Vehicle Re-IdentificationIEEE Transactions on Multimedia10.1109/TMM.2023.328305426(1565-1577)Online publication date: 2024
        • (2024)A Blockchain-Enabled Distributed System for Trustworthy and Collaborative Intelligent Vehicle Re-IdentificationIEEE Transactions on Intelligent Vehicles10.1109/TIV.2023.33472679:2(3271-3282)Online publication date: Feb-2024
        • (2024)Semantic-Oriented Feature Coupling Transformer for Vehicle Re-Identification in Intelligent Transportation SystemIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2023.325787325:3(2803-2813)Online publication date: Mar-2024
        • (2024)Distributed Collaborative Object Retrieval With Blockchain-Based Edge ComputingIEEE Transactions on Industrial Informatics10.1109/TII.2024.337199920:6(8729-8738)Online publication date: Jun-2024
        • (2024)Discriminative Feature Learning With Co-Occurrence Attention Network for Vehicle ReIDIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2023.332637534:5(3510-3522)Online publication date: May-2024
        • (2024)Multi-view cognition with path search for one-shot part labelingComputer Vision and Image Understanding10.1016/j.cviu.2024.104015244(104015)Online publication date: Jul-2024
        • (2023)Identity-Guided Spatial Attention for Vehicle Re-IdentificationSensors10.3390/s2311515223:11(5152)Online publication date: 28-May-2023
        • (2023)Multi-UAV Collaborative Absolute Vision Positioning and Navigation: A Survey and DiscussionDrones10.3390/drones70402617:4(261)Online publication date: 11-Apr-2023
        • Show More Cited By

        View Options

        Get Access

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media