Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3394171.3413861acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Gait Recognition with Multiple-Temporal-Scale 3D Convolutional Neural Network

Published: 12 October 2020 Publication History
  • Get Citation Alerts
  • Abstract

    Gait recognition which is one of the most important and effective biometric technologies has a significant advantage in long-distance recognition systems. For existing gait recognition methods, the template-based approaches may lose temporal information, while the sequence-based methods cannot fully exploit the temporal relations among the sequence. To address the above issues, we propose a novel multiple-temporal-scale gait recognition framework which integrates the temporal information in multiple temporal scales, making use of both the frame and interval fusion information. Moreover, the interval-level representation is realized by a local transformation module. Concretely, 3D convolution neural network (3D CNN) is applied in both the small and the large temporal scales to extract the spatial-temporal information. Moreover, a frame pooling method is developed to address the mismatch of the input of 3D network and video frames, and a novel 3D basic network block is designed to improve efficiency. Experiments demonstrate that the multiple-temporal-scale 3D CNN based gait recognition method can achieve better performance than most recent state-of-the-art methods in CASIA-B dataset. The proposed method obtains the rank-1 accuracy with 96.7% under normal condition, and outperforms other methods on average accuracy by at least 5.8% and 11.1%, respectively, in complex scenarios.

    Supplementary Material

    MP4 File (3394171.3413861.mp4)
    This video is used to describe the paper, which is titled ?Gait Recognition with Multiple-Temporal-Scale 3D Convolutional Network?. In this video, we first introduce the concept of gait recognition and recent gait recognition methods. Then, we overview the framework of the proposed method, MT3D. Next, we describe the key components of the proposed method, including two-branch structure and BasicBlock3D. Finally, we show the performance of MT3D on two popular datasets.

    References

    [1]
    Gunawan Ariyanto and Mark S Nixon. 2011. Model-based 3D gait biometrics. In 2011 International Joint Conference on Biometrics (IJCB). IEEE, 1--7.
    [2]
    Hanqing Chao, Yiwei He, Junping Zhang, and Jianfeng Feng. 2019. Gaitset: Regarding gait as a set for cross-view gait recognition. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 8126--8133.
    [3]
    Patrick Connor and Arun Ross. 2018. Biometric recognition by gait: A survey of modalities and features. Computer Vision and Image Understanding, Vol. 167 (2018), 1--27.
    [4]
    Yang Fu, Yunchao Wei, Yuqian Zhou, Honghui Shi, Gao Huang, Xinchao Wang, Zhiqiang Yao, and Thomas Huang. 2019. Horizontal pyramid matching for person re-identification. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 8295--8302.
    [5]
    Michela Goffredo, Imed Bouchrika, John N Carter, and Mark S Nixon. 2009. Self-calibrating view-invariant gait biometrics. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), Vol. 40, 4 (2009), 997--1008.
    [6]
    Ju Han, Bir Bhanu, and Amit K Roy-Chowdhury. 2005. A study on view-insensitive gait recognition. In IEEE International Conference on Image Processing 2005, Vol. 3. IEEE, III--297.
    [7]
    Kensho Hara, Hirokatsu Kataoka, and Yutaka Satoh. 2018. Can spatiotemporal 3d cnns retrace the history of 2d cnns and imagenet?. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition. 6546--6555.
    [8]
    Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.
    [9]
    Yiwei He, Junping Zhang, Hongming Shan, and Liang Wang. 2018. Multi-task gans for view-specific feature learning in gait recognition. IEEE Transactions on Information Forensics and Security, Vol. 14, 1 (2018), 102--113.
    [10]
    Alexander Hermans, Lucas Beyer, and Bastian Leibe. 2017. In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737 (2017).
    [11]
    Maodi Hu, Yunhong Wang, Zhaoxiang Zhang, James J Little, and Di Huang. 2013. View-invariant discriminative projection for multi-view gait-based human identification. IEEE Transactions on Information Forensics and Security, Vol. 8, 12 (2013), 2034--2045.
    [12]
    Gao Huang, Zhuang Liu, Laurens Van Der Maaten, and Kilian Q Weinberger. 2017. Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 4700--4708.
    [13]
    Haruyuki Iwama, Mayu Okumura, Yasushi Makihara, and Yasushi Yagi. 2012. The ou-isir gait database comprising the large population dataset and performance evaluation of gait recognition. IEEE Transactions on Information Forensics and Security, Vol. 7, 5 (2012), 1511--1521.
    [14]
    Frédéric Jean, Robert Bergevin, and Alexandra Branzan Albu. 2009. Computing and evaluating view-normalized body part trajectories. Image and Vision Computing, Vol. 27, 9 (2009), 1272--1284.
    [15]
    Shuiwang Ji, Wei Xu, Ming Yang, and Kai Yu. 2012. 3D convolutional neural networks for human action recognition. IEEE transactions on pattern analysis and machine intelligence, Vol. 35, 1 (2012), 221--231.
    [16]
    Worapan Kusakunniran, Qiang Wu, Hongdong Li, and Jian Zhang. 2009. Multiple views gait recognition using view transformation model based on optimized gait energy image. In 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops. IEEE, 1058--1064.
    [17]
    Worapan Kusakunniran, Qiang Wu, Jian Zhang, Hongdong Li, and Liang Wang. 2013a. Recognizing gaits across views through correlated motion co-clustering. IEEE Transactions on Image Processing, Vol. 23, 2 (2013), 696--709.
    [18]
    Worapan Kusakunniran, Qiang Wu, Jian Zhang, Yi Ma, and Hongdong Li. 2013b. A new view-invariant feature for cross-view gait recognition. IEEE Transactions on Information Forensics and Security, Vol. 8, 10 (2013), 1642--1653.
    [19]
    Nini Liu, Jiwen Lu, and Yap-Peng Tan. 2011. Joint subspace learning for view-invariant gait recognition. IEEE Signal Processing Letters, Vol. 18, 7 (2011), 431--434.
    [20]
    Yasushi Makihara, Ryusuke Sagawa, Yasuhiro Mukaigawa, Tomio Echigo, and Yasushi Yagi. 2006. Gait recognition using a view transformation model in the frequency domain. In European Conference on Computer Vision. Springer, 151--163.
    [21]
    Daigo Muramatsu, Akira Shiraishi, Yasushi Makihara, Md Zasim Uddin, and Yasushi Yagi. 2014. Gait-based person recognition using arbitrary view transformation model. IEEE Transactions on Image Processing, Vol. 24, 1 (2014), 140--154.
    [22]
    Zhaofan Qiu, Ting Yao, and Tao Mei. 2017. Learning spatio-temporal representation with pseudo-3d residual networks. In proceedings of the IEEE International Conference on Computer Vision. 5533--5541.
    [23]
    Kohei Shiraga, Yasushi Makihara, Daigo Muramatsu, Tomio Echigo, and Yasushi Yagi. 2016. Geinet: View-invariant gait recognition using a convolutional neural network. In 2016 international conference on biometrics (ICB). IEEE, 1--8.
    [24]
    Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, and Manohar Paluri. 2015. Learning spatiotemporal features with 3d convolutional networks. In Proceedings of the IEEE international conference on computer vision. 4489--4497.
    [25]
    Du Tran, Heng Wang, Lorenzo Torresani, Jamie Ray, Yann LeCun, and Manohar Paluri. 2018. A closer look at spatiotemporal convolutions for action recognition. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition. 6450--6459.
    [26]
    Thomas Wolf, Mohammadreza Babaee, and Gerhard Rigoll. 2016. Multi-view gait recognition using 3D convolutional neural networks. In 2016 IEEE International Conference on Image Processing (ICIP). IEEE, 4165--4169.
    [27]
    Zifeng Wu, Yongzhen Huang, Liang Wang, Xiaogang Wang, and Tieniu Tan. 2016. A comprehensive study on cross-view gait based human identification with deep cnns. IEEE transactions on pattern analysis and machine intelligence, Vol. 39, 2 (2016), 209--226.
    [28]
    TzeWei Yeoh, Hernán E Aguirre, and Kiyoshi Tanaka. 2016. Clothing-invariant gait recognition using convolutional neural network. In 2016 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS). IEEE, 1--5.
    [29]
    Shiqi Yu, Haifeng Chen, Qing Wang, Linlin Shen, and Yongzhen Huang. 2017. Invariant feature extraction for gait recognition using only one uniform model. Neurocomputing, Vol. 239 (2017), 81--93.
    [30]
    Shiqi Yu, Daoliang Tan, and Tieniu Tan. 2006. A framework for evaluating the effect of view angle, clothing and carrying condition on gait recognition. In 18th International Conference on Pattern Recognition (ICPR'06), Vol. 4. IEEE, 441--444.
    [31]
    Yuqi Zhang, Yongzhen Huang, Shiqi Yu, and Liang Wang. 2019. Cross-View Gait Recognition by Discriminative Feature Learning. IEEE Transactions on Image Processing, Vol. 29 (2019), 1001--1015.
    [32]
    Guoying Zhao, Guoyi Liu, Hua Li, and Matti Pietikainen. 2006. 3D gait recognition using multiple cameras. In 7th International Conference on Automatic Face and Gesture Recognition (FGR06). IEEE, 529--534.

    Index Terms

    1. Gait Recognition with Multiple-Temporal-Scale 3D Convolutional Neural Network

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      MM '20: Proceedings of the 28th ACM International Conference on Multimedia
      October 2020
      4889 pages
      ISBN:9781450379885
      DOI:10.1145/3394171
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 12 October 2020

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. gait recognition
      2. local transform
      3. spatial-temporal features

      Qualifiers

      • Research-article

      Funding Sources

      • National Natural Science Foundation of China
      • Fundamental Research Funds for the Central Universities
      • Beijing Natural Science Foundation

      Conference

      MM '20
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 995 of 4,171 submissions, 24%

      Upcoming Conference

      MM '24
      The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)151
      • Downloads (Last 6 weeks)18

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Gait image spatio-temporal restoration network and its application under occlusion conditionsJournal of Image and Graphics10.11834/jig.22114229:1(179-191)Online publication date: 2024
      • (2024)Watch Where You Head: A View-biased Domain Gap in Gait Recognition and Unsupervised Adaptation2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)10.1109/WACV57701.2024.00600(6097-6107)Online publication date: 3-Jan-2024
      • (2024)Gait Recognition With Multi-Level Skeleton-Guided RefinementIEEE Transactions on Multimedia10.1109/TMM.2023.332388726(4515-4526)Online publication date: 1-Jan-2024
      • (2024)GaitMPL: Gait Recognition With Memory-Augmented Progressive LearningIEEE Transactions on Image Processing10.1109/TIP.2022.316454333(1464-1475)Online publication date: 2024
      • (2024)Integral Pose Learning via Appearance Transfer for Gait RecognitionIEEE Transactions on Information Forensics and Security10.1109/TIFS.2024.338260619(4716-4727)Online publication date: 2024
      • (2024)Cloth-Imbalanced Gait Recognition via HallucinationIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2024.336023234:7(5665-5676)Online publication date: Jul-2024
      • (2024)GaitNet+ARL: A Deep Learning Algorithm for Interpretable Gait Analysis of Chronic Ankle InstabilityIEEE Journal of Biomedical and Health Informatics10.1109/JBHI.2024.338358828:7(3918-3927)Online publication date: Jul-2024
      • Show More Cited By

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media