skip to main content
10.1145/3628797.3629013acmotherconferencesArticle/Chapter ViewAbstractPublication PagessoictConference Proceedingsconference-collections
research-article

Resnet Video 3D for Gait Retrieval: A Deep Learning Approach to Human Identification

Published: 07 December 2023 Publication History

Abstract

Gait, the distinctive way a person walks, is a useful biometric trait for various applications such as crime prevention, forensic identification, and social security. Gait retrieval, which aims to find the person who matches a given gait, is an active research area, its research has drawn a significant increase. However, learning discriminative temporal features from gait data is difficult due to the subtle variations in the spatial domain of the silhouette. Recent deep learning methods have demonstrated their effectiveness for gait retrieval by learning more robust features from raw video data. In this paper, we propose a baseline network based on ResNet video R3D-18, which can capture both spatial and temporal information from the data, to address the gait retrieval problem. Our experimental results show that our optimized backbone network can extract powerful vector representations of gait and achieve high performance in retrieving the person who matches the gait from the database. On CASIA-B dataset, we obtain Rank-1 accuracy of 97.09% and Rank-10 accuracy of 99.27% under normal walking condition. The source code will be available at.

References

[1]
Hanqing Chao, Kun Wang, Yiwei He, Junping Zhang, and Jianfeng Feng. 2021. GaitSet: Cross-view Gait Recognition through Utilizing Gait as a Deep Set. CoRR abs/2102.03247 (2021).
[2]
Ju Han and Bir Bhanu. 2006. "Individual Recognition Using Gait Energy Image". IEEE transactions on pattern analysis and machine intelligence 28 (03 2006), 316–22. https://doi.org/10.1109/TPAMI.2006.38
[3]
Kensho Hara, Hirokatsu Kataoka, and Yutaka Satoh. 2017. Learning Spatio-Temporal Features with 3D Residual Networks for Action Recognition. 3154–3160. https://doi.org/10.1109/ICCVW.2017.373
[4]
Fabian Caba Heilbron, Victor Escorcia, Bernard Ghanem, and Juan Carlos Niebles. 2015. ActivityNet: A large-scale video benchmark for human activity understanding. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 961–970. https://doi.org/10.1109/CVPR.2015.7298698
[5]
Ebenezer Isaac, Susan Elias, Srinivasan Rajagopalan, and K. Easwarakumar. 2017. View-Invariant Gait Recognition Through Genetic Template Segmentation. IEEE Signal Processing Letters 24 (06 2017), 1188 – 1192. https://doi.org/10.1109/LSP.2017.2715179
[6]
Will Kay, João Carreira, Karen Simonyan, Brian Zhang, Chloe Hillier, Sudheendra Vijayanarasimhan, Fabio Viola, Tim Green, Trevor Back, Paul Natsev, Mustafa Suleyman, and Andrew Zisserman. 2017. The Kinetics Human Action Video Dataset. CoRR abs/1705.06950 (2017).
[7]
Rijun Liao, Chunshui Cao, Edel Garcia, Shiqi Yu, and Yongzhen Huang. 2017. Pose-Based Temporal-Spatial Network (PTSN) for Gait Recognition with Carrying and Clothing Variations. 474–483. https://doi.org/10.1007/978-3-319-69923-3_51
[8]
Du Tran, Heng Wang, Lorenzo Torresani, Jamie Ray, Yann LeCun, and Manohar Paluri. 2017. A Closer Look at Spatiotemporal Convolutions for Action Recognition. CoRR abs/1711.11248 (2017).
[9]
Thomas Wolf, Mohammadreza Babaee, and Gerhard Rigoll. 2016. Multi-view gait recognition using 3D convolutional neural networks. 4165–4169. https://doi.org/10.1109/ICIP.2016.7533144
[10]
Zifeng Wu, Yongzhen Huang, Liang Wang, Xiaogang Wang, and Tieniu Tan. 2016. A Comprehensive Study on Cross-View Gait Based Human Identification with Deep CNNs. IEEE Transactions on Pattern Analysis and Machine Intelligence 39 (03 2016), 1–1. https://doi.org/10.1109/TPAMI.2016.2545669
[11]
Jian Yu, Hu Changhui, Xiao-Yuan Jing, and Yu-Jian Feng. 2020. Deep metric learning with dynamic margin hard sampling loss for face verification. Signal, Image and Video Processing 14 (T 2020). https://doi.org/10.1007/s11760-019-01612-3
[12]
Shiqi Yu, Haifeng Chen, Qing Wang, Linlin Shen, and Yongzhen Huang. 2017. Invariant Feature Extraction for Gait Recognition Using Only One Uniform Model. Neurocomputing 239 (02 2017). https://doi.org/10.1016/j.neucom.2017.02.006
[13]
Shiqi Yu, Daoliang Tan, and Tieniu Tan. 2006. A Framework for Evaluating the Effect of View Angle, Clothing and Carrying Condition on Gait Recognition. Proceedings - International Conference on Pattern Recognition 4, 441–444. https://doi.org/10.1109/ICPR.2006.67

Cited By

View all
  • (2025)A Comprehensive Review of Vision-Based Sensor Systems for Human Gait AnalysisSensors10.3390/s2502049825:2(498)Online publication date: 16-Jan-2025

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication Technology
December 2023
1058 pages
ISBN:9798400708916
DOI:10.1145/3628797
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 December 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. 3D CNNs
  2. Gait Recognition
  3. Gait Retrieval
  4. Representation Learning
  5. Resnet Video (R3D)

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • University of Science, VNU-HCM

Conference

SOICT 2023

Acceptance Rates

Overall Acceptance Rate 147 of 318 submissions, 46%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)25
  • Downloads (Last 6 weeks)3
Reflects downloads up to 01 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)A Comprehensive Review of Vision-Based Sensor Systems for Human Gait AnalysisSensors10.3390/s2502049825:2(498)Online publication date: 16-Jan-2025

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media