skip to main content
10.1145/3581783.3611840acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

LandmarkGait: Intrinsic Human Parsing for Gait Recognition

Published:27 October 2023Publication History

ABSTRACT

Gait recognition is an emerging biometric technology for identifying pedestrians based on their unique walking patterns. In past gait recognition, global-based methods are inadequate to meet the growing demand for accuracy, while commonly used part-based methods provided coarse and inaccurate feature representation for specific body parts. Human parsing appears to be a better option for accurately representing specific and complete body parts in gait recognition. However, its practical application in gait recognition is often hindered by missing RGB modality, lack of annotated body parts, and difficulty in balancing parsing quantity and quality. To address this issue, we propose LandmarkGait, an accessible and alternative parsing-based solution for gait recognition. LandmarkGait introduces an unsupervised landmark discovery network to transform the dense silhouette into a finite set of landmarks with remarkable consistency across various conditions. By grouping landmarks subsets corresponding to distinct body part regions, following a reconstruction task and further refinement from high-quality input silhouettes, we can directly obtain fine-grained parsing results from original binary silhouettes in an unsupervised manner. Moreover, we also develop a multi-scale feature extractor that simultaneously captures global and parsing feature representations based on the integrity and flexibility of specific body parts. Extensive experiments demonstrate that our LandmarkGait can extract more stable features and exhibit significant performance improvement under all conditions, especially in various dressing conditions. Code is available at https://github.com/wzb-bupt/LandmarkGait.

References

  1. Hanqing Chao, Yiwei He, Junping Zhang, and Jianfeng Feng. 2019. Gaitset: Regarding gait as a set for cross-view gait recognition. In Proceedings of the AAAI Conference on Artificial Intelligence. 8126--8133.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Bowen Cheng, Omkar Parkhi, and Alexander Kirillov. 2022. Pointly-supervised instance segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2617--2626.Google ScholarGoogle ScholarCross RefCross Ref
  3. Xiaohan Ding, Xiangyu Zhang, Jungong Han, and Guiguang Ding. 2021. Diverse branch block: Building a convolution as an inception-like unit. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 10886--10895.Google ScholarGoogle ScholarCross RefCross Ref
  4. Chao Fan, Junhao Liang, Chuanfu Shen, Saihui Hou, Yongzhen Huang, and Shiqi Yu. 2023. OpenGait: Revisiting Gait Recognition Towards Better Practicality. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 9707--9716.Google ScholarGoogle ScholarCross RefCross Ref
  5. Chao Fan, Yunjie Peng, Chunshui Cao, Xu Liu, Saihui Hou, Jiannan Chi, Yongzhen Huang, Qing Li, and Zhiqiang He. 2020. Gaitpart: Temporal part-based model for gait recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 14225--14233.Google ScholarGoogle ScholarCross RefCross Ref
  6. Junsong Fan, Zhaoxiang Zhang, and Tieniu Tan. 2022. Pointly-supervised panoptic segmentation. In Proceedings of the European Conference on Computer Vision. Springer, 319--336.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Yang Fu, Shibei Meng, Saihui Hou, Xuecai Hu, and Yongzhen Huang. 2023. GPGait: Generalized Pose-based Gait Recognition. arXiv preprint arXiv:2303.05234 (2023).Google ScholarGoogle Scholar
  8. Ke Gong, Xiaodan Liang, Dongyu Zhang, Xiaohui Shen, and Liang Lin. 2017. Look into person: Self-supervised structure-sensitive learning and a new benchmark for human parsing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 932--940.Google ScholarGoogle ScholarCross RefCross Ref
  9. Xinqian Gu, Hong Chang, Bingpeng Ma, Shutao Bai, Shiguang Shan, and Xilin Chen. 2022. Clothes-Changing person re-identification with RGB modality only. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1060--1069.Google ScholarGoogle ScholarCross RefCross Ref
  10. Xiao Han, Peishan Cong, Lan Xu, Jingya Wang, Jingyi Yu, and Yuexin Ma. 2022. LiCamGait: Gait recognition in the wild by using LiDAR and camera multi-modal visual sensors. arXiv preprint arXiv:2211.12371 (2022).Google ScholarGoogle Scholar
  11. Kaiming He, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. Mask r-cnn. In Proceedings of the IEEE International Conference on Computer Vision. 2961--2969.Google ScholarGoogle ScholarCross RefCross Ref
  12. Saihui Hou, Chunshui Cao, Xu Liu, and Yongzhen Huang. 2020. Gait lateral network: Learning discriminative and compact representations for gait recognition. In Proceedings of the European Conference on Computer Vision. Springer, 382--398.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Saihui Hou, Xu Liu, Chunshui Cao, and Yongzhen Huang. 2021. Set residual network for silhouette-based gait recognition. IEEE Transactions on Biometrics, Behavior, and Identity Science, Vol. 3, 3 (2021), 384--393.Google ScholarGoogle ScholarCross RefCross Ref
  14. Saihui Hou, Xu Liu, Chunshui Cao, and Yongzhen Huang. 2022. Gait quality aware network: toward the interpretability of silhouette-based gait recognition. IEEE Transactions on Neural Networks and Learning Systems (2022).Google ScholarGoogle Scholar
  15. Hung-Min Hsu, Yizhou Wang, Cheng-Yen Yang, Jenq-Neng Hwang, Hoang Le Uyen Thuc, and Kwang-Ju Kim. 2022. GAITTAKE: Gait recognition by temporal attention and keypoint-guided embedding. In IEEE International Conference on Image Processing. 2546--2550.Google ScholarGoogle ScholarCross RefCross Ref
  16. Jie Hu, Li Shen, and Gang Sun. 2018. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 7132--7141.Google ScholarGoogle ScholarCross RefCross Ref
  17. Xiaohu Huang, Duowang Zhu, Hao Wang, Xinggang Wang, Bo Yang, Botao He, Wenyu Liu, and Bin Feng. 2021b. Context-sensitive temporal feature learning for gait recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 12909--12918.Google ScholarGoogle ScholarCross RefCross Ref
  18. Zhen Huang, Dixiu Xue, Xu Shen, Xinmei Tian, Houqiang Li, Jianqiang Huang, and Xian-Sheng Hua. 2021a. 3D local convolutional neural networks for gait recognition. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 14920--14929.Google ScholarGoogle ScholarCross RefCross Ref
  19. Tomas Jakab, Ankush Gupta, Hakan Bilen, and Andrea Vedaldi. 2018. Unsupervised learning of object landmarks through conditional image generation. Advances in Neural Information Processing Systems, Vol. 31 (2018).Google ScholarGoogle Scholar
  20. Zhenchao Jin, Bin Liu, Qi Chu, and Nenghai Yu. 2021. ISNet: Integrate image-level and semantic-level context for semantic segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 7189--7198.Google ScholarGoogle ScholarCross RefCross Ref
  21. Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C Berg, Wan-Yen Lo, et al. 2023. Segment anything. arXiv preprint arXiv:2304.02643 (2023).Google ScholarGoogle Scholar
  22. Tejas D Kulkarni, Ankush Gupta, Catalin Ionescu, Sebastian Borgeaud, Malcolm Reynolds, Andrew Zisserman, and Volodymyr Mnih. 2019a. Unsupervised learning of object keypoints for perception and control. Advances in Neural Information Processing Systems, Vol. 32 (2019).Google ScholarGoogle Scholar
  23. Tejas D Kulkarni, Ankush Gupta, Catalin Ionescu, Sebastian Borgeaud, Malcolm Reynolds, Andrew Zisserman, and Volodymyr Mnih. 2019b. Unsupervised learning of object keypoints for perception and control. Advances in Neural Information Processing Systems, Vol. 32 (2019).Google ScholarGoogle Scholar
  24. Guodong Li, Lijun Guo, Rong Zhang, Jiangbo Qian, and Shangce Gao. 2023 a. TransGait: Multimodal-based gait recognition with set transformer. Applied Intelligence, Vol. 53, 2 (2023), 1535--1547.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Peike Li, Yunqiu Xu, Yunchao Wei, and Yi Yang. 2020. Self-correction for human parsing. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 44, 6 (2020), 3260--3271.Google ScholarGoogle ScholarCross RefCross Ref
  26. Weijia Li, Saihui Hou, Chunjie Zhang, Chunshui Cao, Xu Liu, Yongzhen Huang, and Yao Zhao. 2023 b. An in-depth exploration of person re-identification and gait recognition in cloth-changing conditions. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle ScholarCross RefCross Ref
  27. Xiaodan Liang, Si Liu, Xiaohui Shen, Jianchao Yang, Luoqi Liu, Jian Dong, Liang Lin, and Shuicheng Yan. 2015. Deep human parsing with active template regression. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 37, 12 (2015), 2402--2414.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Rijun Liao, Shiqi Yu, Weizhi An, and Yongzhen Huang. 2020. A model-based gait recognition method with body pose and human prior knowledge. Pattern Recognition, Vol. 98 (2020), 107069.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Beibei Lin, Shunli Zhang, and Feng Bao. 2020. Gait recognition with multiple-temporal-scale 3d convolutional neural network. In Proceedings of the 28th ACM International Conference on Multimedia. 3054--3062.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Beibei Lin, Shunli Zhang, and Xin Yu. 2021. Gait recognition via effective global-local feature representation and local temporal aggregation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 14648--14656.Google ScholarGoogle ScholarCross RefCross Ref
  31. Kunliang Liu, Ouk Choi, Jianming Wang, and Wonjun Hwang. 2022. Cdgnet: Class distribution guided network for human parsing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4473--4482.Google ScholarGoogle ScholarCross RefCross Ref
  32. Xin Liu, Chen Zhao, Bin Zheng, Qinwei Guo, Xiaoqin Duan, Aziguli Wulamu, and Dezheng Zhang. 2021. Wearable devices for gait analysis in intelligent healthcare. Frontiers in Computer Science, Vol. 3 (2021), 661676.Google ScholarGoogle ScholarCross RefCross Ref
  33. Hao Luo, Youzhi Gu, Xingyu Liao, Shenqi Lai, and Wei Jiang. 2019. Bag of tricks and a strong baseline for deep person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.Google ScholarGoogle ScholarCross RefCross Ref
  34. Kang Ma, Ying Fu, Dezhi Zheng, Chunshui Cao, Xuecai Hu, and Yongzhen Huang. 2023. Dynamic Aggregated Network for Gait Recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 22076--22085.Google ScholarGoogle ScholarCross RefCross Ref
  35. Yasushi Makihara, Hidetoshi Mannami, Akira Tsuji, Md Altab Hossain, Kazushige Sugiura, Atsushi Mori, and Yasushi Yagi. 2012. The OU-ISIR gait database comprising the treadmill dataset. IPSJ Transactions on Computer Vision and Applications, Vol. 4 (2012), 53--62.Google ScholarGoogle ScholarCross RefCross Ref
  36. Dimitrios Mallis, Enrique Sanchez, Matthew Bell, and Georgios Tzimiropoulos. 2020. Unsupervised learning of object landmarks via self-training correspondence. Advances in Neural Information Processing Systems, Vol. 33 (2020), 4709--4720.Google ScholarGoogle Scholar
  37. Honghu Pan, Yongyong Chen, Tingyang Xu, Yunqi He, and Zhenyu He. 2023. Toward complete-view and high-level pose-based gait recognition. IEEE Transactions on Information Forensics and Security, Vol. 18 (2023), 2104--2118.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Yunjie Peng, Kang Ma, Yang Zhang, and Zhiqiang He. 2023. Learning rich features for gait recognition by integrating skeletons and silhouettes. Multimedia Tools and Applications (2023), 1--22.Google ScholarGoogle Scholar
  39. Xuqian Ren, Saihui Hou, Chunshui Cao, Xu Liu, and Yongzhen Huang. 2022. Progressive feature learning for realistic cloth-changing gait recognition. arXiv preprint arXiv:2207.11720 (2022).Google ScholarGoogle Scholar
  40. Imad Rida, Noor Almaadeed, and Somaya Almaadeed. 2019. Robust gait recognition: a comprehensive survey. IET Biometrics, Vol. 8, 1 (2019), 14--28.Google ScholarGoogle ScholarCross RefCross Ref
  41. Enrique Sanchez and Georgios Tzimiropoulos. 2019. Object landmark discovery through unsupervised adaptation. Advances in Neural Information Processing Systems, Vol. 32 (2019).Google ScholarGoogle Scholar
  42. Alireza Sepas-Moghaddam and Ali Etemad. 2022. Deep gait recognition: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 45, 1 (2022), 264--284.Google ScholarGoogle ScholarCross RefCross Ref
  43. Chuanfu Shen, Chao Fan, Wei Wu, Rui Wang, George Q Huang, and Shiqi Yu. 2023. LidarGait: Benchmarking 3D Gait Recognition With Point Clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1054--1063.Google ScholarGoogle ScholarCross RefCross Ref
  44. Zhixin Shu, Mihir Sahasrabudhe, Riza Alp Guler, Dimitris Samaras, Nikos Paragios, and Iasonas Kokkinos. 2018. Deforming autoencoders: Unsupervised disentangling of shape and appearance. In Proceedings of the European Conference on Computer Vision. 650--665.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Jamie D Shutler, Michael G Grant, Mark S Nixon, and John N Carter. 2004. On a large sequence-based human gait database. In Applications and Science in Soft Computing. Springer, 339--346.Google ScholarGoogle Scholar
  46. Chunfeng Song, Yongzhen Huang, Yan Huang, Ning Jia, and Liang Wang. 2019. Gaitnet: An end-to-end network for gait based human identification. Pattern recognition, Vol. 96 (2019), 106988.Google ScholarGoogle Scholar
  47. Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, and Alexander Alemi. 2017. Inception-v4, inception-resnet and the impact of residual connections on learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 31.Google ScholarGoogle ScholarCross RefCross Ref
  48. Noriko Takemura, Yasushi Makihara, Daigo Muramatsu, Tomio Echigo, and Yasushi Yagi. 2018. Multi-view large population gait dataset and its performance evaluation for cross-view gait recognition. IPSJ transactions on Computer Vision and Applications, Vol. 10 (2018), 1--14.Google ScholarGoogle Scholar
  49. Torben Teepe, Johannes Gilg, Fabian Herzog, Stefan Hörmann, and Gerhard Rigoll. 2022. Towards a deeper understanding of skeleton-based gait recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1569--1577.Google ScholarGoogle ScholarCross RefCross Ref
  50. Torben Teepe, Ali Khan, Johannes Gilg, Fabian Herzog, Stefan Hörmann, and Gerhard Rigoll. 2021. Gaitgraph: Graph convolutional network for skeleton-based gait recognition. In IEEE International Conference on Image Processing. 2314--2318.Google ScholarGoogle ScholarCross RefCross Ref
  51. James Thewlis, Hakan Bilen, and Andrea Vedaldi. 2017. Unsupervised learning of object landmarks by factorized spatial embeddings. In Proceedings of the IEEE International Conference on Computer Vision. 5916--5925.Google ScholarGoogle ScholarCross RefCross Ref
  52. Changsheng Wan, Li Wang, and Vir V Phoha. 2018. A survey on gait recognition. Comput. Surveys, Vol. 51, 5 (2018), 1--35.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Ming Wang, Xianda Guo, Beibei Lin, Tian Yang, Zheng Zhu, Lincheng Li, Shunli Zhang, and Xin Yu. 2023. DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition. arXiv preprint arXiv:2303.14953 (2023).Google ScholarGoogle Scholar
  54. Meng Ye, Mikael Kanski, Dong Yang, Qi Chang, Zhennan Yan, Qiaoying Huang, Leon Axel, and Dimitris Metaxas. 2021. Deeptag: An unsupervised deep learning method for motion tracking on cardiac tagging magnetic resonance images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7261--7271.Google ScholarGoogle ScholarCross RefCross Ref
  55. Shiqi Yu, Daoliang Tan, and Tieniu Tan. 2006. A framework for evaluating the effect of view angle, clothing and carrying condition on gait recognition. In International Conference on Pattern Recognition, Vol. 4. IEEE, 441--444.Google ScholarGoogle Scholar
  56. Lazaros Zafeiriou, Epameinondas Antonakos, Stefanos Zafeiriou, and Maja Pantic. 2014. Joint unsupervised face alignment and behaviour analysis. In Proceedings of the European Conference on Computer Vision. Springer, 167--183.Google ScholarGoogle ScholarCross RefCross Ref
  57. Dingwen Zhang, Junwei Han, and Yu Zhang. 2017. Supervision by fusion: Towards unsupervised learning of deep salient object detector. In Proceedings of the IEEE International Conference on Computer Vision. 4048--4056.Google ScholarGoogle ScholarCross RefCross Ref
  58. Yuting Zhang, Yijie Guo, Yixin Jin, Yijun Luo, Zhiyuan He, and Honglak Lee. 2018. Unsupervised discovery of object landmarks as structural representations. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2694--2703.Google ScholarGoogle ScholarCross RefCross Ref
  59. Ziyuan Zhang, Luan Tran, Feng Liu, and Xiaoming Liu. 2020. On learning disentangled representations for gait recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence (2020).Google ScholarGoogle Scholar
  60. Jinkai Zheng, Xinchen Liu, Xiaoyan Gu, Yaoqi Sun, Chuang Gan, Jiyong Zhang, Wu Liu, and Chenggang Yan. 2022a. Gait recognition in the wild with multi-hop temporal switch. In Proceedings of the 30th ACM International Conference on Multimedia. 6136--6145.Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Jinkai Zheng, Xinchen Liu, Wu Liu, Lingxiao He, Chenggang Yan, and Tao Mei. 2022b. Gait recognition in the wild with dense 3d representations and a benchmark. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 20228--20237.Google ScholarGoogle ScholarCross RefCross Ref
  62. Yongsheng Zhu, Fengxin Sun, Changjun Jia, Chaorui Huang, Kuo Wang, Ying Li, Liping Chou, and Yupeng Mao. 2022. A 3D printing triboelectric sensor for gait analysis and virtual control based on human-computer interaction and the internet of things. Sustainability, Vol. 14, 17 (2022), 10875.Google ScholarGoogle ScholarCross RefCross Ref
  63. Zheng Zhu, Xianda Guo, Tian Yang, Junjie Huang, Jiankang Deng, Guan Huang, Dalong Du, Jiwen Lu, and Jie Zhou. 2021. Gait recognition in the wild: A benchmark. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 14789--14799.Google ScholarGoogle Scholar

Index Terms

  1. LandmarkGait: Intrinsic Human Parsing for Gait Recognition

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '23: Proceedings of the 31st ACM International Conference on Multimedia
      October 2023
      9913 pages
      ISBN:9798400701085
      DOI:10.1145/3581783

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 October 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia
    • Article Metrics

      • Downloads (Last 12 months)221
      • Downloads (Last 6 weeks)32

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader