Skip to main content

Research on Multi-scale Pedestrian Attribute Recognition Based on Dual Self-attention Mechanism

  • Conference paper
  • First Online:
Mobile Networks and Management (MONAMI 2023)

Abstract

As one of the important fields of computer vision research, pedestrian attribute recognition has gained increasing attention from domestic and foreign researchers due to its huge potential applications. However, obtaining long-distance pedestrian information in actual scenes poses challenges such as lack of information, incomplete feature extraction, and low attribute recognition accuracy. To address these issues, we propose a multi-scale feature fusion network based on a dual self-attention mechanism. The fusion module merges multi-scale features to enable more complete attribute extraction, while the dual self-attention module focuses the network on important regions. Experimental results on PA-100K, RAP, and PETA datasets achieved mean accuracies of 81.97%, 81.53%, and 86.37%, respectively. Extensive experiments demonstrate that the proposed method is highly competitive in pedestrian attribute recognition.

The Jiangxi Province Office of Education provided funding support for this research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Feris, R., Bobbitt, R., Brown, L., Pankanti, S.: Attribute-based people search: lessons learnt from a practical surveillance system. In: Proceedings of the ACM International Conference on Multimedia Retrieval 2014, Glasgow, United Kingdom, pp. 153–160, April 2014

    Google Scholar 

  2. Lin, Y., Zheng, L., Zheng, Z., Wu, Y., Hu, Z., Yan, C., et al.: Improving person re-identification by attribute and identity learning. Pattern Recognit. 95, 151–161 (2019)

    Article  Google Scholar 

  3. Di, X., Zhang, H., Patel, V.M.: Polarimetric thermal to visible face verification via attribute preserved synthesis. In: 2018 IEEE 9th International Conference on Biometrics Theory, Applications and Systems, United states, October 2018

    Google Scholar 

  4. Di, X., Riggan, B.S., Hu, S., Short, N.J., Patel, V.M.: Multi-scale thermal to visible face verification via attribute guided synthesis. IEEE Trans. Biom. Behav. Identity Sci. 266–280 (2021)

    Google Scholar 

  5. Shi, Y., Ling, H., Wu, L., Shen, J., Li, P.: Learning refined attribute-aligned network with attribute selection for person re-identification. Neurocomputing 402, 124–133 (2020)

    Article  Google Scholar 

  6. Li, H., Chen, Y., Tao, D., Yu, Z., Qi, G.: Attribute-aligned domain-invariant feature learning for unsupervised domain adaptation person re-identification. IEEE Trans. Inf. Forensics Secur. 16, 1480–1494 (2021)

    Article  Google Scholar 

  7. Tang, C., Sheng, L., Zhang, Z.-X., Hu, X.: Improving pedestrian attribute recognition with weakly-supervised multi-scale attribute-specific localization. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), pp. 4996–5005, October2019

    Google Scholar 

  8. Zhong, J., Qiao, H., Chen, L., Shang, M., Liu, Q.: Improving pedestrian attribute recognition with multi-scale spatial calibration. In: International Joint Conference on Neural Networks (IJCNN) 2021, pp. 1–8 (2021)

    Google Scholar 

  9. Liu, Z., Zhang, Z., Li, D., Zhang, P., Shan, C.: Dual-branch self-attention network for pedestrian attribute recognition. Pattern Recognit. Lett. 163, 112–120 (2022)

    Article  Google Scholar 

  10. Fan, Z., Guan, Y.: Pedestrian attribute recognition based on dual self-attention mechanism. Comput. Sci. Inf. Syst. 20, 793–812 (2023)

    Article  Google Scholar 

  11. Wang, X., Zheng, S., Yang, R., Luo, B., Chen, Z., Tang, J.: Pedestrian Attribute Recognition: A (2019)

    Google Scholar 

  12. Sudowe, P., Spitzer, H., Leibe, B.: Person attribute recognition with a jointly-trained holistic CNN model. In: 2015 IEEE International Conference on Computer Vision Workshop (ICCVW), Santiago, Chile, pp. 329–337, December 2015. Survey. Pattern Recognit., 121, 108220

    Google Scholar 

  13. Li, D., Chen, X., Huang, K.: Multi-attribute learning for pedestrian attribute recognition in surveillance scenarios. In: 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR), Kuala Lumpur, Malaysia, pp. 111–115, November 2015

    Google Scholar 

  14. Abdulnabi, A.H., Wang, G., Lu, J., Jia, K.: Multi-Task CNN model for attribute prediction. IEEE Trans. Multimed. 17, 1949–1959 (2015)

    Article  Google Scholar 

  15. Zhu, J., Liao, S., Yi, D., Lei, Z.: Multi-label CNN based pedestrian attribute learning for soft biometrics. In: 2015 International Conference on Biometrics (ICB), pp. 535–540 (2015)

    Google Scholar 

  16. Yang, L., Zhu, L., Wei, Y., Liang, S., Tan, P.: Attribute Recognition from Adaptive Parts (2016). ArXiv, abs/1607.01437

    Google Scholar 

  17. Diba, A., Pazandeh, A.M., Pirsiavash, H., Gool, L.V.: DeepCAMP: deep convolutional action & attribute mid-level patterns. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016, pp. 3557–3565 (2016)

    Google Scholar 

  18. Li, D., Chen, X., Zhang, Z., Huang, K.: Pose guided deep model for pedestrian attribute recognition in surveillance scenarios. In: IEEE International Conference on Multimedia and Expo (ICME) 2018, pp. 1–6 (2018)

    Google Scholar 

  19. Yu, K., Leng, B., Zhang, Z., Li, D., Huang, K.: Weakly-supervised Learning of Mid-level Features for Pedestrian Attribute Recognition and Localization (2016). ArXiv, abs/1611.05603

    Google Scholar 

  20. Liu, X., et al.: HydraPlus-Net: attentive deep features for pedestrian analysis. In: IEEE International Conference on Computer Vision (ICCV) 2017, pp. 350–359 (2017)

    Google Scholar 

  21. Sarfraz, M.S., Schumann, A., Wang, Y.: Deep View-Sensitive Pedestrian Attribute Inference in an end-to-end Model (2017). ArXiv, abs/1707.06089

    Google Scholar 

  22. Guo, H., Fan, X., Wang, S.: Human attribute recognition by refining attention heat map. Pattern Recognit. Lett. 94, 38–45 (2017)

    Article  Google Scholar 

  23. Li, D., Zhang, Z., Chen, X., Ling, H., Huang, K.: A Richly Annotated Dataset for Pedestrian Attribute Recognition (2016). ArXiv, abs/1603.07054

    Google Scholar 

  24. Deng, Y., Luo, P., Loy, C.C.: Pedestrian attribute recognition at far distance. In: Proceedings of the 22nd ACM International Conference on Multimedia (2014)

    Google Scholar 

  25. Xie, S., Girshick, R.B., Dollár, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017, pp. 5987–5995 (2016)

    Google Scholar 

  26. Liu, P., Liu, X., Yan, J., Shao, J.: Localization Guided Learning for Pedestrian Attribute Recognition (2018). ArXiv, abs/1808.09102

    Google Scholar 

  27. Han, K., Wang, Y., Shu, H., Liu, C., Xu, C., Xu, C.: Attribute Aware Pooling for Pedestrian Attribute Recognition (2019). ArXiv, abs/1907.11837

    Google Scholar 

  28. Ji, Z., Hu, Z., He, E., Han, J., Pang, Y.: Pedestrian attribute recognition based on multiple time steps attention. Pattern Recognit. Lett. 138, 170–176 (2020)

    Article  Google Scholar 

  29. Jia, J., Huang, H., Yang, W., Chen, X., Huang, K.: Rethinking of Pedestrian Attribute Recognition: Realistic Datasets with Efficient Method (2020). ArXiv, abs/2005.11909

    Google Scholar 

Download references

Acknowledgment

This research received partial support from the National Natural Science Foundation of China(No. 62067003) and the Foundation of Jiangxi Educational Committee (No. GJJ200824).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to He Xiao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xiao, H., Xie, W., Zhou, Y., Luo, Y., Zhang, R., Xu, X. (2024). Research on Multi-scale Pedestrian Attribute Recognition Based on Dual Self-attention Mechanism. In: Wu, C., Chen, X., Feng, J., Wu, Z. (eds) Mobile Networks and Management. MONAMI 2023. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 559. Springer, Cham. https://doi.org/10.1007/978-3-031-55471-1_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-55471-1_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-55470-4

  • Online ISBN: 978-3-031-55471-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics