Research on Multi-scale Pedestrian Attribute Recognition Based on Dual Self-attention Mechanism

Xiao, He; Xie, Wenbiao; Zhou, Yang; Luo, Yong; Zhang, Ruoni; Xu, Xiao

doi:10.1007/978-3-031-55471-1_16

He Xiao^19,20,
Wenbiao Xie¹⁹,
Yang Zhou²¹,
Yong Luo²²,
Ruoni Zhang¹⁹ &
…
Xiao Xu¹⁹

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 559))

Included in the following conference series:

International Conference on Mobile Networks and Management

50 Accesses

Abstract

As one of the important fields of computer vision research, pedestrian attribute recognition has gained increasing attention from domestic and foreign researchers due to its huge potential applications. However, obtaining long-distance pedestrian information in actual scenes poses challenges such as lack of information, incomplete feature extraction, and low attribute recognition accuracy. To address these issues, we propose a multi-scale feature fusion network based on a dual self-attention mechanism. The fusion module merges multi-scale features to enable more complete attribute extraction, while the dual self-attention module focuses the network on important regions. Experimental results on PA-100K, RAP, and PETA datasets achieved mean accuracies of 81.97%, 81.53%, and 86.37%, respectively. Extensive experiments demonstrate that the proposed method is highly competitive in pedestrian attribute recognition.

The Jiangxi Province Office of Education provided funding support for this research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Feris, R., Bobbitt, R., Brown, L., Pankanti, S.: Attribute-based people search: lessons learnt from a practical surveillance system. In: Proceedings of the ACM International Conference on Multimedia Retrieval 2014, Glasgow, United Kingdom, pp. 153–160, April 2014
Google Scholar
Lin, Y., Zheng, L., Zheng, Z., Wu, Y., Hu, Z., Yan, C., et al.: Improving person re-identification by attribute and identity learning. Pattern Recognit. 95, 151–161 (2019)
Article Google Scholar
Di, X., Zhang, H., Patel, V.M.: Polarimetric thermal to visible face verification via attribute preserved synthesis. In: 2018 IEEE 9th International Conference on Biometrics Theory, Applications and Systems, United states, October 2018
Google Scholar
Di, X., Riggan, B.S., Hu, S., Short, N.J., Patel, V.M.: Multi-scale thermal to visible face verification via attribute guided synthesis. IEEE Trans. Biom. Behav. Identity Sci. 266–280 (2021)
Google Scholar
Shi, Y., Ling, H., Wu, L., Shen, J., Li, P.: Learning refined attribute-aligned network with attribute selection for person re-identification. Neurocomputing 402, 124–133 (2020)
Article Google Scholar
Li, H., Chen, Y., Tao, D., Yu, Z., Qi, G.: Attribute-aligned domain-invariant feature learning for unsupervised domain adaptation person re-identification. IEEE Trans. Inf. Forensics Secur. 16, 1480–1494 (2021)
Article Google Scholar
Tang, C., Sheng, L., Zhang, Z.-X., Hu, X.: Improving pedestrian attribute recognition with weakly-supervised multi-scale attribute-specific localization. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), pp. 4996–5005, October2019
Google Scholar
Zhong, J., Qiao, H., Chen, L., Shang, M., Liu, Q.: Improving pedestrian attribute recognition with multi-scale spatial calibration. In: International Joint Conference on Neural Networks (IJCNN) 2021, pp. 1–8 (2021)
Google Scholar
Liu, Z., Zhang, Z., Li, D., Zhang, P., Shan, C.: Dual-branch self-attention network for pedestrian attribute recognition. Pattern Recognit. Lett. 163, 112–120 (2022)
Article Google Scholar
Fan, Z., Guan, Y.: Pedestrian attribute recognition based on dual self-attention mechanism. Comput. Sci. Inf. Syst. 20, 793–812 (2023)
Article Google Scholar
Wang, X., Zheng, S., Yang, R., Luo, B., Chen, Z., Tang, J.: Pedestrian Attribute Recognition: A (2019)
Google Scholar
Sudowe, P., Spitzer, H., Leibe, B.: Person attribute recognition with a jointly-trained holistic CNN model. In: 2015 IEEE International Conference on Computer Vision Workshop (ICCVW), Santiago, Chile, pp. 329–337, December 2015. Survey. Pattern Recognit., 121, 108220
Google Scholar
Li, D., Chen, X., Huang, K.: Multi-attribute learning for pedestrian attribute recognition in surveillance scenarios. In: 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR), Kuala Lumpur, Malaysia, pp. 111–115, November 2015
Google Scholar
Abdulnabi, A.H., Wang, G., Lu, J., Jia, K.: Multi-Task CNN model for attribute prediction. IEEE Trans. Multimed. 17, 1949–1959 (2015)
Article Google Scholar
Zhu, J., Liao, S., Yi, D., Lei, Z.: Multi-label CNN based pedestrian attribute learning for soft biometrics. In: 2015 International Conference on Biometrics (ICB), pp. 535–540 (2015)
Google Scholar
Yang, L., Zhu, L., Wei, Y., Liang, S., Tan, P.: Attribute Recognition from Adaptive Parts (2016). ArXiv, abs/1607.01437
Google Scholar
Diba, A., Pazandeh, A.M., Pirsiavash, H., Gool, L.V.: DeepCAMP: deep convolutional action & attribute mid-level patterns. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016, pp. 3557–3565 (2016)
Google Scholar
Li, D., Chen, X., Zhang, Z., Huang, K.: Pose guided deep model for pedestrian attribute recognition in surveillance scenarios. In: IEEE International Conference on Multimedia and Expo (ICME) 2018, pp. 1–6 (2018)
Google Scholar
Yu, K., Leng, B., Zhang, Z., Li, D., Huang, K.: Weakly-supervised Learning of Mid-level Features for Pedestrian Attribute Recognition and Localization (2016). ArXiv, abs/1611.05603
Google Scholar
Liu, X., et al.: HydraPlus-Net: attentive deep features for pedestrian analysis. In: IEEE International Conference on Computer Vision (ICCV) 2017, pp. 350–359 (2017)
Google Scholar
Sarfraz, M.S., Schumann, A., Wang, Y.: Deep View-Sensitive Pedestrian Attribute Inference in an end-to-end Model (2017). ArXiv, abs/1707.06089
Google Scholar
Guo, H., Fan, X., Wang, S.: Human attribute recognition by refining attention heat map. Pattern Recognit. Lett. 94, 38–45 (2017)
Article Google Scholar
Li, D., Zhang, Z., Chen, X., Ling, H., Huang, K.: A Richly Annotated Dataset for Pedestrian Attribute Recognition (2016). ArXiv, abs/1603.07054
Google Scholar
Deng, Y., Luo, P., Loy, C.C.: Pedestrian attribute recognition at far distance. In: Proceedings of the 22nd ACM International Conference on Multimedia (2014)
Google Scholar
Xie, S., Girshick, R.B., Dollár, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017, pp. 5987–5995 (2016)
Google Scholar
Liu, P., Liu, X., Yan, J., Shao, J.: Localization Guided Learning for Pedestrian Attribute Recognition (2018). ArXiv, abs/1808.09102
Google Scholar
Han, K., Wang, Y., Shu, H., Liu, C., Xu, C., Xu, C.: Attribute Aware Pooling for Pedestrian Attribute Recognition (2019). ArXiv, abs/1907.11837
Google Scholar
Ji, Z., Hu, Z., He, E., Han, J., Pang, Y.: Pedestrian attribute recognition based on multiple time steps attention. Pattern Recognit. Lett. 138, 170–176 (2020)
Article Google Scholar
Jia, J., Huang, H., Yang, W., Chen, X., Huang, K.: Rethinking of Pedestrian Attribute Recognition: Realistic Datasets with Efficient Method (2020). ArXiv, abs/2005.11909
Google Scholar

Download references

Acknowledgment

This research received partial support from the National Natural Science Foundation of China(No. 62067003) and the Foundation of Jiangxi Educational Committee (No. GJJ200824).

Author information

Authors and Affiliations

School of Software Engineering, Jiangxi University of Science and Technology, NanChang, 330013, Jiangxi, People’s Republic of China
He Xiao, Wenbiao Xie, Ruoni Zhang & Xiao Xu
Nanchang Key laboratory of Virtual Digital Factory and Cultural Communications, Nanchang, 330013, People’s Republic of China
He Xiao
Information and Communication Branch, State Grid Jiangxi Electric Power Co, Nanchang, 330095, China
Yang Zhou
School of Software, Jiangxi Normal University, Nanchang, 330022, China
Yong Luo

Authors

He Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Wenbiao Xie
View author publications
You can also search for this author in PubMed Google Scholar
Yang Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Yong Luo
View author publications
You can also search for this author in PubMed Google Scholar
Ruoni Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to He Xiao .

Editor information

Editors and Affiliations

The University of Electro-Communications, Tokyo, Japan
Celimuge Wu
VTT Technical Research Centre, Helsinki, Finland
Xianfu Chen
Xidian University, Xi'an, China
Jie Feng
Jiangxi University of Technology, Nanchang, China
Zhen Wu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xiao, H., Xie, W., Zhou, Y., Luo, Y., Zhang, R., Xu, X. (2024). Research on Multi-scale Pedestrian Attribute Recognition Based on Dual Self-attention Mechanism. In: Wu, C., Chen, X., Feng, J., Wu, Z. (eds) Mobile Networks and Management. MONAMI 2023. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 559. Springer, Cham. https://doi.org/10.1007/978-3-031-55471-1_16

Download citation

DOI: https://doi.org/10.1007/978-3-031-55471-1_16
Published: 17 March 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-55470-4
Online ISBN: 978-3-031-55471-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Research on Multi-scale Pedestrian Attribute Recognition Based on Dual Self-attention Mechanism