Abstract
As one of the important fields of computer vision research, pedestrian attribute recognition has gained increasing attention from domestic and foreign researchers due to its huge potential applications. However, obtaining long-distance pedestrian information in actual scenes poses challenges such as lack of information, incomplete feature extraction, and low attribute recognition accuracy. To address these issues, we propose a multi-scale feature fusion network based on a dual self-attention mechanism. The fusion module merges multi-scale features to enable more complete attribute extraction, while the dual self-attention module focuses the network on important regions. Experimental results on PA-100K, RAP, and PETA datasets achieved mean accuracies of 81.97%, 81.53%, and 86.37%, respectively. Extensive experiments demonstrate that the proposed method is highly competitive in pedestrian attribute recognition.
The Jiangxi Province Office of Education provided funding support for this research.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Feris, R., Bobbitt, R., Brown, L., Pankanti, S.: Attribute-based people search: lessons learnt from a practical surveillance system. In: Proceedings of the ACM International Conference on Multimedia Retrieval 2014, Glasgow, United Kingdom, pp. 153–160, April 2014
Lin, Y., Zheng, L., Zheng, Z., Wu, Y., Hu, Z., Yan, C., et al.: Improving person re-identification by attribute and identity learning. Pattern Recognit. 95, 151–161 (2019)
Di, X., Zhang, H., Patel, V.M.: Polarimetric thermal to visible face verification via attribute preserved synthesis. In: 2018 IEEE 9th International Conference on Biometrics Theory, Applications and Systems, United states, October 2018
Di, X., Riggan, B.S., Hu, S., Short, N.J., Patel, V.M.: Multi-scale thermal to visible face verification via attribute guided synthesis. IEEE Trans. Biom. Behav. Identity Sci. 266–280 (2021)
Shi, Y., Ling, H., Wu, L., Shen, J., Li, P.: Learning refined attribute-aligned network with attribute selection for person re-identification. Neurocomputing 402, 124–133 (2020)
Li, H., Chen, Y., Tao, D., Yu, Z., Qi, G.: Attribute-aligned domain-invariant feature learning for unsupervised domain adaptation person re-identification. IEEE Trans. Inf. Forensics Secur. 16, 1480–1494 (2021)
Tang, C., Sheng, L., Zhang, Z.-X., Hu, X.: Improving pedestrian attribute recognition with weakly-supervised multi-scale attribute-specific localization. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), pp. 4996–5005, October2019
Zhong, J., Qiao, H., Chen, L., Shang, M., Liu, Q.: Improving pedestrian attribute recognition with multi-scale spatial calibration. In: International Joint Conference on Neural Networks (IJCNN) 2021, pp. 1–8 (2021)
Liu, Z., Zhang, Z., Li, D., Zhang, P., Shan, C.: Dual-branch self-attention network for pedestrian attribute recognition. Pattern Recognit. Lett. 163, 112–120 (2022)
Fan, Z., Guan, Y.: Pedestrian attribute recognition based on dual self-attention mechanism. Comput. Sci. Inf. Syst. 20, 793–812 (2023)
Wang, X., Zheng, S., Yang, R., Luo, B., Chen, Z., Tang, J.: Pedestrian Attribute Recognition: A (2019)
Sudowe, P., Spitzer, H., Leibe, B.: Person attribute recognition with a jointly-trained holistic CNN model. In: 2015 IEEE International Conference on Computer Vision Workshop (ICCVW), Santiago, Chile, pp. 329–337, December 2015. Survey. Pattern Recognit., 121, 108220
Li, D., Chen, X., Huang, K.: Multi-attribute learning for pedestrian attribute recognition in surveillance scenarios. In: 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR), Kuala Lumpur, Malaysia, pp. 111–115, November 2015
Abdulnabi, A.H., Wang, G., Lu, J., Jia, K.: Multi-Task CNN model for attribute prediction. IEEE Trans. Multimed. 17, 1949–1959 (2015)
Zhu, J., Liao, S., Yi, D., Lei, Z.: Multi-label CNN based pedestrian attribute learning for soft biometrics. In: 2015 International Conference on Biometrics (ICB), pp. 535–540 (2015)
Yang, L., Zhu, L., Wei, Y., Liang, S., Tan, P.: Attribute Recognition from Adaptive Parts (2016). ArXiv, abs/1607.01437
Diba, A., Pazandeh, A.M., Pirsiavash, H., Gool, L.V.: DeepCAMP: deep convolutional action & attribute mid-level patterns. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016, pp. 3557–3565 (2016)
Li, D., Chen, X., Zhang, Z., Huang, K.: Pose guided deep model for pedestrian attribute recognition in surveillance scenarios. In: IEEE International Conference on Multimedia and Expo (ICME) 2018, pp. 1–6 (2018)
Yu, K., Leng, B., Zhang, Z., Li, D., Huang, K.: Weakly-supervised Learning of Mid-level Features for Pedestrian Attribute Recognition and Localization (2016). ArXiv, abs/1611.05603
Liu, X., et al.: HydraPlus-Net: attentive deep features for pedestrian analysis. In: IEEE International Conference on Computer Vision (ICCV) 2017, pp. 350–359 (2017)
Sarfraz, M.S., Schumann, A., Wang, Y.: Deep View-Sensitive Pedestrian Attribute Inference in an end-to-end Model (2017). ArXiv, abs/1707.06089
Guo, H., Fan, X., Wang, S.: Human attribute recognition by refining attention heat map. Pattern Recognit. Lett. 94, 38–45 (2017)
Li, D., Zhang, Z., Chen, X., Ling, H., Huang, K.: A Richly Annotated Dataset for Pedestrian Attribute Recognition (2016). ArXiv, abs/1603.07054
Deng, Y., Luo, P., Loy, C.C.: Pedestrian attribute recognition at far distance. In: Proceedings of the 22nd ACM International Conference on Multimedia (2014)
Xie, S., Girshick, R.B., Dollár, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017, pp. 5987–5995 (2016)
Liu, P., Liu, X., Yan, J., Shao, J.: Localization Guided Learning for Pedestrian Attribute Recognition (2018). ArXiv, abs/1808.09102
Han, K., Wang, Y., Shu, H., Liu, C., Xu, C., Xu, C.: Attribute Aware Pooling for Pedestrian Attribute Recognition (2019). ArXiv, abs/1907.11837
Ji, Z., Hu, Z., He, E., Han, J., Pang, Y.: Pedestrian attribute recognition based on multiple time steps attention. Pattern Recognit. Lett. 138, 170–176 (2020)
Jia, J., Huang, H., Yang, W., Chen, X., Huang, K.: Rethinking of Pedestrian Attribute Recognition: Realistic Datasets with Efficient Method (2020). ArXiv, abs/2005.11909
Acknowledgment
This research received partial support from the National Natural Science Foundation of China(No. 62067003) and the Foundation of Jiangxi Educational Committee (No. GJJ200824).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Xiao, H., Xie, W., Zhou, Y., Luo, Y., Zhang, R., Xu, X. (2024). Research on Multi-scale Pedestrian Attribute Recognition Based on Dual Self-attention Mechanism. In: Wu, C., Chen, X., Feng, J., Wu, Z. (eds) Mobile Networks and Management. MONAMI 2023. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 559. Springer, Cham. https://doi.org/10.1007/978-3-031-55471-1_16
Download citation
DOI: https://doi.org/10.1007/978-3-031-55471-1_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-55470-4
Online ISBN: 978-3-031-55471-1
eBook Packages: Computer ScienceComputer Science (R0)