A Depth-Guided Attention Strategy for Crowd Counting

Chen, Hao; Li, Zhan; Bhanu, Bir; Lu, Dongping; Han, Xuming

doi:10.1007/978-3-031-44204-9_3

Hao Chen¹¹,
Zhan Li¹¹,
Bir Bhanu¹²,
Dongping Lu¹¹ &
…
Xuming Han¹¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14263))

Included in the following conference series:

International Conference on Artificial Neural Networks

1100 Accesses

Abstract

Crowd counting, an essential technology with numerous applications, often encounters challenges due to non-uniform crowd distributions and noisy backgrounds in congested scenes. To address these issues, this paper proposes the utilization of depth information as an independent indicator. Specifically, we introduce a depth-guided attention strategy (DAS) to fuse depth and crowd density information, effectively modeling the relationship between crowd density and depth of field. Additionally, we propose a depth-guided method to generate the target density map by leveraging the negative correlation between the depth of field and head size in crowd scenes, enabling better supervised learning. To achieve fast inference speed, we design two lightweight crowd counting networks within a knowledge distillation framework that require only a small number of parameters. Furthermore, we propose a two-step network inference algorithm to reduce counting errors. Extensive experiments conducted on four challenging datasets demonstrate that our proposed methods significantly improve counting accuracy over baseline networks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

HRANet: Hierarchical region-aware network for crowd counting

Article 02 February 2022

Self-attention Guidance Based Crowd Localization and Counting

Article 22 February 2024

Single-column CNN for crowd counting with pixel-wise attention mechanism

Article 13 October 2018

References

Abousamra, S., Hoai, M., Samaras, D., Chen, C.: Localization in the crowd with topological constraints. In: AAAI Conference on Artificial Intelligence, vol. 35, pp. 872–881 (2021)
Google Scholar
Cao, X., Wang, Z., Zhao, Y., Su, F.: Scale aggregation network for accurate and efficient crowd counting. In: European Conference on Computer Vision, pp. 734–750 (2018)
Google Scholar
Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., Xu, C.: Ghostnet: more features from cheap operations. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1580–1589 (2020)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Howard, A.G., et al.: Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017)
Hua, C., Xu, K., Tong, T.: Crowd counting with dilated inception convolution. In: International Conference on Computing and Artificial Intelligence, pp. 208–215 (2021)
Google Scholar
Idrees, H., Saleemi, I., Seibert, C., Shah, M.: Multi-source multi-scale counting in extremely dense crowd images. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2547–2554 (2013)
Google Scholar
Idrees, H., et al.: Composition loss for counting, density map estimation and localization in dense crowds. In: European Conference on Computer Vision, pp. 532–546 (2018)
Google Scholar
Jiang, G., Wu, R., Huo, Z., Zhao, C., Luo, J.: Ligmsaet: lightweight multi-scale adaptive convolutional neural network for dense crowd counting. Expert Syst. Appl. 197, 116662 (2022)
Article Google Scholar
Li, Y., Zhang, X., Chen, D.: Csrnet: dilated convolutional neural networks for understanding the highly congested scenes. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1091–1100 (2018)
Google Scholar
Li, Z., Lu, S., Dong, Y., Guo, J.: Msffa: a multi-scale feature fusion and attention mechanism network for crowd counting. Visual Comput. 39(3), 1045–1056 (2023)
Article Google Scholar
Lian, D., Li, J., Zheng, J., Luo, W., Gao, S.: Density map regression guided detection network for rgb-d crowd counting and localization. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1821–1830 (2019)
Google Scholar
Liu, L., Chen, J., Wu, H., Chen, T., Li, G., Lin, L.: Efficient crowd counting via structured knowledge transfer. In: ACM International Conference on Multimedia, pp. 2645–2654 (2020)
Google Scholar
Liu, W., Salzmann, M., Fua, P.: Context-aware crowd counting. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 5099–5108 (2019)
Google Scholar
Ma, X., Du, S., Liu, Y.: A lightweight neural network for crowd analysis of images with congested scenes. In: IEEE International Conference on Image Processing, pp. 979–983 (2019)
Google Scholar
Ma, Z., Wei, X., Hong, X., Lin, H., Qiu, Y., Gong, Y.: Learning to count via unbalanced optimal transport. In: AAAI Conference on Artificial Intelligence, vol. 35, pp. 2319–2327 (2021)
Google Scholar
Oh, M.h., Olsen, P., Ramamurthy, K.N.: Crowd counting with decomposed uncertainty. In: AAAI Conference on Artificial Intelligence, vol. 34, pp. 11799–11806 (2020)
Google Scholar
Ranftl, R., Lasinger, K., Hafner, D., Schindler, K., Koltun, V.: Towards robust monocular depth estimation: mixing datasets for zero-shot cross-dataset transfer. IEEE Trans. Pattern Anal. Mach. Intell. 44, 1623–1637 (2020)
Article Google Scholar
Shu, W., Wan, J., Tan, K.C., Kwong, S., Chan, A.B.: Crowd counting in the frequency domain. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 19618–19627 (2022)
Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Wan, J., Liu, Z., Chan, A.B.: A generalized loss function for crowd counting and localization. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1974–1983 (2021)
Google Scholar
Wang, B., Liu, H., Samaras, D., Nguyen, M.H.: Distribution matching for crowd counting. Adv. Neural Inf. Process. Syst. 33, 1595–1607 (2020)
Google Scholar
Wang, M., Cai, H., Dai, Y., Gong, M.: Dynamic mixture of counter network for location-agnostic crowd counting. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 167–177 (2023)
Google Scholar
Xu, C., et al.: Autoscale: learning to scale for crowd counting. Int. J. Comput. Vision 130(2), 405–434 (2022)
Article MathSciNet Google Scholar
Yang, S.D., Su, H.T., Hsu, W.H., Chen, W.C.: Deccnet: depth enhanced crowd counting. In: IEEE International Conference on Computer Vision Workshops, p. 4521–4530 (2019)
Google Scholar
, Zhai, W., Gao, M., Li, Q., Jeon, G., Anisetti, M.: Fpanet: feature pyramid attention network for crowd counting. Appl. Intell. 1–18 (2023)
Google Scholar
Zhang, Y., Zhou, D., Chen, S., Gao, S., Ma, Y.: Single-image crowd counting via multi-column convolutional neural network. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 589–597 (2016)
Google Scholar
Zhou, F., et al.: Comal: compositional multi-scale feature enhanced learning for crowd counting. Multimedia Tools Appl., 1–20 (2022)
Google Scholar
Zhu, F., Yan, H., Chen, X., Li, T.: Real-time crowd counting via lightweight scale-aware network. Neurocomputing 472, 54–67 (2022)
Article Google Scholar

Download references

Acknowledgements

This work was financially supported by the Guangdong Basic and Applied Basic Research Foundation (No. 2022A1515010119) and the National Natural Science Foundation of China (No. 62071201, No. U2031104).

Author information

Authors and Affiliations

Department of Computer Science, Jinan University, Guangdong, 510632, China
Hao Chen, Zhan Li, Dongping Lu & Xuming Han
Department of Electrical and Computer Engineering, University of California, Riverside, CA, USA
Bir Bhanu

Authors

Hao Chen
View author publications
You can also search for this author in PubMed Google Scholar
Zhan Li
View author publications
You can also search for this author in PubMed Google Scholar
Bir Bhanu
View author publications
You can also search for this author in PubMed Google Scholar
Dongping Lu
View author publications
You can also search for this author in PubMed Google Scholar
Xuming Han
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhan Li .

Editor information

Editors and Affiliations

Democritus University of Thrace, Xanthi, Greece
Lazaros Iliadis
Democritus University of Thrace, Xanthi, Greece
Antonios Papaleonidas
Lancaster University, Lancaster, UK
Plamen Angelov
Teesside University, Middlesbrough, UK
Chrisina Jayne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, H., Li, Z., Bhanu, B., Lu, D., Han, X. (2023). A Depth-Guided Attention Strategy for Crowd Counting. In: Iliadis, L., Papaleonidas, A., Angelov, P., Jayne, C. (eds) Artificial Neural Networks and Machine Learning – ICANN 2023. ICANN 2023. Lecture Notes in Computer Science, vol 14263. Springer, Cham. https://doi.org/10.1007/978-3-031-44204-9_3

Download citation

DOI: https://doi.org/10.1007/978-3-031-44204-9_3
Published: 22 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44203-2
Online ISBN: 978-3-031-44204-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Depth-Guided Attention Strategy for Crowd Counting