Skip to main content

Cross-Modal Information Aggregation and Distribution Method for Crowd Counting

  • Conference paper
  • First Online:
Advances in Computer Graphics (CGI 2023)

Abstract

Crowd counting is a fundamental and challenging task in computer vision. However, existing methods are relatively limited in dealing with scale and illumination changes simultaneously. To improve the accuracy of crowd counting and address the challenges of illumination and scale changes, we adopt the concept of crowding degree information. Due to the fact that a count map can accurately obtain the population in an image and solve the occlusion problem, we use the count map as a specific form of crowding degree information and propose a new cross-modal information aggregation and distribution model for crowd counting. We first input the crowding degree information into LibraNet and modify it with Information Aggregation Transfer (IAT) and Information Distribution Transfer (IDT) modules to obtain a count map. Then, light information, thermal information and crowding degree information are respectively input into the network through RGB image, themal image, and count map. A more accurate density map can be obtained through multiple convolution operations and IADM processing to improve counting accuracy. Finally, the density map is integrated to obtain the number of people. Experiments demonstrate that our methods provide superior quality and higher parallelism. Therefore, we can obtain higher-accuracy density maps by using light information, thermal information, and crowding degree information.

Supported by the National Key R &D Program of China under Grant No. 2021ZD0200403 and 2018YFB1404102.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 74.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Lempitsky, V., Zisserman, A.: Learning to count objects in images. In: Advances in Neural Information Processing Systems, vol. 23 (2010)

    Google Scholar 

  2. Chen, X., Yu, X., Di, H., Wang, S.: SA-InterNet: scale-aware interaction network for joint crowd counting and localization. In: Ma, H., et al. (eds.) PRCV 2021. LNCS, vol. 13019, pp. 203–215. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-88004-0_17

    Chapter  Google Scholar 

  3. Senthilkumar, R., Ritika, S., Manikandan, M., Shyam, B.: Crowd counting using federated learning and domain adaptation. In: Badica, C., Paprzycki, M., Kharb, L., Chahal, D. (eds.) ICICCT 2022. Communications in Computer and Information Science, pp. 97–111. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-20977-2_8

    Chapter  Google Scholar 

  4. Ilyas, N., Ahmad, Z., Lee, B., Kim, K.: An effective modular approach for crowd counting in an image using convolutional neural networks. Sci. Rep. 12(1), 5795 (2022)

    Article  Google Scholar 

  5. Wang, Q., Gao, J., Lin, W., Li, X.: NWPU-crowd: a large-scale benchmark for crowd counting and localization. IEEE Trans. Pattern Anal. Mach. Intell. 43(6), 2141–2149 (2020)

    Article  Google Scholar 

  6. Zhang, C., Li, H., Wang, X., Yang, X.: Cross-scene crowd counting via deep convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 833–841 (2015)

    Google Scholar 

  7. Zhang, Y., Zhou, D., Chen, S., Gao, S., Ma, Y.: Single-image crowd counting via multi-column convolutional neural network. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 589–597 (2016)

    Google Scholar 

  8. Liu, L., Chen, J., Wu, H., Li, G., Li, C., Lin, L.: Cross-modal collaborative representation learning and a large-scale RGBT benchmark for crowd counting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4823–4833 (2021)

    Google Scholar 

  9. Babu Sam, D., Surya, S., Venkatesh Babu, R.: Switching convolutional neural network for crowd counting. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017)

    Google Scholar 

  10. Liu, L., Hao, L., Xiong, H., Xian, K., Cao, Z., Shen, C.: Counting objects by blockwise classification. IEEE Trans. Circ. Syst. Video Technol. 30(10), 3513–3527 (2019)

    Article  Google Scholar 

  11. Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)

    Article  Google Scholar 

  12. Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)

    Article  Google Scholar 

  13. Idrees, H., Saleemi, I., Seibert, C., Shah, M.: Multi-source multi-scale counting in extremely dense crowd images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2547–2554 (2013)

    Google Scholar 

  14. Zhao, M., Zhang, J., Zhang, C., Zhang, W.: Leveraging heterogeneous auxiliary tasks to assist crowd counting. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12736–12745 (2019)

    Google Scholar 

  15. Khan, S.D., Salih, Y., Zafar, B., Noorwali, A.: A deep-fusion network for crowd counting in high-density crowded scenes. Int. J. Comput. Intell. Syst. 14(1), 168 (2021)

    Article  Google Scholar 

  16. Sindagi, V.A., Patel, V.M.: Multi-level bottom-top and top-bottom feature fusion for crowd counting. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1002–1012 (2019)

    Google Scholar 

  17. Sindagi, V.A., Yasarla, R., Patel, V.M.: Pushing the frontiers of unconstrained crowd counting: new dataset and benchmark method. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1221–1231 (2019)

    Google Scholar 

  18. Fang, Y., Gao, S., Li, J., Luo, W., He, L., Bo, H.: Multi-level feature fusion based locality-constrained spatial transformer network for video crowd counting. Neurocomputing 392, 98–107 (2020)

    Article  Google Scholar 

  19. Yan, Z., et al.: Perspective-guided convolution networks for crowd counting. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 952–961 (2019)

    Google Scholar 

  20. Ma, Z., Wei, X., Hong, X., Gong, Y.: Bayesian loss for crowd count estimation with point supervision. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6142–6151 (2019)

    Google Scholar 

  21. Liu, N., Long, Y., Zou, C., Niu, Q., Pan, L., Wu, H.: Adcrowdnet: an attention-injective deformable convolutional network for crowd understanding. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3225–3234 (2019)

    Google Scholar 

  22. Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)

    Google Scholar 

  23. Dai, J., et al.: Deformable convolutional networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 764–773 (2017)

    Google Scholar 

  24. Chan, A.B., Liang, Z.S.J., Vasconcelos, N.: Privacy preserving crowd monitoring: counting people without people models or tracking. In: 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–7. IEEE (2008)

    Google Scholar 

  25. Huang, S., et al.: Body structure aware deep crowd counting. IEEE Trans. Image Process. 27(3), 1049–1059 (2017)

    Article  MathSciNet  Google Scholar 

  26. Shi, M., Yang, Z., Xu, C., Chen, Q.: Perspective-aware CNN for crowd counting. PhD thesis, Inria Rennes-Bretagne Atlantique (2018)

    Google Scholar 

  27. Cao, X., Wang, Z., Zhao, Y., Su, F.: Scale aggregation network for accurate and efficient crowd counting. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 734–750 (2018)

    Google Scholar 

  28. Li, Y., Zhang, X., Chen, D.: CSRNet: dilated convolutional neural networks for understanding the highly congested scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1091–1100 (2018)

    Google Scholar 

  29. Liu, X., Yang, J., Ding, W., Wang, T., Wang, Z., Xiong, J.: Adaptive mixture regression network with local counting map for crowd counting. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12369, pp. 241–257. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58586-0_15

    Chapter  Google Scholar 

  30. Wang, B., Liu, H., Samaras, D., Nguyen, M.H.: Distribution matching for crowd counting. In: Advances in Neural Information Processing Systems, vol. 33, pp. 1595–1607 (2020)

    Google Scholar 

Download references

Acknowledgements

This research was supported by STI 2030-Major Projects 2021ZD0200400 and the National Key R &D Program of China under Grant No. 2018YFB1404102.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tianyang Dong .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chen, Y., Zhou, Y., Dong, T. (2024). Cross-Modal Information Aggregation and Distribution Method for Crowd Counting. In: Sheng, B., Bi, L., Kim, J., Magnenat-Thalmann, N., Thalmann, D. (eds) Advances in Computer Graphics. CGI 2023. Lecture Notes in Computer Science, vol 14498. Springer, Cham. https://doi.org/10.1007/978-3-031-50078-7_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-50078-7_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-50077-0

  • Online ISBN: 978-3-031-50078-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics