Skip to main content
Log in

Dense crowd counting based on adaptive scene division

  • Original Article
  • Published:
International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Abstract

With the rapid development of computer vision and artificial intelligence, crowd counting has attracted significant attention from researchers and many well-known methods were proposed. However, due to interocclusions, perspective distortion, and uneven crowd distribution, crowd counting is still a highly challenging task in crowd analysis. Motivated by granular computing, a novel end-to-end crowd counting network (GrCNet) is proposed to enable the problem of crowd counting to be conceptualized at different levels of granularity, and to map problem into computationally tractable subproblems. It shows that by adaptively dividing the image into granules and then feeding the granules into different counting subnetworks separately, the scale variation range of image is narrowed and the the adaptability of counting algorithm to different scenarios is improved. Experiments on four well-known crowd counting benchmark datasets indicate that GrCNet achieves state-of-the-art counting performance and high robustness in dense crowd counting.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig.5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Sindagi VA, Patel VM (2018) A survey of recent advances in CNN-based single image crowd counting and density estimation. Pattern Recognit Lett 107:3–16

    Article  Google Scholar 

  2. Zhou B, Tang X, Wang X (2015) Learning collective crowd behaviors with dynamic pedestrian-agents. Int J Comput Vis 111(1):50–68

    Article  Google Scholar 

  3. Huang L, Chen T, Wang Y et al (2015) Congestion detection of pedestrians using the velocity entropy: a case study of Love Parade 2010 disaster. Phys A 440:200–209

    Article  Google Scholar 

  4. Qin XH, Wang XF, Zhou X et al (2013) Crowd count in a variety of crowd density scenarios. J Image Graphics 04:37–43 (in Chinese)

    Google Scholar 

  5. Sam DB, Surya S, Babu RV, et al (2017) Switching convolutional neural nNetwork for crowd counting. In: IEEE conference on computer vision and pattern recognition, Honolulu, USA, 2017, pp 4031–4039

  6. Zhang H, Kyaw Z, Chang S, et al (2017) Visual translation embedding network for visual relation detection. In: IEEE Conference on computer vision and pattern recognition, Honolulu, USA, 2017, pp 3107–3115

  7. Zhang S, Wu G, Costeiraz JP, et al (2017) FCN-rLSTM: Deep spatio-temporal neural networks for vehicle counting in city cameras. In: 2017 IEEE International Conference on Computer Vision (ICCV) Venice, Italy ,IEEE, 2017, pp 3687–3696

  8. Arteta C, Lempitsky V, Zisserman A, et al (2016) Counting in the Wild. In: European Conference on computer vision, Amsterdam, The Netherlands, 2016, pp 483–498

  9. Liu X (2018) Research on target counting method in video surveillance. Doctoral thesis, University of Science and Technology of China, Hefei (in Chinese)

  10. Zhang Y, Zhou D, Chen S, et al (2016) Single-image crowd counting via multi-column convolutional neural network. In: Proceedings of the IEEE Conference on computer vision and pattern recognition. Las Vegas, USA: IEEE, 2016, pp 589–597

  11. Lecun Y, Bottou L, Bengio Y et al (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324

    Article  Google Scholar 

  12. Kok VJ, Chan CS (2017) GrCS: granular computing-based crowd segmentation. IEEE Trans Cybern 47(5):1157–1168

    Article  Google Scholar 

  13. Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154

    Article  Google Scholar 

  14. Wu B, Nevatia R (2005) Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors. In: Tenth IEEE International Conference on Computer Vision. Beijing, China, 2005, pp 90–97

  15. Sabzmeydani P, Mori G (2007) Detecting pedestrians by learning shapelet features. In: Proceedings of the IEEE Conference on computer vision and pattern recognition. Minneapolis: IEEE, 2007, pp 1–8

  16. Felzenszwalb PF, Girshick RB, McAllester D et al (2010) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645

    Article  Google Scholar 

  17. Li M, Zhang Z, Huang K, et al (2008) Estimating the number of people in crowded scenes by mid based foreground segmentation and head-shoulder detection. In: International Conference on pattern recognition. Tampa, USA, 2008, pp 1–4

  18. Chan AB, Vasconcelos N (2009) Bayesian poisson regression for crowd counting. In: IEEE 12th International Conference on computer vision. Kyoto, Japan: IEEE, 2009, pp 545–551

  19. Ryan D, Denman S, Fookes CB, et al (2009) Crowd counting using multiple local features. In: Proceeding of digital image computing: techniques and applications. Melbourne, Australia: IEEE, 2009, pp 81–88

  20. Lempitsky V, Zisserman A (2010) Learning to count objects in images. In: Advances in neural information processing systems, Vancouver, Canada, 2010, pp 1324–1332

  21. Pham VQ, Kozakaya T, Yamaguchi O, et al (2015) COUNT Forest: CO-voting uncertain number of targets using random forest for crowd density estimation. In: International Conference on computer vision (ICCV 2015). Santiago,Chile: IEEE, 2015, pp3253–3261.

  22. Zhang C, Li H, Wang X, et al (2015) Cross-scene crowd counting via deep convolutional neural networks. In: Proceedings of the IEEE Conference on computer vision and pattern recognition. Boston, USA: IEEE, 2015, pp 833–841

  23. Sam, DB, Surya S, Babu RV (2017) Switching convolutional neural network for crowd counting. In: Proceedings of the IEEE Conference on computer vision and pattern recognition. Honolulu, USA: IEEE, 2017, pp 4031–4039

  24. Sindagi VA, Patel VM (2017) Generating High-quality crowd density maps using contextual pyramid CNNs. In: Proceedings of the IEEE International Conference on computer vision (ICCV). Venice, Italy: IEEE, 2017, pp 1879–1888

  25. Li Y, Zhang X, Chen D (2019) CSRNet: dilated convolutional neural networks for understanding the highly congested scenes. In: Proceedings of the IEEE Conference on computer vision and pattern recognition. Salt Lake City, USA: IEEE, 2018, pp 1091–1100.

  26. Yu F, Koltun V (2015) Multi-scale context aggregation by dilated convolutions. In: 4th International Conference on learning representations. San Juan, Puerto Rico, : Arxiv, 2015: 1511.07122

  27. Tian Y, Lei Y, Zhang J et al (2020) PaDNet: pan-density crowd counting. IEEE Trans Image Process 29:2714–2727

    Article  Google Scholar 

  28. Ranjan V, Le H, Hoai M (2018) Iterative crowd counting. In: Proceedings of the European Conference on computer vision . Munich, Germany : IEEE, 2018, pp 270–285

  29. Liu J, Gao C, Meng D, et al (2018) Decidenet: counting varying density crowds through attention guided detection and density estimation. In: Proceedings of the IEEE Conference on computer vision and pattern recognition. Salt Lake City, USA: IEEE, 2018, pp 5197–5206

  30. Idrees H, Saleemi I, Seibert C, et al (2013) Multi-source multi-scale counting in extremely dense crowd images. Proceedings of the IEEE Conference on computer vision and pattern recognition. Portland, USA: IEEE, 2013, pp 2547–2554

  31. Chan AB, Liang ZSJ, Vasconcelos N (2008) Privacy preserving crowd monitoring: counting people without people models or tracking. Proceedings of the IEEE Conference on computer vision and pattern recognition. Anchorage, USA: IEEE, 2008, pp 1–7

  32. Gao C, Zhou J, Miao D, Wen J, Yue X (2020) Three-way decision with co-training for partially labeled data. Inf Sci. https://doi.org/10.1016/j.ins.2020.08.104

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to thank the editors for their kindly help and the anonymous referees for their valuable comments and helpful suggestions. The work is partially supported by the National Natural Science Foundation of China (Serial No. 61563016, 61762036), the Natural Science Foundation of Jiangxi Provincial (Serial No. 20181BAB202023, 20171BAB202012).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ying Yu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yu, Y., Zhu, H., Wang, L. et al. Dense crowd counting based on adaptive scene division. Int. J. Mach. Learn. & Cyber. 12, 931–942 (2021). https://doi.org/10.1007/s13042-020-01212-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13042-020-01212-5

Keywords

Navigation