Abstract
With the rapid development of computer vision and artificial intelligence, crowd counting has attracted significant attention from researchers and many well-known methods were proposed. However, due to interocclusions, perspective distortion, and uneven crowd distribution, crowd counting is still a highly challenging task in crowd analysis. Motivated by granular computing, a novel end-to-end crowd counting network (GrCNet) is proposed to enable the problem of crowd counting to be conceptualized at different levels of granularity, and to map problem into computationally tractable subproblems. It shows that by adaptively dividing the image into granules and then feeding the granules into different counting subnetworks separately, the scale variation range of image is narrowed and the the adaptability of counting algorithm to different scenarios is improved. Experiments on four well-known crowd counting benchmark datasets indicate that GrCNet achieves state-of-the-art counting performance and high robustness in dense crowd counting.
Similar content being viewed by others
References
Sindagi VA, Patel VM (2018) A survey of recent advances in CNN-based single image crowd counting and density estimation. Pattern Recognit Lett 107:3–16
Zhou B, Tang X, Wang X (2015) Learning collective crowd behaviors with dynamic pedestrian-agents. Int J Comput Vis 111(1):50–68
Huang L, Chen T, Wang Y et al (2015) Congestion detection of pedestrians using the velocity entropy: a case study of Love Parade 2010 disaster. Phys A 440:200–209
Qin XH, Wang XF, Zhou X et al (2013) Crowd count in a variety of crowd density scenarios. J Image Graphics 04:37–43 (in Chinese)
Sam DB, Surya S, Babu RV, et al (2017) Switching convolutional neural nNetwork for crowd counting. In: IEEE conference on computer vision and pattern recognition, Honolulu, USA, 2017, pp 4031–4039
Zhang H, Kyaw Z, Chang S, et al (2017) Visual translation embedding network for visual relation detection. In: IEEE Conference on computer vision and pattern recognition, Honolulu, USA, 2017, pp 3107–3115
Zhang S, Wu G, Costeiraz JP, et al (2017) FCN-rLSTM: Deep spatio-temporal neural networks for vehicle counting in city cameras. In: 2017 IEEE International Conference on Computer Vision (ICCV) Venice, Italy ,IEEE, 2017, pp 3687–3696
Arteta C, Lempitsky V, Zisserman A, et al (2016) Counting in the Wild. In: European Conference on computer vision, Amsterdam, The Netherlands, 2016, pp 483–498
Liu X (2018) Research on target counting method in video surveillance. Doctoral thesis, University of Science and Technology of China, Hefei (in Chinese)
Zhang Y, Zhou D, Chen S, et al (2016) Single-image crowd counting via multi-column convolutional neural network. In: Proceedings of the IEEE Conference on computer vision and pattern recognition. Las Vegas, USA: IEEE, 2016, pp 589–597
Lecun Y, Bottou L, Bengio Y et al (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Kok VJ, Chan CS (2017) GrCS: granular computing-based crowd segmentation. IEEE Trans Cybern 47(5):1157–1168
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Wu B, Nevatia R (2005) Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors. In: Tenth IEEE International Conference on Computer Vision. Beijing, China, 2005, pp 90–97
Sabzmeydani P, Mori G (2007) Detecting pedestrians by learning shapelet features. In: Proceedings of the IEEE Conference on computer vision and pattern recognition. Minneapolis: IEEE, 2007, pp 1–8
Felzenszwalb PF, Girshick RB, McAllester D et al (2010) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
Li M, Zhang Z, Huang K, et al (2008) Estimating the number of people in crowded scenes by mid based foreground segmentation and head-shoulder detection. In: International Conference on pattern recognition. Tampa, USA, 2008, pp 1–4
Chan AB, Vasconcelos N (2009) Bayesian poisson regression for crowd counting. In: IEEE 12th International Conference on computer vision. Kyoto, Japan: IEEE, 2009, pp 545–551
Ryan D, Denman S, Fookes CB, et al (2009) Crowd counting using multiple local features. In: Proceeding of digital image computing: techniques and applications. Melbourne, Australia: IEEE, 2009, pp 81–88
Lempitsky V, Zisserman A (2010) Learning to count objects in images. In: Advances in neural information processing systems, Vancouver, Canada, 2010, pp 1324–1332
Pham VQ, Kozakaya T, Yamaguchi O, et al (2015) COUNT Forest: CO-voting uncertain number of targets using random forest for crowd density estimation. In: International Conference on computer vision (ICCV 2015). Santiago,Chile: IEEE, 2015, pp3253–3261.
Zhang C, Li H, Wang X, et al (2015) Cross-scene crowd counting via deep convolutional neural networks. In: Proceedings of the IEEE Conference on computer vision and pattern recognition. Boston, USA: IEEE, 2015, pp 833–841
Sam, DB, Surya S, Babu RV (2017) Switching convolutional neural network for crowd counting. In: Proceedings of the IEEE Conference on computer vision and pattern recognition. Honolulu, USA: IEEE, 2017, pp 4031–4039
Sindagi VA, Patel VM (2017) Generating High-quality crowd density maps using contextual pyramid CNNs. In: Proceedings of the IEEE International Conference on computer vision (ICCV). Venice, Italy: IEEE, 2017, pp 1879–1888
Li Y, Zhang X, Chen D (2019) CSRNet: dilated convolutional neural networks for understanding the highly congested scenes. In: Proceedings of the IEEE Conference on computer vision and pattern recognition. Salt Lake City, USA: IEEE, 2018, pp 1091–1100.
Yu F, Koltun V (2015) Multi-scale context aggregation by dilated convolutions. In: 4th International Conference on learning representations. San Juan, Puerto Rico, : Arxiv, 2015: 1511.07122
Tian Y, Lei Y, Zhang J et al (2020) PaDNet: pan-density crowd counting. IEEE Trans Image Process 29:2714–2727
Ranjan V, Le H, Hoai M (2018) Iterative crowd counting. In: Proceedings of the European Conference on computer vision . Munich, Germany : IEEE, 2018, pp 270–285
Liu J, Gao C, Meng D, et al (2018) Decidenet: counting varying density crowds through attention guided detection and density estimation. In: Proceedings of the IEEE Conference on computer vision and pattern recognition. Salt Lake City, USA: IEEE, 2018, pp 5197–5206
Idrees H, Saleemi I, Seibert C, et al (2013) Multi-source multi-scale counting in extremely dense crowd images. Proceedings of the IEEE Conference on computer vision and pattern recognition. Portland, USA: IEEE, 2013, pp 2547–2554
Chan AB, Liang ZSJ, Vasconcelos N (2008) Privacy preserving crowd monitoring: counting people without people models or tracking. Proceedings of the IEEE Conference on computer vision and pattern recognition. Anchorage, USA: IEEE, 2008, pp 1–7
Gao C, Zhou J, Miao D, Wen J, Yue X (2020) Three-way decision with co-training for partially labeled data. Inf Sci. https://doi.org/10.1016/j.ins.2020.08.104
Acknowledgements
The authors would like to thank the editors for their kindly help and the anonymous referees for their valuable comments and helpful suggestions. The work is partially supported by the National Natural Science Foundation of China (Serial No. 61563016, 61762036), the Natural Science Foundation of Jiangxi Provincial (Serial No. 20181BAB202023, 20171BAB202012).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Yu, Y., Zhu, H., Wang, L. et al. Dense crowd counting based on adaptive scene division. Int. J. Mach. Learn. & Cyber. 12, 931–942 (2021). https://doi.org/10.1007/s13042-020-01212-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-020-01212-5