Abstract
Crowd counting plays an important role in crowd analysis and monitoring. To this end, we propose a novel method called Adaptive Weighted Crowd Receptive Field Network (AWRFN) for crowd counting to estimate the number of people and the spatial distribution of input crowd images. The proposed AWRFN is composed of four modules: backbone, crowd receptive field block (CRFB), recurrent block (RB), and channel attention block (CAB). Backbone utilizes the first ten layers of VGG16 to extract base features of input images. CRFB is a multi-branch architecture simulating a real human visual system for further obtaining refined and discriminative crowd features. RB generates strong semantic and global information by recurrently stacking convolutional layers with the same parameters. CAB outputs appropriate weights to supervise each channel of the feature maps output from CRFB, which uses the outputs of RB as guidance. Different from previous works using Euclidean Loss, we employ L1_Smooth Loss to train our network in an end-to-end fashion. To demonstrate the effectiveness of our proposed method, we implement AWRFN on two representative datasets including the ShanghaiTech dataset and the UCF_CC_50 dataset. The experimental results prove that our method is both effective and robust compared with the state-of-the-art approaches.
References
Shen Z, Xu Y, Ni B, et al. (2017) Crowd counting via adversarial cross-scale consistency pursuit. In: IEEE conference on computer vision and pattern recognition. pp 5245-5254
Sindagi VA, Patel VM (2017) Cnn-based cascaded multi-task learning of high-level prior and density estimation for crowd counting. In: IEEE international conference on advanced video and signal based surveillance. pp 1-6
Wang L, Yin B, Tang X, et al. (2019) Removing background interference for crowd counting via de-background detail convolutional network. In: Neurocomputing, pp 332: 360–371
Wang L, Yin B, Guo A, Ma H, Cao J (2018) Skip-connection convolutional neural network for still image crowd counting. In: Applied intelligence. pp 1–12
Zhang Y, Zhou D, Chen S, et al. (2016) Single-image crowd counting via multi-column convolutional neural network. In: IEEE conference on computer vision and pattern recognition. pp 589–597
Zhang L, Shi M, Chen Q (2018) Crowd counting via scale-adaptive convolutional neural network. In: IEEE winter conference on applications of computer vision. pp 1113–1121
Liu J, Gao C, Meng D, et al. (2018) Decidenet: counting varying density crowds through attention guided detection and density estimation. In: IEEE conference on computer vision and pattern recognition. pp 5197-5206
Wandell BA, Winawer J (2015) Computational neuroimaging and population receptive fields. Trends Cognitive Sci, 19(6) pp 349–357
Cao X, Wang Z, Zhao Y, et al. (2018) Scale aggregation network for accurate and efficient crowd counting. In: European conference on computer vision. pp 734–750
Li Y, Zhang X, Chen D (2018) Csrnet: dilated convolutional neural networks for understanding the highly congested scenes. In: IEEE conference on computer vision and pattern recognition. pp 1091–1100
Liu S, Huang D (2018) Receptive field block net for accurate and fast object detection. In: European conference on computer vision. pp 385–400
Yu F, Koltun V (2015) Multi-scale context aggregation by dilated convolutions. In: arXiv preprint
Woo S, Park J, Lee J Y, et al. (2018) Cbam: Convolutional block attention module. In: European conference on computer vision. pp 3–19
Girshick R (2015) Fast r-cnn. In: IEEE International conference on computer vision. pp 1440–1448
Idrees H, Saleemi I, Seibert C, et al. (2013) Multi-source multi-scale counting in extremely dense crowd images. In: IEEE conference on computer vision and pattern recognition. pp 2547–2554
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: International conference on computer vision and pattern recognition. pp 886–893
Wu B, Nevatia R (2007) Detection and tracking of multiple, partially occluded humans by bayesian combination of edgelet based part detectors. Int J Comput Vis, 75: 247–266
Lempitsky V, Zisserman A (2010) Learning to count objects in images. In: Advances in neural information processing systems. pp 1324–1332
Pham V Q, Kozakaya T, Yamaguchi O, et al. (2015) Count forest: co-voting uncertain number of targets using random forest for crowd density estimation. In: IEEE international conference on computer vision. pp 3253–3261
Wang Y, Zou Y (2016) Fast visual object counting via example-based density estimation. In: IEEE international conference on image processing. pp 3653–3657
Wang C, Zhang H, Yang L, et al. (2015) Deep people counting in extremely dense crowds. In: International conference on multimedia. pp 1299–1302
Fu M, Xu P, Li X, et al. (2015) Fast crowd density estimation with convolutional neural networks. Eng Appl Artif Intell, 43: 81–88
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst, 60: 1097–1105
Zhang C, Li H, Wang X, et al. (2015) Cross-scene crowd counting via deep convolutional neural networks. In: IEEE conference on computer vision and pattern recognition. pp 833–841
Walach E, Wolf L (2016) Learning to count with cnn boosting. In: European conference on computer vision. pp 660–676
Boominathan L, Kruthiventi SSS, Babu RV (2016) Crowdnet: A deep convolutional network for dense crowd counting. In: International conference on multimedia. pp 640–644
Onororubio D, Lopezsastre RJ (2016) Towards perspective-free object counting with deep learning. In: European conference on computer vision. pp 615–629
Sam DB, Surya S, Babu RV (2017) Switching convolutional neural network for crowd counting. In: IEEE conference on computer vision and pattern recognition. pp 4031–4039
Sindagi VA, Patel VM (2017) Generating high-quality crowd density maps using contextual pyramid cnns. In: IEEE international conference on computer vision. pp 1861–1870
Liu L, Wang H, Li G, et al. (2018) Crowd counting using deep recurrent spatial-aware network. In: International joint conference on artificial intelligence. pp 849–855
Sam DB, Babu RV (2018) Top-down feedback for crowd counting convolutional neural network. In: National conference on artificial intelligence. pp 7323–7330
Jiang X, Zhang L, Lv P, et al. (2019) Learning multi-level density maps for crowd counting. IEEE Trans Neural Netw, pp 1–11
Sang J, Wu W, Luo H, et al. (2019) Improved crowd counting method based on scale-adaptive convolutional neural network. In: IEEE Access. pp 24411–24419
Yang B, Zhan W, Wang N, et al. (2019) Counting crowds using a scale-distribution-aware network and adaptive human-shaped Kernel. Neurocomputing, 390: 207–216
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. In: arXiv preprint arXiv:1409.1556, pp 31(6):645–654
Szegedy C, Vanhoucke V, Ioffe S, et al. (2016) Rethinking the inception architecture for computer vision. In: IEEE conference on computer vision and pattern recognition. pp 2818–2826
Tai Y, Yang J, Liu X (2017) Image super-resolution via deep recursive residual network. In: IEEE conference on computer vision and pattern recognition. pp 3147–3155
Lim B, Son S, Kim H, et al. (2017) Enhanced deep residual networks for single image super-resolution. In: IEEE conference on computer vision and pattern recognition workshops. pp 136–144
Kingma DP, Ba J (2014) Adam: A method for stochastic optimization. In: arXiv preprint
Liu M, Jiang J, Guo Z, et al. (2018) Crowd counting with fully convolutional neural network. In: IEEE international conference on image processing. pp 953-957
Ranjan V, Le H, Hoai M (2018) Iterative crowd counting. In: European conference on computer vision. pp 270–285
Babu Sam D, Sajjan NN, Venkatesh Babu R, et al. (2018) Divide and grow: capturing huge diversity in crowd images with incrementally growing cnn. In: IEEE conference on computer vision and pattern recognition. pp 3618–3626
Gao J, Wang Q, Li X (2019) PCC net: perspective crowd counting via spatial convolutional network. IEEE Trans Circuits Syst Video Technol, 30: 3486
Jiang H, Jin W (2019) Effective use of convolutional neural networks and diverse deep supervision for better crowd counting. Appl Intell, 49: 2415
Shi Z, Zhang L, Liu Y, et al. (2019) Crowd counting with deep negative correlation learning. In: Computer vision and pattern recognition. pp 5382–5390
Liu X, van de Weijer J, Bagdanov AD (2018) Leveraging unlabeled data for crowd counting by learning to rank. In: IEEE conference on computer vision and pattern recognition. pp 7661–7669
Xiong H, Lu H, Liu C, et al. (2019) From open set to closed set: counting objects by spatial divide-and-conquer. In: International conference on computer vision. pp: 8362–8371
Liu W, Salzmann M, Fua P, et al. (2019) Context-aware crowd counting. In: Computer vision and pattern recognition, pp 5099–5108
Shen Z, Xu Y, Ni B, et al. (2018) Crowd counting via adversarial cross-scale consistency pursuit. In: IEEE conference on computer vision and pattern recognition. pp 5245–5254
Hossain M, Hosseinzadeh M, Chanda O, et al. (2019) Crowd counting using scale-aware attention networks. In: IEEE winter conference on applications of computer vision. pp 1280–1288
Acknowledgements
This work is supported by the Equipment Pre-Research Foundation of China under Grant No.61403120201.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Peng, S., Wang, L., Yin, B. et al. Adaptive weighted crowd receptive field network for crowd counting. Pattern Anal Applic 24, 805–817 (2021). https://doi.org/10.1007/s10044-020-00934-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-020-00934-0