ABSTRACT
In order to solve the problem of crowd counting and crowd density statistics in the security intelligent video surveillance system, this paper adopts the method based on deep learning to optimize the algorithm. This method mainly uses the VGG16 backbone network with 1*1 and 3*3 small convolution Classification and feature extraction of people information in a crowd. In order to reduce the sharp reduction in the number of positive samples after increasing the threshold, and to avoid the situation where using different thresholds during training and testing will cause the performance of the detector to degrade, this paper draws on the cascade structure of the Cascade R-CNN network for input video. The frame images are analyzed and processed, and different IoU thresholds are set at different stages to obtain enough positive samples to reduce over-fitting, and use the multi-task loss function and the Hadamard product to obtain the pedestrian detection network, and output the final number of people. The improved pedestrian counting algorithm in this paper is tested in the public dataset WorldExpo'10 Crowd Counting, and compared with other algorithms to verify the feasibility and effectiveness of this method.
- DiCarlo, James J, Davide Zoccolan, and Nicole C. Rust. How Does the Brain Solve Visual Object Recognition? [J] Neuron 73, no. 3 (February 2012): 415–434.Google ScholarCross Ref
- Fruin J.J. Crowd Dynamics and the Design and Management of Public Places(C].International Conference on Building Use and Safety Technology, National Institute of Building Sciences, Washington DC,1985,110-113.Google Scholar
- Lempitsky V, Zisserman A. Learning to count objects in images[C]//Advances in Neural Information Processing Systems. 2010: 1324-1332.Google Scholar
- Ren S, He K, Girshick R, Faster r-cnn: Towards real-time object detection with region proposal networks[C]//Advances in neural information processing systems. 2015: 91-99.Google Scholar
- Girshick R, Donahue J, Darrell T, Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580-587.Google Scholar
- Simonyan K, Zisserman A. Very Deep Convolutional Networks for Large-Scale Image Recognition[J]. Computer Science, 2014. pp. 66-79.Google Scholar
Index Terms
- A pedestrian detection algorithm based on deep learning
Recommendations
Pedestrian Detection Algorithm Based on ViBe and YOLO
ICVIP '21: Proceedings of the 2021 5th International Conference on Video and Image ProcessingAs more and more monitoring devices are deployed in various cities around the world, the technology of intelligent analysis and processing of video image data based on the computer is becoming more and more mature. This paper adopts an algorithm based ...
Pushing the Limits of Deep CNNs for Pedestrian Detection
Compared with other applications in computer vision, convolutional neural networks (CNNs) have underperformed on pedestrian detection. A breakthrough was made very recently using sophisticated deep CNN (DCNN) models, with a number of handcrafted ...
Deep Convolutional Neural Networks for pedestrian detection
Pedestrian detection is a popular research topic due to its paramount importance for a number of applications, especially in the fields of automotive, surveillance and robotics. Despite the significant improvements, pedestrian detection is still an open ...
Comments