Abstract
With the development of pedestrian detection technologies, existing methods can not simultaneously satisfy high quality detection and fast calculation for practical applications. Therefore, the goal of our research is to balance of pedestrian detection in aspects of the accuracy and efficiency, then get a relatively better method compared with current advanced pedestrian detection algorithms. Inspired from recent outstanding multi-category objects detector SSD (Single Shot MultiBox Detector), we proposed a hierarchical convolution based pedestrians detection algorithm, which can provide competitive accuracy of pedestrian detection at real-time speed. In this work, we proposed a fully convolutional network where the features from lower layers are responsible for small-scale pedestrians and the higher layers are for large-scale, which will further improve the recall rate of pedestrians with different scales, especially for small-scale. Meanwhile, a novel prediction box with a single specific aspect ratio is designed to reduce the miss rate and accelerate the speed of pedestrian detection. Then, the original loss function of SSD is also optimized by eliminating interference of the classifier to more adapt pedestrian detection while also reduce the time complexity. Experimental results on Caltech Benchmark demonstrates that our proposed deep model can reach 11.88% average miss rate with the real-time level speed of 20 fps in pedestrian detection compared with current state-of-the-art methods, which can be the most suitable model for practical pedestrian detection applications.
Similar content being viewed by others
References
Benenson R, Omran M, Hosang J, Schiele B (2014) Ten years of pedestrian detection, what have we learned? In: In European conference on computer vision, pp 613–627
Cai Z, Saberian M, Vasconcelos N (2015) Learning complexity-aware cascades for deep pedestrian detection. In: 2015 IEEE International conference on computer vision (ICCV), pp 3361–3369
Cai Z, Fan Q, Feris R S, Vasconcelos N (2016) A unified multi-scale deep convolutional neural network for fast object detection. In: European conference on computer vision, pp 354–370
Chen L C, Papandreou G, Kokkinos I, Murphy K, Yuille A L (2014) Semantic image segmentation with deep convolutional nets and fully connected crfs. Comput Sci, 357–361
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: 2005 IEEE Computer society conference on computer vision and pattern recognition (CVPR’05), vol 1, pp 886–893
Dollar P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34(4):743
Dollar P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545
Girshick R (2015) Fast r-cnn. In: IEEE International conference on computer vision, pp 1440–1448
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on computer vision and pattern recognition, pp 580–587
Hosang J, Omran M, Benenson R, Schiele B (2015) Taking a deeper look at pedestrians. In: 2015 IEEE Conference on computer vision and pattern recognition, pp 4073–4082
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Cheng Yang F, Berg A C (2016) Ssd: single shot multibox detector. In: Computer vision ECCV 2016: 14th European conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part I, pp 21–37
Luo P, Tian Y, Wang X, Tang X (2014) Switchable deep network for pedestrian detection. In: Computer vision and pattern recognition, pp 899–906
Ojala T, Pietikainen M, Maenpaa T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987
Ouyang W, Wang X (2013) Joint deep learning for pedestrian detection. In: IEEE International conference on computer vision, pp 2056–2063
Redmon J, Farhadi A (2017) Yolo9000: better, faster, stronger. In: 2017 IEEE Conference on computer vision and pattern recognition (CVPR), pp 6517–6525
Ren S, Girshick R, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137
Tian Y, Luo P, Wang X, Tang X (2015) Deep learning strong parts for pedestrian detection. In: IEEE International conference on computer vision, pp 1904–1912
Zeng X, Ouyang W, Wang X (2013) Multi-stage contextual deep learning for pedestrian detection. In: IEEE International conference on computer vision, pp 121–128
Zhang S, Benenson R, Schiele B (2015) Filtered channel features for pedestrian detection. In: 2015 IEEE Conference on computer vision and pattern recognition, pp 1751–1760
Zhang L, Lin L, Liang X, He K (2016) Is faster r-cnn doing well for pedestrian detection. In: Computer Vision ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II, pp 443–457
Acknowledgements
This work is supported in part by National Natural Science Foundation of China with Nos. 61620106003, 91646207, 61671451, 61771026, 61502490, and in part by Project 6140001010207.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yang, D., Zhang, J., Xu, S. et al. Real-time pedestrian detection via hierarchical convolutional feature. Multimed Tools Appl 77, 25841–25860 (2018). https://doi.org/10.1007/s11042-018-5819-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-5819-6