Skip to main content
Log in

Real-time pedestrian detection via hierarchical convolutional feature

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

With the development of pedestrian detection technologies, existing methods can not simultaneously satisfy high quality detection and fast calculation for practical applications. Therefore, the goal of our research is to balance of pedestrian detection in aspects of the accuracy and efficiency, then get a relatively better method compared with current advanced pedestrian detection algorithms. Inspired from recent outstanding multi-category objects detector SSD (Single Shot MultiBox Detector), we proposed a hierarchical convolution based pedestrians detection algorithm, which can provide competitive accuracy of pedestrian detection at real-time speed. In this work, we proposed a fully convolutional network where the features from lower layers are responsible for small-scale pedestrians and the higher layers are for large-scale, which will further improve the recall rate of pedestrians with different scales, especially for small-scale. Meanwhile, a novel prediction box with a single specific aspect ratio is designed to reduce the miss rate and accelerate the speed of pedestrian detection. Then, the original loss function of SSD is also optimized by eliminating interference of the classifier to more adapt pedestrian detection while also reduce the time complexity. Experimental results on Caltech Benchmark demonstrates that our proposed deep model can reach 11.88% average miss rate with the real-time level speed of 20 fps in pedestrian detection compared with current state-of-the-art methods, which can be the most suitable model for practical pedestrian detection applications.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Benenson R, Omran M, Hosang J, Schiele B (2014) Ten years of pedestrian detection, what have we learned? In: In European conference on computer vision, pp 613–627

  2. Cai Z, Saberian M, Vasconcelos N (2015) Learning complexity-aware cascades for deep pedestrian detection. In: 2015 IEEE International conference on computer vision (ICCV), pp 3361–3369

  3. Cai Z, Fan Q, Feris R S, Vasconcelos N (2016) A unified multi-scale deep convolutional neural network for fast object detection. In: European conference on computer vision, pp 354–370

  4. Chen L C, Papandreou G, Kokkinos I, Murphy K, Yuille A L (2014) Semantic image segmentation with deep convolutional nets and fully connected crfs. Comput Sci, 357–361

  5. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: 2005 IEEE Computer society conference on computer vision and pattern recognition (CVPR’05), vol 1, pp 886–893

  6. Dollar P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34(4):743

    Article  Google Scholar 

  7. Dollar P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545

    Article  Google Scholar 

  8. Girshick R (2015) Fast r-cnn. In: IEEE International conference on computer vision, pp 1440–1448

  9. Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on computer vision and pattern recognition, pp 580–587

  10. Hosang J, Omran M, Benenson R, Schiele B (2015) Taking a deeper look at pedestrians. In: 2015 IEEE Conference on computer vision and pattern recognition, pp 4073–4082

  11. Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Cheng Yang F, Berg A C (2016) Ssd: single shot multibox detector. In: Computer vision ECCV 2016: 14th European conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part I, pp 21–37

  12. Luo P, Tian Y, Wang X, Tang X (2014) Switchable deep network for pedestrian detection. In: Computer vision and pattern recognition, pp 899–906

  13. Ojala T, Pietikainen M, Maenpaa T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987

    Article  MATH  Google Scholar 

  14. Ouyang W, Wang X (2013) Joint deep learning for pedestrian detection. In: IEEE International conference on computer vision, pp 2056–2063

  15. Redmon J, Farhadi A (2017) Yolo9000: better, faster, stronger. In: 2017 IEEE Conference on computer vision and pattern recognition (CVPR), pp 6517–6525

  16. Ren S, Girshick R, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137

    Article  Google Scholar 

  17. Tian Y, Luo P, Wang X, Tang X (2015) Deep learning strong parts for pedestrian detection. In: IEEE International conference on computer vision, pp 1904–1912

  18. Zeng X, Ouyang W, Wang X (2013) Multi-stage contextual deep learning for pedestrian detection. In: IEEE International conference on computer vision, pp 121–128

  19. Zhang S, Benenson R, Schiele B (2015) Filtered channel features for pedestrian detection. In: 2015 IEEE Conference on computer vision and pattern recognition, pp 1751–1760

  20. Zhang L, Lin L, Liang X, He K (2016) Is faster r-cnn doing well for pedestrian detection. In: Computer Vision ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II, pp 443–457

Download references

Acknowledgements

This work is supported in part by National Natural Science Foundation of China with Nos. 61620106003, 91646207, 61671451, 61771026, 61502490, and in part by Project 6140001010207.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shibiao Xu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, D., Zhang, J., Xu, S. et al. Real-time pedestrian detection via hierarchical convolutional feature. Multimed Tools Appl 77, 25841–25860 (2018). https://doi.org/10.1007/s11042-018-5819-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-018-5819-6

Keywords

Navigation