Skip to main content
Log in

Supervised Local High-Order Differential Channel Feature Learning for Pedestrian Detection

  • Published:
Neural Processing Letters Aims and scope Submit manuscript

Abstract

In this paper, a novel supervised local high-order differential channel feature is proposed for fast pedestrian detection. This method is motivated by the recent successful use of filtering on the multiple channel maps, which can improve the performance. This method firstly compute the multiple channel maps for the input RGB image, and average pooling is acted on the channel maps in order to reduce the effect of noise and sample misalignment. Then, each of the pooled channel maps is convolved with our proposed local high-order filter bank, which can enhance the discriminative information in the feature space. Finally, due to the increasing memory consumption incurred by the higher dimension of resulting feature, we have proposed a local structure preserved supervised dimension reduction method which aims to keep the manifold structure of samples in the feature space. This method is formulated as a classical spectral graph embedding problem which can be solved by the LPP algorithms. Thorough experiments and comparative studies show that our method can achieve very competitive result compared with many state-of-art methods on the INRIA and Caltech datasets. Besides, our detector can run about 20 fps in 480 \(\times \) 640 resolution images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Htike KK, Hogg D (2016) Adapting pedestrian detectors to new domains: a comprehensive review. Eng Appl Artif Intell 50:142–158

    Article  Google Scholar 

  2. Girshick R, Donahue J, Darrell T, Malik J (2016) Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans Pattern Anal Mach Intell 38(1):142–158

    Article  Google Scholar 

  3. Hosang J, Omran M, Benenson R, Schiele B (2015) Taking a deeper look at pedestrians. In: IEEE conference on computer vision and pattern recognition, pp 4073–4082

  4. Wang H, Yuan C, Hu W et al (2014) Action recognition using nonnegative action component representation and sparse basis selection. IEEE Trans Image Process 23(2):570–581

    Article  MathSciNet  Google Scholar 

  5. Geronimo D, Lopez AM, Sappa AD (2010) Survey of pedestrian detection for advanced driver assistance systems. IEEE Trans Pattern Anal Mach Intell 32(7):1239–1258

    Article  Google Scholar 

  6. Dollár P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34(4):743–761

    Article  Google Scholar 

  7. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceeding of IEEE conference on computer vision and pattern recognition, vol 1, pp 886–893

  8. Zhe L, Davis LS, Doermann D, DeMenthon D (2007) Hierarchical part-template matching for human detection and segmentation. In: IEEE 11th international conference on computer vision, pp 1–8

  9. Dollar P, Tu Z, Perona P, Belongie S (2009) Integral channel features. In: British machine vision conference

  10. Shen J, Yang W, Sun C (2013) Real-time human detection based on gentle MILBoost with variable granularity HOG-CSLBP. Neural Comput Appl 23(7):1937–1948

  11. ouyang W, Wang X (2015) A discriminative deep model for pedestrian detection with occlusion handling. In: IEEE conference on computer vision and pattern recognition, pp 3258–3265

  12. Yang B, Yan J, Lei Z, Li SZ (2015) Convolutional channel features. In: Proceedings of the IEEE international conference on computer vision, pp 82–90

  13. Wang X, Han T, Yan S (2009) An HOG-LBP human detector with partial occlusion handling. In: IEEE international conference on computer vision

  14. Zhang S, Bauckhage C, Cremers AB (2014) Informed haar-like features improve pedestrian detection. In: IEEE conference on computer vision and pattern recognition

  15. Girshick R (2015) Fast R-CNN. In: IEEE international conference on computer vision (ICCV), pp 1440–1448

  16. Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: IEEE conference on computer vision and pattern recognition

  17. Angelova A, Krizhevsky A, Vanhoucke V (2015) Pedestrian detection with a large-field-of-view deep network. In: IEEE international conference on robotics and automation (ICRA), pp 704–711

  18. Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324

    Article  Google Scholar 

  19. Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507

    Article  MathSciNet  MATH  Google Scholar 

  20. Krizhevsky A, Ilya S, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105

    Google Scholar 

  21. Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y (2014) OverFeat: integrated recognition. In: Localization and detection using convolutional networks, proceedings of international conference on learning representations

  22. Mohamed AR, Dahl GE, Hinton G (2012) Acoustic modeling using deep belief networks. IEEE Trans Audio Speech Lang Process 20(1):14–22

    Article  Google Scholar 

  23. Zhang S, Benenson R, Schiele B (2015) Filtered channel features for pedestrian detection. In: IEEE conference on computer vision and pattern recognition

  24. Zuo X, Shen J et al (2015) Haarlike Feature revisited: fast human detection based on multiple channel maps. In: 12th International symposium on neural networks, pp 240–247

  25. Benenson R, Mathias M, Tuytelaars T, Van Gool L (2013) Seeking the strongest rigid detector. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 3666–3673

  26. Shen J, Zuo X, Yang W, Yu H, Liu G (2016) Learning discriminative shape statistics distribution features for pedestrian detection. Neurocomputing 184:66–77

    Article  Google Scholar 

  27. He X, Niyogi P (2004) Locality preserving projections. Adv Neural Inf Process Syst 16:153–160

    Google Scholar 

  28. Dollar P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545

    Article  Google Scholar 

  29. Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: IEEE conference on computer vision and pattern recognition

  30. Nam W, Dollar P, Han JH (2014) Local decorrelation for improved pedestrian detection. Adv Neural Inf Process Syst 27:424–432

    Google Scholar 

Download references

Acknowledgments

This project is supported by the NSF of China (61305058, 61473086), the Fundamental Research Funds for the Jiangsu University (13JDG093), the NSF of the Jiangsu Higher Education Institutes of China (15KJB520008), the NSF of Jiangsu Province (Grants Nos. BK20130471, BK20140566, BK20150470, BK20130501), China Postdoctoral science Foundation (2014M561586) and the Fundamental Research Fund for the Central Universities of China (N150403006).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jifeng Shen.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shen, J., Zuo, X., Liu, H. et al. Supervised Local High-Order Differential Channel Feature Learning for Pedestrian Detection. Neural Process Lett 45, 1025–1037 (2017). https://doi.org/10.1007/s11063-016-9561-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11063-016-9561-7

Keywords

Navigation