Abstract
In this paper, a novel supervised local high-order differential channel feature is proposed for fast pedestrian detection. This method is motivated by the recent successful use of filtering on the multiple channel maps, which can improve the performance. This method firstly compute the multiple channel maps for the input RGB image, and average pooling is acted on the channel maps in order to reduce the effect of noise and sample misalignment. Then, each of the pooled channel maps is convolved with our proposed local high-order filter bank, which can enhance the discriminative information in the feature space. Finally, due to the increasing memory consumption incurred by the higher dimension of resulting feature, we have proposed a local structure preserved supervised dimension reduction method which aims to keep the manifold structure of samples in the feature space. This method is formulated as a classical spectral graph embedding problem which can be solved by the LPP algorithms. Thorough experiments and comparative studies show that our method can achieve very competitive result compared with many state-of-art methods on the INRIA and Caltech datasets. Besides, our detector can run about 20 fps in 480 \(\times \) 640 resolution images.











Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Htike KK, Hogg D (2016) Adapting pedestrian detectors to new domains: a comprehensive review. Eng Appl Artif Intell 50:142–158
Girshick R, Donahue J, Darrell T, Malik J (2016) Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans Pattern Anal Mach Intell 38(1):142–158
Hosang J, Omran M, Benenson R, Schiele B (2015) Taking a deeper look at pedestrians. In: IEEE conference on computer vision and pattern recognition, pp 4073–4082
Wang H, Yuan C, Hu W et al (2014) Action recognition using nonnegative action component representation and sparse basis selection. IEEE Trans Image Process 23(2):570–581
Geronimo D, Lopez AM, Sappa AD (2010) Survey of pedestrian detection for advanced driver assistance systems. IEEE Trans Pattern Anal Mach Intell 32(7):1239–1258
Dollár P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34(4):743–761
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceeding of IEEE conference on computer vision and pattern recognition, vol 1, pp 886–893
Zhe L, Davis LS, Doermann D, DeMenthon D (2007) Hierarchical part-template matching for human detection and segmentation. In: IEEE 11th international conference on computer vision, pp 1–8
Dollar P, Tu Z, Perona P, Belongie S (2009) Integral channel features. In: British machine vision conference
Shen J, Yang W, Sun C (2013) Real-time human detection based on gentle MILBoost with variable granularity HOG-CSLBP. Neural Comput Appl 23(7):1937–1948
ouyang W, Wang X (2015) A discriminative deep model for pedestrian detection with occlusion handling. In: IEEE conference on computer vision and pattern recognition, pp 3258–3265
Yang B, Yan J, Lei Z, Li SZ (2015) Convolutional channel features. In: Proceedings of the IEEE international conference on computer vision, pp 82–90
Wang X, Han T, Yan S (2009) An HOG-LBP human detector with partial occlusion handling. In: IEEE international conference on computer vision
Zhang S, Bauckhage C, Cremers AB (2014) Informed haar-like features improve pedestrian detection. In: IEEE conference on computer vision and pattern recognition
Girshick R (2015) Fast R-CNN. In: IEEE international conference on computer vision (ICCV), pp 1440–1448
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: IEEE conference on computer vision and pattern recognition
Angelova A, Krizhevsky A, Vanhoucke V (2015) Pedestrian detection with a large-field-of-view deep network. In: IEEE international conference on robotics and automation (ICRA), pp 704–711
Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507
Krizhevsky A, Ilya S, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105
Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y (2014) OverFeat: integrated recognition. In: Localization and detection using convolutional networks, proceedings of international conference on learning representations
Mohamed AR, Dahl GE, Hinton G (2012) Acoustic modeling using deep belief networks. IEEE Trans Audio Speech Lang Process 20(1):14–22
Zhang S, Benenson R, Schiele B (2015) Filtered channel features for pedestrian detection. In: IEEE conference on computer vision and pattern recognition
Zuo X, Shen J et al (2015) Haarlike Feature revisited: fast human detection based on multiple channel maps. In: 12th International symposium on neural networks, pp 240–247
Benenson R, Mathias M, Tuytelaars T, Van Gool L (2013) Seeking the strongest rigid detector. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 3666–3673
Shen J, Zuo X, Yang W, Yu H, Liu G (2016) Learning discriminative shape statistics distribution features for pedestrian detection. Neurocomputing 184:66–77
He X, Niyogi P (2004) Locality preserving projections. Adv Neural Inf Process Syst 16:153–160
Dollar P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545
Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In: IEEE conference on computer vision and pattern recognition
Nam W, Dollar P, Han JH (2014) Local decorrelation for improved pedestrian detection. Adv Neural Inf Process Syst 27:424–432
Acknowledgments
This project is supported by the NSF of China (61305058, 61473086), the Fundamental Research Funds for the Jiangsu University (13JDG093), the NSF of the Jiangsu Higher Education Institutes of China (15KJB520008), the NSF of Jiangsu Province (Grants Nos. BK20130471, BK20140566, BK20150470, BK20130501), China Postdoctoral science Foundation (2014M561586) and the Fundamental Research Fund for the Central Universities of China (N150403006).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Shen, J., Zuo, X., Liu, H. et al. Supervised Local High-Order Differential Channel Feature Learning for Pedestrian Detection. Neural Process Lett 45, 1025–1037 (2017). https://doi.org/10.1007/s11063-016-9561-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-016-9561-7