Abstract
In this paper, we propose the Deep Channel Features as an extension to Channel Features for pedestrian detection. Instead of using hand-crafted features, our method automatically learns deep channel features as a mid-level feature by using a convolutional neural network. The network is pretrained by the unsupervised sparse filtering and a group of filters is learned for each channel. Combining the learned deep channel features with other low-level channel features (i.e. LUV channels, gradient magnitude channel and histogram of gradient channels) as the final feature, a boosting classifier with depth-2 decision tree as the weak classifier is learned. Our method achieves a significant detection performance on public datasets (i.e. INRIA, ETH, TUD, and CalTech).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminative trained part based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)
Wang, X., Han, T.X., Yan, S.: An HOG-LBP human detector with partial occlusion handling. In: 2009 IEEE 12th International Conference on Computer Visionn, ICCV, pp. 32–39 (2009)
Zhang, S., Benenson, R., Schiele, B.: Filtered channel features for pedestrian detection. arXiv preprint arXiv:1501.05759 (2015)
Liu, W., Yu, B., Duan, C., et al.: A pedestrian-detection method based on heterogeneous features and ensemble of multi-view? Pose Parts. IEEE Trans. Intell. Transp. Syst. 16(2), 813–824 (2015)
Luo, P., Tian, Y., Wang, X., Tang, X.: Switchable deep network for pedestrian detection. In: Conference on Computer Vision and Pattern Recognition, pp. 899–906 (2014)
Angelova, A., Krizhevsky, A., Vanhoucke, V.: Pedestrian detection with a large-field-of-view deep network. In: 2015 IEEE International Conference on Robotics and Automation (ICRA), pp. 704–711. IEEE (2015)
Cai, Z., Saberian, M., Vasconcelos, N.: Learning complexity-aware cascades for deep pedestrian detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3361–3369 (2015)
Dollár, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: BMVC 2009, London, England, pp. 1–11 (2009)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradientbased learning applied to document recognition. Proc. IEEE 86(11), 2278–2323 (1998)
Schmidt, M.: minFunc (2005). http://www.cs.ubc.ca//schmidtm/Software/minFunc.html
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR Proceedings of 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 886–893 (2005)
Ess, A., Leibe, B., Schindler, K., Van Gool, L.: A mobile vision system for robust multi-person tracking. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8 (2008)
Wojek, C., Walk, S., Schiele, B.: Multi-cue onboard pedestrian detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 794–801 (2009)
Dollár, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: an evaluation of the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 743–761 (2012)
Viola, P., Jones, M.J., Snow, D.: Detecting pedestrians using patterns of motion and appearance. Int. J. Comput. Vis. 63(2), 153–161 (2005)
Dollar, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1532–1545 (2014)
Sermanet, P., Kavukcuoglu, K., Chintala, S., Lecun, Y.: Pedestrian detection with unsupervised multi-stage feature learning. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3626–3633 (2013)
Costea, A.D.: Word channel based multiscale pedestrian detection without image resizing and using only one classifier. In: CVPR, pp. 4321–4328 (2014)
Zhang, S., Bauckhage, C., Cremers, A.B.: Informed haar-like features improve pedestrian detection. In: CVPR 2014, pp. 947–954 (2014)
Walk, S., Majer, N., Schindler, K., Schiele, B.: New features and insights for pedestrian detection. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1030–1037 (2010)
Lim, J.J., Zitnick, C.L., Dollar, P.: Sketch tokens: a learned mid-level representation for contour and object detection. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3158–3165 (2013)
Benenson, R., Mathias, M., Tuytelaars, T., Van Gool, L.: Seeking the strongest rigid detector. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3666–3673 (2013)
Nam, W., Dollár, P., Han, J.H.: Local decorrelation for improved detection. In: NIPS, pp. 1–9 (2014)
Ouyang, W., Zeng, X., Wang, X.: Modeling mutual visibility relationship in pedestrian detection. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 3222–3229 (2013)
Dollar, P., Belongie, S., Perona, P.: The fastest pedestrian detector in the west. In: Proceedings of British Machine Vision Conference 2010, pp. 1–68 (2010)
Dollár, P., Appel, R., Kienzle, W.: Crosstalk cascades for frame-rate pedestrian detection. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part II. LNCS, vol. 7573, pp. 645–659. Springer, Heidelberg (2012)
Benenson, R., Mathias, M., Timofte, R., Van Gool, L.: Pedestrian detection at 100 frames per second. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2903–2910 (2012)
Acknowledgments
This work was supported in part by the Natural Science Foundation of China (NSFC) under Grant No. 61472038 and No. 61375044.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Liu, Z., He, Y., Xie, Y., Gu, H., Liu, C., Pei, M. (2016). Pedestrian Detection Using Deep Channel Features in Monocular Image Sequences. In: Hirose, A., Ozawa, S., Doya, K., Ikeda, K., Lee, M., Liu, D. (eds) Neural Information Processing. ICONIP 2016. Lecture Notes in Computer Science(), vol 9949. Springer, Cham. https://doi.org/10.1007/978-3-319-46675-0_67
Download citation
DOI: https://doi.org/10.1007/978-3-319-46675-0_67
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46674-3
Online ISBN: 978-3-319-46675-0
eBook Packages: Computer ScienceComputer Science (R0)