Pedestrian Detection Using Deep Channel Features in Monocular Image Sequences

Liu, Zhao; He, Yang; Xie, Yi; Gu, Hongyan; Liu, Chao; Pei, Mingtao

doi:10.1007/978-3-319-46675-0_67

Zhao Liu^19,20,
Yang He¹⁹,
Yi Xie^19,20,
Hongyan Gu¹⁹,
Chao Liu¹⁹ &
…
Mingtao Pei¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9949))

Included in the following conference series:

International Conference on Neural Information Processing

3045 Accesses

Abstract

In this paper, we propose the Deep Channel Features as an extension to Channel Features for pedestrian detection. Instead of using hand-crafted features, our method automatically learns deep channel features as a mid-level feature by using a convolutional neural network. The network is pretrained by the unsupervised sparse filtering and a group of filters is learned for each channel. Combining the learned deep channel features with other low-level channel features (i.e. LUV channels, gradient magnitude channel and histogram of gradient channels) as the final feature, a boosting classifier with depth-2 decision tree as the weak classifier is learned. Our method achieves a significant detection performance on public datasets (i.e. INRIA, ETH, TUD, and CalTech).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminative trained part based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)
Article Google Scholar
Wang, X., Han, T.X., Yan, S.: An HOG-LBP human detector with partial occlusion handling. In: 2009 IEEE 12th International Conference on Computer Visionn, ICCV, pp. 32–39 (2009)
Google Scholar
Zhang, S., Benenson, R., Schiele, B.: Filtered channel features for pedestrian detection. arXiv preprint arXiv:1501.05759 (2015)
Liu, W., Yu, B., Duan, C., et al.: A pedestrian-detection method based on heterogeneous features and ensemble of multi-view? Pose Parts. IEEE Trans. Intell. Transp. Syst. 16(2), 813–824 (2015)
Google Scholar
Luo, P., Tian, Y., Wang, X., Tang, X.: Switchable deep network for pedestrian detection. In: Conference on Computer Vision and Pattern Recognition, pp. 899–906 (2014)
Google Scholar
Angelova, A., Krizhevsky, A., Vanhoucke, V.: Pedestrian detection with a large-field-of-view deep network. In: 2015 IEEE International Conference on Robotics and Automation (ICRA), pp. 704–711. IEEE (2015)
Google Scholar
Cai, Z., Saberian, M., Vasconcelos, N.: Learning complexity-aware cascades for deep pedestrian detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 3361–3369 (2015)
Google Scholar
Dollár, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: BMVC 2009, London, England, pp. 1–11 (2009)
Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradientbased learning applied to document recognition. Proc. IEEE 86(11), 2278–2323 (1998)
Article Google Scholar
Schmidt, M.: minFunc (2005). http://www.cs.ubc.ca//schmidtm/Software/minFunc.html
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR Proceedings of 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 886–893 (2005)
Google Scholar
Ess, A., Leibe, B., Schindler, K., Van Gool, L.: A mobile vision system for robust multi-person tracking. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8 (2008)
Google Scholar
Wojek, C., Walk, S., Schiele, B.: Multi-cue onboard pedestrian detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 794–801 (2009)
Google Scholar
Dollár, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: an evaluation of the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 743–761 (2012)
Article Google Scholar
Viola, P., Jones, M.J., Snow, D.: Detecting pedestrians using patterns of motion and appearance. Int. J. Comput. Vis. 63(2), 153–161 (2005)
Article Google Scholar
Dollar, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1532–1545 (2014)
Article Google Scholar
Sermanet, P., Kavukcuoglu, K., Chintala, S., Lecun, Y.: Pedestrian detection with unsupervised multi-stage feature learning. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3626–3633 (2013)
Google Scholar
Costea, A.D.: Word channel based multiscale pedestrian detection without image resizing and using only one classifier. In: CVPR, pp. 4321–4328 (2014)
Google Scholar
Zhang, S., Bauckhage, C., Cremers, A.B.: Informed haar-like features improve pedestrian detection. In: CVPR 2014, pp. 947–954 (2014)
Google Scholar
Walk, S., Majer, N., Schindler, K., Schiele, B.: New features and insights for pedestrian detection. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1030–1037 (2010)
Google Scholar
Lim, J.J., Zitnick, C.L., Dollar, P.: Sketch tokens: a learned mid-level representation for contour and object detection. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3158–3165 (2013)
Google Scholar
Benenson, R., Mathias, M., Tuytelaars, T., Van Gool, L.: Seeking the strongest rigid detector. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3666–3673 (2013)
Google Scholar
Nam, W., Dollár, P., Han, J.H.: Local decorrelation for improved detection. In: NIPS, pp. 1–9 (2014)
Google Scholar
Ouyang, W., Zeng, X., Wang, X.: Modeling mutual visibility relationship in pedestrian detection. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 3222–3229 (2013)
Google Scholar
Dollar, P., Belongie, S., Perona, P.: The fastest pedestrian detector in the west. In: Proceedings of British Machine Vision Conference 2010, pp. 1–68 (2010)
Google Scholar
Dollár, P., Appel, R., Kienzle, W.: Crosstalk cascades for frame-rate pedestrian detection. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part II. LNCS, vol. 7573, pp. 645–659. Springer, Heidelberg (2012)
Chapter Google Scholar
Benenson, R., Mathias, M., Timofte, R., Van Gool, L.: Pedestrian detection at 100 frames per second. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2903–2910 (2012)
Google Scholar

Download references

Acknowledgments

This work was supported in part by the Natural Science Foundation of China (NSFC) under Grant No. 61472038 and No. 61375044.

Author information

Authors and Affiliations

Beijing Lab of Intelligent Information, School of Computer Science, Beijing Institute of Technology, Beijing, 100081, People’s Republic of China
Zhao Liu, Yang He, Yi Xie, Hongyan Gu, Chao Liu & Mingtao Pei
People’s Public Security University of China, Beijing, 100038, People’s Republic of China
Zhao Liu & Yi Xie

Authors

Zhao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yang He
View author publications
You can also search for this author in PubMed Google Scholar
Yi Xie
View author publications
You can also search for this author in PubMed Google Scholar
Hongyan Gu
View author publications
You can also search for this author in PubMed Google Scholar
Chao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Mingtao Pei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mingtao Pei .

Editor information

Editors and Affiliations

The University of Tokyo , Tokyo, Japan
Akira Hirose
Kobe University , Kobe, Japan
Seiichi Ozawa
Okinawa Institute of Science and Technology Graduate University, Onna, Japan
Kenji Doya
Nara Institute of Science and Technology , Ikoma, Japan
Kazushi Ikeda
Kyungpook National University , Daegu, Korea (Republic of)
Minho Lee
Chinese Academy of Sciences , Beijing, China
Derong Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, Z., He, Y., Xie, Y., Gu, H., Liu, C., Pei, M. (2016). Pedestrian Detection Using Deep Channel Features in Monocular Image Sequences. In: Hirose, A., Ozawa, S., Doya, K., Ikeda, K., Lee, M., Liu, D. (eds) Neural Information Processing. ICONIP 2016. Lecture Notes in Computer Science(), vol 9949. Springer, Cham. https://doi.org/10.1007/978-3-319-46675-0_67

Download citation

DOI: https://doi.org/10.1007/978-3-319-46675-0_67
Published: 29 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46674-3
Online ISBN: 978-3-319-46675-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics