Skip to main content

Learning to Integrate Occlusion-Specific Detectors for Heavily Occluded Pedestrian Detection

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10112))

Abstract

It is a challenging problem to detect partially occluded pedestrians due to the diversity of occlusion patterns. Although training occlusion-specific detectors can help handle various partial occlusions, it is a non-trivial problem to integrate these detectors properly. A direct combination of all occlusion-specific detectors can be affected by unreliable detectors and usually does not favor heavily occluded pedestrian examples, which can only be recognized by few detectors. Instead of combining all occlusion-specific detectors into a generic detector for all occlusions, we categorize occlusions based on how pedestrian examples are occluded into K groups. Each occlusion group selects its own occlusion-specific detectors and fuses them linearly to obtain a classifer. An L1-norm linear support vector machine (SVM) is adopted to select and fuse occlusion-specific detectors for the K classifiers simultaneously. Thanks to the L1-norm linear SVM, unreliable and irrelevant detectors are removed for each group. Experiments on the Caltech dataset show promising performance of our approach for detecting heavily occluded pedestrians.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2005)

    Google Scholar 

  2. Dollar, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: British Machine Vision Conference (BMVC) (2009)

    Google Scholar 

  3. Dollar, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. Pattern Anal. Mach. Intell. (PAMI) 36, 1532–1545 (2014)

    Article  Google Scholar 

  4. Zhang, S., Benenson, R., Schiele, B.: Filtered channel features for pedestrian detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015)

    Google Scholar 

  5. Tian, Y., Luo, P., Wang, X., Tang, X.: Deep learning strong parts for pedestrian detection. In: International Conference on Computer Vision (ICCV) (2015)

    Google Scholar 

  6. Cai, Z., Saberian, M., Vasconcelos, N.: Learning complexity-aware cascades for deep pedestrian detection. In: International Conference on Computer Vision (ICCV) (2015)

    Google Scholar 

  7. Dollar, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: an evaluation of the state of the art. IEEE Trans. Pattern Anal. Mach. Intell. (PAMI) 34, 743–761 (2012)

    Article  Google Scholar 

  8. Mathias, M., Benenson, R., Timofte, R., Van Gool, L.: Handling occlusions with franken-classifiers. In: International Conference on Computer Vision (ICCV) (2013)

    Google Scholar 

  9. Zhu, J., Rosset, S., Hastie, T., Tibshirani, R.: 1-norm support vector machines. In: Advances in Neural Information Processing Systems (NIPS) (2004)

    Google Scholar 

  10. Wang, X., Han, T., Yan, S.: An HOG-LBP human detector with partial occlusion handling. In: International Conference on Computer Vision (ICCV) (2009)

    Google Scholar 

  11. Leibe, B., Leonardis, A., Schiele, B.: Combined object categorization and segmentation with an implicit shape model. In: ECCV Workshop on Statistical Learning in Computer Vision (2004)

    Google Scholar 

  12. Leibe, B., Seemann, E., Schiele, B.: Pedestrian detection in crowded scenes. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2005)

    Google Scholar 

  13. Tang, S., Andriluka, M., Schiele, B.: Detection and tracking of occluded people. In: British Machine Vision Conference (BMVC) (2012)

    Google Scholar 

  14. Pepik, B., Stark, M., Gehler, P., Schiele, B.: Occlusion patterns for object class detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013)

    Google Scholar 

  15. Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. (PAMI) 32, 1627–1645 (2010)

    Article  Google Scholar 

  16. Chen, D., Batra, D., Freeman, W.: Group norm for learning structured SVMs with unstructured latent variables. In: International Conference on Computer Vision (ICCV) (2013)

    Google Scholar 

  17. Ouyang, W., Wang, X.: Single-pedestrian detection aided by multi-pedestrian detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013)

    Google Scholar 

  18. Shet, V., Neumann, J., Ramesh, V., Davis, L.: Bilattice-based logical reasoning for human detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2007)

    Google Scholar 

  19. Enzweiler, M., Eigenstetter, A., Schiele, B., Gavrila, D.: Multi-cue pedestrian classification with partial occlusion handling. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2010)

    Google Scholar 

  20. Wu, B., Nevatia, R.: Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors. In: International Conference on Computer Vision (ICCV) (2005)

    Google Scholar 

  21. Ouyang, W., Wang, X.: A discriminative deep model for pedestrian detection with occlusion handling. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2012)

    Google Scholar 

  22. Ouyang, W., Wang, X.: Joint deep learning for pedestrian detection. In: International Conference on Computer Vision (ICCV) (2013)

    Google Scholar 

  23. Ouyang, W., Zeng, X., Wang, X.: Modeling mutual visibility relationship in pedestrian detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013)

    Google Scholar 

  24. Duan, G., Ai, H., Lao, S.: A structural filter approach to human detection. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6316, pp. 238–251. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15567-3_18

    Chapter  Google Scholar 

  25. Nam, W., Dollar, P., Han, J.: Local decorrelation for improved pedestrian detection. In: Advances in Neural Information Processing Systems (NIPS) (2014)

    Google Scholar 

  26. Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. Ann. Stat. 28, 337–407 (2000)

    Article  MathSciNet  MATH  Google Scholar 

  27. Benenson, R., Mathias, M., Tuytelaars, T., Van Gool, L.: Seeking the strongest rigid detector. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013)

    Google Scholar 

  28. Tibshirani, R.: Regression shrinkage and selection via the lasso. J. Roy. Stat. Soc. Ser. B 58, 267–288 (1994)

    MathSciNet  MATH  Google Scholar 

  29. Fan, R., Chang, K., Hsieh, C., Wang, X., Lin, C.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)

    MATH  Google Scholar 

  30. Flamary, R., Jrad, N., Phlypo, R., Congedo, M., Rakotomamonjy, A.: Mixed-norm regularization for brain decoding. Comput. Math. Methods Med. (2014)

    Google Scholar 

  31. Paisitkriangkrai, S., Shen, C., Hengel, A.: Strengthening the effectiveness of pedestrian detection with spatially pooled features. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8692, pp. 546–561. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10593-2_36

    Google Scholar 

  32. Tian, Y., Luo, P., Wang, X., Tang, X.: Pedestrian detection aided by deep learning semantic tasks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2015)

    Google Scholar 

  33. Yang, B., Yan, J., Lei, Z., Li, S.: Convolutional channel features. In: International Conference on Computer Vision (ICCV) (2015)

    Google Scholar 

Download references

Acknowledgement

This work is supported in part by Singapore Ministry of Education Academic Research Fund Tier 2 MOE2015-T2-2-114 and Tier 1 RG27/14.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chunluan Zhou .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Zhou, C., Yuan, J. (2017). Learning to Integrate Occlusion-Specific Detectors for Heavily Occluded Pedestrian Detection. In: Lai, SH., Lepetit, V., Nishino, K., Sato, Y. (eds) Computer Vision – ACCV 2016. ACCV 2016. Lecture Notes in Computer Science(), vol 10112. Springer, Cham. https://doi.org/10.1007/978-3-319-54184-6_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-54184-6_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-54183-9

  • Online ISBN: 978-3-319-54184-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics