Skip to main content
Log in

Human detection using orientation shape histogram and coocurrence textures

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In this article, we present a framework to detect pedestrians in presence of various real world challenges. The depth-level occlusion is addressed by a stereo-aided triangulation mechanism, where the ORB (Oriented FAST and Rotated BRIEF) descriptor is used to speed up the disparity estimation. An empirical formulation has been made to compute the maximum feasible window size during region proposals generation. The variation of unusual articulated postures is tackled with a shape-histogram representation that uses a set of oriented, high-frequency kernels to compute the gradient details; a set of co-occurrence texture cues is further taken into consideration to strengthen the resulting descriptor. We validate the efficacy of our method on three benchmark pedestrian datasets, where the obtained results are expressed in terms of five performance metric.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Arthur D, Vassilvitskii S (2007) K-means++: The advantages of careful seeding. In: Proceedings of the 18th annual ACM-SIAM symposium on discrete algorithms. Society for Industrial and Applied Mathematics, pp 1027–1035

  2. Benenson R, Omran M, Hosang J, Schiele B (2014) Ten years of pedestrian detection, what have we learned?. In: European conference on computer vision. Springer, pp 613–627. https://doi.org/10.1007/978-3-319-16181-5_47

  3. Brown LM, Feris R, Pankanti S (2014) Temporal non-maximum suppression for pedestrian detection using self-calibration. In: International conference on pattern recognition. IEEE, pp 2239–2244. https://doi.org/10.1109/ICPR.2014.389

  4. Brunetti A, Buongiorno D, Trotta GF, Bevilacqua V (2018) Computer vision and deep learning techniques for pedestrian detection and tracking: a survey. Neurocomputing 300:17–33. https://doi.org/10.1016/j.neucom.2018.01.092

    Article  Google Scholar 

  5. Comaniciu D, Ramesh V, Meer P (2003) Kernel-based object tracking. IEEE Trans Pattern Anal Mach Intell 25(5):564–577. https://doi.org/10.1109/TPAMI.2003.1195991

    Article  Google Scholar 

  6. Cutler R, Davis LS (2000) Robust real-time periodic motion detection, analysis, and applications. IEEE Trans Pattern Anal Mach Intell 22(8):781–796. https://doi.org/10.1109/34.868681

    Article  Google Scholar 

  7. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE conference on computer vision and pattern recognition, vol 1. IEEE, pp 886–893. https://doi.org/10.1109/CVPR.2005.177

  8. Dalal N, Triggs B, Schmid C (2006) Human detection using oriented histograms of flow and appearance. In: European conference on computer vision. Springer, pp 428–441. https://doi.org/10.1007/11744047_33

  9. Dollar P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34(4):743–761. https://doi.org/10.1109/TPAMI.2011.155

    Article  Google Scholar 

  10. Enzweiler M, Gavrila DM (2009) Monocular pedestrian detection: survey and experiments. IEEE Trans Pattern Anal Mach Intell 31(12):2179–2195. https://doi.org/10.1109/TPAMI.2008.260

    Article  Google Scholar 

  11. Errami M, Rziza M (2016) Improving pedestrian detection using support vector regression. In: 13th international conference on computer graphics, imaging and visualization. IEEE, pp 156–160. https://doi.org/10.1109/CGiV.2016.38

  12. Ess A, Leibe B, Schindler K, van Gool L (2008) A mobile vision system for robust multi-person tracking. In: IEEE conference on computer vision and pattern recognition. IEEE. https://doi.org/10.1109/CVPR.2008.4587581

  13. Everingham M, Van Gool L, Williams CK, Winn J, Zisserman A (2010) The pascal visual object classes (VOC) challenge. Int J Comput Vis 88(2):303–338. https://doi.org/10.1007/s11263-009-0275-4

    Article  Google Scholar 

  14. Gavrila DM (2007) A Bayesian, exemplar-based approach to hierarchical shape matching. IEEE Trans Pattern Anal Mach Intell 29(8):1408–1421. https://doi.org/10.1109/TPAMI.2007.1062

    Article  Google Scholar 

  15. Gerónimo D, Sappa A, López A, Ponsa D (2007) Adaptive image sampling and windows classification for on-board pedestrian detection. In: International conference on computer vision systems, vol 39

  16. Geronimo D, Lopez AM, Sappa AD, Graf T (2010) Survey of pedestrian detection for advanced driver assistance systems. IEEE Trans Pattern Anal Mach Intell 32(7):1239–1258. https://doi.org/10.1109/TPAMI.2009.122

    Article  Google Scholar 

  17. Gurbuz SZ, Melvin WL, Williams DB (2012) Kinematic model-based human detectors for multi-channel radar. IEEE Trans Aerosp Electron Syst 48(2):1306–1318. https://doi.org/10.1109/TAES.2012.6178063

    Article  Google Scholar 

  18. Haralick RM, Shanmugam K, Dinstein I (1973) Textural features for image classification. IEEE Trans Syst Man Cybern SMC-3(6):610–621. https://doi.org/10.1109/TSMC.1973.4309314

    Article  Google Scholar 

  19. Hirschmuller H (2008) Stereo processing by semiglobal matching and mutual information. IEEE Trans Pattern Anal Mach Intell 30(2):328–341. https://doi.org/10.1109/TPAMI.2007.1166

    Article  Google Scholar 

  20. Li J, Liang X, Shen S, Xu T, Feng J, Yan S (2018) Scale-aware fast r-cnn for pedestrian detection. IEEE Trans Multimed 20(4):985–996. https://doi.org/10.1109/TMM.2017.2759508

    Google Scholar 

  21. Liang F, Wang D, Liu Y, Jiang Y, Tang S (2012) Fast pedestrian detection based on sliding window filtering. In: Pacific-rim conference on multimedia. Springer, pp 811–822. https://doi.org/10.1007/978-3-642-34778-8_76

  22. Lin Z, Davis LS (2008) A pose-invariant descriptor for human detection and segmentation. In: European conference on computer vision, pp 423–436. Springer. https://doi.org/10.1007/978-3-540-88693-8_31

  23. Lin Z, Davis LS (2010) Shape-based human detection and segmentation via hierarchical part-template matching. IEEE Trans Pattern Anal Machine Intell 32 (4):604–618. https://doi.org/10.1109/TPAMI.2009.204

    Article  Google Scholar 

  24. Liu Y, Lasang P, Siegel M, Sun Q (2016) Multi-sparse descriptor: a scale invariant feature for pedestrian detection. Neurocomputing 184:55–65. https://doi.org/10.1016/j.neucom.2015.07.143

    Article  Google Scholar 

  25. Lv Q, Josephson W, Wang Z, Charikar M, Li K (2007) Multi-probe LSH: efficient indexing for high-dimensional similarity search. In: Proceedings of the 33rd international conference on very large data bases. VLDB Endowment, pp 950–961

  26. Maji S, Berg AC, Malik J (2008) Classification using intersection kernel support vector machines is efficient. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 1–8. https://doi.org/10.1109/CVPR.2008.4587630

  27. Mohan A, Papageorgiou C, Poggio T (2001) Example-based object detection in images by components. IEEE Trans Pattern Anal Mach Intell 23(4):349–361. https://doi.org/10.1109/34.917571

    Article  Google Scholar 

  28. Mu Y, Yan S, Liu Y, Huang T, Zhou B (2008) Discriminative local binary patterns for human detection in personal album. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 1–8. https://doi.org/10.1109/CVPR.2008.4587800

  29. Nguyen DT, Li W, Ogunbona PO (2016) Human detection from images and videos: a survey. Pattern Recogn 51:148–175. https://doi.org/10.1016/j.patcog.2015.08.027

    Article  Google Scholar 

  30. Ouyang W, Wang X (2012) A discriminative deep model for pedestrian detection with occlusion handling. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE, pp 3258–3265. https://doi.org/10.1109/CVPR.2012.6248062

  31. Ouyang W, Zeng X, Wang X (2013) Modeling mutual visibility relationship in pedestrian detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, pp 3222–3229. https://doi.org/10.1109/CVPR.2013.414

  32. Ouyang W, Zhou H, Li H, Li Q, Yan J, Wang X (2018) Jointly learning deep features, deformable parts, occlusion and classification for pedestrian detection. IEEE Trans Pattern Anal Mach Intell 40(8):1874–1887. https://doi.org/10.1109/TPAMI.2017.2738645

    Article  Google Scholar 

  33. Rittscher J, Tu PH, Krahnstoever N (2005) Simultaneous estimation of segmentation and shape. In: IEEE conference on computer vision and pattern recognition, vol 2. IEEE, pp 486–493. https://doi.org/10.1109/CVPR.2005.323

  34. Rublee E, Rabaud V, Konolige K, Bradski G (2011) ORB: An efficient alternative to SIFT or SURF. In: 2011 international conference on computer vision. IEEE, pp 2564–2571. https://doi.org/10.1109/ICCV.2011.6126544

  35. Schwartz WR, Kembhavi A, Harwood D, Davis LS (2009) Human detection using partial least squares analysis. In: 12th international conference on computer vision. IEEE, pp 24–31. https://doi.org/10.1109/ICCV.2009.5459205

  36. Shen J, Zuo X, Yang W, Prokhorov D, Mei X, Ling H (2018) Differential features for pedestrian detection: A taylor series perspective. IEEE Trans Intell Transp Syst. https://doi.org/10.1109/TITS.2018.2869087

  37. Walk S, Majer N, Schindler K, Schiele B (2010) New features and insights for pedestrian detection. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 1030–1037. https://doi.org/10.1109/CVPR.2010.5540102

  38. Wang X, Han TX, Yan S (2009) An HOG-LBP human detector with partial occlusion handling. In: 12th international conference on computer vision. IEEE, pp 32–39. https://doi.org/10.1109/ICCV.2009.5459207

  39. Wei Y, Tian Q, Guo T (2013) An improved pedestrian detection algorithm integrating Haar-like features and HOG descriptors. Adv Mech Eng 5(546):206. https://doi.org/10.1155/2013/546206

    Google Scholar 

  40. Wu B, Nevatia R (2005) Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors. In: IEEE international conference on computer vision, vol 1. IEEE, pp 90–97. https://doi.org/10.1109/ICCV.2005.74

  41. Yao S, Pan S, Wang T, Zheng C, Shen W, Chong Y (2015) A new pedestrian detection method based on combined HOG and LSS features. Neurocomputing 151:1006–1014. https://doi.org/10.1016/j.neucom.2014.08.080

    Article  Google Scholar 

  42. Zhang S, Benenson R, Schiele B (2015) Filtered channel features for pedestrian detection. In: IEEE conference on computer vision and pattern recognition. IEEE, pp 1751–1760. https://doi.org/10.1109/CVPR.2015.7298784

  43. Zhang X, Hu HM, Jiang F, Li B (2015) Pedestrian detection based on hierarchical co-occurrence model for occlusion handling. Neurocomputing 168:861–870. https://doi.org/10.1016/j.neucom.2015.05.038

    Article  Google Scholar 

  44. Zhang S, Benenson R, Omran M, Hosang J, Schiele B (2018) Towards reaching human performance in pedestrian detection. IEEE Trans Pattern Anal Mach Intell 40(4):973–986. https://doi.org/10.1109/TPAMI.2017.2700460

    Article  Google Scholar 

  45. Zhao T, Nevatia R, Wu B (2008) Segmentation and tracking of multiple humans in crowded environments. IEEE Trans Pattern Anal Mach Intell 30(7):1198–1211. https://doi.org/10.1109/TPAMI.2007.70770

    Article  Google Scholar 

  46. Zhu Q, Yeh MC, Cheng KT, Avidan S (2006) Fast human detection using a cascade of histograms of oriented gradients. In: IEEE conference on computer vision and pattern recognition, vol 2. IEEE, pp 1491–1498. https://doi.org/10.1109/CVPR.2006.119

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Suman Kumar Choudhury.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Choudhury, S.K., Padhy, R.P., Sa, P.K. et al. Human detection using orientation shape histogram and coocurrence textures. Multimed Tools Appl 78, 13949–13969 (2019). https://doi.org/10.1007/s11042-018-6866-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-018-6866-8

Keywords

Navigation