Skip to main content

Advertisement

Log in

Pedestrian detection using first- and second-order aggregate channel features

  • Short Paper
  • Published:
International Journal of Multimedia Information Retrieval Aims and scope Submit manuscript

Abstract

The content-based analysis of visual multimedia like images and videos are urgently needed to empower human society for the automation of difficult tasks. Pedestrian detection serves as a backbone for a multitude of image processing and machine learning algorithms and secures quite a lot of real-world applications. Keeping this fact in mind, here, we deal with the fabrication of suitable features to identify human/pedestrian instances from images with near accuracy. Accordingly, we introduce second-order aggregate channel features (SOACF) to enhance the performance of much-celebrated pedestrian detection algorithm which was mainly based on the first-order information in an image—aggregate channel features detector (ACF detector). We experimentally proved the complementary nature of ACF and SOACF. Designed to garner both these features together, instead of simple concatenation, or direct merging of the two detectors, we employed a weighted non-maximum suppression merging algorithm. The prospective detector not only performed well on INRIA, Caltech and KITTI pedestrian data set but also, mitigate the miss rate by \(\sim 4\%\) in Caltech data set and \(\sim 2\%\) in KITTI data set in comparison with ACF detector. Despite the fact that our in-house generated detector uses only a few channels, it surpasses many state-of-the-art methods based on baseline ACF detector. Moreover, the detection speed is 100 times faster than the topmost pedestrian detector based on ACF.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

References

  1. Kaur A, Dhir R, Lehal GS (2017) A survey on camera-captured scene text detection and extraction: towards gurmukhi script. Int J Multimed Inf Retr 6(2):115–142

    Article  Google Scholar 

  2. Shirahama K, Grzegorzek M, Uehara K (2015) Weakly supervised detection of video events using hidden conditional random fields. Int J Multimed Inf Retr 4(1):17–32

    Article  Google Scholar 

  3. Saadna Y, Behloul A (2017) An overview of traffic sign detection and classification methods. Int J Multimed Inf Retr 6(3):193–210

    Article  Google Scholar 

  4. Sathish PK, Balaji S (2018) A complete person re-identification model using Kernel-pca-based Gabor-filtered hybrid descriptors. Int J Multimed Inf Retr 7(4):221–229

    Article  Google Scholar 

  5. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), vol 1, pp 886–893. IEEE

  6. Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645

    Article  Google Scholar 

  7. Benenson R, Mathias M, Timofte R, Van Gool L (2012) Pedestrian detection at 100 frames per second. In: CVPR

  8. Viola P, Jones MJ, Snow D (2005) Detecting pedestrian using patterns of motion and appearance. Int J Comput Vis 63(2):153–161

    Article  Google Scholar 

  9. Zhang S, Benenson R, Omran M, Hosang J, Schiele B (2016) How far are we from solving pedestrian detection? In: CVPR

  10. Dollár P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545

    Article  Google Scholar 

  11. Nam W, Dollár P, Han JH (2014) Local decorrelation for improved pedestrian detection. In: Advances in neural information processing systems, pp 424–432

  12. Yang B, Yan J, Lei Z, Li SZ (2015) Convolutional channel features. In: Proceedings of the IEEE international conference on computer vision, pp 82–90

  13. Paisitkriangkrai S, Shen C, van den Hengel A (2014) Strengthening the effectiveness of pedestrian detection with spatially pooled features. In: European conference on computer vision, pp 546–561. Springer

  14. Zhang S, Benenson R, Schiele B (2015) Filtered channel features for pedestrian detection. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp 1751–1760. IEEE

  15. Dollár P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. PAMI 34:743–761

    Article  Google Scholar 

  16. Geiger A, Lenz P, Urtasun R (2012) Are we ready for autonomous driving? The kitti vision benchmark suite. In: Conference on computer vision and pattern recognition (CVPR)

  17. Zhang S, Bauckhage C, Cremers AB (2014) Informed haar-like features improve pedestrian detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 947–954

  18. Lim JJ, Zitnick CL, Dollr, P (2013) Sketch tokens: a learned mid-level representation for contour and object detection. In: 2013 IEEE conference on computer vision and pattern recognition, pp 3158–3165

  19. Cao H, Yamaguchi K, Naito T, Ninomiya Y (2009) Pedestrian recognition using second-order hog feature. In: Asian conference on computer vision, pp 628–634. Springer

  20. Jiang Y, Ma J (2015) Combination features and models for human detection. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition 07 June 2012, pp 240–248. 10.1109/CVPR.2015.7298620

  21. Huang D, Zhu C, Wang Y, Chen L (2014) Hsog: a novel local image descriptor based on histograms of the second-order gradients. IEEE Trans Image Process 23(11):4680–4695

    Article  MathSciNet  MATH  Google Scholar 

  22. Benenson R, Mathias M, Tuytelaars T, Van Gool L (2013) Seeking the strongest rigid detector. In: CVPR

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Blossom Treesa Bastian.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bastian, B.T., C.V., J. Pedestrian detection using first- and second-order aggregate channel features. Int J Multimed Info Retr 8, 127–133 (2019). https://doi.org/10.1007/s13735-019-00171-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13735-019-00171-0

Keywords

Navigation