Abstract
The scientific community witnessed revolutionary changes with algorithms and data sets aiming for precise human detection from images and videos which are largely driven by the quality of features extracted. Regardless of the labyrinth of the existing detectors, featured human detection with near accuracy from complicated real-time data sets remains a major challenge. Here we propose an improved feature set by merging the fast and accurate aggregate channel features (ACF) and the data specific dictionary learned histogram of sparse codes (HSC) for human detection. This integrated feature set efficiently fuses the first-order information from the histogram of oriented gradient channels embedded in the ACF detector and the data specific intelligence contained in the HSC channels. The proposed detector outperforms the state-of-the-art ACF detector in terms of miss rate and average precision on challenging datasets. It is worth to be noted that there is a decrease with the miss rate by a factor of 13% and 5% for INRIA and Caltech pedestrian datasets respectively in comparison with baseline detector. Along with the detection of more instances, our detector reduced the number of false positives compared to other existing detectors. Although further modifications are warranted, our proposed detector could produce a tangible and palpable response with human detection in the vast arena of computer vision.
Similar content being viewed by others
References
Al-Hazaimeh OM, Al-Nawashi M, Saraee M (2018) Geometrical-based approach for robust human image detection. Multimed Tools Appl 78(6):7029–7053
Bai X, Zhang T, Wang C, El-Latif AAA, Niu X (2013) A fully automatic player detection method based on one-class svm. IEICE Trans Inf Sys 96(2):387–391
Benenson R, Mathias M, Timofte R, Van Gool L (2012) Pedestrian detection at 100 frames per second. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Benenson R, Mathias M, Tuytelaars T, Van Gool L (2013) Seeking the strongest rigid detector. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Benenson R, Omran M, Hosang J, Schiele B (2014) Ten years of pedestrian detection, what have we learned?. In: European conference on computer vision. Springer, pp 613–627
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Ding J, Wang Y, Geng W (2013) An hog-ct human detector with histogram-based search. Multimed Tools Appl 63(3):791–807
Dollár P Piotr’s Computer Vision Matlab Toolbox (PMT). https://github.com/pdollar/toolbox
Dollár P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545
Dollár P, Tu Z, Perona P, Belongie S (2009) Integral channel features. In: Proceedings of the British machine vision conference. BMVC Press
Dollar P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Patt Anal Mach Intell 34(4):743–761
Fang Y, Sun L, Fu H, Wu T, Wang R, Dai B (2016) Learning deep compact channel features for object detection in traffic scenes. In: 2016 IEEE International conference on image processing (ICIP). IEEE, pp 1052–1056
Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
Gad R, Talha M, El-Latif AAA, Zorkany M, Ayman ES, Nawal EF, Muhammad G (2018) Iris recognition using multi-algorithmic approaches for cognitive internet of things (cIoT) framework. Futur Gener Comput Syst 89:178–191
Jiang Y, Ma J (2015) Combination features and models for human detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 240–248
Jing H, He X, Han Q, El-Latif AAA, Niu X (2014) Saliency detection based on integrated features. Neurocomputing 129:114–121
Li A, Miao Z, Cen Y, Cen Y (2017) Anomaly detection using sparse reconstruction in crowded scenes. Multimed Tools Appl 76(24):26,249–26,271
Li J, Liang X, Shen S, Xu T, Feng J, Yan S (2017) Scale-aware fast r-cnn for pedestrian detection. IEEE Transactions on Multimedia 20(4):985–996
Nigam S, Khare A (2015) Multiresolution approach for multiple human detection using moments and local binary patterns. Multimed Tools Appl 74(17):7037–7062
Peng J, Li Q, El-Latif AAA, Niu X (2015) Linear discriminant multi-set canonical correlations analysis (ldmcca): an efficient approach for feature fusion of finger biometrics. Multimed Tools Appl 74(13):4469–4486
Peng J, Wang N, El-Latif AAA, Li Q, Niu X (2012) Finger-vein verification using gabor filter and sift feature matching. In: 2012 Eighth international conference on intelligent information hiding and multimedia signal processing. IEEE, pp 45–48
Ren X, Ramanan D (2013) Histograms of sparse codes for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3246–3253
Sermanet P, Kavukcuoglu K, Chintala S, LeCun Y (2013) Pedestrian detection with unsupervised multi-stage feature learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3626–3633
Viola P, Jones MJ, Snow D (2005) Detecting pedestrian using patterns of motion and appearance. Int J Comput Vis 63(2):153–161
Wang X, Han TX, Yan S (2009) An hog-lbp human detector with partial occlusion handling. In: 2009 IEEE 12th international conference on computer vision. IEEE, pp 32–39
Wang HH, Tu CW, Chiang CK (2019) Sparse representation for image classification via paired dictionary learning. Multimed Tools Appl 78(12):16,945–16,963
Wang N, Li Q, El-Latif AAA, Peng J, Niu X (2014) An enhanced thermal face recognition method based on multiscale complex fusion for gabor coefficients. Multimed Tools Appl 72(3):2339–2358
Xiong J, Tang Q, He X, Cai L, Wang F (2016) Tracking in multimedia data via robust reweighted local multi-task sparse representation for transportation surveillance. Multimed Tools Appl 75(24):17,531–17,552
Yang BQ, Gu CC, Wu KJ, Zhang T, Guan XP (2017) Simultaneous dimensionality reduction and dictionary learning for sparse representation based classification. Multimed Tools Appl 76(6):8969–8990
Zhang L, Lin L, Liang X, He K (2016) Is faster r-cnn doing well for pedestrian detection?. In: European conference on computer vision. Springer, pp 443–457
Zhang S, Bauckhage C, Cremers AB (2014) Informed haar-like features improve pedestrian detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Zhang T, El-Latif AAA, Wang N, Li Q, Niu X (2012) A new image segmentation method via fusing NCut eigenvectors maps. In: Fourth international conference on digital image processing (ICDIP 2012), vol 8334. SPIE, pp 593–596
Zhang T, Han Q, El-Latif AAA, Bai X, Niu X (2013) 2-d cartoon character detection based on scalable-shape context and hough voting. Inf Technol J 12(12):2342–2349
Zhao ZQ, Bian H, Hu D, Cheng W, Glotin H (2017) Pedestrian detection based on fast r-cnn and batch normalization. In: International conference on intelligent computing. Springer, pp 735–746
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Bastian, B.T., C.V., J. Integrated feature set using aggregate channel features and histogram of sparse codes for human detection. Multimed Tools Appl 79, 2931–2944 (2020). https://doi.org/10.1007/s11042-019-08498-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-08498-w