Abstract
An approach for estimating the number of pedestrians is presented in this paper. The proposed counting framework combines two main pedestrian counting strategies—direct approach and indirect approach—by the use of mixed features and extreme learning machine (ELM). ELM is used to map mixed features to the number of pedestrians. Mixed features consist of holistic low-level features and rectangular local binary pattern (rLBP) features, and rLBP features are new features designed to describe the statistical and structural information of explicit pedestrian detection rectangles. Through mixed features, the information from both direct approach (rLBP features) and indirect approach (low-level features) is used in our algorithm, so we can take full advantage of two counting strategies. The detection rectangles are obtained by the use of the pedestrian detector described in paper “the fastest pedestrian detector in the west" (FPDW) by Dollár et al. Based on integral channel features and soft cascade classifier, FPDW is able to provide outstanding detection results at rapid speed. Experimental results on PETS 2009 datasets show that the proposed counting framework can improve counting accuracy significantly by the combination of two counting strategies. rLBP features are effective to describe the useful information of detection rectangles for regression models, and mixed features are more effective than either of both.













Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Taylor J Cognitive computation. Cogn Comput. 2009;1(1):4–16.
Wöllmer M, Eyben F, Graves A, Schuller B, Rigoll G Bidirectional lstm networks for context-sensitive keyword detection in a cognitive virtual agent framework. Cogn Comput. 2010;2(3):180–190.
Cambria E, Hussain A. Sentic computing: techniques, tools, and applications, vol 2. Berlin: Springer; 2012.
Wang Q-F, Cambria E, Liu C-L, Hussain A. Common sense knowledge for handwritten chinese text recognition. Cogn Comput. 2013;5(2):234–42.
Mital PK, Smith TJ, Hill RL, Henderson JM Clustering of gaze during dynamic scene viewing is predicted by motion. Cogn Comput. 2011;3(1):5–24.
He B, Xu D, Nian R, van Heeswijk M, Yu Q, Miche Y, Lendasse A. Fast face recognition via sparse coding and extreme learning machine. Cogn Comput. 2013. doi:10.1007/s12559-013-9224-1.
Viola P, Jones MJ, Snow D. Detecting pedestrians using patterns of motion and appearance. In: Proceedings of the ninth ieee international conference on computer vision. IEEE; 2003. p. 734–741.
Zhao T, Nevatia R. Bayesian human segmentation in crowded situations. In: Proceedings. 2003 IEEE computer society conference on computer vision and pattern recognition, vol 2. IEEE; 2003. p. II–459.
Zhao T, Nevatia R, Wu B. Segmentation and tracking of multiple humans in crowded environments. IEEE Trans Pattern Anal Mach Intell. 2008;30(7):1198–1211.
Leibe B, Seemann E, Schiele B. Pedestrian detection in crowded scenes. In: IEEE computer society conference on computer vision and pattern recognition (CVPR 2005), vol 1; 2005. p. 878–85.
Wu B, Nevatia R. Detection of multiple, partially occluded humans in a single image by bayesian combination of edgelet part detectors. In: IEEE Tenth IEEE international conference on computer vision (ICCV 2005), vol 1; 2005. p. 90–97.
Lin S-F, Chen J-Y, Chao H-X Estimation of number of people in crowded scenes using perspective transformation. IEEE Trans Syst Man Cybern Part A Syst Hum. 2001;31(6):645–54.
Rabaud V, Belongie S. Counting crowded moving objects. In: IEEE computer society conference on computer vision and pattern recognition, vol 1. 2006. p. 705–11.
Brostow GJ, Cipolla R. Unsupervised Bayesian detection of independent motion in crowds. In IEEE computer society conference on computer vision and pattern recognition, vol 1. 2006. p. 594–601.
Leibe B, Schindler K, Van Gool L. Coupled detection and trajectory estimation for multi-object tracking. In: IEEE 11th international conference on computer vision (ICCV 2007). 2007. p. 1–8.
Dalal N, Triggs B. Histograms of oriented gradients for human detection. In: IEEE computer society conference on computer vision and pattern recognition (CVPR 2005), vol 1. 2005. p. 886–93.
Felzenszwalb P, McAllester D, Ramanan D. A discriminatively trained, multiscale, deformable part model. In: IEEE conference on computer vision and pattern recognition (CVPR 2008). 2008. p. 1–8.
Dollár P, Tu Z, Perona P, Belongie S. Integral channel features. In: British machine vision conference. 2009. p. 1–11.
Paragios N, Ramesh V. A MRF-based approach for real-time subway monitoring. In: Proceedings of the 2001 IEEE computer society conference on computer vision and pattern recognition (CVPR 2001), vol 1. 2001. p. I–1034.
Kong D, Gray D, Tao H. Counting pedestrians in crowds using viewpoint invariant training. In: British machine vision conference, Citeseer. 2005.
Dong L, Parameswaran V, Ramesh V, Zoghlami I. Fast crowd segmentation using shape indexing. In: IEEE 11th international conference on computer vision (ICCV 2007). 2007. p. 1–8.
Chan AB, Morrow M, Vasconcelos N. Analysis of crowded scenes using holistic properties. In: Performance evaluation of tracking and surveillance workshop at CVPR 2009. p. 101–8.
Chan AB, Liang Z-S, Vasconcelos N. Privacy preserving crowd monitoring: counting people without people models or tracking. In: IEEE conference on computer vision and pattern recognition (CVPR 2008). 2008. p. 1–7.
Huang G-B, Zhu Q-Y, Siew C-K. Extreme learning machine: theory and applications. Neurocomputing, 2006;70(1):489–501.
Huang G-B, Zhou H, Ding X, Zhang R. Extreme learning machine for regression and multiclass classification. IEEE Trans Syst Man Cybern Part B Syst Hum. 2012;42(2):513–29.
Cambria E, Huang G-B, Kasun LLC, Zhou H, Vong CM, Lin J, Yin J, Cai Z, Liu Q, Li K, Leung VCM, Feng L, Ong Y-S, Lim M-H, Akusok A, Lendasse A, Corona F, Nian R, Miche Y, Gastaldo P, Zunino R, Decherchi S, Yang X, Mao K, Oh B-S, Jeon J, Toh K-A, Teoh ABJ, Kim J, Yu H, Chen Y, Liu J. Extreme learning machines [trends & controversies]. IEEE Intell Syst. 2013;28(6):30–59.
Ojala T, Pietikäinen M, Harwood D. A comparative study of texture measures with classification based on featured distributions. Pattern Recognit. 1996;29(1):51–9.
Dollár P, Belongie S, Perona P. The fastest pedestrian detector in the west. In: British machine vision conference, vol 55; 2010.
The dataset of pets2009 conference. http://www.cvg.rdg.ac.uk/PETS2009/.
Li Y, Zhu E, Zhao J, Zhu X, Yin J. Detecting and counting pedestrians in real time. J Comput Inf Syst. 2014;10(2):827–35.
Kalman RE et al. A new approach to linear filtering and prediction problems. J Basic Eng. 1960;82(1):35–45.
Acknowledgments
This work is supported by the Opening Fund of Top key Discipline of Computer Software and Theory in Zhejiang Provincial Colleges at Zhejiang Normal University (Grant No. ZSDZZZZXK38), the National Nature Science Foundation of China (Grant Nos. 60970034, 61170287, 61232016), and the Foundation for the Author of National Excellent Doctoral Dissertation (Grant No. 2007B4).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, Y., Zhu, E., Zhu, X. et al. Counting Pedestrian with Mixed Features and Extreme Learning Machine. Cogn Comput 6, 462–476 (2014). https://doi.org/10.1007/s12559-014-9248-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12559-014-9248-1