Skip to main content
Log in

The large-scale crowd analysis based on sparse spatial-temporal local binary pattern

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

As a particular class of public security issues, the large-scale crowd analysis plays a very important role in video surveillance application. This paper proposes a sparse spatial-temporal local binary pattern (SST-LBP) descriptor to extract dynamic texture of the walking crowd which can be applied to the crowd density estimation and distribution analysis. The proposed approach consists of four steps. First of all, sparse selected locations are extracted, which vary notably in both spatial domain and temporal domain. Afterwards, we propose a SST-LBP algorithm to extract the local dynamic feature and utilize the local feature’s statistical property to describe the crowd feature. Thirdly, the overall crowd density level can be determined by classifying the crowd feature with support vector machine. Finally, the local feature is used to represent the local density and then the overall density distribution can be described. To improve the accuracy, we introduce the perspective correction into the detection of sparse selected locations and the spectrum analysis of SST-LBP code. The experiments on different datasets not only show that the proposed SST-LBP method is effective and robust on the large-scale crowd density estimation and distribution, but also indicate that the deformity correction is useful. Compared with other methods, the proposed method has the advantage of low computation complexity and high efficiency. In addition, it performs well on all density levels and can present local crowd distribution.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

References

  1. Beran V, Hradis M, Zemcik P et al (2008) Video summarization at Brno university of technology. Proceeding of the 2nd ACM workshop on Video summarization, pp. 31–34

  2. Chan AB, Vasconcelos N (2012) Counting people with low-level features and Bayesian regression. IEEE Trans Image Process 21(4):2160–2177

    Article  MathSciNet  Google Scholar 

  3. Chan AB, Liang Z-SJ, Vasconcelos N (2008) Privacy preserving crowd monitoring: counting people without people models or tracking. IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–7

  4. Chapelle O, Haffner P, Vapnik VN (1999) Support vector machines for histogram-based image classification. IEEE Trans Neural Netw 10(5):1055–1064

    Article  Google Scholar 

  5. Chetverikov D, Peteri R (2005) A brief survey of dynamic texture description and recognition. Adv Soft Comput 30:17–26

    Article  Google Scholar 

  6. Cho S-Y, Chow TWS, Leung C-T (1999) A neural-based crowd estimation by hybrid global learning algorithm. IEEE Trans Syst Man Cybern B Cybern 29:535–541

    Article  Google Scholar 

  7. Davies AC, Yin JH, Velastin SA (1995) Crowd monitoring using image processing. Electron Comm Eng J 7(1):37–47

    Article  Google Scholar 

  8. Ge W, Collins RT (2010) Crowd density analysis with marked point processes [applications corner]. IEEE Signal Process Mag 27(5):107–123

    Article  Google Scholar 

  9. Huang D, Chow TWS (2003) A people-counting system using a hybrid RBF neural network. Neural Process Lett 18(2):97–113

    Article  Google Scholar 

  10. Kreßel UH-G (1999) Pairwise classification and support vector machines. Advances in kernel methods: support vector learning, pp. 255–268

  11. Lempitsky V, Zisserman A (2010) Learning to count objects in images. Machine Vision Learning/Statistics & Optimisation

  12. Lin S-F, Chen J-Y, Chao H-X (2001) Estimation of number of people in crowded scenes using perspective transformation. IEEE Trans Syst Man Cybern Syst Hum 31(6):645–654

    Article  Google Scholar 

  13. Ma R, Li L, Huang W, Tian Q (2004) On pixel count based crowd density estimation for visual surveillance. Cybernetics and Intelligent Systems, 2004 IEEE Conference 1:170–173

    Google Scholar 

  14. Marana AN, Verona V (2001) Wavelet packet analysis for crowd density estimation, Proceedings of the IASTED International Symposia on Applied Informatics, pp. 535–540

  15. Marana AN, Velastin SA, Costa L et al (1998) Automatic estimation of crowd density using texture. Saf Sci 28:165–175

    Article  Google Scholar 

  16. Marana AN, Da Fontoura Costa L, Lotufo RA, Velastin SA (1999) Estimating crowd density with Minkowski fractal dimension. Acoust Speech Signal Process 6:3521–3524

    Google Scholar 

  17. Mikolajczyk K, Tuytelaars T, Schmid C et al (2005) A comparison of affine region detectors. Int J Comput Vis 1–2(65):43–72

    Article  Google Scholar 

  18. Ojala T, Pietikainen M, Maenpaa T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987

    Article  Google Scholar 

  19. Polus A, Schofer JL, Ushpiz A (1983) Pedestrian flow and level of service. J Transport Eng 109(1):46–56

    Article  Google Scholar 

  20. Rodriguez M, Laptev I, Sivic J et al (2011) Density-aware person detection and tracking in crowds. 2011 IEEE International Conference on Computer Vision (ICCV), pp. 2423–2430

  21. Roqueiro, Petrushin VA (2007) Counting people using video cameras. International Journal of Parallel, Emergent and Distributed Systems 22(3):193–209

    Article  MathSciNet  Google Scholar 

  22. Sen G, Liu Wei, Yan He Ping (2009) Counting people in crowd open scene based on grey level dependence matrix. International Conference on Information and Automation, pp. 228–231

  23. Su H, Yang H, Zheng S (2011) The large-scale crowd density estimation based on effective region feature extraction method. Lect Notes Comput Sci 6494:302–313

    Article  Google Scholar 

  24. Wu X, Liang G, Lee KK et al (2006) Crowd density estimation using texture analysis and learning. IEEE International Conference on Robotics and Biomimetic, pp. 214–219

  25. Yang H, Su H, Zheng S (2011) The Large-scale Crowd Density Estimation Based on Sparse Spatio-temporal Local Binary Pattern, IEEE International Conference on Multimedia and Expo (ICME), pp. 1–6

  26. Zhao G, Pietikainen M (2007) Dynamic texture recognition using local binary patterns with an application to facial expressions. IEEE Trans Pattern Anal Mach Intell 29(6):915–928

    Article  Google Scholar 

Download references

Acknowledgments

This research is partly supported by NSFC (No.61102099, No.61171172), Scientific and Technological Committee of Shanghai (No.11231203102, No.10231204002) and National Basic Research Program (973 Program, No. 2010CB731406). We sincerely thank for the testing video datasets from University of Reading and permission (PETS2009) and University of Minnesota.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hua Yang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, H., Cao, Y., Su, H. et al. The large-scale crowd analysis based on sparse spatial-temporal local binary pattern. Multimed Tools Appl 73, 41–60 (2014). https://doi.org/10.1007/s11042-012-1264-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-012-1264-0

Keywords

Navigation