Abstract
The paper presents a method for estimating the number of moving people in a scene for video surveillance applications. The method performance has been characterized on the public database used for the PETS 2009 and 2010 international competitions; the proposed method has been compared, on the same database, with the PETS competitions participants. The system exhibits a high accuracy, and revealed to be so fast that it can be used in real time surveillance applications. The rationale of the method lies on the extraction of suited scale-invariant feature points and the successive selection among them of the moving ones, under the hypothesis that the latter are associated to moving people. The perspective distortions are taken into account by dividing the input frames into smaller horizontal zones, each having (approximately) the same perspective effects. Therefore, the evaluation of the number of people is separately carried out for each zone, and the results are summed up. The most important peculiarity of the proposed method is the availability of a simple training procedure using a brief video sequence that shows a person walking around in the scene; the procedure automatically evaluates all the parameters needed by the system, thus making the method particularly suited for end-user applications.
Similar content being viewed by others
References
Albiol, A., Silla, M.J., Albiol, A., Mossi, J.M.: Video analysis using corner motion statistics. In: IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, pp. 31–38 (2009)
Bay, H., Ess, A., Tuytelaars, T., Gool, L.V.: Surf: Speeded up robust features. Comput. Vis. Image Underst. 110(3), 346–359 (2008)
Brostow, G.J., Cipolla, R.: Unsupervised bayesian detection of independent motion in crowds. In: IEEE Conf. on Computer Vision and, Pattern Recognition, pp. 594–601 (2006)
Chan, A.B., Liang, Z.S.J., Vasconcelos, N.: Privacy preserving crowd monitoring: counting people without people models or tracking. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–7 (2008)
Chan, A.B., Vasconcelos, N.: Modeling, clustering, and segmenting video with mixtures of dynamic textures. IEEE Trans. Pattern Anal. Machine Intell. 30:909–926 (2008). http://doi.ieeecomputersociety.org/10.1109/TPAMI.2007.70738
Cho, S.Y., Chow, T.W.S., Leung, C.T.: A neural-based crowd estimation by hybrid global learning algorithm. IEEE Trans. Syst. Man Cybern. B 29(4), 535–541 (1999)
Conte, D., Foggia, P., Percannella, G., Tufano, F., Vento, M.: A method for counting people in crowded scenes. In: 2010 Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), pp. 225–232 (2010). doi:10.1109/AVSS.2010.78
Harris, C., Stephens, M.: A combined corner and edge detection. In: Proceedings of The Fourth Alvey Vision Conference, pp. 147–151 (1988)
Kong, D., Gray, D., Tao, H.: A viewpoint invariant approach for crowd counting. In: International Conference on, Pattern Recognition, pp. 1187–1190 (2006)
Love, N.S., Kamath, C.: An empirical study of block matching techniques for the detection of moving objects. Tech. Rep. UCRL - TR - 218038, University of California, Lawrence Livermore National Laboratory (2006)
Marana, A.N., da F. Costa, L., Lotufo, R.A., Velastin, S.A.: Estimating crowd density with mikowski fractal dimension. In: Int. Conf. on Acoustics, Speech and, Signal Processing (1999)
Mikolajczyk, K., Schmid, C.: Scale and Affine Invariant Interest Point Detectors. Int. J. Comput. Vision 60(1), 63–86 (2004). doi:10.1023/B:VISI.0000027790.02288.f2
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Machine Intell. 27(10), 1615–1630 (2005)
PETS: http://www.cvg.rdg.ac.uk/PETS2009/ (2009)
Rahmalan, H., Nixon, M.S., Carter, J.N.: On crowd density estimation for surveillance. In: The Institution of Engineering and Technology Conference on Crime and Security (2006)
Rittscher, J., Tu, P., Krahnstoever, N.: Simultaneous estimation of segmentation and shape. In: IEEE Conf. on Computer Vision and, Pattern Recognition, pp. 486–493 (2005)
Tan, S., Dale, J., Anderson, A., Johnston, A.: Inverse perspective mapping and optic flow: a calibration method and a quantitative analysis. Image Vision Comput. 24, 153–165 (2006)
Zhao, T., Nevatia, R., Wu, B.: Segmentation and tracking of multiple humans in crowded environments. IEEE Trans. Pattern Anal. Mach. Intell. 30(7), 1198–1211 (2008)
Author information
Authors and Affiliations
Corresponding author
Additional information
This research has been partially supported by A.I.Tech s.r.l., a spin-off company of the University of Salerno (http://www.aitech-solutions.eu).
Rights and permissions
About this article
Cite this article
Conte, D., Foggia, P., Percannella, G. et al. Counting moving persons in crowded scenes. Machine Vision and Applications 24, 1029–1042 (2013). https://doi.org/10.1007/s00138-013-0491-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00138-013-0491-3