Abstract
This work describes a model for understanding people motion in video sequences using Voronoi diagrams, focusing on group detection and classification. We use the position of each individual as a site for the Voronoi diagram at each frame, and determine the temporal evolution of some sociological and psychological parameters, such as distance to neighbors and personal spaces. These parameters are used to compute individual characteristics (such as perceived personal space and comfort levels), that are analyzed to detect the formation of groups and their classification as voluntary or involuntary. Experimental results based on videos obtained from real life as well as from a crowd simulator were analyzed and discussed.









Similar content being viewed by others
Notes
A blob is a set of connected foreground pixels
CROMOS Lab: http://www.inf.unisinos.br/~cromoslab
In Fig. 7, frame 0 represents the first frame when a group was detected.
References
Hall ET (1959) The silent language. Doubleday Company, Garden City
Sommer R (1969) Personal space: the behavioural basis of design. Prentice Hall, Englewood CLiffs
Valera M, Velastin SA (2005) Intelligent distributed surveillance systems: a review. IEE Vis Image Signal Process 152(2):192–204
Moeslund TB, Hilton A, Kruger V (2006) A survey of advances in vision-based human motion capture and analysis. Comput Vis Image Underst 104
Stauffer C, Eric W, Grimson L (2000) Learning patterns of activity using real-time tracking. IEEE Trans Pattern Anal Mach Intell 22(8):747–757
Buxton H, Gong S (1995) Advanced visual surveillance using bayesian networks. In: IEEE International Conference on Computer Vision. Cambridge, Massachusetts, June 1995
Hosie R, Venkatesh S, West GAW (1998) Classifying and detecting group behaviour from visual surveillance data. IEEE Int Conf Pattern Recognit 1:602–604
Oliver NM, Rosario B, Pentland AP (2000) A bayesian computer vision system for modeling human interactions. IEEE Trans Pattern Anal Mach Intell 22(8):831–843
Du Y, Chen G, Xu W, Li Y (2006) Recognizing interaction activities using dynamic bayesian network. IEEE Int Conf Pattern Recognit 1:618–621
Gong S, Xiang T (2003) Recognition of group activities using dynamic probabilistic networks. In: IEEE International Conference on Computer Vision. IEEE Computer Society, Washington, p 742
Xiang T, Gong S (2006) Beyond tracking: Modelling activity and understanding behaviour. Int J Comput Vis 67(1):21–51
Liu XH, Chua CS (2006) Multi-agent activity recognition using observation decomposed hidden markov models. Image Vis Comput 24(2):166–175
Fuentes LM, Velastin A (2004) Tracking-based event detection for CCTV systems. Pattern Anal Appl 7(4):356–364
Fuentes LM, Velastin SA (2006) People tracking in surveillance applications. Image Vis Comput 24(11):1165–1171
Haritaoglu I, Harwood D, Davis LS (2000) W4: Real-time surveillance of people and their activities. IEEE Trans Pattern Anal Mach Intell 22(8):809–830
Elgammal AM, Duraiswami R, Harwood D, Davis LS (2002) Background and foreground modeling using nonparametric kernel density estimation for visual surveillance. Proc IEEE 90(7):1151–1163
Cheng FH, Chen YL (2006) Real time multiple objects tracking and identification based on discrete wavelet transform. Pattern Recognit 39(6):1126–1139
Chien S-Y, Ma S-Y, Chen L-G (2002) Efficient moving object segmentation algorithm using background registration technique. IEEE Trans Circuits Syst Video Technol 12(7):577–586
Cucchiara R, Grana C, Piccardi M, Prati A (2003) Detecting moving objects, ghosts, and shadows in video streams. IEEE Trans Pattern Anal Mach Intell 25(10):1337–1342
Ning H, Tan T, Wang L, Hu W (2004) People tracking based on motion model and motion constraints with automatic initialization. Pattern Recognit 37(7):1423–1440
Xu D, Li X, Liu Z, Yuan Y (2005) Cast shadow detection in video segmentation. Pattern Recognit Lett 26(1):5–26
Wang Y, Tan T, Loe KF, Wu JK (2005) A probabilistic approach for foreground and shadow segmentation in monocular image sequences. Pattern Recognit 38(11):1937–1946
Tian YL, Lu M, Hampapur A (2005) Robust and efficient foreground analysis for real-time video surveillance. IEEE Comput Vis Pattern Recognit I:1182–1187
Jacques JCS Jr., Jung CR, Musse SR (2006) A background subtraction model adapted to illumination changes. In: IEEE international conference on image processing, IEEE Press, pp 1817–1820
Martin J, Crowley JL (1995) Comparison of correlation techniques. In: Conference on intelligent autonomous systems. Karsluhe, Germany
Porikli F, Tuzel O, Meer P (2006) Covariance tracking using model update based on lie algebra. IEEE Computer Visi Pattern Recognit I:728–735
Aurenhammer F (1991) Voronoi diagrams: a survey of a fundamental geometric data structure. ACM Comput Surv 23(3):345–405
Vaughan DG, Asbury T, Riordan-Eva PR (1995) General Ophtalmology. Lange Medical Publications, New York
Fuchs E (1898) Text-book of ophthalmology. D. Appleton
Braun A, Bodmann BEJ, Oliveira LPL, Musse SR (2003) Modelling individual behavior in crowd simulation. In: Proceedings of Computer Animation and Social Agents 2003. IEEE Computer Society, New Brunswick, USA, pp 143–148
Barros LM, da Silva AT, Musse SR (2004) Petrosim: An architecture to manage virtual crowds in panic situations. In: Proceedings of Computer Animation and Social Agents 2004. Geneva, Switzerland, pp 111–120
Courty N, Musse SR (2005) Simulation of large crowds in emergency situations including gaseous phenomena. In: Proceedings of Computer Graphics International 2005. Stony Brook, NY, pp 206–212
Braun A, Bodmann BE, Musse SR (2004) Crowd simulation in emergency situations. In: Short Paper in ACM Symposium on Computer Animation 2004. Grenoble, France
Acknowledgments
This work was developed in collaboration with HP Brazil R&D. The authors would like to thank the anonymous reviewers for their fruitful contributions to improve this work.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jacques, J.C.S., Braun, A., Soldera, J. et al. Understanding people motion in video sequences using Voronoi diagrams. Pattern Anal Applic 10, 321–332 (2007). https://doi.org/10.1007/s10044-007-0070-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-007-0070-1