Abstract
This work proposes a complete framework for human activity discovery, modeling, and recognition using videos. The framework uses trajectory information as input and goes up to video interpretation. The work reduces the gap between low-level vision information and semantic interpretation, by building an intermediate layer composed of Primitive Events. The proposed representation for primitive events aims at capturing meaningful motions (actions) over the scene with the advantage of being learned in an unsupervised manner. We propose the use of Primitive Events as descriptors to discover, model, and recognize activities automatically. The activity discovery is performed using only real tracking data. Semantics are added to the discovered activities (e.g., “Preparing Meal”, “Eating”) and the recognition of activities is performed with new datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Antonini, G., Thiran, J.: Trajectories clustering in ICA space: an application to automatic counting of pedestrians in video sequences. In: ACIVS 2004. Proc. Intl. Soc. Mag. Reson. Med. IEEE, Los Alamitos (2004)
Bobick, A.F., Wilson, A.D.: A state-based approach to the representation and recognition of gesture. IEEE Trans. Pattern Anal. Mach. Intell. 19(12), 1325–1337 (1997)
Bouguet, J.Y.: Pyramidal implementation of the Lucas Kanade feature tracker description of the algorithm (2000)
Georgescu, B., Shimshoni, I., Meer, P.: Mean shift based clustering in high dimensions: A texture classification example. In: 9th ICCV, pp. 456–463 (2003)
Gong, S., Xiang, T.: Recognition of group activities using dynamic probabilistic networks. In: ICC 2003, pp. 742–749 (2003)
Hamid, R., Maddi, S., Johnson, A., Bobick, A., Essa, I., Isbell, C.: A novel sequence representation for unsupervised analysis of human activities. A.I. Journal (2009)
Haritaoglu, I., Harwood, D., Davis, L.S.: W4: Real-time surveillance of people and their activities. TPAMI 22(8), 809–830 (2000)
Hu, W., Xiao, X., Fu, Z., Xie, D.: A system for learning statistical motion patterns. TPAMI 28(9), 1450–1464 (2006)
Johnson, N., Hogg, D.: Learning the distribution of object trajectories for event recognition. In: BMVC 1995, Surrey, UK, pp. 583–592 (1995)
Khalid, S., Naftel, A.: Classifying spatiotemporal object trajectories using unsupervised learning of basis function coefficients. In: VSSN 2005: Proc. of Intl Workshop on Video Surveillance & Sensor Networks (2005)
Laptev, I., Lindeberg, T.: Space-time interest points. In: ICCV, pp. 432–439 (2003)
Morris, B.T., Trivedi, M.M.: Learning and classification of trajectories in dynamic scenes: A general framework for live video analysis. In: AVSS 2008 (2008)
Owens, J., Hunter, A.: Application of the self-organizing map to trajectory classification. In: VS 2000: Proc. of the Third IEEE Int. Workshop on Visual Surveillance (2000)
Piciarelli, C., Foresti, G.L.: On-line trajectory clustering for anomalous events detection. Pattern Recogn. Lett. 27(15), 1835–1842 (2006)
Porikli, F.: Learning object trajectory patterns by spectral clustering. In: IEEE International Conference on Multimedia and Expo., ICME 2004, vol. 2, pp. 1171–1174 (2004)
Romdhane, R., Mulin, E., Derreumeaux, A., Zouba, N., Piano, J., Lee, L., Leroi, I., Mallea, P., David, R., Thonnat, M., Bremond, F., Robert, P.: Automatic video monitoring system for assessment of alzheimer’s disease symptoms. JNHA - The Journal of Nutrition, Health and Aging Ms. No. JNHA-D-11-00004R1 (2011)
Calderara, S., Cucchiara, R., Prati, A.: Detection of abnormal behaviors using a mixture of von mises distributions. In: IEEE AVSS (2007)
Shi, J., Tomasi, C.: Good features to track. In: IEEE CVPR 1994, p. 593–600 (1994)
Zouba, N., Bremond, F., Thonnat, M.: Multisensor fusion for monitoring elderly activities at home. In: AVSS 2009, Genoa, Italy (September 2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pusiol, G., Bremond, F., Thonnat, M. (2011). Unsupervised Discovery, Modeling, and Analysis of Long Term Activities. In: Crowley, J.L., Draper, B.A., Thonnat, M. (eds) Computer Vision Systems. ICVS 2011. Lecture Notes in Computer Science, vol 6962. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23968-7_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-23968-7_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23967-0
Online ISBN: 978-3-642-23968-7
eBook Packages: Computer ScienceComputer Science (R0)