Abstract
This chapter provides an overview of various methods for motion trajectory-based video contentmodeling, retrieval and classification. The techniques discussed form the foundation for content-based video indexing and retrieval (CBVIR) systems. We focus on view-invariant representations of single and multiple motion trajectories based on null-space invariants that allows for video retrieval and classification from unknown and moving camera views. We introduce methods based on matrix and tensor decomposition for efficient storage and retrieval of single and multiple motion trajectories, respectively. We subsequently explore the use of one- and multi-dimensional hidden Markov models for video classification and recognition based on single and multiple motion trajectories. We summarize the basic concepts and present computer simulation results to demonstrate the fundamental notions introduced throughout the chapter.We finally discuss several open problems in the field of motion trajectory analysis and future trends in content-based video modeling, retrieval and classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Yuan, J., Wang, H., Zheng, W., Li, J., Lin, F., Zhang, B.: A Formal Study of Shot Boundary Detection. IEEE Transactions on Circuit and Systems for Video Technology, 168–186 (2007)
Hanjalic, A.: Shot-boundary detection: unraveled or resolved? IEEE Transactions on Circuit and Systems for Video Technoligy 12(2), 90–105 (2002)
Lienhart, R.: Reliable Transition Detection in Videos: A Survey and Practitioner’s Guide. International Journal of Image and Graphics 1, 469–486 (2001)
Lelescu, D., Schonfeld, D.: Statistical sequential analysis for real-time scene hange detection on compressed multimedia bitstream. IEEE Transactions on Multimedia 5, 106–117 (2003)
Johansson, G.: Visual Perception of Biological Motion and a Model for its Analysis. Perception and Psychophysics 14(2), 201–211 (1973)
Yilmaz, A., Javed, O., Shah, M.: Object tracking: A survey. ACM Comput. Surv. 38(4), 45 (2006)
Zhao, T., Nevatia, R.: Tracking multiple humans in crowded environment. In: Proc. IEEE Int. Conf. Compt. Vision and Pattern Recognit., vol. 2, pp. 406–413 (2004)
Chang, C., Ansari, R., Khokhar, A.: Multiple Object Tracking with Kernal Particle Filter. In: Proc. IEEE Int. Conf. Compt. Vision and Pattern Recognit., vol. 1, pp. 566–573 (2005)
Schonfeld, D., Lelescu, D.: VORTEX: Video retrieval and tracking from compressed multimedia databases-multiple object tracking from MPEG-2 bitstream. Journal of Visual Communications and Image Representation, Special Issue on Multimedia Database Management 11, 154–182 (2000) (invited paper)
Hariharakrishnan, K., Schonfeld, D.: Fast object tracking using adaptive block matching. IEEE Transactions on Multimedia 7(5), 853–859 (2005)
Isard, M., Blake, A.: Condensation-Conditional Density Propagation for Visual Tracking. International Journal of Computer Vision 29(1) (1998)
Qu, W., Schonfeld, D., Mohamed, M.: Real-time distributed multi-object tracking using multiple interactive trackers and a magnetic-inertia potential model. IEEE Transactions on Multimedia 9, 511–519 (2007)
Chang, S.F., Chen, W., Meng, H.J., Sundaram, H., Zhong, D.: A Fully Automated Content-Based Video Search Engine Supporting Spatiotemporal Queries. IEEE Trans. Circuits Syst. Video Technol. 8(5), 602–615 (1998)
AbouGhazaleh, N., Gamal, Y.E.: Compressed Video Indexing Based on Object’s Motion. In: Int. Conf. Visual Communication and Image Processing, VCIP 2000, Perth, Australia, pp. 986–993 (2000)
Katz, B., Lin, J., Stauffer, C., Grimson, E.: Answering questions about moving objects in surveillance videos. In: Proceedings of 2003 AAAI Spring Symp. New Directions in Question Answering, Palo Alto, CA (2003)
Rea, N., Dahyot, R., Kokaram, A.: Semantic Event Detection in Sports Through Motion Understanding. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 21–23. Springer, Heidelberg (2004)
Bashir, F.I., Khokhar, A.A., Schonfeld, D.: A Hybrid System for Affine- Invariant Trajectory Retrieval. In: Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, New York (2004)
Bashir, F.I., Khokhar, A.A., Schonfeld, D.: Object trajectory-based activity classification and recognition using hidden Markov models. IEEE Transactions on Image Processing 16, 1912–1919 (2007)
Bashir, F.I., Khokhar, A.A., Schonfeld, D.: Real-time motion trajectory-based indexing and retrieval of video sequences. IEEE Transactions on Multimedia 9, 58–65 (2007)
Bashir, F.I., Khokhar, A.A., Schonfeld, D.: Real-time affine-invariant motion trajectory-based retrieval and classification of video sequences from arbitrary camera view. ACM Multimedia Systems Journal, Special Issue on Machine Learning Approaches to Multimedia Information Retrieval 12, 45–54 (2006)
The University of California at Irvine Knowledge Discovery in Databases (KDD) archive, http://kdd.ics.uci.edu
Fawcett, T.: Roc graphs: Notes and practical considerations for researchers, Technical Report, HP Labs, HPL-2003-4 (2004)
Vaswani, N., Chellappa, R.: Principal Components Null Space Analysis for Image and Video Classification. IEEE Trans. Image Processing (July 2006)
Chen, X., Schonfeld, D., Khokhar, A.: Robust null space representation and sampling for view invariant motion trajectory analysis. In: IEEE Conference on Computer Vision and Pattern Recognition (2008)
Chen, X., Schonfeld, D., Khokhar, A.: Localized Null Space Representation for Dynamic Updating and Downdating in Image and Video Databases. In: IEEE International Conference on Image Processing, ICIP (2009)
Sahouria, E., Zakhor, A.: A Trajectory Based Video Indexing System For Street Surveillance. In: IEEE Int. Conf. on Image Processing (ICIP), pp. 24–28 (1999)
Chen, W., Chang, S.F.: Motion Trajectory Matching of Video Objects. In: IS&T/ SPIE, pp. 544–553 (2000)
Bashir, F.I., Khokhar, A.A., Schonfeld, D.: Segmented trajectory based indexing and retrieval of video data. In: Proc. IEEE Int. Conf. Image Processing, pp. 623–626 (2003)
Ma, X., Bashir, F., Knokhar, A., Schonfeld, D.: Event Analysis Based on Multiple Interactive Motion Trajectories. IEEE Trans. on Circuits and Syst. for Video Technology 19(3) (2009)
Ma, X., Bashir, F., Knokhar, A., Schonfeld, D.: Tensor-based Multiple Object Trajectory Indexing and Retrieval. In: Proc. IEEE Int. Conf. on Multimedia and Expo. (ICME), Toronto, Canada, pp. 341–344 (2006)
Lathauwer, L., Moor, B.D., Vandewalle, J.: A multilinear singular value decomposition. SIAM Journal on Matrix Analysis and Applicat. (SIMAX) 21(4), 1253–1278 (2000)
Lathauwer, L.D., Moor, B.D.: From Matrix to Tensor: Multilinear Algebra and Signal Processing. In: Proc. 4th IMA Int. Conf. Mathmatics in Signal Process., pp. 1–15 (1996)
Harshman, R.A.: Foundations of the PARAFAC procedure: Model and Conditions for an “explanatory” multi-mode factor analysis. UCLA Working Papers in Phonetics, pp.1-84 (1970)
The Context Aware Vision using Image-based Active Recognition (CAVIAR) dataset, http://homepages.inf.ed.ac.uk/rbf/CAVIAR/
Brezeale, D., Cook, D.J.: Automatic Video Classification: A Survey of the Literature. IEEE Transactions on Systems, Man and Cybernetics, Part C: Applications and Reviews 38(3) (2008)
Rabiner, L.R.: A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE 77, 257–286 (1989)
Starner, T., Pentland, A.: Real-Time American Sign Language Recognition From Video Using Hidden Markov Models, Technical Report, MIT Media Lab, Perceptual Computing Group, vol. 375 (1995)
Raphael, C.: Automatic Segmentation of Acoustic Musical Signals Using Hidden Markov Models. IEEE Transactions on Pattern Analysis and Machine Intelligence 21(4), 360–370 (1999)
Bashir, F.I., Khokhar, A.A., Schonfeld, D.: HMM based motion recognitionsystem using segmented pca. In: IEEE International Conference on Image Processing (ICIP 2005), vol. 3, pp. 1288–1291 (2005)
Ma, X., Schonfeld, D., Khokhar, A.: Distributed multidimensional hidden Markov Model: theory and application in multiple-object trajectory classification and recognition. In: SPIE International Conference on Multimedia Content Access: Algorithms and Systems, San Jose, California (2008)
Ma, X., Schonfeld, D., Khokhar, A.: Image segmentation and classification based on a 2D distributed hidden Markov model. In: SPIE International Conference on Visual Communications and Image Processing (VCIP 2008), San Jose, California (2008)
Ma, X., Schonfeld, D., Khokhar, A.: Distributed Multi-dimensional Hidden Markov Models for Image and Trajectory-Based Video Classification. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), Las Vegas, Nevada (2008)
Ma, X., Schonfeld, D., Khokhar, A.: Video Event Classification and Image Segmentation Based on Non-Causal Multi-Dimensional Hidden Markov Models. IEEE Transactions on Image Processing, T-IP (May 2009) (to appear)
Baum, L.E., Petrie, T., Soules, G., Weiss, N.: A maximization technique occuring in the statistical analysis of probabilistic functions of markov chains. Ann. Math. Stat. 1, 164–171 (1970)
Li, J., Najmi, A., Gray, R.M.: Image classification by a two-dimensional hidden markov model. IEEE Trans. on Signal Processing 48, 517–533 (2000)
Schonfeld, D., Bouaynaya, N.: A new method for multidimensional optimization and its application in image and video processing. IEEE Signal Processing Letters 13, 485–488 (2006)
Ma, X., Khokhar, A., Schonfeld, D.: Robust video mining based on local similarity alignment of motion trajectories. In: IEEE Conference on Image Processing (ICIP 2009), Cairo, Egypt (2009)
Ma, X., Schonfeld, D., Khokhar, A.: Dynamic updating and downdating matrix SVD and tensor HOSVD for adaptive indexing and retrieval of motion trajectories. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan (2009)
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Ma, X., Chen, X., Khokhar, A., Schonfeld, D. (2010). Motion Trajectory-Based Video Retrieval, Classification, and Summarization. In: Schonfeld, D., Shan, C., Tao, D., Wang, L. (eds) Video Search and Mining. Studies in Computational Intelligence, vol 287. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12900-1_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-12900-1_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12899-8
Online ISBN: 978-3-642-12900-1
eBook Packages: EngineeringEngineering (R0)