Abstract
This paper presents a framework to identify and align nearly-repetitive contents in a video stream using spatio-temporal manifold embedding. The similarities observed in frame sequences are captured by defining two types of correlation graphs: an intra-correlation graph in the spatial domain and an inter-correlation graph in the temporal domain. The presented work is novel in that it does not utilise any prior information such as the length and contents of the repetitive scenes. No template is required, and no learning process is involved in the approach. Instead it analyses the video contents using the spatio-temporal extension of SIFT combined with a coding technique. The underlying structure is then reconstructed using manifold embedding. Experiments using a TRECVID rushes video proved that the framework was able to improve embedding of repetitive sequences over the conventional methods, thus was able to identify the repetitive contents from complex scenes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chantamunee, S., Gotoh, Y.: Nearly-repetitive video synchronisation using nonlinear manifold embedding. In: Proceedings of ICASSP (2010)
Over, P., Smeaton, A.F., Awad, G.: The TRECVID 2008 BBC rushes summarization evaluation. In: ACM TRECVID Video Summarization Workshop (2008)
Whitehead, A., Laganiere, R., Bose, P.: Temporal synchronization of video sequences in theory and in practice. In: IEEE Workshop on Motion and Video Computing (2005)
Shrestha, P., Weda, H., Barbieri, M., Sekulovski, D.: Synchronization of multiple video recordings based on still camera flashes. In: Proceedings of ACM Multimedia (2006)
Tresadern, P.A., Reid, I.D.: Synchronizing image sequences of non-rigid objects. In: Proceedings of BMVC (2003)
Al Ghamdi, M., Zhang, L., Gotoh, Y.: Spatio-temporal SIFT and its application to human action classification. In: Fusiello, A., Murino, V., Cucchiara, R. (eds.) ECCV 2012 Ws/Demos, Part I. LNCS, vol. 7583, pp. 301–310. Springer, Heidelberg (2012)
Wang, H., Ullah, M.M., Kläser, A., Laptev, I., Schmid, C.: Evaluation of local spatio-temporal features for action recognition. In: Proceedings of BMVC (2009)
Serre, T., Wolf, L., Poggio, T.: Object recognition with features inspired by visual cortex. In: Proceedings of CVPR (2005)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision (2004)
Tenenbaum, J.B., de Silva, V., Langford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Science (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Al Ghamdi, M., Gotoh, Y. (2013). Spatio-temporal Manifold Embedding for Nearly-Repetitive Contents in a Video Stream. In: Wilson, R., Hancock, E., Bors, A., Smith, W. (eds) Computer Analysis of Images and Patterns. CAIP 2013. Lecture Notes in Computer Science, vol 8047. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40261-6_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-40261-6_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40260-9
Online ISBN: 978-3-642-40261-6
eBook Packages: Computer ScienceComputer Science (R0)