Abstract
In this paper, we propose a novel Spatio-Temporal Analysis and Retrieval model to extract attributes for video category classification. First, the spatial relationships and temporal nature of the video object in a frame is coded as the sequence of binary string –VRstring. Then, the similarity between shots is matched as sequential features in hyperspaces. The results show that VRstring allows us to define higher level semantic features capturing the main narrative structures of the video. We also compare our algorithm with state of the art longest common substring finding video retrieval model by Adjeroh et.al.[1] on the Minerva international video benchmark.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Niblack, W., Zhu, X., Hafner, J.L., Bruel, T., Ponceleon, D.B., Petkovic, D., Flickner, M., Upfal, E., Nin, S.I., Sull, S., Dom, B.E.: Updates to the QBIC system. In: Proceedings of IS&T SPIE, Storage and Retrieval for Image and Video Databases VI, San Jose, vol. 3312, pp. 150–161 (1998)
Hampapur, A., Gupta, B., Horowitz, C.-F., Shu, C., Fuller, J.R., Bach, M., Gorkani, R.: Virage video engine. In: Proceedings of SPIE Storage and Retrieval for Image and Video Databases V, vol. 3022, pp. 188–198 (1997)
Chang, S.-F., Chen, W., Meng, H.J., Sundaram, H., Zhong, D.: A fully automated content-based video search engine supporting spatio-temporal queries. IEEE Transactions on Circuits and Systems for Video Technology 8(5), 602–615 (1998)
Smith, J.R., Chang, S.-F.: An image and video search engine for the world-wide web. In: Proceedings of Symposium on Electronic Imaging: Science and Technology - Storage & Retrieval for Image and Video Databases V, vol. 3022, pp. 84–95 (1997)
Jasinschi, R.S., et al.: Integrated multimedia processing for topic segmentation and classification. In: Proceedings of IEEE International Conference Image Processing (ICIP 2001). IEEE CS Press, Los Alamitos (2001)
Kim, S.H., Park, R.-H.: An efficient algorithm for video sequence matching using the modified Hausdorff distance and the directed divergence. IEEE Transactions on Circuits and Systems for Video Technology 12(7), 592–596 (2002)
Adjeroh, D.A., Lee, M.C., King, I.: A distance measure for video sequences. Computer Vision and Image Understanding 75(1/2), 25–45 (1999)
Hsieh, J.-W., Yu, S.-L., Chen, Y.-S.: Motion-based video retrieval by trajectory matching. IEEE Transactions on Circuits and Systems for Video Technology 16(3), 396–409 (2006)
Dao, M.-S., DeNatale, F.G.B., Massa, A.: Video retrieval using video object-trajectory and edge potential function. In: Proceedings of 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing, October 20-22, pp. 454–457 (2004)
Lie, W.-N., Hsiao, W.-C.: Content-based video retrieval based on object motion trajectory. In: Proceeding of IEEE Workshop on Multimedia Signal Processing, December 9-11, pp. 237–240 (2002)
Deng, Y., Manjunath, B.S.: NeTra-V: Towards an object-based video representation. IEEE Transactions on Circuits and Systems for Video Technology 8, 616–627 (1998)
Ren, W., Singh, S.: Automatic video segmentation using machine learning. In: Yang, Z.R., Yin, H., Everson, R.M. (eds.) IDEAL 2004. LNCS, vol. 3177, pp. 285–292. Springer, Heidelberg (2004)
Ren, W., Singh, S.: An Automatic Video Annotation System. In: Proceedings of 3rd International Conference on Advances in Pattern Recognition (ICAPR), Bath, UK (August 2005)
Bimbo, D., Vicario, E., Zingoni, D.: Symbolic description and visual querying of image sequences using spatiotemporal logic. IEEE Transactions in Knowledge Data Engineering 7, 609–622 (1995)
Lee, S., Hsu, F.: Spatial reasoning and similarity retrieval of images using 2D C-string knowledge representation. Pattern Recognition 25, 305–318 (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Singh, S., Ren, W., Singh, M. (2009). A Novel Approach to Spatio-Temporal Video Analysis and Retrieval. In: Gagalowicz, A., Philips, W. (eds) Computer Vision/Computer Graphics CollaborationTechniques. MIRAGE 2009. Lecture Notes in Computer Science, vol 5496. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-01811-4_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-01811-4_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-01810-7
Online ISBN: 978-3-642-01811-4
eBook Packages: Computer ScienceComputer Science (R0)