From Low-Level Features to Semantic Classes: Spatial and Temporal Descriptors for Video Indexing

Zampoglou, Markos; Papadimitriou, Theophilos; Diamantaras, Konstantinos I.

doi:10.1007/s11265-008-0314-3

From Low-Level Features to Semantic Classes: Spatial and Temporal Descriptors for Video Indexing

Published: 26 November 2008

Volume 61, pages 75–83, (2010)
Cite this article

Journal of Signal Processing Systems Aims and scope Submit manuscript

Markos Zampoglou¹,
Theophilos Papadimitriou² &
Konstantinos I. Diamantaras³

177 Accesses
4 Citations
Explore all metrics

Abstract

As the quantity of publicly available multimedia material becomes larger and larger, automatic indexing becomes increasingly important in accessing multimedia databases. In this paper, a novel set of low-level descriptors is presented for the aim of content-based video classification. Concerning temporal features, we use a modified PMES descriptor for the spatial distribution of local motion and a Dominant Direction Histogram we have developed to represent the temporal distribution of camera motion. Concerning color, we present the Weighted Color Histogram we have designed in order to model color distribution. The histogram models the H parameter of the HSV color space, and we combine it with weighted means for the S and V parameters. For the selection of key-frames from which to extract the spatial descriptors we use a modified version of a simple efficient method. We then proceed to evaluate our descriptor set on a database of video shots resulting from the temporal segmentation of the archive of a real-world TV station. Results demonstrate that our approach can achieve high success rates on a wide range of semantic classes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Attention mechanisms in computer vision: A survey

Article Open access 15 March 2022

Meng-Hao Guo, Tian-Xing Xu, … Shi-Min Hu

The Pascal Visual Object Classes Challenge: A Retrospective

Article 25 June 2014

Mark Everingham, S. M. Ali Eslami, … Andrew Zisserman

Perceptual image quality assessment: a survey

Article 26 April 2020

Guangtao Zhai & Xiongkuo Min

References

Datta, R., Li, J., Wang, J. Z. (2005). Content-based image retrieval: approaches and trends of the new age. Proceedings of the 7th International Workshop on Multimedia Information Retrieval, in conjunction with ACM International Conference on Multimedia, pp. 253–262.
Smeulders, A. W. M., Worring, M., Santini, S., Gupta, A., & Jain, R. (2000). Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 1349–1380. doi:10.1109/34.895972.
Article Google Scholar
Koprinska, I., & Carrato, S. (2001). Temporal video segmentation: a survey. Signal Processing: Image Communication, 8, 477–500. doi:10.1016/S0923-5965(00)00011-4.
Article Google Scholar
Nagasaka, A., Tanaka, Y. (1991) Automatic video indexing and full-video search for object appearances. Proceedings of the IFIP TC2/WG 2.6 Second Working Conference on Visual Database Systems II, pp 113–127.
Ardizzone, E., Gatani, L., La Cascia, M., Lo Re, G., Ortolani, M. (2006) Advances in Multimedia Modelling. Springer Berlin, chapter “A P2P Architecture for Multimedia Content Retrieval,” pp. 462–474.
Chen, J. F., Liao, H. Y. M., Lin, C. W. (2005) Knowledge-Based Intelligent Information and Engineering Systems. Springer Berlin/Heidelberg, chapter “Fast Video Retrieval via the Statistics of Motion Within the Regions-of-Interest”.
Fablet, R., Bouthemy, P., & Pérez, P. (2002). Non-parametric motion characterization using causal probabilistic models for video indexing and retrieval. IEEE Transactions on Image Processing, 11, 393–407. doi:10.1109/TIP.2002.999674.
Article Google Scholar
Fablet, R., & Bouthemy, P. (2000). Statistical motion-based object indexing using optic flow field. IEEE International Conference on Pattern Recognition, 4, 287–290.
Google Scholar
Piriou, G., Bouthemy, P., & Yao, J. F. (2006). Recognition of dynamic video contents with global probabilistic models of visual motion. IEEE Transactions on Image Processing, 15, 3418–3431.
Article Google Scholar
Shih, H. C., Huang, C. L. (2003). Image analysis and interpretation for semantics categorization in baseball video. IEEE International Conference on Information Technology: Coding and Computing [Computers and Communications], pp 379–383.
Ferman, A. M., Tekalp, A. M., & Mehrotra, R. (1998). Effective content representation for video. IEEE International Conference on Image Processing, 3, 521–525.
Google Scholar
Jeannin, S., & Divakaran, A. (2001). MPEG-7 visual motion descriptors. IEEE Transactions on Circuits and Systems for Video Technology, 11, 720–724. doi:10.1109/76.927428.
Article Google Scholar
Chia-Han, L., & Chen, A. L. P. (2001). Processing concept queries with object motions in video databases. IEEE International Conference on Image Processing, 2, 641–644.
Google Scholar
Zhen-Hua Zhang, Yong Quan, Wen-Hui Li, Wu Guo (2006). A new content-based image retrieval. Machine Learning and Cybernetics, IEEE International Conference on, pp 4013–4018.
Sural, S., Quian, G., & Pramanik, S. (2002). Segmentation and Histogram Generation Using the HSV Color Space for Image Retrieval. Proceedings. International Conference on Image Processing, 2, 589–592.
Google Scholar
Rautiainen, M., & Doermann, D. (2002). Temporal Color Correlograms for Video Retrieval. Proceedings, International Conference on Pattern Recognition, 2, 589–592.
Google Scholar
Williams, A., & Yoon, P. (2007). Content-based image retrieval using joint correlograms. Multimedia Tools and Application, 34, 239–248. doi:10.1007/s11042-006-0087-2.
Article Google Scholar
Yu-Fei, Ma, & Hong-Jiang, Zhang (2001). A new perceived motion based shot content representation. IEEE International Conference on Image Processing, 3, 426–429.
Google Scholar
Zampoglou, M., Papadimitriou, T., Diamantaras, K. I. (2007). Support Vector Machines Content-Based Video Retrieval Based Solely on Motion Information. Proc. 17th Int. Workshop on Machine Learning for Signal Processing, IEEE, Thessaloniki, Greece, pp 176–180.
Zampoglou, M., Papadimitriou, T., Diamantaras, K. I. (2008). Integrating Motion and Color for Content-Based Video Classification. 2008 IAPR Workshop on Cognitive Information Processing, Santorini, Greece.
Ferman, A. M., Tekalp, A. M., & Mehrotra, R. (2002). Robust Color Histogram Descriptors for Video Segment Retrieval and Identification. IEEE Transactions on Image Processing, 11, 497–508. doi:10.1109/TIP.2002.1006397.
Article Google Scholar
Cristianini, N., Shawe-Taylor, J. (2000). An Introduction to Support Vector Machines. Cambridge University Press.
Zhang, L., Fuzong Lin, Bo Zhang (2001). Support vector machine learning for image retrieval. International Conference on Image Processing, pp 721–724.
Mezaris, V., Kompatsiaris, I., Boulgouris, N. V., & Strintzis, M. G. (2004). Real-time compressed-domain spatiotemporal segmentation and ontologies for video indexing and retrieval. IEEE Transactions on Circuits and Systems for Video Technology, 14, 606–621. doi:10.1109/TCSVT.2004.826768.
Article Google Scholar
Joachims, T. Schϕlkopf, B., Burges, C., Smola, A. (eds.) (1999). Advances in Kernel Methods - Support Vector Learning. MIT, chapter “Making large-scale SVM learning practical,” pp. 169–184.

Download references

Author information

Authors and Affiliations

Department of Applied Informatics, University of Macedonia, Thessaloniki, 54006, Greece
Markos Zampoglou
Department International Economic Relations and Development, Democritus University of Thrace, Komotini, 69100, Greece
Theophilos Papadimitriou
Department of Informatics, TEI of Thessaloniki, Thessaloniki, 57400, Greece
Konstantinos I. Diamantaras

Authors

Markos Zampoglou
View author publications
You can also search for this author in PubMed Google Scholar
Theophilos Papadimitriou
View author publications
You can also search for this author in PubMed Google Scholar
Konstantinos I. Diamantaras
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Markos Zampoglou.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zampoglou, M., Papadimitriou, T. & Diamantaras, K.I. From Low-Level Features to Semantic Classes: Spatial and Temporal Descriptors for Video Indexing. J Sign Process Syst 61, 75–83 (2010). https://doi.org/10.1007/s11265-008-0314-3

Download citation

Received: 17 February 2008
Revised: 03 October 2008
Accepted: 27 October 2008
Published: 26 November 2008
Issue Date: October 2010
DOI: https://doi.org/10.1007/s11265-008-0314-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

From Low-Level Features to Semantic Classes: Spatial and Temporal Descriptors for Video Indexing

Abstract

Access this article

Similar content being viewed by others

Attention mechanisms in computer vision: A survey

The Pascal Visual Object Classes Challenge: A Retrospective

Perceptual image quality assessment: a survey

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

From Low-Level Features to Semantic Classes: Spatial and Temporal Descriptors for Video Indexing

Abstract

Access this article

Similar content being viewed by others

Attention mechanisms in computer vision: A survey

The Pascal Visual Object Classes Challenge: A Retrospective

Perceptual image quality assessment: a survey

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation