TV Genre Classification Using Multimodal Information and Multilayer Perceptrons

Montagnuolo, Maurizio; Messina, Alberto

doi:10.1007/978-3-540-74782-6_63

Maurizio Montagnuolo¹ &
Alberto Messina²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4733))

Included in the following conference series:

Congress of the Italian Association for Artificial Intelligence

1664 Accesses
9 Citations

Abstract

Multimedia content annotation is a key issue in the current convergence of audiovisual entertainment and information media. In this context, automatic genre classification (AGC) provides a simple and effective solution to describe video contents in a structured and well understandable way. In this paper a method for classifying the genre of TV broadcasted programmes is presented. In our approach, we consider four groups of features, which include both low-level visual descriptors and higher level semantic information. For each type of these features we derive a characteristic vector and use it as input data of a multilayer perceptron (MLP). Then, we use a linear combination of the outputs of the four MLPs to perform genre classification of TV programmes. The experimental results on more than 100 hours of broadcasted material showed the effectiveness of our approach, achieving a classification accuracy of ~92%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Albiol, A., Fullà, M.J.C., Albiol, A., Torres, L.: Commercials detection using HMMs. In: Proc. of the Int. Workshop on Image Analysis for Multimedia Interactive Services (2004)
Google Scholar
Brugnara, F., Cettolo, M., Federico, M., Giuliani, D.: A system for the segmentation and transcription of Italian radio news. In: Proc. of RIAO, Content-Based Multimedia Information Access (2000)
Google Scholar
Dimitrova, N., Agnihotri, L., Wei, G.: Video classification based on HMM using text and faces. In: Proc. of the European Conference on Signal Processing (2000)
Google Scholar
Dinh, P.Q., Dorai, C., Venkatesh, S.: Video genre categorization using audio wavelet coefficients. In: Proc. of the 5th Asian Conference on Computer Vision (2002)
Google Scholar
Fräba, B., Küblbeck, C.: Orientation Template Matching for Face Localization in Complex Visual Scenes. In: Proc. of the IEEE Int. Conf. on Image Processing, pp. 251–254. IEEE Computer Society Press, Los Alamitos (2000)
Google Scholar
Glasberg, R., Samour, A., Elazouzi, K., Sikora, T.: Cartoon-Recognition using Video & Audio-Descriptors. In: Proc. of the 13th European Signal Processing Conference (2005)
Google Scholar
Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proc. of the Int. Conf. on Artificial Intelligence (1995)
Google Scholar
Ianeva, T.I., de Vries, A.P., Rohrig, H.: Detecting cartoons: a case study in automatic video-genre classification. In: Int. Conf. on Multimedia and Expo (2003)
Google Scholar
Igel, C., Hüsken, M.: Improving the Rprop Learning Algorithm. In: Proc. of the 2nd Int. ICSC Symposium on Neural Computation (2000)
Google Scholar
ISO/IEC 15398: Multimedia Content Description Interface (2001)
Google Scholar
Jain, A.K., Duin, R.P.W., Mao, J.: Statistical Pattern Recognition: A Review. IEEE Trans. on Pattern Analysis and Machine Intelligence 22(1), 4–37
Google Scholar
Liu, Z., Huang, J., Wang, Y., Chen, T.: Audio feature extraction and analysis for scene classification. In: IEEE Workshop on Multimedia Signal Processing, IEEE Computer Society Press, Los Alamitos (1997)
Google Scholar
Liu, Z., Huang, J., Wang, Y.: Classification of TV programs based on audio information using Hidden Markov Model. In: Proc. of IEEE Workshop on Multimedia Signal Processing, IEEE Computer Society Press, Los Alamitos (1998)
Google Scholar
Messina, A., Airola Gnota, D.: Automatic Archive Documentation Based on Content Analysis. In: IBC 2005 Conference Pubblication
Google Scholar
Messina, A., Montagnuolo, M., Sapino, M.L: Characterizing multimedia objects through multimodal content analysis and fuzzy fingerprints. In: IEEE Int. Conf. on Signal-Image Technology and Internet-Based Systems, IEEE Computer Society Press, Los Alamitos (2006)
Google Scholar
Montagnuolo, M., Messina, A.: Multimedia Knowledge Representation for Automatic Annotation of Broadcast TV Archives. In: Proc. of the 4th Int. Workshop on Multimedia Semantics (2006)
Google Scholar
Polikar, R.: Ensemble Based Systems in Decision Making. IEEE Circuits and Systems Magazine 6(3), 21–45 (2006)
Article Google Scholar
Quinlan, J.R.: C4.5 - Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Roach, M.J., Mason, J.S.D., Pawlewski, M.: Video genre classification using dynamics. In: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, IEEE Computer Society Press, Los Alamitos (2001)
Google Scholar
Sánchez, J.M., Binefa, X., Vitriá, J., Radeva, P.: Local Color Analysis for Scene Break Detection Applied to TV Commercials Recognition. In: Huijsmans, D.P., Smeulders, A.W.M. (eds.) VISUAL 1999. LNCS, vol. 1614, pp. 237–244. Springer, Heidelberg (1999)
Google Scholar
Schwenker, F., Marinai, S.: Artificial Neural Networks in Pattern Recognition. In: Schwenker, F., Marinai, S. (eds.) ANNPR 2006. LNCS (LNAI), vol. 4087, Springer, Heidelberg (2006)
Google Scholar
Snoek, C.G., Worring, M.: Multimodal video indexing: A review of the state-of-the-art. Multimedia Tools and Applications 25(1), 5–35 (2005)
Article Google Scholar
Swain, M.J., Ballard, D.H.: Color indexing. Int. Journal of Computer Vision 7(1), 11–32 (1991)
Article Google Scholar
Tamura, H., Mori, S., Yamawaki, T.: Texture features corresponding to visual perception. IEEE Trans. on Systems, Man and Cybernetics 8(6), 460–473 (1978)
Article Google Scholar
Taylor, J.S., Cristianini, N.: Support Vector Machines and other kernel-based learning methods. Cambridge University Press, Cambridge (2000)
Google Scholar
Tekalp, M.: Digital Video Processing. Prentice-Hall, Englewood Cliffs (1995)
Google Scholar
Tomasi, C.: Estimating Gaussian Mixture Densities with EM – A Tutorial. Duke University (2005)
Google Scholar
Truong, B.T., Dorai, C., Venkatesh, S.: Automatic Genre Identification for Content-Based Video Categorization. In: Proc. of the 15th IEEE Int. Conf. on Pattern Recognition, IEEE Computer Society Press, Los Alamitos (2000)
Google Scholar
Xu, L.Q., Li, Y.: Video classification using spatial-temporal features and PCA. In: Proc. of the IEEE Int. Conf. on Multimedia and Expo, IEEE Computer Society Press, Los Alamitos (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Università degli Studi di Torino, Dip. di Informatica, Torino, Italy
Maurizio Montagnuolo
RAI Centro Ricerche e Innovazione Tecnologica, Torino, Italy
Alberto Messina

Authors

Maurizio Montagnuolo
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Messina
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Roberto Basili Maria Teresa Pazienza

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Montagnuolo, M., Messina, A. (2007). TV Genre Classification Using Multimodal Information and Multilayer Perceptrons. In: Basili, R., Pazienza, M.T. (eds) AI*IA 2007: Artificial Intelligence and Human-Oriented Computing. AI*IA 2007. Lecture Notes in Computer Science(), vol 4733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74782-6_63

Download citation

DOI: https://doi.org/10.1007/978-3-540-74782-6_63
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74781-9
Online ISBN: 978-3-540-74782-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics