Skip to main content

TV Genre Classification Using Multimodal Information and Multilayer Perceptrons

  • Conference paper
AI*IA 2007: Artificial Intelligence and Human-Oriented Computing (AI*IA 2007)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4733))

Included in the following conference series:

Abstract

Multimedia content annotation is a key issue in the current convergence of audiovisual entertainment and information media. In this context, automatic genre classification (AGC) provides a simple and effective solution to describe video contents in a structured and well understandable way. In this paper a method for classifying the genre of TV broadcasted programmes is presented. In our approach, we consider four groups of features, which include both low-level visual descriptors and higher level semantic information. For each type of these features we derive a characteristic vector and use it as input data of a multilayer perceptron (MLP). Then, we use a linear combination of the outputs of the four MLPs to perform genre classification of TV programmes. The experimental results on more than 100 hours of broadcasted material showed the effectiveness of our approach, achieving a classification accuracy of ~92%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Albiol, A., Fullà, M.J.C., Albiol, A., Torres, L.: Commercials detection using HMMs. In: Proc. of the Int. Workshop on Image Analysis for Multimedia Interactive Services (2004)

    Google Scholar 

  2. Brugnara, F., Cettolo, M., Federico, M., Giuliani, D.: A system for the segmentation and transcription of Italian radio news. In: Proc. of RIAO, Content-Based Multimedia Information Access (2000)

    Google Scholar 

  3. Dimitrova, N., Agnihotri, L., Wei, G.: Video classification based on HMM using text and faces. In: Proc. of the European Conference on Signal Processing (2000)

    Google Scholar 

  4. Dinh, P.Q., Dorai, C., Venkatesh, S.: Video genre categorization using audio wavelet coefficients. In: Proc. of the 5th Asian Conference on Computer Vision (2002)

    Google Scholar 

  5. Fräba, B., Küblbeck, C.: Orientation Template Matching for Face Localization in Complex Visual Scenes. In: Proc. of the IEEE Int. Conf. on Image Processing, pp. 251–254. IEEE Computer Society Press, Los Alamitos (2000)

    Google Scholar 

  6. Glasberg, R., Samour, A., Elazouzi, K., Sikora, T.: Cartoon-Recognition using Video & Audio-Descriptors. In: Proc. of the 13th European Signal Processing Conference (2005)

    Google Scholar 

  7. Kohavi, R.: A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proc. of the Int. Conf. on Artificial Intelligence (1995)

    Google Scholar 

  8. Ianeva, T.I., de Vries, A.P., Rohrig, H.: Detecting cartoons: a case study in automatic video-genre classification. In: Int. Conf. on Multimedia and Expo (2003)

    Google Scholar 

  9. Igel, C., Hüsken, M.: Improving the Rprop Learning Algorithm. In: Proc. of the 2nd Int. ICSC Symposium on Neural Computation (2000)

    Google Scholar 

  10. ISO/IEC 15398: Multimedia Content Description Interface (2001)

    Google Scholar 

  11. Jain, A.K., Duin, R.P.W., Mao, J.: Statistical Pattern Recognition: A Review. IEEE Trans. on Pattern Analysis and Machine Intelligence 22(1), 4–37

    Google Scholar 

  12. Liu, Z., Huang, J., Wang, Y., Chen, T.: Audio feature extraction and analysis for scene classification. In: IEEE Workshop on Multimedia Signal Processing, IEEE Computer Society Press, Los Alamitos (1997)

    Google Scholar 

  13. Liu, Z., Huang, J., Wang, Y.: Classification of TV programs based on audio information using Hidden Markov Model. In: Proc. of IEEE Workshop on Multimedia Signal Processing, IEEE Computer Society Press, Los Alamitos (1998)

    Google Scholar 

  14. Messina, A., Airola Gnota, D.: Automatic Archive Documentation Based on Content Analysis. In: IBC 2005 Conference Pubblication

    Google Scholar 

  15. Messina, A., Montagnuolo, M., Sapino, M.L: Characterizing multimedia objects through multimodal content analysis and fuzzy fingerprints. In: IEEE Int. Conf. on Signal-Image Technology and Internet-Based Systems, IEEE Computer Society Press, Los Alamitos (2006)

    Google Scholar 

  16. Montagnuolo, M., Messina, A.: Multimedia Knowledge Representation for Automatic Annotation of Broadcast TV Archives. In: Proc. of the 4th Int. Workshop on Multimedia Semantics (2006)

    Google Scholar 

  17. Polikar, R.: Ensemble Based Systems in Decision Making. IEEE Circuits and Systems Magazine 6(3), 21–45 (2006)

    Article  Google Scholar 

  18. Quinlan, J.R.: C4.5 - Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)

    Google Scholar 

  19. Roach, M.J., Mason, J.S.D., Pawlewski, M.: Video genre classification using dynamics. In: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, IEEE Computer Society Press, Los Alamitos (2001)

    Google Scholar 

  20. Sánchez, J.M., Binefa, X., Vitriá, J., Radeva, P.: Local Color Analysis for Scene Break Detection Applied to TV Commercials Recognition. In: Huijsmans, D.P., Smeulders, A.W.M. (eds.) VISUAL 1999. LNCS, vol. 1614, pp. 237–244. Springer, Heidelberg (1999)

    Google Scholar 

  21. Schwenker, F., Marinai, S.: Artificial Neural Networks in Pattern Recognition. In: Schwenker, F., Marinai, S. (eds.) ANNPR 2006. LNCS (LNAI), vol. 4087, Springer, Heidelberg (2006)

    Google Scholar 

  22. Snoek, C.G., Worring, M.: Multimodal video indexing: A review of the state-of-the-art. Multimedia Tools and Applications 25(1), 5–35 (2005)

    Article  Google Scholar 

  23. Swain, M.J., Ballard, D.H.: Color indexing. Int. Journal of Computer Vision 7(1), 11–32 (1991)

    Article  Google Scholar 

  24. Tamura, H., Mori, S., Yamawaki, T.: Texture features corresponding to visual perception. IEEE Trans. on Systems, Man and Cybernetics 8(6), 460–473 (1978)

    Article  Google Scholar 

  25. Taylor, J.S., Cristianini, N.: Support Vector Machines and other kernel-based learning methods. Cambridge University Press, Cambridge (2000)

    Google Scholar 

  26. Tekalp, M.: Digital Video Processing. Prentice-Hall, Englewood Cliffs (1995)

    Google Scholar 

  27. Tomasi, C.: Estimating Gaussian Mixture Densities with EM – A Tutorial. Duke University (2005)

    Google Scholar 

  28. Truong, B.T., Dorai, C., Venkatesh, S.: Automatic Genre Identification for Content-Based Video Categorization. In: Proc. of the 15th IEEE Int. Conf. on Pattern Recognition, IEEE Computer Society Press, Los Alamitos (2000)

    Google Scholar 

  29. Xu, L.Q., Li, Y.: Video classification using spatial-temporal features and PCA. In: Proc. of the IEEE Int. Conf. on Multimedia and Expo, IEEE Computer Society Press, Los Alamitos (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Roberto Basili Maria Teresa Pazienza

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Montagnuolo, M., Messina, A. (2007). TV Genre Classification Using Multimodal Information and Multilayer Perceptrons. In: Basili, R., Pazienza, M.T. (eds) AI*IA 2007: Artificial Intelligence and Human-Oriented Computing. AI*IA 2007. Lecture Notes in Computer Science(), vol 4733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74782-6_63

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74782-6_63

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74781-9

  • Online ISBN: 978-3-540-74782-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics