Stochastic Models of Video Structure for Program Genre Detection

Taskiran, Cuneyt M.; Pollak, Ilya; Bouman, Charles A.; Delp, Edward J.

doi:10.1007/978-3-540-39798-4_13

Cuneyt M. Taskiran⁶,
Ilya Pollak⁶,
Charles A. Bouman⁶ &
…
Edward J. Delp⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2849))

Included in the following conference series:

International Workshop on Visual Content Processing and Representation

387 Accesses
3 Citations

Abstract

In this paper we introduce stochastic models that characterize the structure of typical television program genres. We show how video sequences can be represented using discrete-symbol sequences derived from shot features. We then use these sequences to build HMM and hybrid HMM-SCFG models which are used to automatically classify the sequences into genres. In contrast to previous methods for using SCGFs for video processing, we use unsupervised training without an a priori grammar.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Taskiran, C., Bouman, C., Delp, E.J.: The ViBE video database system: An update and further studies. In: Proceedings of the SPIE/IS&T Conference on Storage and Retrieval for Media Databases 2000, San Jose, CA, pp. 199–207 (2000)
Google Scholar
Adams, B., Dorai, C., Venkatesh, S.: Study of shot length and motion as contributing factors to movie tempo. In: Proceedings of the ACM International Conference on Multimedia, Los Angeles, CA, pp. 353–355 (2000)
Google Scholar
Vasconcelos, N., Lippman, A.: Statistical models of video structure for content analysis and characterization. IEEE Transactions in Image Processing 9, 3–19 (2000)
Article Google Scholar
Rissanen, J.: A universal prior for integers and estimation by minimum description length. The Annals of Statistics 11, 417–431 (1983)
Article MathSciNet Google Scholar
Liu, Z., Huang, J., Wang, Y.: Classification of TV programs based on audio information using hidden Markov model. In: IEEE Second Workshop on Multimedia Signal Processing, Redondo Beach, CA, pp. 27–32 (1998)
Google Scholar
Alatan, A.A., Akansu, A.N., Wolf, W.: Multi-modal dialog scene detection using hidden Markov models for content-based multimedia indexing. Multimedia Tools and Applications 14, 137–151 (2001)
Article MATH Google Scholar
Brand, M., Kettnaker, V.: Discovery and segmentation of activities in video. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 844–851 (2000)
Article Google Scholar
Xie, L., Chang, S.F., Divakaran, A., Sun, H.: Structure analysis of soccer video with hidden Markov models. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Orlando, Fl (2002)
Google Scholar
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 77, 257–285 (1989)
Article Google Scholar
Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)
MATH Google Scholar
Ivanov, Y.A., Bobick, A.: Recogition of visual activities and interactions by stochastic parsing. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 852–872 (2000)
Article Google Scholar
Moore, D., Essa, I.: Recognizing multitasked activities from video using stochastic context-free grammar. In: Workshop on Models versus Exemplars in Computer Vision in IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Kauai, Hawaii (2001)
Google Scholar
Lari, K., Young, S.J.: The estimation of stochastic context-free grammars using the inside-outside algorithm. Computer Speech and Language 4, 35–56 (1990)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 47907-1285, USA
Cuneyt M. Taskiran, Ilya Pollak, Charles A. Bouman & Edward J. Delp

Authors

Cuneyt M. Taskiran
View author publications
You can also search for this author in PubMed Google Scholar
Ilya Pollak
View author publications
You can also search for this author in PubMed Google Scholar
Charles A. Bouman
View author publications
You can also search for this author in PubMed Google Scholar
Edward J. Delp
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Grupo de Tratamiento de Imágenes, Universidad Politécnica de Madrid, 28040, Madrid, Spain
Narciso García & Luis Salgado &
Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049, Madrid, Spain
José M. Martínez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Taskiran, C.M., Pollak, I., Bouman, C.A., Delp, E.J. (2003). Stochastic Models of Video Structure for Program Genre Detection. In: García, N., Salgado, L., Martínez, J.M. (eds) Visual Content Processing and Representation. VLBV 2003. Lecture Notes in Computer Science, vol 2849. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39798-4_13

Download citation

DOI: https://doi.org/10.1007/978-3-540-39798-4_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20081-9
Online ISBN: 978-3-540-39798-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics