Skip to main content

Stochastic Models of Video Structure for Program Genre Detection

  • Conference paper
Visual Content Processing and Representation (VLBV 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2849))

Included in the following conference series:

Abstract

In this paper we introduce stochastic models that characterize the structure of typical television program genres. We show how video sequences can be represented using discrete-symbol sequences derived from shot features. We then use these sequences to build HMM and hybrid HMM-SCFG models which are used to automatically classify the sequences into genres. In contrast to previous methods for using SCGFs for video processing, we use unsupervised training without an a priori grammar.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Taskiran, C., Bouman, C., Delp, E.J.: The ViBE video database system: An update and further studies. In: Proceedings of the SPIE/IS&T Conference on Storage and Retrieval for Media Databases 2000, San Jose, CA, pp. 199–207 (2000)

    Google Scholar 

  2. Adams, B., Dorai, C., Venkatesh, S.: Study of shot length and motion as contributing factors to movie tempo. In: Proceedings of the ACM International Conference on Multimedia, Los Angeles, CA, pp. 353–355 (2000)

    Google Scholar 

  3. Vasconcelos, N., Lippman, A.: Statistical models of video structure for content analysis and characterization. IEEE Transactions in Image Processing 9, 3–19 (2000)

    Article  Google Scholar 

  4. Rissanen, J.: A universal prior for integers and estimation by minimum description length. The Annals of Statistics 11, 417–431 (1983)

    Article  MathSciNet  Google Scholar 

  5. Liu, Z., Huang, J., Wang, Y.: Classification of TV programs based on audio information using hidden Markov model. In: IEEE Second Workshop on Multimedia Signal Processing, Redondo Beach, CA, pp. 27–32 (1998)

    Google Scholar 

  6. Alatan, A.A., Akansu, A.N., Wolf, W.: Multi-modal dialog scene detection using hidden Markov models for content-based multimedia indexing. Multimedia Tools and Applications 14, 137–151 (2001)

    Article  MATH  Google Scholar 

  7. Brand, M., Kettnaker, V.: Discovery and segmentation of activities in video. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 844–851 (2000)

    Article  Google Scholar 

  8. Xie, L., Chang, S.F., Divakaran, A., Sun, H.: Structure analysis of soccer video with hidden Markov models. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Orlando, Fl (2002)

    Google Scholar 

  9. Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 77, 257–285 (1989)

    Article  Google Scholar 

  10. Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)

    MATH  Google Scholar 

  11. Ivanov, Y.A., Bobick, A.: Recogition of visual activities and interactions by stochastic parsing. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 852–872 (2000)

    Article  Google Scholar 

  12. Moore, D., Essa, I.: Recognizing multitasked activities from video using stochastic context-free grammar. In: Workshop on Models versus Exemplars in Computer Vision in IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Kauai, Hawaii (2001)

    Google Scholar 

  13. Lari, K., Young, S.J.: The estimation of stochastic context-free grammars using the inside-outside algorithm. Computer Speech and Language 4, 35–56 (1990)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Taskiran, C.M., Pollak, I., Bouman, C.A., Delp, E.J. (2003). Stochastic Models of Video Structure for Program Genre Detection. In: García, N., Salgado, L., Martínez, J.M. (eds) Visual Content Processing and Representation. VLBV 2003. Lecture Notes in Computer Science, vol 2849. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39798-4_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-39798-4_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20081-9

  • Online ISBN: 978-3-540-39798-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics