Skip to main content

A Framework for Semantic Classification of Scenes Using Finite State Machines

  • Conference paper
Image and Video Retrieval (CIVR 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3115))

Included in the following conference series:

  • 1011 Accesses

Abstract

We address the problem of classifying scenes from feature films into semantic categories and propose a robust framework for this problem. We propose that the Finite State Machines (FSM) are suitable for detecting and classifying scenes and demonstrate their usage for three types of movie scenes; conversation, suspense and action. Our framework utilizes the structural information of the scenes together with the low and mid-level features. Low level features of video including motion and audio energy and a mid-level feature, face detection, are used in our approach. The transitions of the FSMs are determined by the features of each shot in the scene. Our FSMs have been experimented on over 60 clips and convincing results have been achieved.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Adams, B., Dorai, C., Venkatesh, S.: Novel Approach to Determining Tempo and Dramatic Story Sections in Motion Pictures, ICIP (2000)

    Google Scholar 

  2. Rasheed, Z., Shah, M.: Scene Detection In Hollywood Movies and TV Shows. In: IEEE Computer Vision and Pattern Recognition Conference, Madison,Wisconsin, June 16-22 (2003)

    Google Scholar 

  3. Yoshitaka, A., Ishii, T., Hirakawa, M., Ichikawa, T.: Content-Based Retrieval of Video Data by the Grammar of Film. In: IEEE Symposium on Visual Languages (1997)

    Google Scholar 

  4. Lienhart, R., Pfeiffer, S., Effelsberg, W.: Scene Determination Based on Video and Audio Features. In: Proc. IEEE Conf. on Multimedia Computing and Systems, Florence, Italy (1999)

    Google Scholar 

  5. Li, Y., Narayanan, S., Jay Kuo, C.-C.: Movie Content Analysis Indexing, and Skimming. In: Video Mining. ch.5, Kluwer Academic Publishers, Dordrecht (2003)

    Google Scholar 

  6. Viola, P., Jones, M.: Robust Real-Time Object Detection, International Journal of Computer Vision (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhai, Y., Rasheed, Z., Shah, M. (2004). A Framework for Semantic Classification of Scenes Using Finite State Machines. In: Enser, P., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds) Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science, vol 3115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27814-6_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-27814-6_35

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22539-3

  • Online ISBN: 978-3-540-27814-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics