Mining Semantic Structures in Movies

Shirahama, Kimiaki; Matsuo, Yuya; Uehara, Kuniaki

doi:10.1007/11415763_8

Kimiaki Shirahama²²,
Yuya Matsuo²² &
Kuniaki Uehara²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3392))

Included in the following conference series:

507 Accesses

Abstract

‘Video data mining’ is a technique to discover useful patterns from videos. It plays an important role in efficient video management. Particularly, we concentrate on extracting useful editing patterns from movies. These editing patterns are useful for an amateur editor to produce a new, more attractive video. But, it is essential to extract editing patterns associated with their semantic contents, called ‘semantic structures’. Otherwise the amateur editor can’t determine how to use the extracted editing patterns during the process of editing a new video.

In this paper, we propose two approaches to extract semantic structures from a movie, based on two different time series models of the movie. In one approach, the movie is represented as a multi-stream of metadata derived from visual and audio features in each shot. In another approach, the movie is represented as one-dimensional time series consisting of durations of target character’s appearance and disappearance. To both time series models, we apply data mining techniques. As a result, we extract the semantic structures about shot transitions and about how the target character appears on the screen and disappears from the screen.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Movie Lens: Discovering and Characterizing Editing Patterns in the Analysis of Short Movie Sequences

Semantic Sequence Analysis for Human Activity Prediction

Spatio-Temporal Video Segmentation

References

Matsuo, Y., Shirahama, K., Uehara, K.: Video Data Mining: Extracting Cinematic Rules from Movie. In: Proc. of 4th International Workshop on Multimedia Data Mining MDM/KDD, pp. 18–27 (2003)
Google Scholar
Kumano, M., Ariki, Y., Amano, M., Uehara, K., Shunto, K., Tsukada, K.: Video Editing Support System Based on Video Grammar and Content Analysis. In: Proc. of 16th International Conference on Pattern Recognition, pp. 346–354 (2002)
Google Scholar
Boyer, R., Moore, S.: A Fast String Searching Algorithm. Communications of the ACM 20, 762–772 (1977)
Article Google Scholar
Guralnik, V., Srivastava, J.: Event Detection from Time Series Data. In: Proc. of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 33–42 (1999)
Google Scholar
Navarro, G., Baeza-Yates, R.: Fast Multi-Dimensional Approximate Pattern Matching. In: Crochemore, M., Paterson, M. (eds.) CPM 1999. LNCS, vol. 1645, pp. 243–257. Springer, Heidelberg (1999)
Chapter Google Scholar
Smyth, P., Goodman, R.M.: An Information Theoretic Approach to Rule Induction from Databases. IEEE Transactions on Knowledge and Data Engineering 4(4), 301–316 (1992)
Article Google Scholar
Kleinberg, J.: Bursty and Hierarchical Structure in Streams. In: Proc. of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 91–101 (2002)
Google Scholar
Cohen, P., Heeringa, B., Adams, N.: An Unsupervised Algorithm for Segmenting Categorical Timeseries in Episodes. In: Hand, D.J., Adams, N.M., Bolton, R.J. (eds.) Pattern Detection and Discovery. LNCS (LNAI), vol. 2447, pp. 49–62. Springer, Heidelberg (2002)
Chapter Google Scholar
Hoppner, F.: Discovery of Core Episodes from Sequences Using Generalization for Defragmentation of Rule Sets. In: Hand, D.J., Adams, N.M., Bolton, R.J. (eds.) Pattern Detection and Discovery. LNCS (LNAI), vol. 2447, pp. 199–213. Springer, Heidelberg (2002)
Chapter Google Scholar
Arijon, D.: Grammar of File Language. Focal Press Limited Publishers (1976)
Google Scholar
Hampapur, A.: Designing Video Data Management Systems. Ph.D dissertation, University of Michigan (1995)
Google Scholar
Wijesekera, D., Barbara, D.: Mining Cinematic Knowledge: Work in Progress. In: Proc. of the International Workshop on Multimedia Data Mining, pp. 98–103 (2000)
Google Scholar
Oh, J., Bandi, B.: Multimedia Data Mining Framework for Raw Video Sequences. In: Zaïane, O.R., Simoff, S.J., Djeraba, C. (eds.) MDM/KDD 2002 and KDMCD 2002. LNCS (LNAI), vol. 2797, pp. 1–10. Springer, Heidelberg (2003)
Chapter Google Scholar
Pan, J., Faloutsos, C.: VideoCube: a novel tool for video mining and classification. In: Lim, E.-p., Foo, S.S.-B., Khoo, C., Chen, H., Fox, E., Urs, S.R., Costantino, T. (eds.) ICADL 2002. LNCS, vol. 2555, pp. 194–205. Springer, Heidelberg (2002)
Chapter Google Scholar
Uchihashi, S., Foote, J., Girgensohn, A., Boreczky, J.: Video Manga: Generating Semantically Meaningful Video Summaries. In: Proc. of 7th ACM Multimedia 1999, pp. 383–392 (1999)
Google Scholar
Fan, J., Ji, Y.: Automatic Moving Object Extraction toward Content-Based Video Representation and Indexing. Jour. of Visual Communications and Image Representation 12(3), 217–239 (2001)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Science and Technology, Kobe University, Nada, Kobe, 657-8501, Japan
Kimiaki Shirahama, Yuya Matsuo & Kuniaki Uehara

Authors

Kimiaki Shirahama
View author publications
You can also search for this author in PubMed Google Scholar
Yuya Matsuo
View author publications
You can also search for this author in PubMed Google Scholar
Kuniaki Uehara
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universität Würzburg, Am Hubland, 97074, Würzburg, Germany
Dietmar Seipel
Institut für Informatik, CAU Kiel, Germany
Michael Hanus
Fraunhofer FIRST, Berlin
Ulrich Geske
IF Computer Japan, 5-28-2 Sendagi, Bunkyo-ku, 113-0022, Tokyo, Japan
Oskar Bartenstein

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shirahama, K., Matsuo, Y., Uehara, K. (2005). Mining Semantic Structures in Movies. In: Seipel, D., Hanus, M., Geske, U., Bartenstein, O. (eds) Applications of Declarative Programming and Knowledge Management. INAP WLP 2004 2004. Lecture Notes in Computer Science(), vol 3392. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11415763_8

Download citation

DOI: https://doi.org/10.1007/11415763_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25560-4
Online ISBN: 978-3-540-32124-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics