Abstract
‘Video data mining’ is a technique to discover useful patterns from videos. It plays an important role in efficient video management. Particularly, we concentrate on extracting useful editing patterns from movies. These editing patterns are useful for an amateur editor to produce a new, more attractive video. But, it is essential to extract editing patterns associated with their semantic contents, called ‘semantic structures’. Otherwise the amateur editor can’t determine how to use the extracted editing patterns during the process of editing a new video.
In this paper, we propose two approaches to extract semantic structures from a movie, based on two different time series models of the movie. In one approach, the movie is represented as a multi-stream of metadata derived from visual and audio features in each shot. In another approach, the movie is represented as one-dimensional time series consisting of durations of target character’s appearance and disappearance. To both time series models, we apply data mining techniques. As a result, we extract the semantic structures about shot transitions and about how the target character appears on the screen and disappears from the screen.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Matsuo, Y., Shirahama, K., Uehara, K.: Video Data Mining: Extracting Cinematic Rules from Movie. In: Proc. of 4th International Workshop on Multimedia Data Mining MDM/KDD, pp. 18–27 (2003)
Kumano, M., Ariki, Y., Amano, M., Uehara, K., Shunto, K., Tsukada, K.: Video Editing Support System Based on Video Grammar and Content Analysis. In: Proc. of 16th International Conference on Pattern Recognition, pp. 346–354 (2002)
Boyer, R., Moore, S.: A Fast String Searching Algorithm. Communications of the ACM 20, 762–772 (1977)
Guralnik, V., Srivastava, J.: Event Detection from Time Series Data. In: Proc. of the 5th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 33–42 (1999)
Navarro, G., Baeza-Yates, R.: Fast Multi-Dimensional Approximate Pattern Matching. In: Crochemore, M., Paterson, M. (eds.) CPM 1999. LNCS, vol. 1645, pp. 243–257. Springer, Heidelberg (1999)
Smyth, P., Goodman, R.M.: An Information Theoretic Approach to Rule Induction from Databases. IEEE Transactions on Knowledge and Data Engineering 4(4), 301–316 (1992)
Kleinberg, J.: Bursty and Hierarchical Structure in Streams. In: Proc. of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 91–101 (2002)
Cohen, P., Heeringa, B., Adams, N.: An Unsupervised Algorithm for Segmenting Categorical Timeseries in Episodes. In: Hand, D.J., Adams, N.M., Bolton, R.J. (eds.) Pattern Detection and Discovery. LNCS (LNAI), vol. 2447, pp. 49–62. Springer, Heidelberg (2002)
Hoppner, F.: Discovery of Core Episodes from Sequences Using Generalization for Defragmentation of Rule Sets. In: Hand, D.J., Adams, N.M., Bolton, R.J. (eds.) Pattern Detection and Discovery. LNCS (LNAI), vol. 2447, pp. 199–213. Springer, Heidelberg (2002)
Arijon, D.: Grammar of File Language. Focal Press Limited Publishers (1976)
Hampapur, A.: Designing Video Data Management Systems. Ph.D dissertation, University of Michigan (1995)
Wijesekera, D., Barbara, D.: Mining Cinematic Knowledge: Work in Progress. In: Proc. of the International Workshop on Multimedia Data Mining, pp. 98–103 (2000)
Oh, J., Bandi, B.: Multimedia Data Mining Framework for Raw Video Sequences. In: Zaïane, O.R., Simoff, S.J., Djeraba, C. (eds.) MDM/KDD 2002 and KDMCD 2002. LNCS (LNAI), vol. 2797, pp. 1–10. Springer, Heidelberg (2003)
Pan, J., Faloutsos, C.: VideoCube: a novel tool for video mining and classification. In: Lim, E.-p., Foo, S.S.-B., Khoo, C., Chen, H., Fox, E., Urs, S.R., Costantino, T. (eds.) ICADL 2002. LNCS, vol. 2555, pp. 194–205. Springer, Heidelberg (2002)
Uchihashi, S., Foote, J., Girgensohn, A., Boreczky, J.: Video Manga: Generating Semantically Meaningful Video Summaries. In: Proc. of 7th ACM Multimedia 1999, pp. 383–392 (1999)
Fan, J., Ji, Y.: Automatic Moving Object Extraction toward Content-Based Video Representation and Indexing. Jour. of Visual Communications and Image Representation 12(3), 217–239 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shirahama, K., Matsuo, Y., Uehara, K. (2005). Mining Semantic Structures in Movies. In: Seipel, D., Hanus, M., Geske, U., Bartenstein, O. (eds) Applications of Declarative Programming and Knowledge Management. INAP WLP 2004 2004. Lecture Notes in Computer Science(), vol 3392. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11415763_8
Download citation
DOI: https://doi.org/10.1007/11415763_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25560-4
Online ISBN: 978-3-540-32124-8
eBook Packages: Computer ScienceComputer Science (R0)