Abstract
Automatic indexing to video data is in strong demand to cope with the increasing amount. We propose an automatic indexing method for television news video, which indexes to shots considering the correspondence of image contents and semantic attributes of keywords. This is realized by first, (1) classifying shots by graphical feature, and (2) analyzing semantic attributes of accompanying captions. Next, keywords are selectively indexed to shots according to appropriate correspondence of typical shot classes and semantic attributes of keywords. The method was applied to 75 minutes of actual news video, and resulted in indexing successfully to approximately 50% of the typical shots (60% of the shots were classified as typical), and 80% of the typical shots where captions existed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ide, I. and Tanaka, H.; “Automatic Semantic Analysis of Television News Captions”; Proc. 3rd Intl. Workshop on Information Retrieval with Asian Languages, Oct 1998 (to appear).
Ide, I., Hamada, R., Tanaka, H. and Sakai, S.; “News Video Classification based on Semantic Attributes of Captions”; Proc. 6th ACM Intl. Multimedia Conf.-Art Demos-Techinical Demos-Poster Papers-, pp.60–61, Sep 1998.
Kaneko, T., and Hori, O.; “Cut Detection Technique from MPEG Compressed Video Using Likelihood Ratio Test”; Proc. 14th Intl. Conf. on Pattern Recognition, Aug 1998.
Ide, I. and Tanaka, H.; “Semantic Analysis of Television News Captions by Suffixes”; Trans. IPS Japan, Vol.39, No.8, pp.2543–2546, Aug 1998 (in Japanese).
Nakamura, Y. and Kanade, T.; “Semantic Analysis for Video Contents Extraction-Spotting by Association in News Video-”; Proc. 5th ACM Intl. Multimedia Conf., pp.393–402, Nov 1997.
Satoh, S., Nakamura, Y. and Kanade, T.; “Name-It: Naming and Detecting Faces in Video by the Integration of Image and Natural Language Processing”; Proc. IJCAI’97, pp.1488–1493, Aug 1997.
Nasukawa, T.; “Keyword Categorization based on Discourse Information”; Proc. 11th Annual Conf. JSAI, pp.348–349, Jun 1997 (in Japanese).
Hauptmann, A.G. and Witbrock, M. J.; “Informedia News-on-Demand: Using Speech Recognition to Create a Digital Video Library”; Proc. AAAI’97 Spring Symp. on Intelligent Integration and Use of Text, Image, Video and Audio Corpora, pp.120–126, Mar 1997.
Kurakake, S., Kuwano, H. and Odaka, K.; “Recognition and Visual Feature Matching of Text Region in Video for Conceptual Indexing”; SPIE Proc. of Storage and Retrieval for Image and Video Database V, Vol.3022, Feb 1997.
Smith, M.A. and Kanade, T.; “Video Skimming and Characterization through the Combination of Image and Language Understanding Techniques”; CMU Tech. Rep. CMU-CS-97-111, Feb 1997.
Watanabe, Y., Okada, Y. and Nagao, M.; “Semantic Analysis of Telops in TV Newscasts”; Tech. Rep. IPS Japan 96-NL-116, Vol.96, No.89, pp.107–114, Nov 1996 (in Japanese).
Ariki, Y. and Saito, Y.; “Extraction of TV News Articles Based on Scene Cut Detection Using DCT Clustering”; Proc. 1996 Intl. Conf. on Image Processing, pp.847–850, Sep 1996.
Wactlar, H. D., Kanade, T., Smith, M.A. and Stevens, S. M.; “Intelligent Access to Digital Video: Informedia Project”; IEEE Computer, Vol.29, No.3, pp.46–52, May 1996.
Motegi, Y. and Ariki, Y.; “Indexing to TV News Articles Based on Character Recognition”; Tech. Report IEICE, PRU-95-240, Vol.95, No.584, pp.33–40, Mar 1996 (in Japanese).
United States Defense Advanced Research Projects Agency (DARPA), Information Technology Office; “Named Entity Task Definition, Version 2.1”; Proc. 6th Message Understanding Conference, pp.317–332, Nov 1995.
Matsuhashi, S., Nakamura, O. and Minami, T.; “Human-Face Extraction Using Modified HSV Color System and Personal Identification Through Facial Image Based on Isodensity Maps”; Canadian Conf. on Elec. and Comp. Eng.’ 95, Vol.2, pp.909–912, Sep 1995.
Nagasaka, A. and Tanaka, Y.; “Automatic Video Indexing and Full-Video Search for Object Appearances”; IFIP Trans., Vol.A, No.7, 1992.
Kurohsashi, S., Saito, Y. and Nagao, M.; “Kyoto University Corpus version 2.0”; Jun 1998. Available from http://www-lab25.kuee.kyoto-u.ac.jp/nl-resource/corpus.html
Real World Computing Partnership (RWCP); “RWC Text Database”; Mar 1996.
Kurohsashi, S. and Nagao, M.; “Japanese Morphological Analysis System JUMAN version 3.5”; Mar 1998. Available from http://www-lab25.kuee.kyoto-u.ac.jp/nl-resource/juman-e.html
“The Informedia Project”; http://www.informedia.cs.cmu.edu/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ide, I., Yamamoto, K., Tanaka, H. (1999). Automatic Video Indexing Based on Shot Classification. In: Nishio, S., Kishino, F. (eds) Advanced Multimedia Content Processing. AMCP 1998. Lecture Notes in Computer Science, vol 1554. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48962-2_7
Download citation
DOI: https://doi.org/10.1007/3-540-48962-2_7
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65762-0
Online ISBN: 978-3-540-48962-7
eBook Packages: Springer Book Archive