Automatic, Context-of-Capture-Based Categorization, Structure Detection and Segmentation of News Telecasts

Jacobs, Arne; Ioannidis, George T.; Christodoulakis, Stavros; Moumoutzis, Nektarios; Georgoulakis, Stratos; Papachristoudis, Yiannis

doi:10.1007/978-3-540-77088-6_27

Arne Jacobs¹,
George T. Ioannidis¹,
Stavros Christodoulakis²,
Nektarios Moumoutzis²,
Stratos Georgoulakis² &
…
Yiannis Papachristoudis²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4877))

Included in the following conference series:

International DELOS Conference

699 Accesses
1 Citations

Abstract

The objective of the work reported here is to provide an automatic, context-of-capture categorization, structure detection and segmentation of news broadcasts employing a multimodal semantic based approach. We assume that news broadcasts can be described with context-free grammars that specify their structural characteristics. We propose a system consisting of two main types of interoperating units: The recognizer unit consisting of several modules and a parser unit. The recognizer modules (audio, video and semantic recognizer) analyze the telecast and each one identifies hypothesized instances of features in the audiovisual input. A probabilistic parser analyzes the identifications provided by the recognizers. The grammar represents the possible structures a news telecast may have, so the parser can identify the exact structure of the analyzed telecast.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Boreczky, J.S., Wilcox, L.D.: A Hidden Markov Model Framework for Video Segmentation Using Audio and Image Features. In: proc. IEEE ICASSP, Seattle (USA) (1998)
Google Scholar
Brand, M., Kettnaker, V.: Discovery and Segmentation of Activities in Video. IEEE Trans. Pattern Anal. Mach. Intel 22(8), 844–851 (2000)
Article Google Scholar
Bruckmann, A., Lerbs, B., Gao, D., Eidtmann, J., Mozogovenko, L., Buczilowski, M., Jughardt, T., Xu, Y., Jacobs, A., Lüdtke, A.: Trecvid 2006 high level feature extraction. In: TRECVID 2006 Workshop Notebook Papers (2006)
Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm
Dimitrova, N., Agnihotri, L., Wei, G.: Video classification based on HMM using text and faces. In: European Signal Processing Conference. Tampere (Finland) (2000)
Google Scholar
Eickeler, S., Muller, S.: Content-based video indexing of TV broadcast news using hidden markov models. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, Phoenix (USA), pp. 2997–3000. IEEE Computer Society Press, Los Alamitos (1999)
Google Scholar
Greiff, W., Morgan, A., Fish, R., Richards, M., Kundu, A.: Fine-Grained Hidden Markov Modeling for Broadcast-News Story Segmentation. In: Proceedings of the first international conference on Human language technology research, San Diego (USA), pp. 1–5 (2001)
Google Scholar
Hoiem, D., Ke, Y., Sukthankar, R.: Solar: Sound object localization and retrieval in complex audio environments. In: Proc. of the IEEE International Conference on Acoustics, Speech and Signal (2005)
Google Scholar
Huang, J., Kumar, R., Mitra, M., Zhu, W.J., Zabih, R.: Image indexing using color correlograms. In: CVPR 1997. Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (1997)
Google Scholar
Huang, J., Liu, Z., Wang, Y., Chen, Y., Wong, E.K.: Integration of multimodal features for video scene classification based on HMM. In: IEEE Workshop on Multimedia Signal Processing, Copenhagen (Denmark) (1999)
Google Scholar
Ivanov, Y., Bobick, A.F.: Recognition of Visual Activities and Interactions by Stochastic Parsing. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(8) (2000)
Google Scholar
Joachims, T.: Making large-Scale SVM Learning Practical. MIT Press, Cambridge (1999)
Google Scholar
Johnston, M.: Deixis and Conjunction in Multimodal Systems. In: Proceedings of the 18th conference on Computational linguistics, vol. 1, pp. 362–368 (2000)
Google Scholar
Miller, G., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: Introduction to WordNet: an on-line lexical database. International Journal of Lexicography 3(4), 235–244 (1990)
Article Google Scholar
Moore, D., Essa, I.: Recognizing multitasked activities using stochastic context-free grammar. In: Proceedings of Workshop on Models vs Exemplars in Computer Vision (2001)
Google Scholar
Stokes, N., Carthy, J., Smeaton, A.F.: SeLeCT: A lexical Cohesion Based News Story Segmentation System. Journal of AI Communications 17(1), 3–12 (2004)
MATH MathSciNet Google Scholar
Stolcke, A.: An efficient Probabilistic Context-Free Parsing Algorithm That Computes Prefix Probabilities. Computational Linguistics 21(2), 165–201 (1995)
MathSciNet Google Scholar
Tamura, H., Mori, S., Yamawaki, T.: Textural feratures corresponding to visual perception. IEEE Trans. Syst., Man, Cyb. 8(6), 460–473 (1978)
Article Google Scholar
Wilkens, N.: Detektion von Videoframes mit Texteinblendungen in Echtzeit. Diploma thesis. Universität Bremen (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Computing Technologies (TZI), University of Bremen, Germany
Arne Jacobs & George T. Ioannidis
Laboratory of Distributed Multimedia Information Systems and Applications, Technical University of Crete (TUC/MUSIC), 73100 Chania, Greece
Stavros Christodoulakis, Nektarios Moumoutzis, Stratos Georgoulakis & Yiannis Papachristoudis

Authors

Arne Jacobs
View author publications
You can also search for this author in PubMed Google Scholar
George T. Ioannidis
View author publications
You can also search for this author in PubMed Google Scholar
Stavros Christodoulakis
View author publications
You can also search for this author in PubMed Google Scholar
Nektarios Moumoutzis
View author publications
You can also search for this author in PubMed Google Scholar
Stratos Georgoulakis
View author publications
You can also search for this author in PubMed Google Scholar
Yiannis Papachristoudis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Costantino Thanos Francesca Borri Leonardo Candela

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jacobs, A., Ioannidis, G.T., Christodoulakis, S., Moumoutzis, N., Georgoulakis, S., Papachristoudis, Y. (2007). Automatic, Context-of-Capture-Based Categorization, Structure Detection and Segmentation of News Telecasts. In: Thanos, C., Borri, F., Candela, L. (eds) Digital Libraries: Research and Development. DELOS 2007. Lecture Notes in Computer Science, vol 4877. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77088-6_27

Download citation

DOI: https://doi.org/10.1007/978-3-540-77088-6_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77087-9
Online ISBN: 978-3-540-77088-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics