Abstract
This paper addresses an integrated information mining techniques for broadcasting TV-news. The utilizes technique from the fields of acoustic, image, and video analysis, for information on news title, reporters and news background. The goal is to construct a compact yet meaningful abstraction of broadcast TV news, allowing users to browse through large amounts of data in a non-linear fashion with flexibility and efficiency. By using acoustic analysis, a news program can be partitioned into news and commercial clips, with 90% accuracy on a data set of 400 hours TV-news recorded off the air from July 2005 to August of 2006. By applying additional speaker identification and/or image detection techniques, each news stories can be segmented with a better accuracy of 95.92%. On screen captions and screen characters are recognized by video OCR techniques to produce the title of each news stories. Then keywords can be extracted from title to link related news contents on the WWW. In cooperation with facial and scene analysis and recognition techniques, OCR results can provide users with multimodal query on specific news stories.
This research was supported in part by the National Science Council under Grant NSC 94-2213-E009-139.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Huffman, S., Yang, T.E., Yan, L., Sanders, K.: Genie out of the bottle: Three u.s. networks report tiananmen square. In: Proceedings of the annual meeting of Association for Education in Journalism and Mass Communication, Minneapolis, Minnesota, USA (1990)
Vanderbilt television news archive, http://www.vanderbilt.edu/vtna
Lai, P., Lai, L., Tseng, T., Chen, Y., Fu, H.C.: A fully automated web-based tv-news system. In: Proceedings of PCM2004, Tokyo, Japan (2004)
Dan rather interview with texas monthly, http://tvnews.vanderbilt.edu/about.pl
Informedia, http://www.informedia.cs.cmu.edu/
Daniel, G., Daniel, J.: Automatic labeling of semantic roles. Comput. Linguist. 28, 245–288 (2002)
Wang, Y., Ostermann, J., Zhang, Y.Q.: Video processing and communications. Prentice Hall Press, Englewood Cliffs (2002)
Patel, N.V., Sethi, I.K.: Video shot detection and characterization for video databases. Pattern Recognition 30, 583–592 (1997)
Cheng, S.S., Wang, H.M., Fu, H.C.: A model-selection-based self-splitting gaussian mixture learning with application to speaker identification. EURASIP Journal on Applied Signal Processing 17, 2626–2639 (2004)
Lin, C.J., Liu, C.C., Chen, H.H.: A simple method for chinese video ocr and its application to question answering. International Journal of Computational Linguistics and Chinese Language Processing 6, 11–30 (2001)
Feinstein, C., Morris, P.: Information tree: a model of information flow in complex organizations. Systems, Man and Cybernetics, IEEE Transactions 18, 390–401 (1988)
Huang, T.Y., Lai, P.S., Fu, H.C.: A shot-based video clip search method. In: Proceedings of CVGIP2004, Taipei, Hualien, ROC (2004)
Sun, S.Y., Tseng, C.L., Chen, Y.H., Chuang, S.C., Fu, H.C.: Cluster-based support vector machine in text-independent speaker identification. In: Proceedings of International Joint Conference on Neural Networks IJCNN 2004, Budapest, Hungary (2004)
Zhu, L., Rao, A., Zhang, A.: Theory of keyblock-based image rerieval. ACM Trans. on Information Systems 20, 224–257 (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this paper
Cite this paper
Pao, H.T., Xu, Y.Y., Chung, S.C., Fu, H.C. (2007). Constructing and Application of Multimedia TV News Archives. In: Sebe, N., Liu, Y., Zhuang, Y., Huang, T.S. (eds) Multimedia Content Analysis and Mining. MCAM 2007. Lecture Notes in Computer Science, vol 4577. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73417-8_22
Download citation
DOI: https://doi.org/10.1007/978-3-540-73417-8_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73416-1
Online ISBN: 978-3-540-73417-8
eBook Packages: Computer ScienceComputer Science (R0)