Abstract
In this paper, we propose a new method for searching and browsing news videos, based on multi-modal approach. In the proposed scheme, we use closed caption (CC) data to index the contents of TV news articles effectively. To achieve time alignment between the CC texts and video data, which is necessary for multi-modal search and visualization, supervised speech recognition technique is employed. In our implementations, we provide two different mechanisms for news video browsing. One is to use a textual query based search engine, and the other is to use topic based browser which acts as an assistant tool for finding the desired news articles. Compared to other systems mainly dependent on visual features, the proposed scheme could retrieve more semantically relevant articles quite well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
MPEG Requirements Group: Introduction to MPEG-7. ISO/IEC JTC1/SC29/WG11 N3751 (2000)
Kim, J.-G., Chang, H. S., Kim, M., Kim, J., Kim, H.-M.: Summary Description Schemes for Efficient Video Navigation and Browsing. In: Proc. IS&T/SPIE Visual Communications and Image Processing (2000) 1397–1408
Bertini, M., Bimbo, A. D., Pala, P.: Content Based Annotation and Retrieval of News Videos. In: Proc. IEEE International Conference on Multimedia and Expo (2000) 483–486
Huang, Q., Liu, Z., Rosenberg, A., Gibbon, D., Shahraray, B.: Automated Generation of News Content Hierarchy by Integrating Audio, Video, and Text Information. In: Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (1999) 3025–3028
Huang, Q., Puri, A., Liu, Z.: Multimedia Search and Retrieval: New Concepts, System Implementation, and Application. IEEE Transactions on Circuits and Systems Video Technology, Vol. 10, No. 5 (2000) 679–692
Kuwano, H., Taniguchi, Y., Arai, H., Mori, M., Kurakake, S., Kojima, H.: Telop-on-Demand: Video Structuring and Retrieval Based on Text Recognition. In: Proc. IEEE International Conference on Multimedia and Expo(2000) 483–486
Kim, Y.-B., Shibata, M., Ehara, T.: Agent-Based Broadcasting with Video Indexing. IEEE Transactions on Broadcasting, Vol. 42, No. 3 (1996) 215–221
Kim, J.-G., Chang, H. S., Kim, Y., Kim, M., Kang, K., Kim, J.: Multimodal Approach to the News Video Indexing and Summarization. In: Proc. IWAIT (2001) 187–192
Son, J., Kim, J., Kang, K., Bae, K.: Application of Speech Recognition with Closed Caption for Content-Based Video Segmentation. In: Proc. IEEE Digital Signal Processing Workshop (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, Yt., Kim, JG., Chang, H.S., Kang, K., Kim, J. (2001). Content-Based News Video Retrieval with Closed Captions and Time Alignment. In: Shum, HY., Liao, M., Chang, SF. (eds) Advances in Multimedia Information Processing — PCM 2001. PCM 2001. Lecture Notes in Computer Science, vol 2195. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45453-5_115
Download citation
DOI: https://doi.org/10.1007/3-540-45453-5_115
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42680-6
Online ISBN: 978-3-540-45453-3
eBook Packages: Springer Book Archive