Abstract
Multimedia data is the fastest growing media type in many areas and especially on the Internet. The text in videos is one powerful high-level index for retrieval. Efficient indexing and retrieval of digital video is an important aspect of multimedia databases. Detecting, extracting and recognizing text can build such an index. Segmenting and recognizing text in the non-text parts o f w e b pages is also a very important issue. More and more w e b pages present text in images. Existing text segmentation and text recognition algorithms cannot extract the text. Thus, all existing search engines cannot index the content o f image-rich web pages properly! A new, robust, and true multi-resolution approach to localizing and segmenting text in videos and images is proposed in this paper. It has been tested extensively on large variety o f video sizes such 352×240 up to 1920×1280 and a large representative set of video sequences such as home videos, newscast, title sequences and commercials as well as images.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
J.D. Foley, A. Dam, S.K. Feiner und J.F. Hughes. Computer Graphics: Principles and Practice. Addison-Wesley, Reading, MA, USA, 1990.
Bernd Jaehne. Practical Handbook on Image Processing for Scientific Applications. CRC Press, Boca Raton, 1997.
R. Lienhart. Automatic Text Recognition for Video Indexing. Proc. ACM Multimedia, Bosten, MA, Nov. 1996, pp. 11-20.
R. Lienhart and W. Effelsberg. Automatic Text Segmentation and Text Recognition for Video Indexing. ACM/Springer Multimedia Systems Magazine, to appear.2
H. Li, D. Doermann, and O. Kia.Automatic Text Detection and Tracking in Digital Video. IEEE Trans, on Image Processing, to appear.
H.A. Rowley, S. Baluja, and T. Kanade. Neural Network-Based Face Detection. IEEE PAMI, vol. 20, no. 1, pp. 23–38, January 1998.
T. Sato, T. Kanade, E. K. Hughes, M. A. Smith. Video OCR for Digital News Archives. IEEE Int. Workshop on Content-Based Access of Image and Video Database, 1998.
K.-K. Sung. Learning and Example Selection for Object and Pattern Detection. PhD Thesis, MIT AI Lab, January 1996.
V. Wu, R. Manmatha and E.M. Riseman.Finding Text in Images. In Proc. of Second ACM International Conference on Digital Libraries, Philadelphia, PA, pp. 23-26, July 1997.
X. Wu. YIQ Vector Quantization in a New Color Palette Architecture. IEEE Trans, on Image Processing, vol. 5. no. 2, p. 321–329, 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wernicke, A., Lienhart, R. (2000). Text Localization and Text Segmentation in Images, Videos and Web Pages. In: Sommer, G., Krüger, N., Perwass, C. (eds) Mustererkennung 2000. Informatik aktuell. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-59802-9_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-59802-9_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67886-1
Online ISBN: 978-3-642-59802-9
eBook Packages: Springer Book Archive