Abstract
In this paper, we propose a new text detection algorithm for images/video frames in a coarse-to-fine framework. Firstly, in the coarse detection, multiscale wavelet energy feature is employed to locate all possible text pixels and then a density-based region growing method is developed to connect these pixels into text lines. Secondly, in the fine detection, four kinds of texture features are combined to represent a text line and a SVM classifier is employed to identify texts from the candidate ones. Experimental results on two datasets show the encouraging performance of the proposed algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhong, Y., Zhang, H.J., Jain, A.K.: Automatic Caption Localization in Compressed Video. IEEE Trans. on PAMI 22(4), 385–392 (2000)
Wu, V., Manmatha, R., Riseman, E.M.: Textfinder: An Automatic System to Detect and Recognize Text in Images. IEEE Trans. on PAMI 20, 1224–1229 (1999)
Lienhart, R., Wernicke, A.: Localizing and Segmenting Text in Images and Videos. IEEE Trans. on CSVTÂ 12(4) (2002)
Jain, A.K., Yu, B.: Automatic Text Location in Images and Video Frames. Pattern Recognition 31(12), 2055–2076 (1998)
Zhong, Y., Karu, K., Jain, A.K.: Locating text in complex color images. Pattern Recognition 28, 1523–1535 (1995)
Li, H., Doermann, D., Kia, O.: Automatic Text Detection and Tracking in Digital Video. IEEE Trans. on Image Processing 9(1) (2000)
Tang, X., Gao, X.B., Liu, J., Zhang, H.: Spatial-Temporal Approach for Video Caption Detection and Recognition. IEEE Trans. Neural Networks 13, 961–971 (2002)
Luo, B., Tang, X.O., Liu, J.Z., Zhang, H.: Video Caption Detection and Extraction Using Temporal Feature Vector. In: Int. Conf. on Image Processing (2003)
Chen, D.T., Bourlard, H., Thiran, J.-P.: Text Identification in Complex Background Using SVM. In: Int. Conf. on CVPR (2001)
Mallat, S.G.: A Theory for Multiresolution Signal Decomposition: The Wavelet Representation. IEEE Trans. on PAMIÂ 11(7) (1989)
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1995)
Jain, A.K.: Statistical Pattern Recognition: A Review. IEEE Trans. on PAMI 2(1), 4–37 (2001)
Sung, K., Paggio, T.: Example-based Learning for View-based Human Face Detection. Mass, Inst. Technol., Cambridge, MA, A.I. Memo 1521 (1994)
Hua, X.S., Liu, W.Y., Zhang, H.J.: Automatic Performance Evaluation for Video Text Detection. In: Int. Conf. on Document Analysis and Recognition (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ye, Q., Huang, Q. (2004). A New Text Detection Algorithm in Images/Video Frames. In: Aizawa, K., Nakamura, Y., Satoh, S. (eds) Advances in Multimedia Information Processing - PCM 2004. PCM 2004. Lecture Notes in Computer Science, vol 3332. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30542-2_106
Download citation
DOI: https://doi.org/10.1007/978-3-540-30542-2_106
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23977-2
Online ISBN: 978-3-540-30542-2
eBook Packages: Computer ScienceComputer Science (R0)