Abstract
This paper presents an automatic segmentation system for characters in text color images cropped from natural images or videos based on a new neuronal architecture insuring fast processing and robustness against noise, variations in illumination, complex background and low resolution. An off-line training phase on a set of synthetic text color images, where the exact character positions are known, allows adjusting the neural parameters and thus building an optimal non linear filter which extracts the best features in order to robustly detect the border positions between characters. The proposed method is tested on a set of synthetic text images to precisely evaluate its performance according to noise, and on a set of complex text images collected from video frames and web pages to evaluate its performance on real images. The results are encouraging with a good segmentation rate of 89.12% and a recognition rate of 81.94% on a set of difficult text images collected from video frames and from web pages.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Niblack, W.: An Introduction to Digital Image Processing. Prentice-Hall, N.J (1986)
Sauvola, J., Seppänen, T., Haapakoski, S., Pietikäinen, M.: Adaptive document binarization. In: International Conference on Document Analysis and Recognition, vol. 1 (1997)
Liao, P.S., Chen, T.S., Chung, P.C.: A fast algorithm for multilevel thresholding. Journal of information science and engineering (2001)
Garcia, C., Apostolidis, X.: Text detection and segmentation in complex color images. In: IEEE International Conference on Acoustics, Speech, and Signal Processing. ICASSP 2000, vol. 4 (2000)
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Transactions on circuits and systems for video technology 12(14) (2002)
Saidane, Z., Garcia, C.: Robust binarization for video text recognition. In: Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR), vol. 2 (2007)
Sato, T., Kanade, T., Hughes, E., Smith, M.: Video ocr for digital news archive. In: Proceedings of the International Workshop on Content-Based Access of Image and Video Databases (CAIVD 1998) (1998)
Horowitz, S., Pavlidis, T.: Picture segmentation by a tree traversal algorithm. ACM 2 (1976)
Karatzas, D., Antonacopoulos, A.: Text extraction from web images based on a split-and-merge segmentation method using colour perception. In: Proceedings of the 17th International Conference on Pattern Recognition. ICPR 2004, vol. 2 (2004)
Kopf, S., Haenselmann, T., Effelsberg, W.: Robust character recognition in low-resolution images and videos. Technical report, Department for Mathematics and Computer Science, University of Mannheim (2005)
Burges, C., Matan, O., LeCun, Y., Denker, J., Jackel, L., Stenard, C., Nohl, C., Ben, J.: Shortest path segmentation: A method for training a neural network to recognize character strings. In: Proceedings of the International Joint Conf. Neural Networks, vol. 3 (1992)
Chen, D., Odobez, J., Thiran, J.: Monte carlo text segmentation. Int. journal of Pattern Recognition and Artificial Intelligence (2005)
Sato, T., Kanade, T., Hughes, E., Smith, M., Satoh, S.: Video ocr: Indexing digital news libraries by recognition of superimposed captions. In: ACM Multimedia Systems Special Issue on Video Libraries (1998)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient based learning applied to document recognition. Proc. of the IEEE (1998)
Garcia, C., Delakis, M.: Convolutional face finder: A neural architecture for fast and robust face detection. IEEE Transactions on pattern analysis and machine intelligence 26 (2004)
Saidane, Z., Garcia, C.: Automatic scene text recognition using a convolutional neural network. In: Proceedings of the Second International Workshop on Camera-Based Document Analysis and Recognition (CBDAR) (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Saidane, Z., Garcia, C. (2008). An Automatic Method for Video Character Segmentation. In: Campilho, A., Kamel, M. (eds) Image Analysis and Recognition. ICIAR 2008. Lecture Notes in Computer Science, vol 5112. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69812-8_55
Download citation
DOI: https://doi.org/10.1007/978-3-540-69812-8_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69811-1
Online ISBN: 978-3-540-69812-8
eBook Packages: Computer ScienceComputer Science (R0)