An Automatic Method for Video Character Segmentation

Saidane, Zohra; Garcia, Christophe

doi:10.1007/978-3-540-69812-8_55

Zohra Saidane¹ &
Christophe Garcia¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5112))

Included in the following conference series:

International Conference Image Analysis and Recognition

1755 Accesses
6 Citations

Abstract

This paper presents an automatic segmentation system for characters in text color images cropped from natural images or videos based on a new neuronal architecture insuring fast processing and robustness against noise, variations in illumination, complex background and low resolution. An off-line training phase on a set of synthetic text color images, where the exact character positions are known, allows adjusting the neural parameters and thus building an optimal non linear filter which extracts the best features in order to robustly detect the border positions between characters. The proposed method is tested on a set of synthetic text images to precisely evaluate its performance according to noise, and on a set of complex text images collected from video frames and web pages to evaluate its performance on real images. The results are encouraging with a good segmentation rate of 89.12% and a recognition rate of 81.94% on a set of difficult text images collected from video frames and from web pages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

An Overview of Handwritten Character Recognition Systems for Historical Documents

A Comprehensive Study on Character Segmentation

A Novel Segmentation-Free Approach for Handwritten Sentence Recognition

References

Niblack, W.: An Introduction to Digital Image Processing. Prentice-Hall, N.J (1986)
Google Scholar
Sauvola, J., Seppänen, T., Haapakoski, S., Pietikäinen, M.: Adaptive document binarization. In: International Conference on Document Analysis and Recognition, vol. 1 (1997)
Google Scholar
Liao, P.S., Chen, T.S., Chung, P.C.: A fast algorithm for multilevel thresholding. Journal of information science and engineering (2001)
Google Scholar
Garcia, C., Apostolidis, X.: Text detection and segmentation in complex color images. In: IEEE International Conference on Acoustics, Speech, and Signal Processing. ICASSP 2000, vol. 4 (2000)
Google Scholar
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Transactions on circuits and systems for video technology 12(14) (2002)
Google Scholar
Saidane, Z., Garcia, C.: Robust binarization for video text recognition. In: Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR), vol. 2 (2007)
Google Scholar
Sato, T., Kanade, T., Hughes, E., Smith, M.: Video ocr for digital news archive. In: Proceedings of the International Workshop on Content-Based Access of Image and Video Databases (CAIVD 1998) (1998)
Google Scholar
Horowitz, S., Pavlidis, T.: Picture segmentation by a tree traversal algorithm. ACM 2 (1976)
Google Scholar
Karatzas, D., Antonacopoulos, A.: Text extraction from web images based on a split-and-merge segmentation method using colour perception. In: Proceedings of the 17th International Conference on Pattern Recognition. ICPR 2004, vol. 2 (2004)
Google Scholar
Kopf, S., Haenselmann, T., Effelsberg, W.: Robust character recognition in low-resolution images and videos. Technical report, Department for Mathematics and Computer Science, University of Mannheim (2005)
Google Scholar
Burges, C., Matan, O., LeCun, Y., Denker, J., Jackel, L., Stenard, C., Nohl, C., Ben, J.: Shortest path segmentation: A method for training a neural network to recognize character strings. In: Proceedings of the International Joint Conf. Neural Networks, vol. 3 (1992)
Google Scholar
Chen, D., Odobez, J., Thiran, J.: Monte carlo text segmentation. Int. journal of Pattern Recognition and Artificial Intelligence (2005)
Google Scholar
Sato, T., Kanade, T., Hughes, E., Smith, M., Satoh, S.: Video ocr: Indexing digital news libraries by recognition of superimposed captions. In: ACM Multimedia Systems Special Issue on Video Libraries (1998)
Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient based learning applied to document recognition. Proc. of the IEEE (1998)
Google Scholar
Garcia, C., Delakis, M.: Convolutional face finder: A neural architecture for fast and robust face detection. IEEE Transactions on pattern analysis and machine intelligence 26 (2004)
Google Scholar
Saidane, Z., Garcia, C.: Automatic scene text recognition using a convolutional neural network. In: Proceedings of the Second International Workshop on Camera-Based Document Analysis and Recognition (CBDAR) (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Orange Labs, , 4 rue Clos Courtel, 35510, Cesson Sévigné, France
Zohra Saidane & Christophe Garcia

Authors

Zohra Saidane
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Garcia
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Aurélio Campilho Mohamed Kamel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Saidane, Z., Garcia, C. (2008). An Automatic Method for Video Character Segmentation. In: Campilho, A., Kamel, M. (eds) Image Analysis and Recognition. ICIAR 2008. Lecture Notes in Computer Science, vol 5112. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69812-8_55

Download citation

DOI: https://doi.org/10.1007/978-3-540-69812-8_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69811-1
Online ISBN: 978-3-540-69812-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics