Skip to main content

An Automatic Method for Video Character Segmentation

  • Conference paper
Image Analysis and Recognition (ICIAR 2008)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5112))

Included in the following conference series:

Abstract

This paper presents an automatic segmentation system for characters in text color images cropped from natural images or videos based on a new neuronal architecture insuring fast processing and robustness against noise, variations in illumination, complex background and low resolution. An off-line training phase on a set of synthetic text color images, where the exact character positions are known, allows adjusting the neural parameters and thus building an optimal non linear filter which extracts the best features in order to robustly detect the border positions between characters. The proposed method is tested on a set of synthetic text images to precisely evaluate its performance according to noise, and on a set of complex text images collected from video frames and web pages to evaluate its performance on real images. The results are encouraging with a good segmentation rate of 89.12% and a recognition rate of 81.94% on a set of difficult text images collected from video frames and from web pages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Niblack, W.: An Introduction to Digital Image Processing. Prentice-Hall, N.J (1986)

    Google Scholar 

  2. Sauvola, J., Seppänen, T., Haapakoski, S., Pietikäinen, M.: Adaptive document binarization. In: International Conference on Document Analysis and Recognition, vol. 1 (1997)

    Google Scholar 

  3. Liao, P.S., Chen, T.S., Chung, P.C.: A fast algorithm for multilevel thresholding. Journal of information science and engineering (2001)

    Google Scholar 

  4. Garcia, C., Apostolidis, X.: Text detection and segmentation in complex color images. In: IEEE International Conference on Acoustics, Speech, and Signal Processing. ICASSP 2000, vol. 4 (2000)

    Google Scholar 

  5. Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Transactions on circuits and systems for video technology 12(14) (2002)

    Google Scholar 

  6. Saidane, Z., Garcia, C.: Robust binarization for video text recognition. In: Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR), vol. 2 (2007)

    Google Scholar 

  7. Sato, T., Kanade, T., Hughes, E., Smith, M.: Video ocr for digital news archive. In: Proceedings of the International Workshop on Content-Based Access of Image and Video Databases (CAIVD 1998) (1998)

    Google Scholar 

  8. Horowitz, S., Pavlidis, T.: Picture segmentation by a tree traversal algorithm. ACM 2 (1976)

    Google Scholar 

  9. Karatzas, D., Antonacopoulos, A.: Text extraction from web images based on a split-and-merge segmentation method using colour perception. In: Proceedings of the 17th International Conference on Pattern Recognition. ICPR 2004, vol. 2 (2004)

    Google Scholar 

  10. Kopf, S., Haenselmann, T., Effelsberg, W.: Robust character recognition in low-resolution images and videos. Technical report, Department for Mathematics and Computer Science, University of Mannheim (2005)

    Google Scholar 

  11. Burges, C., Matan, O., LeCun, Y., Denker, J., Jackel, L., Stenard, C., Nohl, C., Ben, J.: Shortest path segmentation: A method for training a neural network to recognize character strings. In: Proceedings of the International Joint Conf. Neural Networks, vol. 3 (1992)

    Google Scholar 

  12. Chen, D., Odobez, J., Thiran, J.: Monte carlo text segmentation. Int. journal of Pattern Recognition and Artificial Intelligence (2005)

    Google Scholar 

  13. Sato, T., Kanade, T., Hughes, E., Smith, M., Satoh, S.: Video ocr: Indexing digital news libraries by recognition of superimposed captions. In: ACM Multimedia Systems Special Issue on Video Libraries (1998)

    Google Scholar 

  14. LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient based learning applied to document recognition. Proc. of the IEEE (1998)

    Google Scholar 

  15. Garcia, C., Delakis, M.: Convolutional face finder: A neural architecture for fast and robust face detection. IEEE Transactions on pattern analysis and machine intelligence 26 (2004)

    Google Scholar 

  16. Saidane, Z., Garcia, C.: Automatic scene text recognition using a convolutional neural network. In: Proceedings of the Second International Workshop on Camera-Based Document Analysis and Recognition (CBDAR) (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Aurélio Campilho Mohamed Kamel

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Saidane, Z., Garcia, C. (2008). An Automatic Method for Video Character Segmentation. In: Campilho, A., Kamel, M. (eds) Image Analysis and Recognition. ICIAR 2008. Lecture Notes in Computer Science, vol 5112. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69812-8_55

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-69812-8_55

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-69811-1

  • Online ISBN: 978-3-540-69812-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics