Skip to main content
Log in

Morphology-based text line extraction

  • Original Paper
  • Published:
Machine Vision and Applications Aims and scope Submit manuscript

Abstract

This paper presents a morphology-based text line extraction algorithm for extracting text regions from cluttered images. First of all, the method defines a novel set of morphological operations for extracting important contrast regions as possible text line candidates. The contrast feature is robust to lighting changes and invariant against different image transformations like image scaling, translation, and skewing. In order to detect skewed text lines, a moment-based method is then used for estimating their orientations. According to the orientation, an x-projection technique can be applied to extract various text geometries from the text-analogue segments for text verification. However, due to noise, a text line region is often fragmented to different pieces of segments. Therefore, after the projection, a novel recovery algorithm is then proposed for recovering a complete text line from its pieces of segments. After that, a verification scheme is then proposed for verifying all extracted potential text lines according to their text geometries. Experimental results show that the proposed method improves the state-of-the-art work in terms of effectiveness and robustness for text line detection.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Jung K., Kim K.I. and Jain A.K. (2004). Text information extraction in images and video: a survey. Patt. Recognit. 37(5): 977–997

    Article  Google Scholar 

  2. Dekun Z. and Shi Y.Q. (2005). Formatted text document data hiding robust to printing, copying and scanning. IEEE Int. Sym. Circuits Syst. 5: 4971–4974

    Article  Google Scholar 

  3. Smith, M.A., Kanade, T.: Video skimming for quick browsing based on audio and image characterization. Technical Report CMU-CS-95–186, Carnegei Mellon University, July 1995

  4. Sato, T., Kanade, T., Hughes, E.K., Smith, M.A.: Video OCR for digital news archive. 1998 IEEE International Workshop on Content-Based Access of Image and Video Database, pp. 52–60, Bombay India, 1998

  5. Lyu M.R., Song J. and Cai M. (2005). A comprehensive method for multilingual video text detection, localization, and extraction. IEEE Trans. Circuits Syst. Video Technol. 15(2): 243–255

    Article  Google Scholar 

  6. Zhang, N., Tao, T., Satya, R.V., Mukheriee, A.: Modified LZW algorithm for efficient compressed text retrieval. In: Proceeding of International Conference on Information Technology, Coding and Computer, pp. 224–228 (2004)

  7. Hoogs, A., Mundy, J., Cross, G.: Multi-modal fusion for video understanding. In: Proceeding 30th Applied Imagery Pattern Recognition, pp. 103–108 (2001)

  8. Zhong Y., Karu K. and Jain A.K. (1995). Locating text in complex color images. Patt. Recognit. 28(10): 1523–1536

    Article  Google Scholar 

  9. Lienhart, R., Stuber, F.: Automatic text recognition in digital videos. In: Proceeding of SPIE, pp. 180–188 (1996)

  10. Hasan Y.M.Y. and Karam L.J. (2000). Morphological text extraction from images. IEEE Trans. Image Process. 9(11): 1978–1983

    Article  Google Scholar 

  11. Wong E.K. and Chen M. (2003). A new robust algorithm for video text extraction. Patt. Recognit. 36(6): 1397–1406

    Article  MATH  Google Scholar 

  12. Sin, B., Kim, S., Cho, B.: Locating characters in scene images using frequency features. In: Proceedings of International Conference on Pattern Recognition, vol. 3, Canada, pp. 489–492 (2002)

  13. Mao, W., Chung, F., Lanm, K., Siu, W.: Hybrid Chinese/English text detection in images and video frames. In: Proceedings of International Conference on Pattern Recognition, vol. 3, Canada, pp. 1015–1018 (2002)

  14. Kim K.I., Jung K., Park S.H. and Kim H.J. (2001). Support vector machine-based text detection in digital video. Patt. Recognit. 34(2): 527–529

    Article  Google Scholar 

  15. Xiangrong, C., Yuille, A.L.: Detecting and reading text in natural scenes. In: Proceeding of the IEEE Computer Vision and Pattern Recognition, vol.2, pp. 366–373 (2004)

  16. Sonka M., Hlavac V. and Boyle R. (1993). Image Processing, Analysis, and Machine Vision. Chapman & Hall, London

    Google Scholar 

  17. Press, W.H., Teukolsky, S.A., Vetterling, W.T., Flannery, B.P.: Numerical Recipes in C (1992)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jun-Wei Hsieh.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, JC., Hsieh, JW. & Chen, YS. Morphology-based text line extraction. Machine Vision and Applications 19, 195–207 (2008). https://doi.org/10.1007/s00138-007-0092-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00138-007-0092-0

Keywords

Navigation