Morphology-based text line extraction

Wu, Jui-Chen; Hsieh, Jun-Wei; Chen, Yung-Sheng

doi:10.1007/s00138-007-0092-0

Morphology-based text line extraction

Original Paper
Published: 03 August 2007

Volume 19, pages 195–207, (2008)
Cite this article

Machine Vision and Applications Aims and scope Submit manuscript

Jui-Chen Wu¹,
Jun-Wei Hsieh¹ &
Yung-Sheng Chen¹

319 Accesses
23 Citations
Explore all metrics

Abstract

This paper presents a morphology-based text line extraction algorithm for extracting text regions from cluttered images. First of all, the method defines a novel set of morphological operations for extracting important contrast regions as possible text line candidates. The contrast feature is robust to lighting changes and invariant against different image transformations like image scaling, translation, and skewing. In order to detect skewed text lines, a moment-based method is then used for estimating their orientations. According to the orientation, an x-projection technique can be applied to extract various text geometries from the text-analogue segments for text verification. However, due to noise, a text line region is often fragmented to different pieces of segments. Therefore, after the projection, a novel recovery algorithm is then proposed for recovering a complete text line from its pieces of segments. After that, a verification scheme is then proposed for verifying all extracted potential text lines according to their text geometries. Experimental results show that the proposed method improves the state-of-the-art work in terms of effectiveness and robustness for text line detection.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Jung K., Kim K.I. and Jain A.K. (2004). Text information extraction in images and video: a survey. Patt. Recognit. 37(5): 977–997
Article Google Scholar
Dekun Z. and Shi Y.Q. (2005). Formatted text document data hiding robust to printing, copying and scanning. IEEE Int. Sym. Circuits Syst. 5: 4971–4974
Article Google Scholar
Smith, M.A., Kanade, T.: Video skimming for quick browsing based on audio and image characterization. Technical Report CMU-CS-95–186, Carnegei Mellon University, July 1995
Sato, T., Kanade, T., Hughes, E.K., Smith, M.A.: Video OCR for digital news archive. 1998 IEEE International Workshop on Content-Based Access of Image and Video Database, pp. 52–60, Bombay India, 1998
Lyu M.R., Song J. and Cai M. (2005). A comprehensive method for multilingual video text detection, localization, and extraction. IEEE Trans. Circuits Syst. Video Technol. 15(2): 243–255
Article Google Scholar
Zhang, N., Tao, T., Satya, R.V., Mukheriee, A.: Modified LZW algorithm for efficient compressed text retrieval. In: Proceeding of International Conference on Information Technology, Coding and Computer, pp. 224–228 (2004)
Hoogs, A., Mundy, J., Cross, G.: Multi-modal fusion for video understanding. In: Proceeding 30th Applied Imagery Pattern Recognition, pp. 103–108 (2001)
Zhong Y., Karu K. and Jain A.K. (1995). Locating text in complex color images. Patt. Recognit. 28(10): 1523–1536
Article Google Scholar
Lienhart, R., Stuber, F.: Automatic text recognition in digital videos. In: Proceeding of SPIE, pp. 180–188 (1996)
Hasan Y.M.Y. and Karam L.J. (2000). Morphological text extraction from images. IEEE Trans. Image Process. 9(11): 1978–1983
Article Google Scholar
Wong E.K. and Chen M. (2003). A new robust algorithm for video text extraction. Patt. Recognit. 36(6): 1397–1406
Article MATH Google Scholar
Sin, B., Kim, S., Cho, B.: Locating characters in scene images using frequency features. In: Proceedings of International Conference on Pattern Recognition, vol. 3, Canada, pp. 489–492 (2002)
Mao, W., Chung, F., Lanm, K., Siu, W.: Hybrid Chinese/English text detection in images and video frames. In: Proceedings of International Conference on Pattern Recognition, vol. 3, Canada, pp. 1015–1018 (2002)
Kim K.I., Jung K., Park S.H. and Kim H.J. (2001). Support vector machine-based text detection in digital video. Patt. Recognit. 34(2): 527–529
Article Google Scholar
Xiangrong, C., Yuille, A.L.: Detecting and reading text in natural scenes. In: Proceeding of the IEEE Computer Vision and Pattern Recognition, vol.2, pp. 366–373 (2004)
Sonka M., Hlavac V. and Boyle R. (1993). Image Processing, Analysis, and Machine Vision. Chapman & Hall, London
Google Scholar
Press, W.H., Teukolsky, S.A., Vetterling, W.T., Flannery, B.P.: Numerical Recipes in C (1992)

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, Yuan Ze University, 135 Yuan-Tung Road, Chung-Li, 320, Taiwan, ROC
Jui-Chen Wu, Jun-Wei Hsieh & Yung-Sheng Chen

Authors

Jui-Chen Wu
View author publications
You can also search for this author in PubMed Google Scholar
Jun-Wei Hsieh
View author publications
You can also search for this author in PubMed Google Scholar
Yung-Sheng Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jun-Wei Hsieh.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, JC., Hsieh, JW. & Chen, YS. Morphology-based text line extraction. Machine Vision and Applications 19, 195–207 (2008). https://doi.org/10.1007/s00138-007-0092-0

Download citation

Received: 13 September 2006
Revised: 03 March 2007
Accepted: 24 April 2007
Published: 03 August 2007
Issue Date: May 2008
DOI: https://doi.org/10.1007/s00138-007-0092-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Morphology-based text line extraction

Abstract

Access this article

Similar content being viewed by others

Automated Text Detection and Text-Line Construction in Natural Images

An Efficient Detection Method for Text of Arbitrary Orientations in Natural Images

Residual Dual Scale Scene Text Spotting by Fusing Bottom-Up and Top-Down Processing

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Morphology-based text line extraction

Abstract

Access this article

Similar content being viewed by others

Automated Text Detection and Text-Line Construction in Natural Images

An Efficient Detection Method for Text of Arbitrary Orientations in Natural Images

Residual Dual Scale Scene Text Spotting by Fusing Bottom-Up and Top-Down Processing

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation