Skip to main content

Recursive Projection Profiling for Text-Image Separation

  • Conference paper
  • First Online:
Innovations in Computing Sciences and Software Engineering

Abstract

This paper presents an efficient and very simple method for separating text characters from graphical images in a given document image. This is based on a Recursive Projection Profiling (RPP) of the document image. The algorithm tries to use the projection profiling method [4] [6] to its maximum bent to bring out almost all that is possible with the method. The projection profile reveals the empty space along the horizontal and vertical axes, projecting the gaps between the characters/images. The algorithm turned out to be quite efficient, accurate and least complex in nature. Though some exceptional cases were encountered owing to the drawbacks of projection profiling, they were well handled with some simple heuristics thus resulting in a very efficient method for text-image separation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S.N Srihari and V Govindaraju, Analysis of Textual Images Using the Hough Transform, Machine Vision and Applications, 2(3):141-153, 1989.

    Article  Google Scholar 

  2. N. J. Naccache and R Shinghal, Proposed Algorithm for Thinning Binary Patterns, IEEE Transactions on Systems, Man. and Cybernatics, SMC-14:409-418, 1984.

    Google Scholar 

  3. P. C. K Kwok, A Thinning Algorithm by Contour Generation, Communications of the ACM, Vol-31, No. 11, PP. 1314-1324, 1988.

    Article  Google Scholar 

  4. T. Taxt, P. J. Flynn & A. K. Jain, Segmentation of Document Images, IEEE Transaction on Pattern Analysis and Machine Intelligence, 11(12):1322-1329, December 1989.

    Article  Google Scholar 

  5. H. S. Baird, The Skew Angle of Printed Documents, In Proc. of the Conference Society of Photographic Scientists and Engineers, Volume 40, Pages 21-24, Rochester, NY, May, 20-21 1987.

    Google Scholar 

  6. [6]R. Cattani, T. Coianiez, S. Messelodi & C. Modena, Geometric Layout Analysis Techniques for Document Image Understanding: A Review, IRST Technical Report, Trento, Italy, 1998, 68pp.

    Google Scholar 

  7. [7]D. Wang, S. Srihari Classification of Newspaper Image Blocks Using Texture Analysis. Computer Vision, Graphics, and Image Processing, Vol. 47, 1989, pp.327-352.

    Google Scholar 

  8. [8]A. K. Jain and S. Bhattacharjee, Text Segmentation Using Gabor Filters for Automatic Document Processing, Machine Vision and Applications, Vol. 5, No. 3, 1992, pp. 169-184.

    Google Scholar 

  9. [9]O. Okun, D. Doermann, Matti P. Page Segmentation and zone classification. The State of the Art, Nov 1999.

    Google Scholar 

  10. C.L. Tan, Z. Zhang Text block segmentation using pyramid structure. SPIE Document Recognition and Retrieval, Vol. 8, January 24-25, 2001, San Jose, USA, pp. 297-306.

    Google Scholar 

  11. [11]H. Makino. Representation and segmentation of document images. Proc. of IEEE Computer Society Conference on Pattern Recognition and Image Processing, 1983, pp. 291-296.

    Google Scholar 

  12. J. Duong, M. Ct, H. Emptoz, C. Suen. Extraction of Text Areas in Printed Document Images. ACM Symposium on Document Engineering ,DocEng’01, Atlanta (USA), November9-10, 2001, pp. 157-165.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shivsubramani Krishnamoorthy .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media B.V.

About this paper

Cite this paper

Krishnamoorthy, S., Loganathan, R., Soman, K.P. (2010). Recursive Projection Profiling for Text-Image Separation. In: Sobh, T., Elleithy, K. (eds) Innovations in Computing Sciences and Software Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-9112-3_1

Download citation

  • DOI: https://doi.org/10.1007/978-90-481-9112-3_1

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-90-481-9111-6

  • Online ISBN: 978-90-481-9112-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics