Skip to main content

Efficient Touching Text Line Segmentation in Tamil Script Using Horizontal Projection

  • Conference paper
Mining Intelligence and Knowledge Exploration

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8284))

Abstract

In this paper an efficient method has been proposed to segment a document of machine printed Tamil sources into text lines. Because of the interfering lines, text line segmentation remain a problem. Standard Horizontal projection method can not segment the lines which are overlapped or touched. But the proposed method uses horizontal projection technique to solve the problem of line overlapping and over segmentation. Experimental results show that 100% accuracy is obtained from the line segmentation process which involves Tamil language document with different sizes and different fonts with line overlapping.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Dhanya, D., Ramakrishnan, A.G., Pati, P.B.: Script Identification in printed bilingual documents. Sadhana 27, 73–82 (2002)

    Article  Google Scholar 

  2. Jindal, M.K., Lehal, G.S., Sharma, R.K.: Segmentation Problems and Solutions in Printed Degraded Gurmukhi Script. World Academy of Science, Engineering and Technology 21, 1153–1162 (2008)

    Google Scholar 

  3. Jindal, M.K., Sharma, R.K., Lehal, G.S.: Segmentation of Horizontally Overlapping Lines in Printed Indian Scripts. International Journal of Computational Intelligence Research 3, 277–286 (2007)

    Article  Google Scholar 

  4. Karthik, S., Hemanth, V.K., Balaji, V., Soman, K.P.: Level Set Methodology for Tamil Document Image Binarization and Segmentation. International Journal of Computer Applications 39, 7–12 (2012)

    Article  Google Scholar 

  5. Likforman-Sulem, L., Zahour, A., Taconet, B.: Text line Segmentation of Historical Documents: A Survey. International Journal on Document Analysis and Recognition (2006)

    Google Scholar 

  6. Manmatha, R., Rothfeder, J.L.: A Scale Space Approach for Automatically Segmenting Words from Historical Handwritten Documents. IEEE Transactions on Pattern Analysis And Machine Intelligence 27, 1212–1225 (2005)

    Article  Google Scholar 

  7. Stamatopoulos, N., Gatos, B., Perantonis, S.J.: A method for combining complementary techniques for document image segmentation. Pattern Recognition 42, 3158–3168 (2009)

    Article  MATH  Google Scholar 

  8. Premaratne, H.L., Bigun, J.: A segmentation-free approach to recognise printed Sinhala script using linear symmetry. Pattern Recognition 37, 2081–2089 (2004)

    Article  Google Scholar 

  9. Soujanya, P., Koppula, V.K., Gaddam, K., Sruthi, P.: Comparative Study of Text Line Segmentation Algorithms on Low Quality Documents. Special Issue of International Journal of Computer Science & Informatics (IJCSI) II(1,2), 2231–5292, ISSN (Print) : 2231-5292

    Google Scholar 

  10. Kunte, S., Samuel, S.: Two Stage Character Segmentation Technique for printed Kannada Text. Special Issue on Image Sampling and Segmentation (March 2006)

    Google Scholar 

  11. Siromony, G., Chandrasekaran, R., Chandrasekaran, M.: Computer Recogniton of printed Tamil Characters. Pattern Recognition 10, 243–247 (1978)

    Article  Google Scholar 

  12. Sridevi, N., Subashini, P.: Segmentation of Text Lines and Characters in Ancient Tamil Script Documents using Computational Intelligence Techniques. International Journal of Computer Applications 52, 7–12 (2012)

    Article  Google Scholar 

  13. Garain, U., Chaudhuri, B.B.: Indian Script character recognition: a survey. Pattern Recognition 37, 1887–1899 (2004)

    Article  Google Scholar 

  14. Garain, U., Chaudhuri, B.B.: Segmentation of Touching Characters in Printed Devnagari and Bangla Scripts Using Fuzzy Multifactorial Analysis. IEEE Transaction on Systems, Man and Cybernetics-Part C: Application and Reviews 32, 449–459 (2002)

    Article  Google Scholar 

  15. Garain, U., Sinha, S., Chaudhuri, B.B.: Multi-Script Line identification from Indian Documents. In: Proceedings of the ICDAR (ICDAR 2003), pp. 880–884 (2003)

    Google Scholar 

  16. Kumar, V., Sengar, P.K.: Segmentation of Printed Text in Devanagari Script and Gurmukhi Script. International Journal of Computer Applications 3, 24–29 (2010)

    Google Scholar 

  17. Dongre, V.J., Mankar, V.H.: Devnagari Document Segmentation Using Histogram Approach. International Journal of Computer Applications 1, 46–53 (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Kathirvalavakumar, T., Selvi, M.K. (2013). Efficient Touching Text Line Segmentation in Tamil Script Using Horizontal Projection. In: Prasath, R., Kathirvalavakumar, T. (eds) Mining Intelligence and Knowledge Exploration. Lecture Notes in Computer Science(), vol 8284. Springer, Cham. https://doi.org/10.1007/978-3-319-03844-5_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-03844-5_29

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-03843-8

  • Online ISBN: 978-3-319-03844-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics