Abstract
In this paper an efficient method has been proposed to segment a document of machine printed Tamil sources into text lines. Because of the interfering lines, text line segmentation remain a problem. Standard Horizontal projection method can not segment the lines which are overlapped or touched. But the proposed method uses horizontal projection technique to solve the problem of line overlapping and over segmentation. Experimental results show that 100% accuracy is obtained from the line segmentation process which involves Tamil language document with different sizes and different fonts with line overlapping.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Dhanya, D., Ramakrishnan, A.G., Pati, P.B.: Script Identification in printed bilingual documents. Sadhana 27, 73–82 (2002)
Jindal, M.K., Lehal, G.S., Sharma, R.K.: Segmentation Problems and Solutions in Printed Degraded Gurmukhi Script. World Academy of Science, Engineering and Technology 21, 1153–1162 (2008)
Jindal, M.K., Sharma, R.K., Lehal, G.S.: Segmentation of Horizontally Overlapping Lines in Printed Indian Scripts. International Journal of Computational Intelligence Research 3, 277–286 (2007)
Karthik, S., Hemanth, V.K., Balaji, V., Soman, K.P.: Level Set Methodology for Tamil Document Image Binarization and Segmentation. International Journal of Computer Applications 39, 7–12 (2012)
Likforman-Sulem, L., Zahour, A., Taconet, B.: Text line Segmentation of Historical Documents: A Survey. International Journal on Document Analysis and Recognition (2006)
Manmatha, R., Rothfeder, J.L.: A Scale Space Approach for Automatically Segmenting Words from Historical Handwritten Documents. IEEE Transactions on Pattern Analysis And Machine Intelligence 27, 1212–1225 (2005)
Stamatopoulos, N., Gatos, B., Perantonis, S.J.: A method for combining complementary techniques for document image segmentation. Pattern Recognition 42, 3158–3168 (2009)
Premaratne, H.L., Bigun, J.: A segmentation-free approach to recognise printed Sinhala script using linear symmetry. Pattern Recognition 37, 2081–2089 (2004)
Soujanya, P., Koppula, V.K., Gaddam, K., Sruthi, P.: Comparative Study of Text Line Segmentation Algorithms on Low Quality Documents. Special Issue of International Journal of Computer Science & Informatics (IJCSI) II(1,2), 2231–5292, ISSN (Print) : 2231-5292
Kunte, S., Samuel, S.: Two Stage Character Segmentation Technique for printed Kannada Text. Special Issue on Image Sampling and Segmentation (March 2006)
Siromony, G., Chandrasekaran, R., Chandrasekaran, M.: Computer Recogniton of printed Tamil Characters. Pattern Recognition 10, 243–247 (1978)
Sridevi, N., Subashini, P.: Segmentation of Text Lines and Characters in Ancient Tamil Script Documents using Computational Intelligence Techniques. International Journal of Computer Applications 52, 7–12 (2012)
Garain, U., Chaudhuri, B.B.: Indian Script character recognition: a survey. Pattern Recognition 37, 1887–1899 (2004)
Garain, U., Chaudhuri, B.B.: Segmentation of Touching Characters in Printed Devnagari and Bangla Scripts Using Fuzzy Multifactorial Analysis. IEEE Transaction on Systems, Man and Cybernetics-Part C: Application and Reviews 32, 449–459 (2002)
Garain, U., Sinha, S., Chaudhuri, B.B.: Multi-Script Line identification from Indian Documents. In: Proceedings of the ICDAR (ICDAR 2003), pp. 880–884 (2003)
Kumar, V., Sengar, P.K.: Segmentation of Printed Text in Devanagari Script and Gurmukhi Script. International Journal of Computer Applications 3, 24–29 (2010)
Dongre, V.J., Mankar, V.H.: Devnagari Document Segmentation Using Histogram Approach. International Journal of Computer Applications 1, 46–53 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Kathirvalavakumar, T., Selvi, M.K. (2013). Efficient Touching Text Line Segmentation in Tamil Script Using Horizontal Projection. In: Prasath, R., Kathirvalavakumar, T. (eds) Mining Intelligence and Knowledge Exploration. Lecture Notes in Computer Science(), vol 8284. Springer, Cham. https://doi.org/10.1007/978-3-319-03844-5_29
Download citation
DOI: https://doi.org/10.1007/978-3-319-03844-5_29
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03843-8
Online ISBN: 978-3-319-03844-5
eBook Packages: Computer ScienceComputer Science (R0)