Computer recognition of printed Tamil characters

https://doi.org/10.1016/0031-3203(78)90032-8Get rights and content

Abstract

Computer recognition of machine-printed letters of the Tamil alphabet is described. Each character is represented as a binary matrix and encoded into a string using two different methods. The encoded strings form a dictionary. A given text is presented symbol by symbol and information from each symbol is extracted in the form of a string and compared with the strings in the dictionary. When there is agreement the letters are recognized and printed out in Roman letters following a special method of transliteration. The lengthening of vowels and hardening of consonants are indicated by numerals printed above each letter.

References (8)

There are more references available in the full text version of this article.

Cited by (46)

  • A multi-scale deep quad tree based feature extraction method for the recognition of isolated handwritten characters of popular indic scripts

    2017, Pattern Recognition
    Citation Excerpt :

    Among different feature extraction techniques mentioned in contemporary literature, some are generic e.g. basic shape based primitive features [1,2], gradient based features [4,19], shadow features [20,9], moment-based features [7], contour based features [9,10] etc. which have been successfully used for the recognition of isolated handwritten characters and digits of multiple Indic scripts. On the other hand, some methodologies utilize some shape based trait, specific to the script used in their respective approaches e.g. Das et al. [21], Negi et al. [22], Bajaj et al. [23], Siromoney et al. [24] etc. Although the first approach is easier to design, the latter provides better [1,25] performance in terms of recognition accuracy.

  • An effective feature set for enhancing printed tamil character recognition

    2021, Journal of the National Science Foundation of Sri Lanka
View all citing articles on Scopus
View full text