Abstract.
We describe a process of word recognition that has high tolerance for poor image quality, tunability to the lexical content of the documents to which it is applied, and high speed of operation. This process relies on the transformation of text images into character shape codes, and on special lexica that contain information on the shape of words. We rely on the structure of English and the high efficiency of mapping between shape codes and the characters in the words. Remaining ambiguity is reduced by template matching using exemplars derived from surrounding text, taking advantage of the local consistency of font, face and size as well as image quality. This paper describes the effects of lexical content, structure and processing on the performance of a word recognition engine. Word recognition performance is shown to be enhanced by the application of an appropriate lexicon. Recognition speed is shown to be essentially independent of the details of lexical content provided the intersection of the occurrences of words in the document and the lexicon is high. Word recognition accuracy is dependent on both intersection and specificity of the lexicon.
Similar content being viewed by others
Author information
Authors and Affiliations
Additional information
Received May 1, 1998 / Revised October 20, 1998
Rights and permissions
About this article
Cite this article
Spitz, A. Shape-based word recognition. IJDAR 1, 178–190 (1999). https://doi.org/10.1007/s100320050017
Issue Date:
DOI: https://doi.org/10.1007/s100320050017