Shape-based word recognition

Spitz, A. Lawrence

doi:10.1007/s100320050017

A. Lawrence Spitz¹

136 Accesses
16 Citations
6 Altmetric
Explore all metrics

Abstract.

We describe a process of word recognition that has high tolerance for poor image quality, tunability to the lexical content of the documents to which it is applied, and high speed of operation. This process relies on the transformation of text images into character shape codes, and on special lexica that contain information on the shape of words. We rely on the structure of English and the high efficiency of mapping between shape codes and the characters in the words. Remaining ambiguity is reduced by template matching using exemplars derived from surrounding text, taking advantage of the local consistency of font, face and size as well as image quality. This paper describes the effects of lexical content, structure and processing on the performance of a word recognition engine. Word recognition performance is shown to be enhanced by the application of an appropriate lexicon. Recognition speed is shown to be essentially independent of the details of lexical content provided the intersection of the occurrences of words in the document and the lexicon is high. Word recognition accuracy is dependent on both intersection and specificity of the lexicon.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Author information

Authors and Affiliations

Document Recognition Technologies, Inc., 459 Hamilton Avenue, Suite 204, Palo Alto, CA 94301 USA; e-mail: spitz@docrec.com , , , , , , US
A. Lawrence Spitz

Authors

A. Lawrence Spitz
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Received May 1, 1998 / Revised October 20, 1998

Rights and permissions

Reprints and permissions

About this article

Cite this article

Spitz, A. Shape-based word recognition. IJDAR 1, 178–190 (1999). https://doi.org/10.1007/s100320050017

Download citation

Issue Date: May 1999
DOI: https://doi.org/10.1007/s100320050017

Key words: Word recognition – Character shape coding – Lexical contents – Lexical specificity – Template matching

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Shape-based word recognition

Abstract.

Access this article

Similar content being viewed by others

Holistic Shape Recognition: Where-to-Look and How-to-Look

A Modified Fourier Descriptor for Shape-Based Image Recognition

Face Recognition: Shape versus Texture

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Navigation

Shape-based word recognition

Abstract.

Access this article

Similar content being viewed by others

Holistic Shape Recognition: Where-to-Look and How-to-Look

A Modified Fourier Descriptor for Shape-Based Image Recognition

Face Recognition: Shape versus Texture

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation