A non-contact method of capturing low-resolution text for OCR

Mirmehdi, M.; Clark, P.; Lam, J.

doi:10.1007/s10044-002-0172-8

A non-contact method of capturing low-resolution text for OCR

Published: April 2003

Volume 6, pages 12–21, (2003)
Cite this article

Pattern Analysis & Applications Aims and scope Submit manuscript

M. Mirmehdi¹,
P. Clark¹ &
J. Lam¹

94 Accesses
7 Citations
6 Altmetric
Explore all metrics

Abstract

Document recognition is a lively research area with much effort concentrated on optical character recognition. Less attention is paid to locating and extracting text from the general (non-desktop, non-scanner) environment. Such contact-free extraction of text from a general scene has applications in the context of wearable computing, robotic vision, point and click document capture, or as an aid for visually handicapped people. Here, a novel automatic text reading system is introduced using an active camera focused on text regions already located in the scene (using our recent work). Initially, a located region of text is analysed to determine the optimal zoom that would foveate onto it. Then a number of images are captured over the text region to construct a high-resolution mosaic composite of the whole region. This magnified image of the text is suitable for reading by humans or for recognition by OCR, or even for text-to speech synthesis. Although we employed a low resolution camera, we still obtained very good results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Systematic Survey on CAPTCHA Recognition: Types, Creation and Breaking Techniques

Article 14 June 2021

OCR with Tesseract, Amazon Textract, and Google Document AI: a benchmarking experiment

Article Open access 22 November 2021

Eye Tracking and Eye-Based Human–Computer Interaction

Author information

Authors and Affiliations

Department of Computer Science, University of Bristol, Bristol BS8 1UB, UK., , , , , , GB
M. Mirmehdi, P. Clark & J. Lam

Authors

M. Mirmehdi
View author publications
You can also search for this author in PubMed Google Scholar
P. Clark
View author publications
You can also search for this author in PubMed Google Scholar
J. Lam
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

ID="A1"Correspondance and offprint requests to: Dr M. Mirmehdi, Department of Computer Science, University of Bristol, Bristol BS8 1UB, UK. Email: majid@cs.bris.ac.uk

Rights and permissions

Reprints and permissions

About this article

Cite this article

Mirmehdi, M., Clark, P. & Lam, J. A non-contact method of capturing low-resolution text for OCR. Pattern Anal Appl 6, 12–21 (2003). https://doi.org/10.1007/s10044-002-0172-8

Download citation

Issue Date: April 2003
DOI: https://doi.org/10.1007/s10044-002-0172-8

Key words: Composite text mosaicing; Document recognition; High resolution text; OCR; Zooming

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A non-contact method of capturing low-resolution text for OCR

Abstract

Access this article

Similar content being viewed by others

A Systematic Survey on CAPTCHA Recognition: Types, Creation and Breaking Techniques

OCR with Tesseract, Amazon Textract, and Google Document AI: a benchmarking experiment

Eye Tracking and Eye-Based Human–Computer Interaction

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Navigation

A non-contact method of capturing low-resolution text for OCR

Abstract

Access this article

Similar content being viewed by others

A Systematic Survey on CAPTCHA Recognition: Types, Creation and Breaking Techniques

OCR with Tesseract, Amazon Textract, and Google Document AI: a benchmarking experiment

Eye Tracking and Eye-Based Human–Computer Interaction

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation