Paper
8 February 2015 Clustering of Farsi sub-word images for whole-book recognition
Mohammad Reza Soheili, Ehsanollah Kabir, Didier Stricker
Author Affiliations +
Proceedings Volume 9402, Document Recognition and Retrieval XXII; 94020C (2015) https://doi.org/10.1117/12.2075931
Event: SPIE/IS&T Electronic Imaging, 2015, San Francisco, California, United States
Abstract
Redundancy of word and sub-word occurrences in large documents can be effectively utilized in an OCR system to improve recognition results. Most OCR systems employ language modeling techniques as a post-processing step; however these techniques do not use important pictorial information that exist in the text image. In case of large-scale recognition of degraded documents, this information is even more valuable. In our previous work, we proposed a subword image clustering method for the applications dealing with large printed documents. In our clustering method, the ideal case is when all equivalent sub-word images lie in one cluster. To overcome the issues of low print quality, the clustering method uses an image matching algorithm for measuring the distance between two sub-word images. The measured distance with a set of simple shape features were used to cluster all sub-word images. In this paper, we analyze the effects of adding more shape features on processing time, purity of clustering, and the final recognition rate. Previously published experiments have shown the efficiency of our method on a book. Here we present extended experimental results and evaluate our method on another book with totally different font face. Also we show that the number of the new created clusters in a page can be used as a criteria for assessing the quality of print and evaluating preprocessing phases.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Mohammad Reza Soheili, Ehsanollah Kabir, and Didier Stricker "Clustering of Farsi sub-word images for whole-book recognition", Proc. SPIE 9402, Document Recognition and Retrieval XXII, 94020C (8 February 2015); https://doi.org/10.1117/12.2075931
Lens.org Logo
CITATIONS
Cited by 3 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Optical character recognition

Distance measurement

Image processing

Image segmentation

Detection and tracking algorithms

Shape analysis

Systems modeling

Back to Top