Clustering of Farsi sub-word images for whole-book recognition

Mohammad Reza Soheili; Ehsanollah Kabir; Didier Stricker

doi:10.1117/12.2075931

8 February 2015 Clustering of Farsi sub-word images for whole-book recognition

Mohammad Reza Soheili, Ehsanollah Kabir, Didier Stricker

Proceedings Volume 9402, Document Recognition and Retrieval XXII; 94020C (2015) https://doi.org/10.1117/12.2075931
Event: SPIE/IS&T Electronic Imaging, 2015, San Francisco, California, United States

Abstract

Redundancy of word and sub-word occurrences in large documents can be effectively utilized in an OCR system to improve recognition results. Most OCR systems employ language modeling techniques as a post-processing step; however these techniques do not use important pictorial information that exist in the text image. In case of large-scale recognition of degraded documents, this information is even more valuable. In our previous work, we proposed a subword image clustering method for the applications dealing with large printed documents. In our clustering method, the ideal case is when all equivalent sub-word images lie in one cluster. To overcome the issues of low print quality, the clustering method uses an image matching algorithm for measuring the distance between two sub-word images. The measured distance with a set of simple shape features were used to cluster all sub-word images. In this paper, we analyze the effects of adding more shape features on processing time, purity of clustering, and the final recognition rate. Previously published experiments have shown the efficiency of our method on a book. Here we present extended experimental results and evaluate our method on another book with totally different font face. Also we show that the number of the new created clusters in a page can be used as a criteria for assessing the quality of print and evaluating preprocessing phases.

Citation Download Citation

Mohammad Reza Soheili, Ehsanollah Kabir, and Didier Stricker "Clustering of Farsi sub-word images for whole-book recognition", Proc. SPIE 9402, Document Recognition and Retrieval XXII, 94020C (8 February 2015); https://doi.org/10.1117/12.2075931

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available