Character sequence prediction method for training data creation in the task of text recognition

Pavel K. Zlobin; Yulia S. Chernyshova; Alexander V. Sheshkus; Vladimir V. Arlazarov

doi:10.1117/12.2623773

4 March 2022 Character sequence prediction method for training data creation in the task of text recognition

Pavel K. Zlobin, Yulia S. Chernyshova, Alexander V. Sheshkus, Vladimir V. Arlazarov

Proceedings Volume 12084, Fourteenth International Conference on Machine Vision (ICMV 2021); 120840R (2022) https://doi.org/10.1117/12.2623773
Event: Fourteenth International Conference on Machine Vision (ICMV 2021), 2021, Rome, Italy

Abstract

For text line recognition, much attention is paid to augmentation of the training images. Yet the inner structure of the textual information in the images also affects the accuracy of the resulting model. In this paper, we propose an ANNbased method for textual data generation for printing in images with a background of a synthetic training sample. In our method we avoid the usage of completely random sequences as well as the dictionary-based ones. As a result, we gain the data that saves the basic properties of the target language model, such as the balance of vowels and consonants, but avoid the lexicon-based properties, like the prevalence of the specific characters. Moreover, as our method focuses only on high-levels features and does not try to generate the real words, we can use a small training sample and light-weight ANN for text generation. To check our method, we train three ANNs with same architecture, but with different training samples. We choose machine readable zones as a target field because of their structure that does not correspond with the ordinary lexicon. The results of the experiments on three public datasets of identity documents demonstrate the effectiveness of our method and allows to enhance the state-of-the art results for the target field.

Citation Download Citation

Pavel K. Zlobin, Yulia S. Chernyshova, Alexander V. Sheshkus, and Vladimir V. Arlazarov "Character sequence prediction method for training data creation in the task of text recognition", Proc. SPIE 12084, Fourteenth International Conference on Machine Vision (ICMV 2021), 120840R (4 March 2022); https://doi.org/10.1117/12.2623773

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
9 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Neural networks

Data acquisition

Data modeling

Optical character recognition

Detection and tracking algorithms

Feature extraction

Image segmentation

Show All Keywords

Keywords/Phrases

Search In:

Publication Years