Perturbation Models for Generating Synthetic Training Data in Handwriting Recognition

Varga, Tamás; Bunke, Horst

doi:10.1007/978-3-540-76280-5_13

Tamás Varga⁴ &
Horst Bunke⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 90))

2638 Accesses
9 Citations

In this chapter, the use of synthetic training data for handwriting recognition is studied. After an overview of the previous works related to the field, the authors' main results regarding this research area are presented and discussed, including a perturbation model for the generation of synthetic text lines from existing cursively handwritten lines of text produced by human writers. The goal of synthetic text line generation is to improve the performance of an off-line cursive handwriting recognition system by providing it with additional training data. It can be expected that by adding synthetic training data the variability of the training set improves, which leads to a higher recognition rate. On the other hand, synthetic training data may bias a recognizer towards unnatural handwriting styles, which could lead to a deterioration of the recognition rate. The proposed perturbation model is evaluated under several experimental conditions, and it is shown that significant improvement of the recognition performance is possible even when the original training set is large and the text lines are provided by a large number of different writers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A sigma-lognormal model-based approach to generating large synthetic online handwriting sample databases

Article 26 May 2017

Maximizing Data Efficiency of HTR Models by Synthetic Text

Improving Handwriting Recognition for Historical Documents Using Synthetic Text Lines

References

Bunke, H.: Recognition of Cursive Roman Handwriting - Past, Present and Future. In: Proc. 7th Int. Conf. on Document Analysis and Recognition, Edinburgh, Scotland (2003) 448-459
Chapter Google Scholar
Plamondon, R., Srihari, S.: On-line and Off-line Handwriting Recognition: A Comprehensive Survey. IEEE Trans. on Pattern Analysis and Machine Intelli-gence 22(1) (2000) 63-84
Article Google Scholar
Srihari, S.: Handwritten Address Interpretation: a Task of Many Pattern Recog- nition Problems. Int. Journal of Pattern Recognition and Artificial Intelligence 14(5)(2000) 663-674
Article Google Scholar
. Impedovo, S., Wang, P., Bunke, H., eds.: Automatic Bankcheck Processing. World Scientific (1997)
Google Scholar
Gopisetty, S., Lorie, R., Mao, J., Mohiuddin, M., Sorin, A., Yair, E.: Auto-mated Forms-processing Software and Services. IBM Journal of Research and Development 40(2) (1996) 211-230
Article Google Scholar
Ye, X., Cheriet, M., Suen, C.: A Generic Method of Cleaning and Enhancing Handwritten Data from Business Forms. Int. Journal on Document Analysis and Recognition 4(2) (2001) 84-96
Article Google Scholar
Arica, N., Yarman-Vural, F.: An Overview of Character Recognition Focused on Off-line Handwriting. IEEE Trans. on Systems, Man, and Cybernetics - Part C: Applications and Reviews 31(2) (2001) 216-233
Article Google Scholar
. Mori, S., Suen, C., Yamamoto, K.: Historical Review of OCR Research and Development. In O’Gorman, L., Kasturi, R., eds.: Document Image Analysis. IEEE Computer Society Press (1995) 244-273
Google Scholar
Simon, J.C.: Off-line Cursive Word Recognition. Proceedings of the IEEE 80(7) (1992) 1150-1161
Article Google Scholar
Steinherz, T., Rivlin, E., Intrator, N.: Offline Cursive Script Word Recognition - a Survey. Int. Journal on Document Analysis and Recognition 2(2) (1999) 90-110
Google Scholar
Vinciarelli, A.: A Survey on Off-line Cursive Word Recognition. Pattern Recog- nition 35(7) (2002) 1433-1446
Article MATH Google Scholar
Marti, U.V., Bunke, H.: Using a Statistical Language Model to Improve the Performance of an HMM-based Cursive Handwriting Recognition System. Int. Journal of Pattern Recognition and Artificial Intelligence 15(1) (2001) 65-90
Article Google Scholar
Kittler, J., Hatef, M., Duin, R., Matas, J.: On Combining Classifiers. IEEE Trans. on Pattern Analysis and Machine Intelligence 20(3) (1998) 226-239
Article Google Scholar
Roli, F., Kittler, J., Windeatt, T., eds.: Proc. 5th Int. Workshop on Multiple Classifier Systems, Cagliari, Italy, Springer (2004)
Google Scholar
.Kuncheva, L.: Combining Pattern Classifiers: Methods and Algorithms. Wiley- Interscience (2004)
Google Scholar
Lorette, G.: Handwriting Recognition or Reading? - What is the Situation at the Dawn of the 3rd Millenium? Int. Journal on Document Analysis and Recognition 2(1) (1999) 2-12
Article Google Scholar
Rosenfeld, R.: Two Decades of Statistical Language Modeling: Where do We Go from Here? Proc. of the IEEE 88(8) (2000) 1270-1278
Article Google Scholar
. Elliman, D., Sherkat, N.: A Truthing Tool for Generating a Database of Cursive Words. In: Proc. 6th Int. Conf. on Document Analysis and Recognition, Seattle, WA, USA (2001) 1255-1262
Google Scholar
. Guyon, I., Haralick, R., Hull, J., Phillips, I.: Data Sets for OCR and Document Image Understanding Research. In Bunke, H., Wang, P., eds.: Handbook of Character Recognition and Document Image Analysis. World Scientific (1997) 779-799
Google Scholar
Marti, U.V., Bunke, H.: The IAM-Database: an English Sentence Database for Off-line Handwriting Recognition. Int. Journal on Document Analysis and Recognition 5(1) (2002) 39-46
Article MATH Google Scholar
Cano, J., Pérez-Cortes, J., Arlandis, J., Llobet, R.: Training Set Expansion in Handwritten Character Recognition. In: Proc. 9th SSPR / 4th SPR, Windsor, Ontario, Canada (2002) 548-556
Google Scholar
Günter, S., Bunke, H.: Multiple Classifier Systems in Offline Handwritten Word Recognition - On the Influence of Training Set and Vocabulary Size. Int. Journal of Pattern Recognition and Artificial Intelligence 18(7) (2004) 1302-1320
Google Scholar
Rowley, H., Goyal, M., Bennett, J.: The Effect of Large Training Set Sizes on Online Japanese Kanji and English Cursive Recognizers. In: Proc. 8th Int. Workshop on Frontiers in Handwriting Recognition, Niagara-on-the-Lake, Ontario, Canada (2002) 36-40
Chapter Google Scholar
Velek, O., Nakagawa, M.: The Impact of Large Training Sets on the Recogni-tion Rate of Off-line Japanese Kanji Character Classifiers. In: Proc. 5th IAPR Workshop on Document Analysis Systems, Princeton, New Jersey, USA (2002) 106-109
Google Scholar
Varga, T., Bunke, H.: Generation of Synthetic Training Data for an HMM-based Handwriting Recognition System. In: Proc. 7th Int. Conf. on Document Analysis and Recognition, Edinburgh, Scotland (2003) 618-622
Chapter Google Scholar
Varga, T., Bunke, H.: Off-line Handwritten Textline Recognition Using a Mix-ture of Natural and Synthetic Training Data. In: Proc. 17th Int. Conf. on Pattern Recognition, Cambridge, United Kingdom (2004) 545-549
Chapter Google Scholar
Kasturi, R., O’Gorman, L., Govindaraju, V.: Document Image Analysis: A Primer. Sadhana 27(1) (2002) 3-22
Article Google Scholar
Nagy, G., Nartker, T., Rice, S.: Optical Character Recognition: An Illustrated Guide to the Frontier. In: Proc. IS&T/SPIE Symposium on Electronic Imaging: Science and Technology. Volume 3967., San Jose, CA, USA (2000) 58-69
Google Scholar
Rice, S., Jenkins, F., Nartker, T.: The Fifth Annual Test of OCR Accuracy. Technical Report ISRI-TR-96-01, University of Nevada, Las Vegas, Nevada, USA (1996)
Google Scholar
Baird, H.: The State of the Art of Document Image Degradation Modeling. In: Proc. 4th IAPR Workshop on Document Analysis Systems, Rio de Janeiro, Brasil (2000) 1-13
Google Scholar
. Stork, D.: Toward a Computational Theory of Data Acquisition and Truthing. In Helmbold, D., Williamson, B., eds.: Computational Learning Theory. Volume 2111 of Lecture Notes in Computer Science. Springer (2001) 194-207
Google Scholar
Vuurpijl, L., Niels, R., Erp, M., Schomaker, L., Ratzlaff, E.: Verifying the UNIPEN devset. In: Proc. 9th Int. Workshop on Frontiers in Handwriting Recognition, Kokubunji, Tokyo, Japan (2004) 586-591
Chapter Google Scholar
. Baird, H.: Document Image Defect Models. In Baird, H., Bunke, H., Yamamoto, K., eds.: Structured Document Image Analysis. Springer (1992) 546-556
Google Scholar
Doermann, D., Yao, S.: Generating Synthetic Data for Text Analysis Systems. In: Proc. 4th Annual Symposium on Document Analysis and Information Re-trieval, Las Vegas, Nevada, USA (1995) 449-467
Google Scholar
Kanungo, T., Haralick, R., Phillips, I.: Global and Local Document Degrada-tion Models. In: Proc. 2nd Int. Conf. on Document Analysis and Recognition, Tsukuba Science City, Japan (1993) 730-734
Chapter Google Scholar
Baird, H.: Document Image Defect Models and their Uses. In: Proc. 2nd Int. Conf. on Document Analysis and Recognition, Tsukuba Science City, Japan (1993) 62-67
Chapter Google Scholar
Ho, T., Baird, H.: Large-Scale Simulation Studies in Image Pattern Recognition. IEEE Trans. on Pattern Analysis and Machine Intelligence 19(10) (1997) 1067-1079
Google Scholar
Märgner, V., Pechwitz, M.: Synthetic Data for Arabic OCR System Develop-ment. In: Proc. 6th Int. Conf. on Document Analysis and Recognition, Seattle, WA, USA (2001) 1159-1163
Chapter Google Scholar
Baird, H., Fossey, R.: A 100-Font Classifier. In: Proc. 1st Int. Conf. on Document Analysis and Recognition, St.Malo, France (1991) 332-340
Google Scholar
Baird, H., Nagy, G.: A Self-Correcting 100-Font Classifier. In: Proc. IS&T/SPIE Symposium on Electronic Imaging: Science and Technology. Volume 2181., San Jose, California, USA (1994) 106-115
Google Scholar
Ho, T., Baird, H.: Evaluation of OCR Accuracy Using Synthetic Data. In: Proc. 4th Annual Symposium on Document Analysis and Information Retrieval, Las Vegas, Nevada, USA (1995) 413-422
Google Scholar
Helmers, M., Bunke, H.: Generation and Use of Synthetic Training Data in Cursive Handwriting Recognition. In: Proc. 1st Iberian Conf. on Pattern Recog-nition and Image Analysis, Puerto de Andratx, Mallorca, Spain (2003) 336-345
Google Scholar
. Drucker, H., Schapire, R., Simard, P.: Improving Performance in Neural Net- works Using a Boosting Algorithm. In Hanson et. al., S., ed.: Advances in Neural Information Processing Systems 5. Morgan Kaufmann (1993) 42-49
Google Scholar
Ha, T., Bunke, H.: Off-line Handwritten Numeral Recognition by Perturba- tion Method. IEEE Trans. on Pattern Analysis and Machine Intelligence 19(5) (1997) 535-539
Article Google Scholar
Mao, J., Mohiuddin, K.: Improving OCR Performance Using Character Degra-dation Models and Boosting Algorithm. Pattern Recognition Letters 18 (1997) 1415-1419
Article Google Scholar
Mori, M., Suzuki, A., Shio, A., Ohtsuka, S.: Generating New Samples from Handwritten Numerals based on Point Correspondence. In: Proc. 7th Int. Work-shop on Frontiers in Handwriting Recognition, Amsterdam, The Netherlands (2000) 281-290
Google Scholar
. Kaufmann, G., Bunke, H., Ha, T.: Recognition of Cursively Handwritten Words Using a Combined Normalization/Perturbation Approach. In Downton, A., Impedovo, S., eds.: Progress in Handwriting Recognition. World Scientific (1997) 21-28
Google Scholar
Setlur, S., Govindaraju, V.: Generating Manifold Samples from a Handwritten Word. Pattern Recognition Letters 15 (1994) 901-905
Article Google Scholar
Velek, O., Liu, C.L., Nakagawa, M.: Generating Realistic Kanji Character Im-ages from On-line Patterns. In: Proc. 6th Int. Conf. on Document Analysis and Recognition, Seattle, WA, USA (2001) 556-560
Chapter Google Scholar
Govindan, V., Shivaprasad, A.: Artificial Database for Character Recognition Research. Pattern Recognition Letters 12(10) (1991) 645-648
Article Google Scholar
Tung, C., Lee, H.: Performance Analysis of an OCR System via a Handwritten Character Generator. Pattern Recognition 27(2) (1994) 221-232
Article Google Scholar
. Simard, P., Cun, Y., Denker, J.: Efficient Pattern Recognition Using a New Transformation Distance. In et al., S.H., ed.: Advances in Neural Information Processing Systems 5. Morgan Kaufmann (1993) 50-58
Google Scholar
Guyon, I.: Handwriting Synthesis from Handwritten Glyphs. In: Proc. 5th Int. Workshop Frontiers in Handwriting Recognition, Essex, United Kingdom (1996) 309-312
Google Scholar
Choi, H., Cho, S., Kim, J.: Generation of Handwritten Characters with Bayesian Network based On-line Handwriting Recognizers. In: Proc. 7th Int. Conf. on Document Analysis and Recognition, Edinburgh, Scotland (2003) 995-1001
Chapter Google Scholar
Choi, H., Cho, S., Kim, J.: Writer-Dependent Online Handwriting Generation with Bayesian Networks. In: Proc. 9th Int. Workshop on Frontiers in Handwrit-ing Recognition, Kokubunji, Tokyo, Japan (2004) 130-135
Chapter Google Scholar
Wang, J., Wu, C., Xu, Y.Q., Shum, H.Y., Ji, L.: Learning based Cursive Hand-writing Synthesis. In: Proc. 8th Int. Workshop on Frontiers in Handwriting Recognition, Niagara-on-the-Lake, Ontario, Canada (2002) 157-162
Chapter Google Scholar
Lee, D.H., Cho, H.G.: A New Synthesizing Method for Handwriting Korean Scripts. Int. Journal of Pattern Recognition and Artificial Intelligence 12(1) (1998) 46-61
Google Scholar
Plamondon, R., Guerfali, W.: The Generation of Handwriting with Delta-lognormal Synergies. Biological Cybernetics 78 (1998) 119-132
Article MATH Google Scholar
Baird, H., Coates, A., Fateman, R.: PessimalPrint: a Reverse Turing Test. Int. Journal on Document Analysis and Recognition 5(2-3) (2003) 158-163
Article Google Scholar
. Ahn, L., Blum, M., Hopper, N., Langford, J.: The CAPTCHA Web Page. http://www.captcha.net (2000)
. Ahn, L., Blum, M., Hopper, N., Langford, J.: CAPTCHA: Telling Humans and Computers Apart. In Biham, E., ed.: Advances in Cryptology. Volume 2656 of Lecture Notes in Computer Science. Springer (2003) 294-311
Google Scholar
Ahn, L., Blum, M., Langford, J.: Telling Humans and Computers Apart Au- tomatically - How Lazy Cryptographers Do AI. Communications of the ACM 47(2)(2004) 57-60
Google Scholar
Baird, H., Luk, M.: Protecting Websites with Reading-based CAPTCHAs. In: Proc. 2nd Int. Web Document Analysis Workshop, Edinburgh, Scotland (2003) 53-56
Google Scholar
Baird, H., Riopka, T.: ScatterType: a Reading CAPTCHA Resistant to Segmen-tation Attack. In: Proc. 12th SPIE/IS&T Document Recognition and Retrieval Conference. Volume 5676., San Jose, California, USA (2005)
Google Scholar
Chew, M., Baird, H.: BaffleText: a Human Interactive Proof. In: Proc. 10th SPIE/IS&T Document Recognition and Retrieval Conference. Volume 5010., Santa Clara, California, USA (2003) 305-316
Google Scholar
. Lillibridge, M., Abadi, M., Bharat, K., Broder, A.: Method for Selectively Re-stricting Access to Computer Systems. U.S. Patent No. 6,195,698 (2000)
Google Scholar
Simard, P., Szeliski, R., Couvreur, J., Calinov, I.: Using Character Recognition and Segmentation to Tell Computer from Humans. In: Proc. 7th Int. Conf. on Document Analysis and Recognition, Edinburgh, Scotland (2003) 418-423
Chapter Google Scholar
Rusu, A., Govindaraju, V.: Handwritten CAPTCHA: Using the Difference in the Abilities of Humans and Machines in Reading Handwritten Words. In: 9th Int. Workshop on Frontiers in Handwriting Recognition, Kokubunji, Tokyo, Japan (2004) 226-231
Chapter Google Scholar
Manzanera, A., Bernard, T.: Improved Low Complexity Fully Parallel Thinning Algorithm. In: Proc. 10th Int. Conf. on Image Analysis and Processing, Venice, Italy (1999) 215-220
Google Scholar
. Soille, P.: Morphological Image Analysis. Springer (1999)
Google Scholar
Rabiner, L.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2) (1989) 257-286
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science and Applied Mathematics (IAM), University of Bern, Neubrückstrasse 10, CH-3012, Bern, Switzerland
Tamás Varga & Horst Bunke

Authors

Tamás Varga
View author publications
You can also search for this author in PubMed Google Scholar
Horst Bunke
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Sistemi e Informatica, University of Florence, Via S. Marta, 3, 50139, Firenze, Italy
Simone Marinai
Hitachi Central Research Laboratory, 1-280, Higashi-Koigakubo, Kokubunji-shi, Tokyo, 185-8601, Japan
Hiromichi Fujisawa

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Varga, T., Bunke, H. (2008). Perturbation Models for Generating Synthetic Training Data in Handwriting Recognition. In: Marinai, S., Fujisawa, H. (eds) Machine Learning in Document Analysis and Recognition. Studies in Computational Intelligence, vol 90. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76280-5_13

Download citation

DOI: https://doi.org/10.1007/978-3-540-76280-5_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76279-9
Online ISBN: 978-3-540-76280-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Perturbation Models for Generating Synthetic Training Data in Handwriting Recognition

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A sigma-lognormal model-based approach to generating large synthetic online handwriting sample databases

Maximizing Data Efficiency of HTR Models by Synthetic Text

Improving Handwriting Recognition for Historical Documents Using Synthetic Text Lines

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Perturbation Models for Generating Synthetic Training Data in Handwriting Recognition

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A sigma-lognormal model-based approach to generating large synthetic online handwriting sample databases

Maximizing Data Efficiency of HTR Models by Synthetic Text

Improving Handwriting Recognition for Historical Documents Using Synthetic Text Lines

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation