Analysis and recognition of highly degraded printed characters

Tonazzini, Anna; Vezzosi, Stefano; Bedini, Luigi

doi:10.1007/s10032-003-0115-y

Analysis and recognition of highly degraded printed characters

Published: April 2003

Volume 6, pages 236–247, (2003)
Cite this article

Document Analysis and Recognition Aims and scope Submit manuscript

Anna Tonazzini¹,
Stefano Vezzosi¹ &
Luigi Bedini¹

101 Accesses
18 Citations
Explore all metrics

Abstract.

This paper proposes an integrated system for the processing and analysis of highly degraded printed documents for the purpose of recognizing text characters. As a case study, ancient printed texts are considered. The system is comprised of various blocks operating sequentially. Starting with a single page of the document, the background noise is reduced by wavelet-based decomposition and filtering, the text lines are detected, extracted, and segmented by a simple and fast adaptive thresholding into blobs corresponding to characters, and the various blobs are analyzed by a feedforward multilayer neural network trained with a back-propagation algorithm. For each character, the probability associated with the recognition is then used as a discriminating parameter that determines the automatic activation of a feedback process, leading the system back to a block for refining segmentation. This block acts only on the small portions of the text where the recognition cannot be relied on and makes use of blind deconvolution and MRF-based segmentation techniques whose high complexity is greatly reduced when applied to a few subimages of small size. The experimental results highlight that the proposed system performs a very precise segmentation of the characters and then a highly effective recognition of even strongly degraded texts.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Aarts E, Korst J (1989) Simulated annealing and Boltzmann machines. Wiley, New York
Aas K, Eikvil L (1996) Text page recognition using grey-level features and hidden Markov models. Patt Recog 29:977-985
Google Scholar
Aloimonos Y, Shulman D (1989) Integration of visual modules: an extension of the Marr paradigm. Academic, Boston
Ayers GR, Dainty JG (1988) Iterative blind deconvolution method and its applications. Opt Lett 13:547-549
Google Scholar
Avi-Itzhak HI, Diep TA, Garland H (1995) High accuracy optical character recognition using neural networks with centroid dithering. IEEE Trans PAMI 17:218-224
Google Scholar
Bedini L, Gerace I, Salerno E, Tonazzini A (1996) Models and algorithms for edge-preserving image reconstruction. In: Hawkes PW (ed) Advances in imaging and electron physics, 97:86-189, Academic, San Diego
Bedini L, Cannata A, Ferraro M, Salerno E, Tonazzini A (1998) Visual module integration for optical flow estimation. In: Theodoridis S, Pitas I, Stouraitis A, Kalouptsidis N (eds) Signal processing IX: Theories and applications. Proceedings of EUSIPCO-98, Rhodes, Greece, 8-11 September 1998, 3:1577-1580
Cohen A (1995) Wavelets and multiscale signal processing. Chapman and Hall, New York
Cohen A, Daubechies I, Jawerth B, Vial P (1993) Multiresolution analysis, wavelets, and fast algorithms on an interval. Comptes Rendus Acad Sci Paris A 316:417-421
Google Scholar
Demoment G (1989) Image reconstruction and restoration: overview of common estimation structures and problems. IEEE Trans Acoust Speech Signal Process 37(12):2024-2036
Google Scholar
Denker JS, Gardner WR, Graf HP, Henderson D, Howard RE, Hubbard W, Jackel LD, Baird HS, Guyon I (1989) Neural network recognizer for handwritten zip code digits. In: Touretzky D (ed) Advances in neural information processing systems 1 (NIPS ‘88), pp 323-331, Morgan Kaufmann, San Mateo, CA
Donoho DL (1995) Denoising by soft-tresholding. IEEE Trans Inf Technol 41:613-627
Google Scholar
Fukushima K, Miyake S (1982) Neocognitron: a new algorithm for pattern recognition tolerant to deformations and shifts in position. Patt Recog 15:455-469
Google Scholar
Gamble E, Poggio T (1987) Visual integration and detection of discontinuities: the key role of intensity edges. MIT-AI Memo No. 970
Geman S, Geman D (1984) Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans PAMI 6:721-740
Google Scholar
Kanungo T, Haralick RM, Baird HS, Stuezle W, Madigan D (2000) A statistical, nonparametric methodology for document degradation model validation. IEEE Trans PAMI 22(11):1209-1223
Google Scholar
Katsaggelos AK, Lay KT (1991) Maximum likelihood blur identification and image restoration using the EM algorithm. IEEE Trans Signal Process 39:729-732
Google Scholar
Kopec GE, Chou PA (1994) Document image decoding using Markov source models. IEEE Trans PAMI 16(6):602-617
Google Scholar
Kundur D, Hatzinakos D (1996) Blind image deconvolution. IEEE Signal Process Mag 13(3):43-64
Google Scholar
Lagendijk RL, Tekalp AM, Biemond J (1990) Maximum likelihood image and blur identification: a unifying approach. Opt Eng 29:422-435
Google Scholar
Li SZ (1995) Markov random field modeling in computer vision. Springer, Berlin Heidelberg New York
Liang S, Shridhar M, Ahmadi M (1994) Segmentation of touching characters in printed document recognition. Patt Recog 27:825-840
Google Scholar
Nguyen D, Widrow B (1990) Improving the learning speed of two layer neural networks by choosing initial values of the adaptive weights. In: Proceedings of the 1st international joint conference on neural networks, San Diego, June 1990, 3:21-26
Poggio T, Torre V, Koch C (1985) Computational vision and regularization theory. Nature 317:314-319
Google Scholar
Rocha J, Pavlidis T (1994) A shape analysis model with applications to a character recognition system. IEEE Trans PAMI 16(4):393-404
Google Scholar
Rocha J, Pavlidis T (1995) Character recognition without segmentation. IEEE Trans PAMI 17(9):903-909
Google Scholar
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning internal representations by error propagation. In: Rumelhart DE, McClelland JL (eds) Parallel distributed processing, vol 1. MIT Press, Cambridge, MA, pp 318-362
Sezan MI, Tekalp AM (1990) Survey of recent developments in digital image restoration. Opt Eng 29(5):393-404
Google Scholar
Strang G, Nguyen T (1996) Wavelets and filter banks. Wellesley-Cambridge Press, Wellesley, MA
Taxt T, Flynn PJ, Jain AK (1989) Segmentation of document images. IEEE Trans PAMI 11:1322-1329
Google Scholar
Tonazzini A, Bedini L (1999) Character segmentation in highly blurred ancient printed documents. In: Proceedings of the 10th IAPR international conference on image analysis and processing, Venice, Italy, 27-29 September 1999, pp 836-841
Tonazzini A, Bedini L (2000) Segmentation of printed characters in ancient degraded documents through blind image restoration and labeling. Patt Recog Image Anal 10:273-287
Google Scholar
Trier OD, Taxt T (1995) Evaluation of binarization methods for document images. IEEE Trans PAMI 17:312-315
Google Scholar
Vogl TP, Mangis JK, Rigler AK, Zink WT, Alkon DL (1988) Accelerating the convergence of the back propagation method. Biol Cybern 59:256-264
Google Scholar
Yanowitz SD, Bruckstein AM (1989) A new method for image segmentation. Comput Vision Graph Image Process 46:82-95
Google Scholar
You Y, Kaveh M (1996) A regularization approach to joint blur identification and image restoration. IEEE Trans Image Process 5:416-428
Google Scholar

Download references

Author information

Authors and Affiliations

Istituto di Scienza e Tecnologie dell’Informazione, Area della Ricerca CNR di Pisa, Via G. Moruzzi, 1, 56124, Pisa, Italy
Anna Tonazzini, Stefano Vezzosi & Luigi Bedini

Authors

Anna Tonazzini
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Vezzosi
View author publications
You can also search for this author in PubMed Google Scholar
Luigi Bedini
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anna Tonazzini.

Additional information

Received: 4 January 2003, Accepted: 22 June 2003, Published online: 17 November 2003

This work has been supported by the Italian CNR Special Project “Safeguard of Cultural Heritage”

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tonazzini, A., Vezzosi, S. & Bedini, L. Analysis and recognition of highly degraded printed characters. IJDAR 6, 236–247 (2003). https://doi.org/10.1007/s10032-003-0115-y

Download citation

Issue Date: April 2003
DOI: https://doi.org/10.1007/s10032-003-0115-y

Keywords:

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Analysis and recognition of highly degraded printed characters

Abstract.

Access this article

Similar content being viewed by others

Text Extraction and Restoration of Old Handwritten Documents

Document segmentation using textural features summarization and feedforward neural network

Document Image Edge Detection Based on a Local Hysteresis Thresholding and Automatic Setting Using PSO

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords:

Navigation

Analysis and recognition of highly degraded printed characters

Abstract.

Access this article

Similar content being viewed by others

Text Extraction and Restoration of Old Handwritten Documents

Document segmentation using textural features summarization and feedforward neural network

Document Image Edge Detection Based on a Local Hysteresis Thresholding and Automatic Setting Using PSO

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords:

Search

Navigation