Abstract
State-of-the-art OCR/ICR algorithms and software are the result of large-scale experiments on the accuracy of OCR systems and proper selection of the size and distribution of training sets. The key factor in improving OCR technology is the degradation models. While it is a leading-edge tool for processing conventional printed materials, the degradation model now faces additional challenges as a result of the appearance in recent years of new imaging media, new definitions of text information, and the need to process low quality document images. In addition to discussing these challenges in this paper, we present well-developed degradation models and suggest some directions for further study. Particular attention is paid to restoration and enhancement of degraded single-sided or multi-sided document images which suffer from bleed-through or shadow-through.
Chapter PDF
Similar content being viewed by others
Keywords
- Independent Component Analysis
- Interference Pattern
- Source Image
- Independent Component Analysis
- Document Image
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Rice, S., Kanai, J., Nartker, T.: A report on the accuracy of ocr devices. Technical Report TR-92-02, Univ. Nevada Las Vegas, Las Vegas, Nevada (1992)
Rice, S., Jenkins, F., Nartker, T.: The fifth test of ocr accuracy. Technical Report TR-96-01, ISRI, Univ. Nevada Las Vegas, Las Vegas, Nevada (April 1996)
Baird, H.: The State of the Art of Document Image Degradation Modelling. In: Digital Document Processing: Major Directions and Recent Advances, pp. 261–279. Springer, Heidelberg (2007)
Baird, H.: Document image defect models. In: Proc. IAPR Workshop Synthetic and Structural Pattern Recognition. Murray Hill, NJ, June 13–15 (1990)
Baird, H.: The state of the art of document image degradation modeling. In: Proc. of 4 th IAPR International Workshop on Document Analysis Systems, Rio de Janeiro, Brazil, pp. 1–16 (2000)
Hale, C., Barney-Smith, E.: Human image preference and document degradation models. In: Barney-Smith, E. (ed.) Ninth International Conference on Document Analysis and Recognition, 2007. ICDAR 2007, vol. 1, pp. 257–261 (2007)
Kanungo, T., Haralick, R.M., Phillips, I.: Nonlinear local and global document degradation models. Int. Journal of Imaging Systems and Technology 5, 220–230 (1994)
Zi, G., Doermann, D.: Document image ground truth generation from electronic text. In: Doermann, D. (ed.) ICPR 2004. Proceedings of the 17th International Conference on Pattern Recognition, 2004, vol. 2, pp. 663–666 (2004)
Ho, T.K., Baird, H.: Large-scale simulation studies in image pattern recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(10), 1067–1079 (1997)
Kanungo, T., Zheng, Q.: Estimating degradation model parameters using neighborhood pattern distributions: an optimization approach. Transactions on Pattern Analysis and Machine Intelligence 26(4), 520–524 (2004)
Barney-Smith, E.H.: Estimating scanning characteristics from corners in bilevel images. In: Proceedings of SPIE. Document Recognition and Retrieval VIII, San Jose, CA, January 21-26, vol. 4307, pp. 176–183 (2001)
Yam, H.S., Barney Smith, E.: Estimating degradation model parameters from character images. In: Barney Smith, E. (ed.) Proceedings. Seventh International Conference on Document Analysis and Recognition, 2003, Edinburgh, Scotland, August 3-6, vol. 2, pp. 710–714 (2003)
Kanungo, T., Haralick, R., Baird, H., Stuezle, W., Madigan, D.: A statistical, nonparametric methodology for document degradation model validation. Transactions on Pattern Analysis and Machine Intelligence 22(11), 1209–1223 (2000)
Kanungo, T., Haralick, R., Baird, H., Stuetzle, W., Madigan, D.: Document degradation models: Parameter estimation and model validation. In: Proc. of Int. Workshop on Machine Vision Applications, Kawasaki, Japan, December 1994, pp. 552–557 (1994)
Lesk, M.: Substituting images for books: The economics for libraries. In: Symposium Document Analysis and Information Retieval, pp. 1–6 (1996)
Dubois, E., Dano, P.: Joint compression and restoration of documents with bleed-through. In: Proc. IS&T Archiving 2005, Washington DC, USA, April 2005, pp. 170–174 (2005)
Hyvarinen, A., Oja, E.: Independent component analysis: algorithms and applications. Neural Networks 13(4-5), 411–430 (2000)
Oja, E., Yuan, Z.: The fastica algorithm revisited: Convergence analysis. IEEE Transactions on Neural Networks 17(6), 1370–1381 (2006)
Cichocki, A., Amari, S., Siwek, K., Tanaka, T., Phan, A.H., Zdunek, R.: Icalab matlab toolbox ver. 3 for signal processing (2007)
Tan, C.L., Cao, R., Shen, P., Wang, Q., Chee, J., Chang, J.: Removal of interfering strokes in double-sided document images. In: Cao, R. (ed.) IEEE Workshop on Applications of Computer Vision 2000, pp. 16–21 (2000)
Wang, X., Sun, J.: The researching about water and ink motion model based on soil-water dynamics in simulating for the chinese painting. In: Sun, J. (ed.) Fourth International Conference on Image and Graphics, 2007. ICIG 2007, pp. 880–885 (2007)
Chen, L., Zhu, J., Young, M., Susfalk, R.: Modeling polyacrylamide transport in water delivery canals. In: ASA-CSSA-SSSA International Annual Meetings, Indianapolis, IN, November 12-16, pp. 294–6 (2006)
Roth, K.: Scaling of water flow through porous media and soils. European Journal of Soil Science 59(1), 125–130 (2008)
Vaziri, H.H., Xiao, Y., Islam, R., Nouri, A.: Numerical modeling of seepage-induced sand production in oil and gas reservoirs. Journal of Petroleum Science and Engineering 36(1), 71–86 (2002)
Huang, S.W., Way, D.L., Shih, Z.C.: Physical-based model of ink diffusion in chinese ink paintings. Journal of WSCG 10(3), 520–527 (2003)
Yongxin, S., Jizhou, S., Haijiang, Z.: Graphical simulation algorithm for chinese ink wash drawing by particle system (chinese). Journal of Computer-Aided Design & Computer Graphics 15(6), 667–672 (2003)
Zhang, Q., Sato, Y., Takahashi, J.Y., Muraoka, K., Chiba, N.: Simple cellular automaton-based simulation of ink behaviour and its application to suibokuga-like 3d rendering of trees. The Journal of Visualization and Computer Animation 10(1), 27–37 (1999)
Xiujin, W., Jingshan, J., Jizhou, S.: Graphical simulator for chinese ink-wash drawing. Transactions Of Tianjin University 8(1), 1–7 (2002)
Mei-jun, S., Ji-zhou, S., Bin, Y.: Physical modeling of ”xuan” paper in the simulation of chinese ink-wash drawing. In: Ji-zhou, S. (ed.) International Conference on Computer Graphics, Imaging and Vision: New Trends, 2005, pp. 317–322 (2005)
Yu, Y., Lee, D., Lee, Y., Cho, H.: Interactive rendering technique for realistic oriental painting. Journal of WSCG 11(1), 538–545 (2003)
Zi, G.: Groundtruth generation and document image degradation. Technical Report LAMP-TR-121/CAR-TR-1008/CS-TR-4699/UMIACS-TR-2005-08, University of Maryland, College Park (2005)
Cheriet, M., Farrahi Moghaddam, R.: Degradation modeling and enhancement of low quality documents. In: WOSPA 2008, Sharjah, UAE (to appear, 2008)
Lee, J.H., Allebach, J.: Inkjet printer model-based halftoning. IEEE Transactions on Image Processing 14(5), 674–689 (2005)
Saund, E., Fleet, D., Mahoney, J., Lamer, D.: Rough and degraded document interpretation by perceptual organization. In: Doermann, D. (ed.) Proceedings 5th Symposium on Document Image Understanding Technology (SDIUT), UMD (2003)
Sharma, G.: Show-through cancellation in scans of duplex printed documents. IEEE Transactions on Image Processing 10(5), 736–754 (2001)
Knox, K.T., Rochester, N.: Show-through correction for two-sided documents (July 1997)
Tan, C.L., Cao, R., Shen, P.: Restoration of archival documents using a wavelet technique. Transactions on Pattern Analysis and Machine Intelligence 24(10), 1399–1404 (2002)
Leedham, G., Varma, S., Patankar, A., Govindaraju, V.: Separating text and background in degraded document images - a comparison of global thresholding techniques for multi-stage thresholding. In: Proc. Eighth International Workshop on Frontiers in Handwriting Recognition, August 6-8, pp. 244–249 (2002)
Nishida, H., Suzuki, T.: Correcting show-through effects on document images by multiscale analysis. In: Suzuki, T. (ed.) Proceedings 16th International Conference on Pattern Recognition, 2002, vol. 3, pp. 65–68 (2002)
Gerace, I., Cricco, F., Tonazzini, A.: An extended maximum likelihood approach for the robust blind separation of autocorrelated images from noisy mixtures. Independent Component Analysis and Blind Signal Separation, 954–961 (2004)
Tonazzini, A., Salerno, E., Bedini, L.: Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique. International Journal on Document Analysis and Recognition 10(1), 17–25 (2007)
Salerno, E., Tonazzini, A., Bedini, L.: Digital image analysis to enhance underwritten text in the archimedes palimpsest. International Journal on Document Analysis and Recognition 9(2), 79–87 (2007)
Zhang, X., Lu, J., Yahagi, T.: Blind separation methods for image show-through problem. In: Lu, J. (ed.) 6th International Special Topic Conference on Information Technology Applications in Biomedicine, 2007. ITAB 2007, November 8-11, pp. 255–258 (2007)
Dubois, E., Pathak, A.: Reduction of bleed-through in scanned manuscript documents. In: Proc. IS&T Image Processing, Image Quality, Image Capture Systems Conference (PICS 2001), Montreal, Canada, April 2001, pp. 177–180 (2001)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cheriet, M., Moghaddam, R.F. (2008). DIAR: Advances in Degradation Modeling and Processing. In: Campilho, A., Kamel, M. (eds) Image Analysis and Recognition. ICIAR 2008. Lecture Notes in Computer Science, vol 5112. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69812-8_1
Download citation
DOI: https://doi.org/10.1007/978-3-540-69812-8_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69811-1
Online ISBN: 978-3-540-69812-8
eBook Packages: Computer ScienceComputer Science (R0)