Abstract
There has been intensive research carried out in the field of OCR (Optical Character Recognition). Lots of work has been done and articles have been published. Noise is one of the important factors which have to be handled at the stage of preprocessing before applying other steps of OCR systems. Noise is undesirable signal because it obscures the subject of the image. This paper presents the comparative study of the five noise removal approaches: Weiner, Median, Wavelet, Contourlet, and Curvelet for document images. The different approaches of noise removal were compared visually and by employing Peak Signal to Noise Ratio (PSNR), F-measure and NRM evaluation measures.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Gatos, B., Mantzaris, S.L., Perantonis, S.J., Tsigris, A.: Automatic page analysis for the creation of a digital library from newspaper archives. Int. J. Digit. Libr. 3, 77–84 (2000)
Peerawit, W., Kawtrakul, A.: Marginal noise removal from document images using edge density. In: Proceeding of 4th Information and Computer Engineering Postgraduate Workshop, Phuket, Thailand (January 2004)
Ye, X., Cheriet, M., Suen, C.Y.: A generic method of cleaning and enhancing handwritten data from business forms. Int. J. Doc. Anal. Recog. 4, 84–96 (2001)
Jain, A.K.: Fundamentals of Digital Image Processing. Prentice-Hall, Englewood Cliffs (1989)
Kavallieratou, E., Stamatatos, E.: Improving the quality of degraded document images. In: Proceedings of the Second International Conference on Document Image Analysis for Libraries, pp. 330–339. IEEE (2006)
Cao, H., Govindaraju, V.: Handwritten carbon form pre-processing based on markov random field. In: Proceeding of Computer Vision and Pattern Recognition, pp. 1–7. IEEE (2007)
Lins, R.D., Silva, G.F.P., Simske, S.J., Fan, J., Shaw, M., Sá, P., Thielo, M.: Image classification to improve printing quality of mixed type documents. In: Proceeding of International Conference on Document Analysis and Recognition, pp. 1106–1110. IEEE Press, Barcelona (2009)
Lins, R.D.: A Taxonomy for Noise in Images of Paper Documents - the Physical Noises. In: Kamel, M., Campilho, A. (eds.) ICIAR 2009. LNCS, vol. 5627, pp. 844–854. Springer, Heidelberg (2009)
Lins, R.D., Banerjee, S., Thielo, M.: Automatically detecting and classifying noises in document images. In: Proceeding of ACM Symposium on Applied Computing, vol. 3, pp. 33–39 (2010)
Fan, K.C., Wang, Y.K., Lay, T.R.: Marginal noise removal of document images. Pattern Recognition 35(11), 2593–2611 (2002)
Zheng, Y., Liu, C., Ding, X., Pan, S.: Form frame line detection with directional single-connected chain. In: Proceeding of Sixth International Conference on Document Analysis and Recognition, pp. 699–703 (2001)
Ali, M.: Background noise detection and cleaning in document images. In: Proceeding of 13th International Conference on Pattern Recognition, vol. 3, pp. 758–762 (1996)
Bernsen, J.: Dynamic thresholding of grey-level images. In: Proceeding of 8th International Conference on Pattern Recognition, pp. 1251–1255 (1986)
Niblack, W.: An Introduction to Digital Image Processing, pp. 115–116. Prentice Hall (1986)
Schilling, R.J.: Fundamentals of Robotics Analysis and Control. Prentice-Hall, Englewood Cliffs (1990)
O’Gorman, L.: Image and document processing techniques for the right pages electronic library system. In: Proceeding of 11th International Conference on Pattern Recognition, pp. 260–263 (1992)
Rudin, L., Osher, S., Fatemi, E.: Nonlinear total variation based noise removal algorithms. Physica D 60, 259–268 (1992)
Story, G.A., O’Gorman, L., Fox, D., Schaper, L.L., Jagadish, H.V.: The right pages image-based electronic library for alerting and browsing. Computer 25(9), 17–26 (1992)
Ali, M.B.J.: Background noise detection and cleaning in document images. In: Proceeding of International Conference on Pattern Recognition, Vienna, Austria, pp. 758–762 (1996)
Liang, J., Haralick, R.: Document image restoration using binary morphological filters. In: Proceeding of SPIE Document Recognition III, San Jose, CA, vol. 2660, pp. 274–285 (1996)
Buades, A., Coll, B., Morel, J.M.: A review of image denoising algorithms. SIAM- Multiscale Modeling and Simulation 4, 490–530 (2005)
Loce, R.P., Dougherty, E.R.: Enhancement and restoration of digital documents – Statistical Design of Nonlinear Algorithms. SPIE Optical Engineering Press (1997)
Jain, A.K., Yu, B.: Document representation and its application to page decomposition. IEEE Transaction on Pattern Analysis and Machine Intelligence 20(3), 294–308 (1998)
Chinnasarn, K., Rangsanseri, Y., Thitimajshima, P.: Removing salt-and-pepper noise in text/graphics images. In: Proceeding of IEEE Asia-Pacific Conference on Circuits and Systems, Chiangmai, pp. 459–462 (1998)
Cheriet, M.: Extraction of handwritten data from noisy gray-level images using a multi-scale approach. In: Proceeding of Vision Interface, Vancouver, BC, Canada, vol. 1, pp. 389–396 (1998)
Don, H.S.: A noise attributes thresholding method for document image binarization. International Journal on Document Image Analysis and Recognition 4(2), 131–138 (2000)
Nishiwaki, D., Hayashi, M., Sato, A.: Robust Frame Extraction and Removal for Processing form Documents. In: Blostein, D., Kwon, Y.-B. (eds.) GREC 2001. LNCS, vol. 2390, pp. 36–45. Springer, Heidelberg (2002)
Gonzalez, R.C., Woods, R.E.: Digital image processing (DIP/3e), 3rd edn. Pearson Education, Asia
Siyuan, C., Xiangpeng, C.: The Second-generation Wavelet Transform and its Application in denoising of Seismic Data. Applied Geophysics 2(2), 70–74 (2005)
Do, M.N., Vetterli, M.: The contourlet transform: An Efficient Directional Multiresolution Image Representation. IEEE Transactions on Image Processing 14, 2091–2106
Hostalkova, E., Prochazka, A.: Wavelet Signal and Image Denoising. Institute of Chemical Technology. Department of Computing and Control Engineering
Candès, E.J., Donoho, D.L.: Curvelets- A Surprisingly Adaptive Representation for Object with Edges, pp. 105–120. Vanderbilt University Press, Nashville (2000)
Starck, J.L., Candès, E.J., Donoho, D.L.: The curvelet transform for image Denoising. IEEE Transactions on Image Processing 11(6), 670–684 (2002)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer India Pvt. Ltd.
About this paper
Cite this paper
Singh, B., Mridula, Chand, V., Mittal, A., Ghosh, D. (2012). A Comparative Study of Different Approaches of Noise Removal for Document Images. In: Deep, K., Nagar, A., Pant, M., Bansal, J. (eds) Proceedings of the International Conference on Soft Computing for Problem Solving (SocProS 2011) December 20-22, 2011. Advances in Intelligent and Soft Computing, vol 130. Springer, India. https://doi.org/10.1007/978-81-322-0487-9_80
Download citation
DOI: https://doi.org/10.1007/978-81-322-0487-9_80
Published:
Publisher Name: Springer, India
Print ISBN: 978-81-322-0486-2
Online ISBN: 978-81-322-0487-9
eBook Packages: EngineeringEngineering (R0)