skip to main content
10.1145/2683483.2683511acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicvgipConference Proceedingsconference-collections
research-article

Removing Structural Noise in Handwriting Images using Deep Learning

Authors Info & Claims
Published:14 December 2014Publication History

ABSTRACT

Since handwriting recognition is very sensitive to structural noise, like superimposed objects such as straight lines and other marks, it is necessary to remove noise in a preprocessing stage before recognition. Although numerous denoising approaches have been proposed, it remains a challenge. The difficulties are due to non-locality of structural noise and hard discernment between spurious and the meaningful regions. We propose a supervised approach using deep learning to remove structural noise. Specifically, we generalize the deep autoencoder into the deep denoising autoencoder (DDAE), which consists in training a neural network with noisy and clean pairs to minimize cross-entropy error. Inspired by recurrent neural networks, we introduce feedback loop from the output to enhance the "repaired" image well in the reconstruction stage in our framework. We test the DDAE model on three handwritten image data sets, and show advantages over Wiener filter, robust Boltzmann machines and deep autoencoder.

References

  1. M. Aharon, M. Elad, and A. Bruckstein. K-svd: An algorithm for designing of overcomplete dictionaries for sparse representation. IEEE Trans. On Signal Processing, 54(11):4311–5322, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Besag. On the statistical analysis of dirty pictures. Journal of the Royal Statistical Society. Series B, 48(3):259–302, 1986.Google ScholarGoogle Scholar
  3. A. Buades, B. Coll, and J.-M. Morel. A non-local algorithm for image denoising. In CVPR, pages 60–65, Washington, DC, USA, 2005. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. G. Chen and S. N. Srihari. A noisy-or discriminative restricted boltzmann machine for recognizing handwriting style development. In ICFHR, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  5. G. Chen, C. Xiong, and J. J. Corso. Dictionary transfer for image denoising via domain adaptation. In ICIP, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  6. M. Elad and M. Aharon. Image denoising via learned dictionaries and sparse representation. In CVPR, pages 17–22, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. L. Elman. Finding structure in time. COGNITIVE SCIENCE, 14(2):179–211, 1990.Google ScholarGoogle ScholarCross RefCross Ref
  8. K. Engan, S. O. Aase, and J. H. Husoy. Frame based signal compression using method of optimal directions (mod). 1999.Google ScholarGoogle ScholarCross RefCross Ref
  9. P. Gallinari, Y. LeCun, S. Thiria, and F. Fogelman-Soulie. Memoires associatives distribuees. In Proceedings of COGNITIVA 87, 1987.Google ScholarGoogle Scholar
  10. R. Haddad and A. Akansu. A class of fast gaussian binomial filters for speech and image processing. Trans. Sig. Proc., 39(3):723–727, Mar. 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. G. E. Hinton, S. Osindero, and Y.-W. Teh. A fast learning algorithm for deep belief nets. Neural Comput., 18(7):1527–1554, July 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. G. E. Hinton and R. R. Salakhutdinov. Reducing the dimensionality of data with neural networks. Science, 313(5786):504–507, July 2006.Google ScholarGoogle ScholarCross RefCross Ref
  13. L. Holmström and P. Koistinen. Using additive noise in back-propagation training. Research Reports A3, Rolf Nevanlinna Institute, 1990.Google ScholarGoogle Scholar
  14. J. J. Hopfield. Neurocomputing: Foundations of research. chapter Neural Networks and Physical Systems with Emergent Collective Computational Abilities, pages 457–464. MIT Press, Cambridge, MA, USA, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. V. Jain and H. S. Seung. Natural image denoising with convolutional networks. In D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, editors, NIPS, pages 769–776. Curran Associates, Inc., 2008.Google ScholarGoogle Scholar
  16. P. Perona and J. Malik. Scale-space and edge detection using anisotropic diffusion. IEEE PAMI, 12(7):629–639, July 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. J. Portilla, V. Strela, M. J. Wainwright, and E. P. Simoncelli. Image denoising using scale mixtures of gaussians in the wavelet domain. IEEE Trans. Image Process, 12:1338–1351, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Puri, S. N. Srihari, and Y. Tang. Bayesian network structure learning and inference methods for handwriting. In ICDAR, pages 1320–1324, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. L. I. Rudin and S. Osher. Total variation based image restoration with free local constraints. In ICIP, pages 31–35. IEEE, 1994.Google ScholarGoogle Scholar
  20. L. Sendur and I. W. Selesnick. Bivariate Shrinkage With Local Variance Estimation. IEEE SIGNAL PROCESSING LETTERS, 9(12):438–441, Dec. 2002.Google ScholarGoogle ScholarCross RefCross Ref
  21. H. S. Seung. Learning continuous attractors in recurrent networks. In NIPS, pages 654–660. MIT Press, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. C. E. Shannon. A mathematical theory of communication. SIGMOBILE Mob. Comput. Commun. Rev., 5(1):3–55, Jan. 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. H. Takeda, S. Farsiu, and P. Milanfar. Kernel regression for image processing and reconstruction. IEEE TRANSACTIONS ON IMAGE PROCESSING, 16(2):349–366, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Y. Tang and C. Eliasmith. Deep networks for robust visual recognition. In J. F§rnkranz and T. Joachims, editors, ICML, pages 1055–1062. Omnipress, 2010.Google ScholarGoogle Scholar
  25. Y. Tang, R. Salakhutdinov, and G. Hinton. Robust boltzmann machines for recognition and denoising. In CVPR, pages 2264–2271, Washington, DC, USA, 2012. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. C. Tomasi and R. Manduchi. Bilateral filtering for gray and color images. In ICCV, pages 839–, Washington, DC, USA, 1998. IEEE Computer Society. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P.-A. Manzagol. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. JMLR, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. N. Wiener. Extrapolation, Interpolation, and Smoothing of Stationary Time Series. The MIT Press, 1964. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. J. Xie, L. Xu, and E. Chen. Image denoising and inpainting with deep neural networks. In NIPS, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Removing Structural Noise in Handwriting Images using Deep Learning

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          ICVGIP '14: Proceedings of the 2014 Indian Conference on Computer Vision Graphics and Image Processing
          December 2014
          692 pages
          ISBN:9781450330619
          DOI:10.1145/2683483

          Copyright © 2014 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 14 December 2014

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          Overall Acceptance Rate95of286submissions,33%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader