Skip to main content

Neural-Based Classification of Blocks from Documents

  • Conference paper
  • First Online:
Artificial Neural Networks — ICANN 2002 (ICANN 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2415))

Included in the following conference series:

  • 83 Accesses

Abstract

The process of obtaining the logical structure of a given document from its geometric structure is known as document understanding. In this process, it is important to classify the distinct blocks or homogeneous regions that the document contains as reliably as possible. In this work, we propose a neural-based method to classify among manuscript text, typed text, drawings and photographic images. The excellent performance of the approach is demonstrated by the experiments performed.

Work partially supported by the Spanish CICYT under contract TIC2000-1153.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Cattoni, T. Coianiz, S. Messelodi, and Modena. Geometric layout analysis techniques for document image understanding: a review. Technical Report 9703-09, IRST-Instituto Trentino di Cultura, 1998.

    Google Scholar 

  2. Y. Y. Tang, M. Cheriet, J. Liu, J. N. Said, and C. Y. Suen. Handbook of Pattern Recognition and Computer Vision, chapter Document analysis and recognition by computers. World Scientific Pub. Co., 1999.

    Google Scholar 

  3. S. W. Lee and D. S. Ryu. Parameter-free geometric document layout analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(11): 1240–1256, 2001.

    Article  Google Scholar 

  4. D. Dori, D. Doermann, C. Shin, R. Haralick, I. Phillips, M. Buchman, and D. Ross. Handbook on Optical Character Recognition and Document Image Analysis, chapter The representation of document structure: a generic object-process analysis. World Scientific Pub. Co., 1996.

    Google Scholar 

  5. K. Summers. Near-wordless document structure classification. In Proceedings of the International Conference on Document Analysis and Recognition, pages 462–465, 1995.

    Google Scholar 

  6. J. Sauvola and M. Pietikäinen. Adaptative document image binarization. Pattern Recognition, 33:225–236, 2000.

    Article  Google Scholar 

  7. Y. Yang and H. Yan. An adaptative logical method for binarization of degraded document images. Pattern Recognition, 33:787–807, 2000.

    Article  Google Scholar 

  8. H. Noda, M. N. Shirazi, and E. Kawaguchi. MRF-based texture segmentation using wavelet decomposed images. Pattern Recognition, 35:771–782, 2002.

    Article  MATH  Google Scholar 

  9. T. Ojala and M. Pietikäinen. Unsupervised texture segmentation using feature distributions. Pattern Recognition, 33:447–486, 2000.

    Google Scholar 

  10. E. V. Kurmyshev and M. Cervantes. A quasi-statistical approach to digital binary image representation. Revista Mexicana de Física, 42(1):104–116, 1996.

    MathSciNet  Google Scholar 

  11. E.V. Kurmyshev and F. J. Cuevas. Reconocimiento de texturas binarias degradas por ruido aditivo usando la transformada de cúmulos coordinados. In Proceedings of the VI Taller Iberoamericano de Reconocimiento de Patrones (TIARP’01), pages 179–187, 2001.

    Google Scholar 

  12. D. E. Rumelhart, G. E. Hinton, and R. J. Williams. PDP: Computational models of cognition and perception, I, chapter Learning internal representations by error propagation, pages 319–362. MIT Press, 1986.

    Google Scholar 

  13. A. Zell et al. SNNS: Stuttgart Neural Network Simulator. User Manual, Version 4.2. Institute for Parallel and Distributed High Performance Systems, Univ. of Stuttgart, 1998.

    Google Scholar 

  14. R. O. Duda, P. E. Hart, and G. Stork. Pattern classification. John Wiley, 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

López, D., Castro, M.J. (2002). Neural-Based Classification of Blocks from Documents. In: Dorronsoro, J.R. (eds) Artificial Neural Networks — ICANN 2002. ICANN 2002. Lecture Notes in Computer Science, vol 2415. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46084-5_88

Download citation

  • DOI: https://doi.org/10.1007/3-540-46084-5_88

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44074-1

  • Online ISBN: 978-3-540-46084-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics