Neural-Based Classification of Blocks from Documents

López, Damián; Castro, María José

doi:10.1007/3-540-46084-5_88

Damián López⁵ &
María José Castro⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2415))

Included in the following conference series:

International Conference on Artificial Neural Networks

83 Accesses

Abstract

The process of obtaining the logical structure of a given document from its geometric structure is known as document understanding. In this process, it is important to classify the distinct blocks or homogeneous regions that the document contains as reliably as possible. In this work, we propose a neural-based method to classify among manuscript text, typed text, drawings and photographic images. The excellent performance of the approach is demonstrated by the experiments performed.

Work partially supported by the Spanish CICYT under contract TIC2000-1153.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. Cattoni, T. Coianiz, S. Messelodi, and Modena. Geometric layout analysis techniques for document image understanding: a review. Technical Report 9703-09, IRST-Instituto Trentino di Cultura, 1998.
Google Scholar
Y. Y. Tang, M. Cheriet, J. Liu, J. N. Said, and C. Y. Suen. Handbook of Pattern Recognition and Computer Vision, chapter Document analysis and recognition by computers. World Scientific Pub. Co., 1999.
Google Scholar
S. W. Lee and D. S. Ryu. Parameter-free geometric document layout analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(11): 1240–1256, 2001.
Article Google Scholar
D. Dori, D. Doermann, C. Shin, R. Haralick, I. Phillips, M. Buchman, and D. Ross. Handbook on Optical Character Recognition and Document Image Analysis, chapter The representation of document structure: a generic object-process analysis. World Scientific Pub. Co., 1996.
Google Scholar
K. Summers. Near-wordless document structure classification. In Proceedings of the International Conference on Document Analysis and Recognition, pages 462–465, 1995.
Google Scholar
J. Sauvola and M. Pietikäinen. Adaptative document image binarization. Pattern Recognition, 33:225–236, 2000.
Article Google Scholar
Y. Yang and H. Yan. An adaptative logical method for binarization of degraded document images. Pattern Recognition, 33:787–807, 2000.
Article Google Scholar
H. Noda, M. N. Shirazi, and E. Kawaguchi. MRF-based texture segmentation using wavelet decomposed images. Pattern Recognition, 35:771–782, 2002.
Article MATH Google Scholar
T. Ojala and M. Pietikäinen. Unsupervised texture segmentation using feature distributions. Pattern Recognition, 33:447–486, 2000.
Google Scholar
E. V. Kurmyshev and M. Cervantes. A quasi-statistical approach to digital binary image representation. Revista Mexicana de Física, 42(1):104–116, 1996.
MathSciNet Google Scholar
E.V. Kurmyshev and F. J. Cuevas. Reconocimiento de texturas binarias degradas por ruido aditivo usando la transformada de cúmulos coordinados. In Proceedings of the VI Taller Iberoamericano de Reconocimiento de Patrones (TIARP’01), pages 179–187, 2001.
Google Scholar
D. E. Rumelhart, G. E. Hinton, and R. J. Williams. PDP: Computational models of cognition and perception, I, chapter Learning internal representations by error propagation, pages 319–362. MIT Press, 1986.
Google Scholar
A. Zell et al. SNNS: Stuttgart Neural Network Simulator. User Manual, Version 4.2. Institute for Parallel and Distributed High Performance Systems, Univ. of Stuttgart, 1998.
Google Scholar
R. O. Duda, P. E. Hart, and G. Stork. Pattern classification. John Wiley, 2001.
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Sistemas Informáticos y Computación, Universidad Politécnica de Valencia, Camino de Vera s/n, 46071, Valencia, Spain
Damián López & María José Castro

Authors

Damián López
View author publications
You can also search for this author in PubMed Google Scholar
María José Castro
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ETS Informática, Universidad Autónoma de Madrid, 28049, Madrid, Spain
José R. Dorronsoro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

López, D., Castro, M.J. (2002). Neural-Based Classification of Blocks from Documents. In: Dorronsoro, J.R. (eds) Artificial Neural Networks — ICANN 2002. ICANN 2002. Lecture Notes in Computer Science, vol 2415. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46084-5_88

Download citation

DOI: https://doi.org/10.1007/3-540-46084-5_88
Published: 21 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44074-1
Online ISBN: 978-3-540-46084-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics