Abstract
The process of obtaining the logical structure of a given document from its geometric structure is known as document understanding. In this process, it is important to classify the distinct blocks or homogeneous regions that the document contains as reliably as possible. In this work, we propose a neural-based method to classify among manuscript text, typed text, drawings and photographic images. The excellent performance of the approach is demonstrated by the experiments performed.
Work partially supported by the Spanish CICYT under contract TIC2000-1153.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. Cattoni, T. Coianiz, S. Messelodi, and Modena. Geometric layout analysis techniques for document image understanding: a review. Technical Report 9703-09, IRST-Instituto Trentino di Cultura, 1998.
Y. Y. Tang, M. Cheriet, J. Liu, J. N. Said, and C. Y. Suen. Handbook of Pattern Recognition and Computer Vision, chapter Document analysis and recognition by computers. World Scientific Pub. Co., 1999.
S. W. Lee and D. S. Ryu. Parameter-free geometric document layout analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(11): 1240–1256, 2001.
D. Dori, D. Doermann, C. Shin, R. Haralick, I. Phillips, M. Buchman, and D. Ross. Handbook on Optical Character Recognition and Document Image Analysis, chapter The representation of document structure: a generic object-process analysis. World Scientific Pub. Co., 1996.
K. Summers. Near-wordless document structure classification. In Proceedings of the International Conference on Document Analysis and Recognition, pages 462–465, 1995.
J. Sauvola and M. Pietikäinen. Adaptative document image binarization. Pattern Recognition, 33:225–236, 2000.
Y. Yang and H. Yan. An adaptative logical method for binarization of degraded document images. Pattern Recognition, 33:787–807, 2000.
H. Noda, M. N. Shirazi, and E. Kawaguchi. MRF-based texture segmentation using wavelet decomposed images. Pattern Recognition, 35:771–782, 2002.
T. Ojala and M. Pietikäinen. Unsupervised texture segmentation using feature distributions. Pattern Recognition, 33:447–486, 2000.
E. V. Kurmyshev and M. Cervantes. A quasi-statistical approach to digital binary image representation. Revista Mexicana de Física, 42(1):104–116, 1996.
E.V. Kurmyshev and F. J. Cuevas. Reconocimiento de texturas binarias degradas por ruido aditivo usando la transformada de cúmulos coordinados. In Proceedings of the VI Taller Iberoamericano de Reconocimiento de Patrones (TIARP’01), pages 179–187, 2001.
D. E. Rumelhart, G. E. Hinton, and R. J. Williams. PDP: Computational models of cognition and perception, I, chapter Learning internal representations by error propagation, pages 319–362. MIT Press, 1986.
A. Zell et al. SNNS: Stuttgart Neural Network Simulator. User Manual, Version 4.2. Institute for Parallel and Distributed High Performance Systems, Univ. of Stuttgart, 1998.
R. O. Duda, P. E. Hart, and G. Stork. Pattern classification. John Wiley, 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
López, D., Castro, M.J. (2002). Neural-Based Classification of Blocks from Documents. In: Dorronsoro, J.R. (eds) Artificial Neural Networks — ICANN 2002. ICANN 2002. Lecture Notes in Computer Science, vol 2415. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46084-5_88
Download citation
DOI: https://doi.org/10.1007/3-540-46084-5_88
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44074-1
Online ISBN: 978-3-540-46084-8
eBook Packages: Springer Book Archive