Abstract
In this paper, a qualitative representation for the layout of structured documents is presented, which is established by the means of supervised learning from a labeled training set of documents. For this formal representation, an inference algorithm has been developed, adopted from error tolerant subgraph isomorphism, which assigns logic labels to layout objects of a test document.
Preview
Unable to display preview. Download preview PDF.
References
James F. Allen. Maintaining knowledge about temporal intervals. Communications of the ACM, 26(11):832–843, November 1983.
Thomas Bayer. Ulrich Bohnacker, and Ingrid Renz. Information Extraction from Paper Documents. Handbook on Optical Character Recognition and Document Image Analysis. P. Wang and Horst Bunke, to be published.
Thomas A. Bayer and Hanno Walischewski. Experiments on extracting structural information from paper documents using syntactic pattern analysis. In Proceedings of the 3rd International Conference on Document Analysis and Recognition, volume 1, pages 476–479, 1995.
F. Esposito, D. Malerba, and G. Semeraro. Multistrategy Learning for Document Recognition. Applied Artificial Intelligence, 8(33–84), 1994.
Amitabha Mukerjee and Gene Joe. A qualitative spatial representation based on rangency and alignment. Technical report, Texas A&M University, Department of Computer Science, 1992.
J. Pearl. Heuristics. Addison Wesley, London, 1984.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Walischewski, H. (1997). Learning and interpretation of the layout of structured documents. In: Brewka, G., Habel, C., Nebel, B. (eds) KI-97: Advances in Artificial Intelligence. KI 1997. Lecture Notes in Computer Science, vol 1303. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3540634932_39
Download citation
DOI: https://doi.org/10.1007/3540634932_39
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63493-5
Online ISBN: 978-3-540-69582-0
eBook Packages: Springer Book Archive