Abstract
In the knowledge-based document image understanding, it is important to distinguish the layout structures of individual documents exactly with a view to making use of adaptable document model. At least, the document models which are characterized heuristically by the application-specific layout structures are not always applicable to every document. In this paper, we propose a categorization method of various kinds of documents. Our categorization method on the basis of the classification and verification paradigm divides various kinds of documents into appropriate document types stepwisely. First, the classification procedure divides the given documents using rough features about documents, and then the verification procedure is applied to the globally categorized document sets, using the detail features.
This is a preview of subscription content, log in via an institution.
Preview
Unable to display preview. Download preview PDF.
References
T. Watanabe, Q. Luo, Y. Yoshida, and Y. Inagaki: “A Stepwise Recognition Method of Library Cataloging Cards on the Basis of Various Kinds of Knowledge”, Proc. of 10th IPCCC, pp.821–827(1991).
T. Watanabe, H. Naruse, Q. Luo and N. Sugie: “Structure Analysis of Table-form Documents on the Basis of the Recognition of Vertical and Horizontal Line Segments”, Proc. of 1st ICDAR, pp.638–646(1991).
Q. Luo, T. Watanabe and N. Sugie: “A Structure Recognition Method for Japanese Newspapers”, Proc. of 1st SDAIR, pp.217–234(1992).
T. Watanabe, Q. Luo and N. Sugie: “Structure Recognition Methods for Various Types of Documents”, Int'l Journal of MVA, Vol.6, pp.163–176 (1993).
Y. Ishitani: “Document Layout Analysis Based on Emergent Computation”, MIRU'96, Vol.I, pp.343–348 (1996) (in Japanese).
H. Masai and T. Watanabe: “Identification of Document Types from Various Kinds of Document Images Based on Physical and Layout Features”, Proc. of MVA'96, pp.369–372.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Masai, H., Watanabe, T. (1997). Document categorization for document image understanding. In: Chin, R., Pong, TC. (eds) Computer Vision — ACCV'98. ACCV 1998. Lecture Notes in Computer Science, vol 1352. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63931-4_204
Download citation
DOI: https://doi.org/10.1007/3-540-63931-4_204
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63931-2
Online ISBN: 978-3-540-69670-4
eBook Packages: Springer Book Archive