Skip to main content

Document categorization for document image understanding

  • Poster Session II
  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1352))

Abstract

In the knowledge-based document image understanding, it is important to distinguish the layout structures of individual documents exactly with a view to making use of adaptable document model. At least, the document models which are characterized heuristically by the application-specific layout structures are not always applicable to every document. In this paper, we propose a categorization method of various kinds of documents. Our categorization method on the basis of the classification and verification paradigm divides various kinds of documents into appropriate document types stepwisely. First, the classification procedure divides the given documents using rough features about documents, and then the verification procedure is applied to the globally categorized document sets, using the detail features.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. T. Watanabe, Q. Luo, Y. Yoshida, and Y. Inagaki: “A Stepwise Recognition Method of Library Cataloging Cards on the Basis of Various Kinds of Knowledge”, Proc. of 10th IPCCC, pp.821–827(1991).

    Google Scholar 

  2. T. Watanabe, H. Naruse, Q. Luo and N. Sugie: “Structure Analysis of Table-form Documents on the Basis of the Recognition of Vertical and Horizontal Line Segments”, Proc. of 1st ICDAR, pp.638–646(1991).

    Google Scholar 

  3. Q. Luo, T. Watanabe and N. Sugie: “A Structure Recognition Method for Japanese Newspapers”, Proc. of 1st SDAIR, pp.217–234(1992).

    Google Scholar 

  4. T. Watanabe, Q. Luo and N. Sugie: “Structure Recognition Methods for Various Types of Documents”, Int'l Journal of MVA, Vol.6, pp.163–176 (1993).

    Google Scholar 

  5. Y. Ishitani: “Document Layout Analysis Based on Emergent Computation”, MIRU'96, Vol.I, pp.343–348 (1996) (in Japanese).

    Google Scholar 

  6. H. Masai and T. Watanabe: “Identification of Document Types from Various Kinds of Document Images Based on Physical and Layout Features”, Proc. of MVA'96, pp.369–372.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Roland Chin Ting-Chuen Pong

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Masai, H., Watanabe, T. (1997). Document categorization for document image understanding. In: Chin, R., Pong, TC. (eds) Computer Vision — ACCV'98. ACCV 1998. Lecture Notes in Computer Science, vol 1352. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63931-4_204

Download citation

  • DOI: https://doi.org/10.1007/3-540-63931-4_204

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-63931-2

  • Online ISBN: 978-3-540-69670-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics