Reference Hub1
Understanding Digital Documents Using Gestalt Properties of Isothetic Components

Understanding Digital Documents Using Gestalt Properties of Isothetic Components

Shyamosree Pal, Partha Bhowmick, Arindam Biswas, Bhargab B. Bhattacharya
Copyright: © 2010 |Volume: 1 |Issue: 3 |Pages: 26
ISSN: 1947-9077|EISSN: 1947-9085|EISBN13: 9781609609627|DOI: 10.4018/jdls.2010070101
Cite Article Cite Article

MLA

Pal, Shyamosree, et al. "Understanding Digital Documents Using Gestalt Properties of Isothetic Components." IJDLS vol.1, no.3 2010: pp.1-26. http://doi.org/10.4018/jdls.2010070101

APA

Pal, S., Bhowmick, P., Biswas, A., & Bhattacharya, B. B. (2010). Understanding Digital Documents Using Gestalt Properties of Isothetic Components. International Journal of Digital Library Systems (IJDLS), 1(3), 1-26. http://doi.org/10.4018/jdls.2010070101

Chicago

Pal, Shyamosree, et al. "Understanding Digital Documents Using Gestalt Properties of Isothetic Components," International Journal of Digital Library Systems (IJDLS) 1, no.3: 1-26. http://doi.org/10.4018/jdls.2010070101

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

This paper introduces how Gestalt properties can be used for identifying various components in a document image. That the human mind makes a holistic approach to vision rather than a disintegrated approach is shown to be useful for document analysis. Since the major constituent components (textual or non-textual) in a document page are arranged in a rectilinear fashion, rectilinear/isothetic decomposition of different components are made on a document page. After representing the page as a feature set of its polygonal covers corresponding to the distinct regions of interest, each polygon is iteratively decomposed into the sub-polygons tightly enclosing the corresponding sub-components to capture the overall information as well as the necessary details to the desired level of precision. Subsequently, these components and sub-components are analyzed using Gestalt laws/properties, which have been explained in detail in the context of this work. Text regions, tabular structures, and various graphic objects readily admit some of the Gestalt properties. We have tested our algorithm on several benchmark datasets, and some relevant results have been produced here to demonstrate the effectiveness and elegance of the proposed method.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.