Abstract
Understanding of graphic objects has become a problem of pertinence in today’s context of digital documentation and document digitization, since graphic information in a document image may be present in several forms, such as engineering drawings, architectural plans, musical scores, tables, charts, extended objects, hand-drawn sketches, etc. There exist quite a few approaches for segmentation of graphics from text, and also a separate set of techniques for recognizing a graphics and its characteristic features. This paper introduces a novel geometric algorithm that performs the task of segmenting out all the graphic objects in a document image and subsequently also works as a high-level tool to classify various graphic types. Given a document image, it performs the text-graphics segmentation by analyzing the geometric features of the minimum-area isothetic polygonal covers of all the objects for varying grid spacing, g. As the shape and size of a polygonal cover depends on g, and each isothetic polygon is represented by an ordered sequence of its vertices, the spatial relationship of the polygons corresponding to a higher grid spacing with those corresponding to a lower spacing, is used for graphics segmentation and subsequent classification. Experimental results demonstrate its efficiency, elegance, and versatility.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Antonacopoulos, A., Ritchings, R.T.: Representation and classification of complex-shaped printed regions using white tiles. In: Proc. ICDAR 1995, pp. 1132–1134 (1995)
Biswas, A., Bhowmick, P., Bhattacharya, B.B.: Construction of isothetic covers of a digital object: A combinatorial approach. JVCIR (in press, 2010)
Chen, J., Leung, M.K., Gao, Y.: Noisy logo recognition using line segment Hausdorff distance. Pattern Recognition 36(4), 943–955 (2003)
Futrelle, R.P., et al.: Extraction, layout analysis and classification of diagrams in PDF documents. In: ICDAR 2003, pp. 1007–1014 (2003)
Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Addison-Wesley, California (1993)
Haralick, R.M.: Document image understanding: Geometric and logical layout. In: Proc. CVPR, pp. 385–390 (1994)
Hu, J., Kashi, R., Lopresti, D., Wilfong, G.: Evaluating the performance of table processing algorithms 4(3), 140–153 (2002)
Klette, R., Rosenfeld, A.: Digital Geometry: Geometric Methods for Digital Picture Analysis. Morgan Kaufmann, San Francisco (2004)
Kopec, G.E., Chou, P.A.: Document image decoding using Markov source models. IEEE TPAMI 16(6), 602–617 (1994)
Li, J., Najmi, A., Gray, R.M.: Image classification by a two-dimensional hidden Markov model. IEEE Trans. Signal Process 48(2), 517–533 (2000)
Pham, T.D.: Unconstrained logo detection in document images. Pattern Recognition 36(12), 3023–3025 (2003)
Ramel, J.-Y., Vincent, N.: Strategy for line drawing understanding. In: Lladós, J., Kwon, Y.-B. (eds.) GREC 2003. LNCS, vol. 3088, pp. 1–12. Springer, Heidelberg (2004)
Song, J., et al.: An object-oriented progresssive-simplification-based vectorization system for engineering drawings: Model, algorithm, and performance. IEEE TPAMI 24(8), 1048–1060 (2002)
Sun, Z., Wang, W., Zhang, L., Liu, J.: Sketch parameterization using curve approximation. In: Liu, W., Lladós, J. (eds.) GREC 2005. LNCS, vol. 3926, pp. 334–345. Springer, Heidelberg (2006)
Wang, Y., Phillips, I.T., Haralick, R.M.: Document zone content classification and its performance evaluation. Pattern Recognition 39, 57–73 (2006)
Wenyin, L.: On-line graphics recognition: State-of-the-art. In: Lladós, J., Kwon, Y.-B. (eds.) GREC 2003. LNCS, vol. 3088, pp. 291–304. Springer, Heidelberg (2004)
Xiao, Y., Yan, H.: Text region extraction in a document image based on the Delaunay tessellation. Pattern Recognition 36(3), 799–809 (2003)
Zanibbi, R., Blostein, D., Cordy, J.R.: Recognizing mathematical expressions using tree transformation. IEEE TPAMI 24(11), 1455–1467 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pal, S., Bhowmick, P., Biswas, A., Bhattacharya, B.B. (2010). GOAL: Towards Understanding of Graphic Objects from Architectural to Line Drawings. In: Ogier, JM., Liu, W., Lladós, J. (eds) Graphics Recognition. Achievements, Challenges, and Evolution. GREC 2009. Lecture Notes in Computer Science, vol 6020. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13728-0_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-13728-0_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13727-3
Online ISBN: 978-3-642-13728-0
eBook Packages: Computer ScienceComputer Science (R0)