Skip to main content

GOAL: Towards Understanding of Graphic Objects from Architectural to Line Drawings

  • Conference paper
Graphics Recognition. Achievements, Challenges, and Evolution (GREC 2009)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6020))

Included in the following conference series:

  • 624 Accesses

Abstract

Understanding of graphic objects has become a problem of pertinence in today’s context of digital documentation and document digitization, since graphic information in a document image may be present in several forms, such as engineering drawings, architectural plans, musical scores, tables, charts, extended objects, hand-drawn sketches, etc. There exist quite a few approaches for segmentation of graphics from text, and also a separate set of techniques for recognizing a graphics and its characteristic features. This paper introduces a novel geometric algorithm that performs the task of segmenting out all the graphic objects in a document image and subsequently also works as a high-level tool to classify various graphic types. Given a document image, it performs the text-graphics segmentation by analyzing the geometric features of the minimum-area isothetic polygonal covers of all the objects for varying grid spacing, g. As the shape and size of a polygonal cover depends on g, and each isothetic polygon is represented by an ordered sequence of its vertices, the spatial relationship of the polygons corresponding to a higher grid spacing with those corresponding to a lower spacing, is used for graphics segmentation and subsequent classification. Experimental results demonstrate its efficiency, elegance, and versatility.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Antonacopoulos, A., Ritchings, R.T.: Representation and classification of complex-shaped printed regions using white tiles. In: Proc. ICDAR 1995, pp. 1132–1134 (1995)

    Google Scholar 

  2. Biswas, A., Bhowmick, P., Bhattacharya, B.B.: Construction of isothetic covers of a digital object: A combinatorial approach. JVCIR (in press, 2010)

    Google Scholar 

  3. Chen, J., Leung, M.K., Gao, Y.: Noisy logo recognition using line segment Hausdorff distance. Pattern Recognition 36(4), 943–955 (2003)

    Article  Google Scholar 

  4. Futrelle, R.P., et al.: Extraction, layout analysis and classification of diagrams in PDF documents. In: ICDAR 2003, pp. 1007–1014 (2003)

    Google Scholar 

  5. Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Addison-Wesley, California (1993)

    Google Scholar 

  6. Haralick, R.M.: Document image understanding: Geometric and logical layout. In: Proc. CVPR, pp. 385–390 (1994)

    Google Scholar 

  7. Hu, J., Kashi, R., Lopresti, D., Wilfong, G.: Evaluating the performance of table processing algorithms 4(3), 140–153 (2002)

    Google Scholar 

  8. Klette, R., Rosenfeld, A.: Digital Geometry: Geometric Methods for Digital Picture Analysis. Morgan Kaufmann, San Francisco (2004)

    MATH  Google Scholar 

  9. Kopec, G.E., Chou, P.A.: Document image decoding using Markov source models. IEEE TPAMI 16(6), 602–617 (1994)

    Google Scholar 

  10. Li, J., Najmi, A., Gray, R.M.: Image classification by a two-dimensional hidden Markov model. IEEE Trans. Signal Process 48(2), 517–533 (2000)

    Article  Google Scholar 

  11. Pham, T.D.: Unconstrained logo detection in document images. Pattern Recognition 36(12), 3023–3025 (2003)

    Article  MATH  MathSciNet  Google Scholar 

  12. Ramel, J.-Y., Vincent, N.: Strategy for line drawing understanding. In: Lladós, J., Kwon, Y.-B. (eds.) GREC 2003. LNCS, vol. 3088, pp. 1–12. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  13. Song, J., et al.: An object-oriented progresssive-simplification-based vectorization system for engineering drawings: Model, algorithm, and performance. IEEE TPAMI 24(8), 1048–1060 (2002)

    Google Scholar 

  14. Sun, Z., Wang, W., Zhang, L., Liu, J.: Sketch parameterization using curve approximation. In: Liu, W., Lladós, J. (eds.) GREC 2005. LNCS, vol. 3926, pp. 334–345. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  15. Wang, Y., Phillips, I.T., Haralick, R.M.: Document zone content classification and its performance evaluation. Pattern Recognition 39, 57–73 (2006)

    Article  Google Scholar 

  16. Wenyin, L.: On-line graphics recognition: State-of-the-art. In: Lladós, J., Kwon, Y.-B. (eds.) GREC 2003. LNCS, vol. 3088, pp. 291–304. Springer, Heidelberg (2004)

    Google Scholar 

  17. Xiao, Y., Yan, H.: Text region extraction in a document image based on the Delaunay tessellation. Pattern Recognition 36(3), 799–809 (2003)

    Article  MATH  MathSciNet  Google Scholar 

  18. Zanibbi, R., Blostein, D., Cordy, J.R.: Recognizing mathematical expressions using tree transformation. IEEE TPAMI 24(11), 1455–1467 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pal, S., Bhowmick, P., Biswas, A., Bhattacharya, B.B. (2010). GOAL: Towards Understanding of Graphic Objects from Architectural to Line Drawings. In: Ogier, JM., Liu, W., Lladós, J. (eds) Graphics Recognition. Achievements, Challenges, and Evolution. GREC 2009. Lecture Notes in Computer Science, vol 6020. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13728-0_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-13728-0_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-13727-3

  • Online ISBN: 978-3-642-13728-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics