GOAL: Towards Understanding of Graphic Objects from Architectural to Line Drawings

Pal, Shyamosree; Bhowmick, Partha; Biswas, Arindam; Bhattacharya, Bhargab B.

doi:10.1007/978-3-642-13728-0_8

Shyamosree Pal¹⁹,
Partha Bhowmick¹⁹,
Arindam Biswas²⁰ &
…
Bhargab B. Bhattacharya²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6020))

Included in the following conference series:

International Workshop on Graphics Recognition

640 Accesses

Abstract

Understanding of graphic objects has become a problem of pertinence in today’s context of digital documentation and document digitization, since graphic information in a document image may be present in several forms, such as engineering drawings, architectural plans, musical scores, tables, charts, extended objects, hand-drawn sketches, etc. There exist quite a few approaches for segmentation of graphics from text, and also a separate set of techniques for recognizing a graphics and its characteristic features. This paper introduces a novel geometric algorithm that performs the task of segmenting out all the graphic objects in a document image and subsequently also works as a high-level tool to classify various graphic types. Given a document image, it performs the text-graphics segmentation by analyzing the geometric features of the minimum-area isothetic polygonal covers of all the objects for varying grid spacing, g. As the shape and size of a polygonal cover depends on g, and each isothetic polygon is represented by an ordered sequence of its vertices, the spatial relationship of the polygons corresponding to a higher grid spacing with those corresponding to a lower spacing, is used for graphics segmentation and subsequent classification. Experimental results demonstrate its efficiency, elegance, and versatility.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Line Graphics Digitization: A Step Towards Full Automation

Heuristics-Based Detection to Improve Text/Graphics Segmentation in Complex Engineering Drawings

Parameter free approach for segmenting complex manhattan layouts

Article 08 August 2022

References

Antonacopoulos, A., Ritchings, R.T.: Representation and classification of complex-shaped printed regions using white tiles. In: Proc. ICDAR 1995, pp. 1132–1134 (1995)
Google Scholar
Biswas, A., Bhowmick, P., Bhattacharya, B.B.: Construction of isothetic covers of a digital object: A combinatorial approach. JVCIR (in press, 2010)
Google Scholar
Chen, J., Leung, M.K., Gao, Y.: Noisy logo recognition using line segment Hausdorff distance. Pattern Recognition 36(4), 943–955 (2003)
Article Google Scholar
Futrelle, R.P., et al.: Extraction, layout analysis and classification of diagrams in PDF documents. In: ICDAR 2003, pp. 1007–1014 (2003)
Google Scholar
Gonzalez, R.C., Woods, R.E.: Digital Image Processing. Addison-Wesley, California (1993)
Google Scholar
Haralick, R.M.: Document image understanding: Geometric and logical layout. In: Proc. CVPR, pp. 385–390 (1994)
Google Scholar
Hu, J., Kashi, R., Lopresti, D., Wilfong, G.: Evaluating the performance of table processing algorithms 4(3), 140–153 (2002)
Google Scholar
Klette, R., Rosenfeld, A.: Digital Geometry: Geometric Methods for Digital Picture Analysis. Morgan Kaufmann, San Francisco (2004)
MATH Google Scholar
Kopec, G.E., Chou, P.A.: Document image decoding using Markov source models. IEEE TPAMI 16(6), 602–617 (1994)
Google Scholar
Li, J., Najmi, A., Gray, R.M.: Image classification by a two-dimensional hidden Markov model. IEEE Trans. Signal Process 48(2), 517–533 (2000)
Article Google Scholar
Pham, T.D.: Unconstrained logo detection in document images. Pattern Recognition 36(12), 3023–3025 (2003)
Article MATH MathSciNet Google Scholar
Ramel, J.-Y., Vincent, N.: Strategy for line drawing understanding. In: Lladós, J., Kwon, Y.-B. (eds.) GREC 2003. LNCS, vol. 3088, pp. 1–12. Springer, Heidelberg (2004)
Chapter Google Scholar
Song, J., et al.: An object-oriented progresssive-simplification-based vectorization system for engineering drawings: Model, algorithm, and performance. IEEE TPAMI 24(8), 1048–1060 (2002)
Google Scholar
Sun, Z., Wang, W., Zhang, L., Liu, J.: Sketch parameterization using curve approximation. In: Liu, W., Lladós, J. (eds.) GREC 2005. LNCS, vol. 3926, pp. 334–345. Springer, Heidelberg (2006)
Chapter Google Scholar
Wang, Y., Phillips, I.T., Haralick, R.M.: Document zone content classification and its performance evaluation. Pattern Recognition 39, 57–73 (2006)
Article Google Scholar
Wenyin, L.: On-line graphics recognition: State-of-the-art. In: Lladós, J., Kwon, Y.-B. (eds.) GREC 2003. LNCS, vol. 3088, pp. 291–304. Springer, Heidelberg (2004)
Google Scholar
Xiao, Y., Yan, H.: Text region extraction in a document image based on the Delaunay tessellation. Pattern Recognition 36(3), 799–809 (2003)
Article MATH MathSciNet Google Scholar
Zanibbi, R., Blostein, D., Cordy, J.R.: Recognizing mathematical expressions using tree transformation. IEEE TPAMI 24(11), 1455–1467 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science and Engineering Department, Indian Institute of Technology, Kharagpur, India
Shyamosree Pal & Partha Bhowmick
Department of Information Technology, Bengal Engineering and Science University, Shibpur, India
Arindam Biswas
Advanced Computing and Microelectronics Unit, Indian Statistical Institute, Kolkata, India
Bhargab B. Bhattacharya

Authors

Shyamosree Pal
View author publications
You can also search for this author in PubMed Google Scholar
Partha Bhowmick
View author publications
You can also search for this author in PubMed Google Scholar
Arindam Biswas
View author publications
You can also search for this author in PubMed Google Scholar
Bhargab B. Bhattacharya
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Laboratoire d’Informatique, Image et Interactions, Université de La Rochelle, Avenue Crépeau, 17042, La Rochelle Cedex 1, France
Jean-Marc Ogier
Department of Computer Science, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon Tong, Hong Kong, China
Wenyin Liu
Computer Science Department, Computer Vision Center, Edifici O Campus UAB, Bellaterra, Spain
Josep Lladós

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pal, S., Bhowmick, P., Biswas, A., Bhattacharya, B.B. (2010). GOAL: Towards Understanding of Graphic Objects from Architectural to Line Drawings. In: Ogier, JM., Liu, W., Lladós, J. (eds) Graphics Recognition. Achievements, Challenges, and Evolution. GREC 2009. Lecture Notes in Computer Science, vol 6020. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13728-0_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-13728-0_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13727-3
Online ISBN: 978-3-642-13728-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics