Abstract
The separation of overlapping text and graphics is a challenging problem in document image analysis. This paper proposes a specific method of detecting and extracting characters that are touching graphics. It is based on the observation that the constituent strokes of characters are usually short segments in comparison with those of graphics. It combines line continuation with the feature line width to decompose and reconstruct segments underlying the region of intersection. Experimental results showed that the proposed method improved the percentage of correctly detected text as well as the accuracy of character recognition significantly.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
G. Nagy, Twenty years of document image analysis in PAMI, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 1, pp. 38–62, January 2000
D. S. Doermann, An introduction to vectorization and segmentation, in Graphics Recognition: Algorithms and Systems, K. Tombre and A. K. Chhabra (eds.), Lecture Notes in Computer Science 1389, Springer, pp. 1–8, 1998
C. L. Tan and P. O. Ng, Text extraction using pyramid, Pattern Recognition, Vol. 31, No. 1, pp. 63–72, 1998
D. Wang and S. N. Srihari, Analysis of form images, in Document Image Analysis, Bunke, P. S. P. Wang, H. Baird (eds.), World Scientific, pp. 1031–1051, 1994
S. Naoi, Y. Hotta, M. Yabuki, and A. Asakawa, Global interpolation in the segmentation of handwritten characters overlapping a border, Proceeding of 1st IEEE International Conference on Image Processing, pp. 149–153, 1994
J. Yoo, M. Kim, S. Y. Han, and Y. Kwon, Line removal and restoration of handwritten characters on the form documents, Proceeding of 4th International Conference on Document Analysis and Recognition, pp. 128–131, 1997
K. Lee, H. Byun, and Y. Lee, Robust reconstruction of damaged character images on the form documents. In Graphics Recognition: Algorithms and Systems, K. Tombre and A. K. Chhabra (eds.), Lecture Notes in Computer Science 1389, Springer, pp. 149–162, 1998
R. Kasturi, S. T. Bow, W. El-Masri, J. Shah, J. R. Gattiker, and U. B. Mokate, A system for interpretation of line drawings, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 12, No. 10, pp. 978–992, October 1990
D. Dori and Liu W., Vector-based segmentation of text connected to graphics in engineering drawings, in Advances in Structural and Syntactical Pattern Recognition, P. Perner, P. Wang, A. Rosenfeld (eds.), Springer, pp. 322–331, 1996
Z. Lu, Detection of text regions from digital engineering drawings, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 20, No. 4, pp. 431–439, April 1998
H. Luo, G. Agam, and I. Dinstein, Directional mathematical morphology approach for line thinning and extraction of character strings from maps and line drawings, Proceeding of 3rd International Conference on Document Analysis and Recognition, pp. 257–260, 1995
L. Li, G. Nagy, A. Samal, S. Seth, Y. Xu, Cooperative text and line-art extraction from a topographic map, Proceedings of 5th International Conference on Document Analysis and Recognition, pp. 467–470, 1999
D. Dori, Liu W. and M. Peleg, How to win a dashed line detection contest, in Graphics Recognition: methods and Applications, R. Kasturi and K. Tombre (eds.), Lecture Notes in Computer Science 1072, Springer, pp. 286–300, 1996
R. Jain, R. Kasturi, and B. G. Schunck, Machine Vision, MIT Press and McGraw-Hill, 1995
B. K. Jang and R. T. Chin, One-pass parallel thinning: analysis, properties, and quantitative evaluation, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 14, No. 11, pp. 1129–1140, November 1992
T. Pavlidis, Algorithms for graphics and image processing, Computer Science Press, 1982
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cao, R., Tan, C.L. (2002). Text/Graphics Separation in Maps. In: Blostein, D., Kwon, YB. (eds) Graphics Recognition Algorithms and Applications. GREC 2001. Lecture Notes in Computer Science, vol 2390. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45868-9_14
Download citation
DOI: https://doi.org/10.1007/3-540-45868-9_14
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44066-6
Online ISBN: 978-3-540-45868-5
eBook Packages: Springer Book Archive