Basic techniques and symbol-level recognition — An overview

O'Gorman, Lawrence

doi:10.1007/3-540-61226-2_1

Lawrence O'Gorman¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1072))

Included in the following conference series:

International Workshop on Graphics Recognition

169 Accesses
1 Citations

Abstract

This is an overview paper that describes methods used in graphics recognition from the stage of the initial scanned image to that of the graphics features. The objective is to give context and background to the papers in this section — and, because it is introductory material — to the rest of the papers in this book.

Methods are described under the categories of: pixel-level processing, line-level processing, and feature detection. In the pixel-level processing section, thresholding, noise reduction, and compression are discussed. In the line-level processing section, thinning, chain coding, region detection, and polygonalization are discussed. And in the final section on feature detection, critical point detection, line and curve fitting, and shape recognition are discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

L. O'Gorman, R. Kasturi, “Document Image Analysis”, IEEE Computer Society Press, Los Vaqueros, CA, 1994.
Google Scholar
N. Otsu, “A threshold selection method from gray-level histograms”, IEEE Trans. Systems, Man, and Cybernetics, Vol. SMC-9, No. 1, Jan. 1979, pp. 62–66.
Google Scholar
S.S. Reddi, S.F. Rudin, H.R. Keshavan, “An optimal multiple threshold scheme for image segmentation”, IEEE Trans. Systems, Man, and Cybernetics, Vol. SMC-14, No. 4, July/Aug, 1984, pp. 661–665.
Google Scholar
W-H. Tsai, “Moment-preserving thresholding: A new approach”, Computer Vision, Graphics, and Image Processing, Vol. 29, 1985, pp. 377–393.
Google Scholar
L. O'Gorman, “Binarization and multi-thresholding of document images using connectivity”, CVGIP: Graphical Models and Image Processing, Vol. 56, No. 6, Nov. pp. 494–506, 1994.
Google Scholar
K.Y. Wong, “Multi-function auto-thresholding algorithm”, IBM Technical Disclosure Bulletin, Vol. 21, No. 7, 1978, pp. 3001–3003.
Google Scholar
R.G. Casey, K.Y. Wong, “Document analysis systems and techniques”, in Image Analysis Applications, R. Kasturi and M.M. Trivedi (eds), Marcel Dekker, 1990, pp. 1–36.
Google Scholar
M. Kamel, A. Zhao, “Extraction of binary character/graphics images from grayscale document images”, CVGIP: Graphical Models and Image Processing, Vol. 55, No. 3, 1993, pp. 203–217.
Google Scholar
J.M. White, G.D. Rohrer, “Image thresholding for optical character recognition and other applications requiring character image extraction”, IBM J. Res. Development, Vol. 27, no. 4, July 1983, pp. 400–411.
Google Scholar
O. D. Trier, T. Taxt, “Evaluation of binarization methods for document images”, IEEE Trans. PAMI, Vol. 17, No. 3, Mar. 1995, pp. 312–320.
Google Scholar
R.M. Haralick, S.R. Sternberg, X. Zhuang, “Image analysis using mathematical morphology”, IEEE Trans. PAMI, Vol 9, July 1987, pp. 532–550.
Google Scholar
L. O'Gorman, “Image and document processing techniques for the RightPages Electronic Library System”, Int. Conf. Pattern Recognition (ICPR), The Netherlands, Sept. 1992, pp. 260–263.
Google Scholar
R. Aravind, G. L. Cash, D. L. Duttweiler, H-M. Hang, B. G. Haskell, A. Puri, “Image and video coding standards”, AT&T Technical Journal, Jan–Feb., 1993, pp. 67–88.
Google Scholar
JBIG, “Progressive bi-level image compression”, ISO/IEC International Standard 11544, 1993.
Google Scholar
I. H. Witten, A. Moffat, T. C. Bell, “Managing Gigabytes: Compressing and Indexing Documents and Images”, Van Nostrand Reinhold Publ., New York, 1994.
Google Scholar
L. Lam, S-W. Lee, C.Y. Suen, “Thinning methodologies — A comprehensive survey”, IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 14, No. 9, Sept. 1992, pp. 869–885.
Google Scholar
C.J. Hilditch, “Linear skeletons from square cupboards”, Machine Intelligence 4, 1969, pp. 403–420.
Google Scholar
N.J. Naccache, R. Shinghal, “SPTA: A proposed algorithm for thinning binary patterns”, IEEE Trans. Systems, Man, and Cybernetics, Vol. SMC-14, No. 3, 1984, pp. 409–418.
Google Scholar
L. O'Gorman, “k x k Thinning”, CVGIP, Vol. 51, pp. 195–215, 1990.
Google Scholar
H. Freeman, “Computer processing of line drawing images”, Computing Surveys, Vol. 6, No. 1, 1974, pp. 57–98.
Article Google Scholar
J.F. Harris, J. Kittler, B. Llewellyn, G. Preston, “A modular system for interpreting binary pixel representations of line-structured data”, in Pattern Recognition: Theory and Applications, J. Kittler, K.S. Fu, L.F. Pau (eds.), D. Reidel Publishing Co. pp. 311–351, 1982.
Google Scholar
L. O'Gorman, “Primitives Chain Code”, in “Progress in Computer Vision and Image Processing”, edited by A. Rosenfeld and L. G. Shapiro, Academic Press, San Diego, 1992 pp. 167–183.
Google Scholar
U. E. Ramer, “An iterative procedure for the polygonal approximation of plane curves”, Computer Graphics and Image Processing, 1:244–256, 1972
Google Scholar
I. Tomek, “Two algorithms for piecewise-linear continuous fit of functions of one variable”, IEEE Trans. Computer, Vol. C-23, No. 4, pp. 445–448, 1974
Google Scholar
C. M. Williams, “An efficient algorithm for the piecewise linear approximation of planar curves,” Computer Graphics and Image Processing, 8:286–293, 1978
Google Scholar
J. Sklansky, V. Gonzalez, “Fast polygonal approximation of digitized curves,” Pattern Recognition, 12:327–331, 1980
Article Google Scholar
C. M. Williams, “Bounded straight-line approximation of digitized planar curves and lines,” Computer Graphics and Image Processing, 16:370–381, 1981
Google Scholar
K. Wall, P.E. Danielsson, “A fast sequential method for polygonal approximation of digitized curves,” Computer Graphics and Image Processing, 28:220–227, 1984
Google Scholar
Y. Kurozumi, W. A. Davis, “Polygonal approximation by minimax method,” Computer Graphics and Image Processing, Vol. 19, 1982, pp. 248–264.
Google Scholar
A. Rosenfeld, E. Johnston, “Angle detection on digital curves”, IEEE Trans. Computers, 22: 875–878, Sept. 1973.
Google Scholar
A. Rosenfeld, J.S. Weszka, “An improved method of angle detection on digital curves”, IEEE Trans. Computer, Vol. C-24, 1975, pp. 940–941.
Google Scholar
H. Freeman, L. Davis, “A corner-finding algorithm for chain-coded curves”, IEEE Trans. Computer, Vol. C-26, 1977, pp. 297–303.
Google Scholar
L. O'Gorman, “Curvilinear feature detection from curvature estimation”, 9th Int. Conference on Pattern Recognition, Rome, Italy Nov., 1988, pp. 1116–1119.
Google Scholar
H. Asada, M. Brady, “The curvature primal sketch”, IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. PAMI-8, No. 1, Jan., 1986, pp. 2–14.
Google Scholar
A.P. Pridmore, J. Porrill, J.E.W. Mayhew, “Segmentation and description of binocularly viewed contours”, Image and Vision Computing, Vol. 5, No. 2, May 1987, pp. 132–138.
Google Scholar
R. O. Duda, P. E. Hart, Pattern Classification and Scene Analysis, Wiley-Interscience, New York, 1973, pp. 332–335.
Google Scholar
D. H. Ballard, C. M. Brown, Computer Vision, Prentice-Hall, New Jersey, 1982, pp. 485–489.
Google Scholar
P.L. Rosin, G.A.W. West, “Segmentation of edges into lines and arcs”, Image and Vision Computing, Vol. 7, No. 2, May 1989, pp. 109–114.
Google Scholar
J. Illingworth, J. Kittler, “A survey of the Hough transform”, Computer Graphics and Image Processing, 44:87–116, 1988.
Google Scholar
M. K. Hu, “Visual pattern recognition by moment invariants”, in Computer Methods in Image Analysis, J. K. Aggarwal, R. O. Duda, A. Rosenfeld (eds.), IEEE Computer Society, Los Angeles, 1977.
Google Scholar
A. K. Jain, Fundamentals of Digital Image Processing, Prentice Hall, New Jersey, 1989, pp. 377–381.
Google Scholar
T. Pavlidis, “A review of algorithms for shape analysis”, Computer Graphics and image processing”, vol. 7, 1978, pp. 243–258.
Google Scholar
S. Marshall, “Review of shape coding techniques”, Image and Vision Computing, Vol. 7, No. 4, Nov. 1989, pp. 281–294.
Google Scholar
C. Arcelli, L. P. Cordella, G. Sanniti di Baja (eds.), Visual Form: Analysis and Recognition, (Proceedings of the International Workshop on Visual Form, May, 1991, Capri, Italy), Plenum Press, New York, 1992.
Google Scholar

Download references

Author information

Authors and Affiliations

AT&T Bell Laboratories, 07940-0636, Murray Hill, NJ
Lawrence O'Gorman

Authors

Lawrence O'Gorman
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Rangachar Kasturi Karl Tombre

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

O'Gorman, L. (1996). Basic techniques and symbol-level recognition — An overview. In: Kasturi, R., Tombre, K. (eds) Graphics Recognition Methods and Applications. GREC 1995. Lecture Notes in Computer Science, vol 1072. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61226-2_1

Download citation

DOI: https://doi.org/10.1007/3-540-61226-2_1
Published: 07 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61226-1
Online ISBN: 978-3-540-68387-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics