Abstract
Document analysis and graphics recognition algorithms are normally applied to the processing of images of 2D documents scanned when flattened against a planar surface. Technological advancements in recent years have led to a situation in which digital cameras with high resolution are widely available. Consequently, traditional graphics recognition tasks may be updated to accommodate document images captured through a camera in an uncontrolled environment. In this paper the problem of document image rectification is discussed. The rectification targets the correction of perspective and geometric distortions of document images taken from uncalibrated cameras, by synthesizing new views which are better suited for existing graphics recognition and document analysis techniques. The proposed approach targets cases in which the document is not necessarily flat, without relaying on specific modeling assumptions, and by utilizing one or more overlapping views of the document. Document image rectification results are provided for several cases.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
H. Baird, “Document image defect models”, In Proc. SSPR’90, pp. 38–46, 1990.
A. Amin, S. Fischer, A. F. Parkinson, R. Shiu, “Comparative study of skew algorithms”, Journal of Electronic Imaging, Vol. 5, Iss. 4, pp. 443–451, 1996.
M. Sawaki, H. Murase, and N. Hagita, “Character recognition in bookshelf images by automatic template selection,” Proc. ICPR’98, pp. 1117–1120, Aug. 1998.
H. Fujisawa, H. Sako, Y. Okada, and S. Lee, “Information capturing camera and developmental issues”, In Proc. ICDAR’99, pp. 205–208, 1999.
M. Shridhar, J. W. V. Miller, G. Houle, and L. Bijnagte, “Recognition of license plate images: issues and perspectives”, In Proc. ICDAR’99, pp. 17–20, 1999.
H. Li, D. Doermann, and O. Kia, “Automatic text detection and tracking in digital video”, IEEE Trans. Image Processing, Vol. 9, No. 1, pp. 147–156, 2000.
T. Kanungo, R. Haralick, and I. Philips, “Global and local document degradation models”, In Proc. ICDAR’93, pp. 730–734, 1993.
G. Agam, G. Michaud, J. S. Perrier, J. L. Houle, and P. Cohen, “A survey of image based view synthesis approaches for interactive 3D sensing”, Technical Report GRPR-RT-9901, The Perception and Robotics Laboratory, Ecole Polytechnique, Montreal, Canada, April, 1999.
J. S. Perrier, G. Agam, and P. Cohen, “Image-based view synthesis for enhanced perception in teleoperation”, in Enhanced and Synthetic Vision 2000, J. G. Verly, ed., Proc. SPIE 4023, pp. 213–224, 2000.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Agam, G., Wu, C. (2002). Structural Rectification of Non-planar Document Images: Application to Graphics Recognition. In: Blostein, D., Kwon, YB. (eds) Graphics Recognition Algorithms and Applications. GREC 2001. Lecture Notes in Computer Science, vol 2390. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45868-9_25
Download citation
DOI: https://doi.org/10.1007/3-540-45868-9_25
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44066-6
Online ISBN: 978-3-540-45868-5
eBook Packages: Springer Book Archive