Abstract
Geometric distortion often occurs when taking images of bound documents. This phenomenon greatly impairs recognition accuracy. In this paper, a new one-image based method is proposed to correct geometric distortion in bound document images. According to this method, the document image is binarized first. Next, curved text-line features are extracted. Thirdly, locally optimized text curves are detected using a graph model. Finally, the technique of texture warping is applied to correct the image. Experimental results show that images restored by our proposed method can achieve good perception and recognition results.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Brown, M.S., Seales, W.: Image restoration of arbitrarily warped documents. IEEE Transcations on Pattern Analysis and Machine Intelligence 26(2), 1295–1306 (2004)
Yamashita, A., Kawarago, A., Kaneko, T., Miura, K.T.: Shape reconstruction and image restoration for non-flat surfaces of documents with a stereo vision system. In: Proceedings of the Seventeenth International Conference on Pattern Recognition, vol. 1, pp. 482–485. IEEE, Los Alamitos (2004)
Wada, T., Ukida, H., Matsuyama, T.: Shape from shading with interreflections under a proximal light source: Distortion-free copying of an unfolded book. International Journal of Computer Vision 24(2), 125–135 (1997)
Tan, C., Zhang, L., Zhang, Z., Xia, T.: Restoring warped document images through 3d shape modeling. IEEE Transcations on Pattern Analysis and Machine Intelligence 28(2), 195–208 (2006)
Cao, H., Ding, X., Liu, C.: Rectifying the bound document image captured by the camera: a model based approach. In: Proceedings of the Seventh International Conference on Document Analysis and Recognition, vol. 1, pp. 71–75. IEEE, Los Alamitos (2003)
Wu, C., Agam, G.: Document image de-warping for text/graphics recognition. In: Caelli, T.M., Amin, A., Duin, R.P.W., Kamel, M.S., de Ridder, D. (eds.) SPR 2002 and SSPR 2002. LNCS, vol. 2396, pp. 348–357. Springer, Heidelberg (2002)
Zhang, Z., Tan, C.I.: Correcting document image warping based on regression of curved text lines. In: Proceedings of the Seventh International Conference on Document Analysis and Recognition, pp. 589–593 (2003)
Zhang, L., Tan, C.I.: Warped image restoration with applications to digital libraries. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition, pp. 192–196 (2005)
Ulges, A., Lampert, C.H., Breuel, T.M.: Document image dewarping using robust estimation of curled text lines. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition, pp. 1001–1005 (2005)
Ezaki, H., Uchida, S., Asano, A., Sakoe, H.: Dewarping of document image by global optimization. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition, pp. 302–306 (2005)
Seeger, M., Dance, C.: Binarising camera images for ocr. In: Proceedings of the Sixth International Conference on Document Analysis and Recognition, pp. 54–58 (2001)
Chen, M., Cheng, Z., Liu, Y.: A robust algorithm of principal curve detection. In: Proceedings of the Seventeenth International Conference on Pattern Recognition (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ma, Y., Wang, C., Dai, R. (2006). Correcting Bound Document Images Based on Automatic and Robust Curved Text Lines Estimation. In: Matsumoto, Y., Sproat, R.W., Wong, KF., Zhang, M. (eds) Computer Processing of Oriental Languages. Beyond the Orient: The Research Challenges Ahead. ICCPOL 2006. Lecture Notes in Computer Science(), vol 4285. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11940098_21
Download citation
DOI: https://doi.org/10.1007/11940098_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49667-0
Online ISBN: 978-3-540-49668-7
eBook Packages: Computer ScienceComputer Science (R0)