Skip to main content
Log in

Combined orientation and skew detection using geometric text-line modeling

  • Original Paper
  • Published:
International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Abstract

In large scale document digitization, orientation detection plays an important role, especially in the scenario of digitizing incoming mail. The heavy use of automatic document feeding scanners and moreover automatic processing of facsimiles results in many documents being scanned in the wrong orientation. These misoriented scans have to be corrected, as most subsequent processing steps assume the document to be scanned in the right orientation. Several existing methods for orientation detection use the fact that in Latin script text, ascenders are more likely to occur than descenders. In this paper, we propose a one-step skew and orientation detection method using a well-established geometric text-line model. The advantage of our method is that it combines accurate skew estimation with robust, resolution-independent orientation detection. An interesting aspect of our method is that it incorporates orientation detection into a previously published skew detection method allowing to perform orientation detection, skew estimation, and, if necessary, text-line extraction in one step. The effectiveness of our orientation detection approach is demonstrated on the UW-I dataset, and on publicly available test images from OCRopus. Our method achieves an accuracy of 99% on the UW-I dataset and 100% on test images from OCRopus.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Vincent, L.: Google book search: document understanding on a massive scale. In: Proceedings of the 9th International Conference on Document Analysis and Recognition, pp. 819–823, Curitiba, Brazil (2007)

  2. Shafait F., van Beusekom J., Keysers D., Breuel T.M.: Document cleanup using page frame detection. Int. Jour. Document Anal. Recognit. 11(2), 81–96 (2008)

    Article  Google Scholar 

  3. Shafait, F., Keysers, D., Breuel, T.M.: Efficient implementation of local adaptive thresholding techniques using integral images, In: Proceedings of SPIE Document Recognition and Retrieval XV, vol. 6815, pp. 681510–681510, San Jose, CA, USA (2008)

  4. Shafait F., Keysers D., Breuel T.M.: Performance evaluation and benchmarking of six page segmentation algorithms. IEEE Trans. Pattern Anal. Mach. Intel. 30(6), 941–954 (2008)

    Article  Google Scholar 

  5. Smith, R.: An overview of the Tesseract OCR engine, In: Proceedings of the 9th International Conference on Document Analysis and Recognition, pp. 629–633, Curitiba, Brazil, (2007)

  6. Breuel, T.M.: The OCRopus open source OCR system, In: Proceedings of SPIE Document Recognition and Retrieval XV, vol. 6815, pp. 0F1–0F15, San Jose, CA, USA, (2008)

  7. Cattoni, R., Coianiz, T., Messelodi, S., Modena, C.M.: Geometric layout analysis techniques for document image understanding: a review, Tech. Rep. pp. 9703–9709, IRST, Trento, Italy (1998)

  8. Lu Y., Tan C.: A nearest-neighbor chain based approach to skew estimation in document images. Pattern Recognit. Lett. 24(14), 2315–2323 (2003)

    Article  Google Scholar 

  9. Caprari R.S.: Algorithm for text page up/down orientation determination. Pattern Recognit. Lett. 21(4), 311–317 (2001)

    Article  Google Scholar 

  10. Aradhye H.: A generic method for determining up/down orientation of text in roman and non-roman scripts. Pattern Recognit. 38(11), 2114–2131 (2005)

    Article  Google Scholar 

  11. Lu, S., Tan, C.L.: Automatic document orientation detection and categorization through document vectorization, In: Proceedings of the 14th ACM International Conference on Multimedia, pp. 113–116, New York, NY, USA (2006)

  12. Le D., Thoma G., Wechsler H.: Automated page orientation and skew angle detection for binary document images. Pattern Recognit. 27(10), 1325–1344 (1994)

    Article  Google Scholar 

  13. Bloomberg, D.S., Kopec, G.E., Dasari, L.: Measuring document image skew and orientation. In: Proceedings of SPIE Document Recognition and Retrieval II, pp. 302–316, San Jose, CA, USA (1995)

  14. Phillips, I.T.: User’s reference manual for the UW english/technical document image database III, Tech. rep., Seattle University, Washington (1996)

  15. Ávila, B.T., Lins, R.D.: A fast orientation and skew detection algorithm for monochromatic document images, In: Proceedings of the 5th ACM Symposium on Document Engineering, pp. 118–126, New York, NY, USA, (2005)

  16. Lu, S., Wang, J., Tan, C.: Fast and accurate detection of document skew and orientation, In: Proceedings of the 9th International Conference on Document Analysis and Recognition, pp. 684–688, Curitiba, Brazil, (2007)

  17. Breuel, T.M.: Robust least square baseline finding using a branch and bound algorithm, In: Proceedings of SPIE Document Recognition and Retrieval IX, pp. 20–27, San Jose, CA, USA (2002)

  18. Breuel T.M.: A practical, globally optimal algorithm for geometric matching under uncertainty. Electron Notes Theor. Comput. Sci. 46, 1–15 (2001)

    Article  Google Scholar 

  19. Breuel T.M.: Implementation techniques for geometric branch-and-bound matching methods. Comput. Vis. Image Underst. 90(3), 258–294 (2003)

    Article  MATH  Google Scholar 

  20. Chen, S., Jaisimha, M.Y., Ha, J., Haralick, R.M., Phillips, I.T.: Reference manual for the UW english document image database, Tech. rep., Seattle University, Washington (1993)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Faisal Shafait.

Rights and permissions

Reprints and permissions

About this article

Cite this article

van Beusekom, J., Shafait, F. & Breuel, T.M. Combined orientation and skew detection using geometric text-line modeling. IJDAR 13, 79–92 (2010). https://doi.org/10.1007/s10032-009-0109-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-009-0109-5

Keywords

Navigation