Skip to main content

Imaging Techniques in Document Analysis Processes

  • Reference work entry
  • First Online:
Handbook of Document Image Processing and Recognition

Abstract

Imaging techniques are widely used in document image analysis in order to process, enhance, analyze, and recognize document images. In this chapter, we present an overview of basic image processing algorithms used in document image analysis and focus on the techniques used for document image binarization, enhancement, and normalization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 549.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 549.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Akiyama T, Hagita N (1990) Automated entry system for printed documents. Pattern Recognit 23(11):1141–1154

    Article  Google Scholar 

  2. Amin A, Fischer S (2000) A document skew detection method using the Hough transform. Pattern Anal Appl 3(3):243–253

    Article  Google Scholar 

  3. Aradhye HB (2005) A generic method for determining up/down orientation of text in roman and non-roman scripts. Pattern Recognit 38:2114–2131

    Article  Google Scholar 

  4. Avila BT, Lins RD (2004) A new algorithm for removing noisy borders from monochromatic documents. In: ACM-SAC’2004, Cyprus, Mar 2004. ACM, pp 1219–1225

    Google Scholar 

  5. Avila BT, Lins RD (2004) Efficient removal of noisy borders from monochromatic documents. In: ICIAR 2004, Porto, Portugal. LNCS 3212, pp 249–256

    Google Scholar 

  6. Badekas E, Papamarkos N (2007) Optimal combination of document binarization techniques using a self-organizing map neural network. Eng Appl Artif Intell (Elsevier) 20:11–24

    Article  Google Scholar 

  7. Badekas E, Nikolaou N, Papamarkos N (2006) Text binarization in color documents. Int J Imaging Syst Technol 16(6):262–274

    Article  Google Scholar 

  8. Baird HS (1987) The skew angle of printed documents. In: SPSE 40th conference and symposium on hybrid imaging systems, Rochester, pp 21–24

    Google Scholar 

  9. Bozinovich A, Srihari A (1989) Off-line cursive script word recognition. Trans Pattern Anal Mach Intell II(1):69–82

    Article  Google Scholar 

  10. Brown MS, Tsoi YC (2006) Geometric and shading correction for images of printed materials using boundary. IEEE Trans Image Process 15(6):1544–1554

    Article  Google Scholar 

  11. Cao Y, Li H (2003) Skew detection and correction in document images based on straight-line fitting. Pattern Recognit Lett 24(12):1871—1879

    Article  Google Scholar 

  12. Cao H, Ding X, Liu C (2003) Rectifying the bound document image captured by the camera: a model based approach. In: 7th international conference on document analysis and recognition, Edinburgh, pp 71–75

    Google Scholar 

  13. Caprari RS (2000) Algorithm for text page up/down orientation determination. Pattern Recognit Lett 21:311–317

    Article  Google Scholar 

  14. Chen Y, Leedham G (2005) Decompose algorithm for thresholding degraded historical document images. IEE Vis Image Signal Process 152(6):702–714

    Article  Google Scholar 

  15. Chen YK, Wang JF (2001) Skew detection and reconstruction of color-printed document images. IEICE Trans Inf Syst E84-D(8):1018–1024

    Google Scholar 

  16. Cheriet M, Said JN, Suen CY (1998) A recursive thresholding technique for image segmentation. IEEE Trans Image Process 7(6):918–921

    Article  Google Scholar 

  17. Chou CH, Chu SY, Chang F (2007) Estimation of skew angles for scanned documents based on piecewise covering by parallelograms. Pattern Recognit 40:443–455

    Article  Google Scholar 

  18. Chou C-H, Lin W-H, Chang F (2010) A binarization method with learning-built rules for document images produced by cameras. Pattern Recognit 43(4):1518–1530

    Article  Google Scholar 

  19. Ding Y, Kimura F, Miyake Y (2000) Slant estimation for handwritten words by directionally refined chain code. In: 7th international workshop on Frontiers in handwriting recognition (IWFHR 2000), Amsterdam, pp 53–62

    Google Scholar 

  20. Ding Y, Ohyama W, Kimura F, Shridhar M (2004) Local Slant estimation for handwritten English words. In: 9th international workshop on Frontiers in handwriting recognition (IWFHR 2004), Tokyo, Japan, pp 328–333,

    Google Scholar 

  21. Drira F, LeBourgeois F, Emptoz H (2011) A new PDE-based approach for singularity-preserving regularization: application to degraded characters restoration. Int J Doc Anal Recognit. doi:10.1007/s10032-011-0165-5

    Article  Google Scholar 

  22. Dubois E, Pathak A (2001) Reduction of bleed-through in scanned manuscript documents. In: Proceedings of the image processing, image quality, image capture systems conference, Apr 2001, pp 177–180

    Google Scholar 

  23. Fadoua D, Le Bourgeois F, Emptoz H (2006) Restoring ink bleed-through degraded document images using a recursive unsupervised classification technique. In: 7th international workshop on document analysis systems, (DAS 2006), Nelson, New Zealand, pp 38–49

    Google Scholar 

  24. Fan KC, Wang YK, Lay TR (2002) Marginal noise removal of document images. Pattern Recognit 35(11):2593–2611

    Article  Google Scholar 

  25. Fan H, Zhu L, Tang Y (2010) Skew detection in document images based on rectangular active contour. Int J Doc Anal Recognit 13(4):261–269

    Article  Google Scholar 

  26. Gatos B, Papamarkos N, Chamzas C (1997) Skew detection and text line position determination in digitized documents. Pattern Recognit 30(9):1505–1519

    Article  Google Scholar 

  27. Gatos B, Pratikakis I, Perantonis SJ (2006) Adaptive degraded document image binarization. Pattern Recognit 39:317–327

    Article  Google Scholar 

  28. Gatos B, Pratikakis I, Perantonis SJ (2008) Efficient binarization of historical and degraded document images. In: 8th international workshop on document analysis systems (DAS’08), Nara, Sept 2008, pp 447–454

    Google Scholar 

  29. Haji MM, Bui TD, Suen CY (2009) Simultaneous document margin removal and skew correction based on corner detection in projection profiles. In: ICIAP 2009, Solerno, Italy. LNCS 5716, pp 1025–1034

    Chapter  Google Scholar 

  30. Huang S, Ahmadi M, Sid-Ahmed MA (2008) A hidden Markov model-based character extraction method. Pattern Recognit 41(9):2890–2900

    Article  Google Scholar 

  31. Hull JJ (1998) Document image skew detection: survey and annotated bibliography. In: Hull JJ, Taylor SL (eds) Document analysis systems II. World Scientific, Singapore/River Edge, pp 40–64

    Chapter  Google Scholar 

  32. Kamel M, Zhao A (1993) Extraction of binary character/graphics images from grayscale document images. Comput Vis Graph Image Process 55:203–217

    Google Scholar 

  33. Kavallieratou E, Fakotakis N, Kokkinakis G (2001) Slant estimation algorithm for OCR systems. Pattern Recognit 34:2515–2522

    Article  Google Scholar 

  34. Kim G, Govindaraju V (1997) A Lexicon driven approach to handwritten word recognition for real-time applications. IEEE Trans Pattern Anal Mach Intell 19(4):366–379

    Article  Google Scholar 

  35. Kim I-K, Jung D-W, Park R-H (2002) Document image binarization based on topographic analysis using a water flow model. Pattern Recognit 35:265–277

    Article  Google Scholar 

  36. Kimura F, Shridhar M, Chen Z (1993) Improvements of a lexicon directed algorithm for recognition of unconstrained handwritten words. In: 2nd international conference on document analysis and recognition (ICDAR 1993), Tsukuba City, Japan. Oct 1993, pp 18–22

    Google Scholar 

  37. Lavialle O, Molines X, Angella F, Baylou P (2001) Active contours network to straighten distorted text lines. In: International conference on image processing, Thessaloniki, pp 748–751

    Google Scholar 

  38. Le DX, Thoma GR (1996) Automated borders detection and adaptive segmentation for binary document images. In: International conference on pattern recognition, (ICPR 1996), Vienna, Austria, p III: 737–741

    Google Scholar 

  39. Le DS, Thoma GR, Wechsler H (1994) Automated page orientation and skew angle detection for binary document images. Pattern Recognit 27(10):1325–1344

    Article  Google Scholar 

  40. Leung CC, Chan KS, Chan HM, Tsui WK (2005) A new approach for image enhancement applied to low-contrast–low-illumination IC and document images. Pattern Recognit Lett 26:769–778

    Article  Google Scholar 

  41. Li S, Shen Q, Sun J (2007) Skew detection using wavelet decomposition and projection profile analysis. Pattern Recognit Lett 28:555–562

    Article  Google Scholar 

  42. Liang J, DeMenthon D, Doermann D (2008) Geometric rectification of camera-captured document images. IEEE Trans PAMI 30(4):591–605

    Article  Google Scholar 

  43. Likforman-Sulem L, Darbon J, Barney Smith EH (2011) Enhancement of historical printed document images by combining total variation regularization and Non-local Means filtering. Image Vis Comput J 29(5):351–363

    Article  Google Scholar 

  44. Liu H, Wua Q, Zha H, Liu X (2008) Skew detection for complex document images using robust borderlines in both text and non-text regions. Pattern Recognit Lett 29:1893–1900

    Article  Google Scholar 

  45. Lu Y, Tan CL (2003) A nearest-neighbor chain based approach to skew estimation in document images. Pattern Recognit Lett 24:2315–2323

    Article  Google Scholar 

  46. Lu SJ, Chen BM, Ko CC (2006) A partition approach for the restoration of camera images of planar and curled document. Image Vis Comput 24(8):837–848

    Article  Google Scholar 

  47. Makridis M, Nikolaou N, Papamarkos N (2010) An adaptive technique for global and local skew correction in color documents. Expert Syst Appl 37(10):6832–6843

    Article  Google Scholar 

  48. Masalovitch A, Mestetskiy L (2007) Usage of continuous skeletal image representation for document images de-warping. In: International workshop on camera-based document analysis and recognition, Curitiba, pp 45–53

    Google Scholar 

  49. Moghaddam RF, Cheriet M (2010) A variational approach to degraded document enhancement. IEEE Trans Pattern Anal Mach Intell 32(8):1347–1361

    Article  Google Scholar 

  50. Niblack W (1986) An introduction to digital image processing. Prentice-Hall, Englewood Cliffs, pp 115–116

    Google Scholar 

  51. Nomura S, Yamanaka K, Shiose T, Kawakami H, Katai O (2009) Morphological preprocessing method to thresholding degraded word images. Pattern Recognit Lett 30(8):729–744

    Article  Google Scholar 

  52. Obafemi-Ajayi T, Agam G, Frieder O (2010) Historical document enhancement using LUT classification. Int J Doc Anal Recognit 13:1–17

    Article  Google Scholar 

  53. O’Gorman L (1993) The document spectrum for page layout analysis. IEEE Trans Pattern Anal Mach Intell 15(11):1162–1173

    Article  Google Scholar 

  54. Otsu N (1979) A threshold selection method from Gray-level histograms. IEEE Trans Syst Man Cybern 9(1):377–393

    Article  Google Scholar 

  55. Pratt WK (2012) Digital image processing. Wiley, New York

    MATH  Google Scholar 

  56. Sauvola J, Pietikainen M (2000) Adaptive document image binarization. Pattern Recognit 33:225–236

    Article  Google Scholar 

  57. Sezgin M, Sankur B (2004) Survey over image thresholding techniques and quantitative performance evaluation. J Electron Imaging 13(1):146–165

    Article  Google Scholar 

  58. Shafait F, Breuel TM (2009) A simple and effective approach for border noise removal from document images. In: 13th IEEE international multi-topic conference, Islamabad, Dec 2009, pp 1–5

    Google Scholar 

  59. Shafait F, van Beusekom J, Keysers D, Breuel TM (2008) Document cleanup using page frame detection. Int J Doc Anal Recognit 11:81–96

    Article  Google Scholar 

  60. Singh C, Bhatia N, Kaur A (2008) Hough transform based fast skew detection and accurate skew correction methods. Pattern Recognit 41(12):3528–3546

    Article  Google Scholar 

  61. Solihin Y, Leedham C (1999) Integral ratio: a new class of global thresholding techniques for handwriting images. IEEE Trans Pattern Anal Mach Intell 21:761–768

    Article  Google Scholar 

  62. Stamatopoulos N, Gatos B, Kesidis A (2007) Automatic borders detection of camera document images. In: 2nd international workshop on camera-based document analysis and recognition (CBDAR’07), Curitiba, Sept 2007, pp 71–78

    Google Scholar 

  63. Stamatopoulos N, Gatos B, Pratikakis I, Perantonis SJ (2011) Goal-oriented rectification of camera-based document images. IEEE Trans Image Process 20(4):910–920

    Article  MathSciNet  Google Scholar 

  64. Su B, Lu S, Tan CL (2011) Combination of document image binarization techniques. In: 11th international conference on document analysis and recognition, Beijing, 18–21 Sept 2011

    Google Scholar 

  65. Tonazzini A (2010) Color space transformations for analysis and enhancement of ancient degraded manuscripts. Pattern Recognit Image Anal 20(3):404–417

    Article  Google Scholar 

  66. Trier ØD, Taxt T (1995) Evaluation of binarization methods for document images for document images. IEEE Trans Pattern Anal Mach Intell 17(3):312–315

    Article  Google Scholar 

  67. Tsai CM, Lee HJ (2002) Binarization of color document images via luminance and saturation color features. IEEE Trans Image Process 11(4):434–451

    Article  Google Scholar 

  68. Tseng YH, Lee HJ (2008) Document image binarization by two-stage block extraction and background intensity determination. Pattern Anal Appl 11:33–44

    Article  MathSciNet  Google Scholar 

  69. Ulges A, Lampert CH, Breuel TM (2005) Document image dewarping using robust estimation of curled text lines. In: 8th international conference on document analysis and recognition, Seoul, pp 1001–1005

    Google Scholar 

  70. van Beusekom J, Shafait F, Breuel TM (2010) Combined orientation and skew detection using geometric text-line modeling. Int J Doc Anal Recognit 13(2):79–92

    Article  Google Scholar 

  71. Vinciarelli A, Luettin J (2001) A new normalization technique for cursive handwritten words. Pattern Recognit Lett 22(9):1043–1050

    Article  Google Scholar 

  72. Vonikakis V, Andreadis I, Papamarkos N (2011) Robust document binarization with OFF center-surround cells. Pattern Anal Appl 14:219–234

    Article  MathSciNet  Google Scholar 

  73. Wahl FM, Wong KY, Casey RG (1982) Block segmentation and text extraction in mixed text/image documents. Comput Graph Image Process 20:375–390

    Article  Google Scholar 

  74. Wang B, Li XF, Liu F, Hu FQ (2005) Color text image binarization based on binary texture analysis. Pattern Recognit Lett 26:1650–1657

    Article  Google Scholar 

  75. Wu C, Agam G (2002) Document image De-warping for text/graphics recognition. In: Joint IAPR international workshop on structural, syntactic and statistical pattern recognition, Windsor, pp 348–357

    Google Scholar 

  76. Yan H (1993) Skew correction of document images using interline cross-correlation. Graph Models Image Process 55(6):538–543

    Article  Google Scholar 

  77. Yang Y, Yan H (2000) An adaptive logical method for binarization of degraded document images. Pattern Recognit 33:787–807

    Article  Google Scholar 

  78. Yin PY (2001) Skew detection and block classification of printed documents. Image Vis Comput 19(8):567–579

    Article  Google Scholar 

  79. Zhang L, Tan CL (2005) Warped image restoration with applications to digital libraries. In: 8th international conference on document analysis and recognition, Seoul, pp 192–196

    Google Scholar 

  80. Zhang L, Yip AM, Brown MS, Tan CL (2009) A unified framework for document restoration using in painting and shape-from-shading. Pattern Recognit J 42(11):2961–2978

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Basilis G. Gatos .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag London

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Gatos, B.G. (2014). Imaging Techniques in Document Analysis Processes. In: Doermann, D., Tombre, K. (eds) Handbook of Document Image Processing and Recognition. Springer, London. https://doi.org/10.1007/978-0-85729-859-1_4

Download citation

Publish with us

Policies and ethics