A neuro-fuzzy technique for document binarisation

Papamarkos, Nikos

doi:10.1007/s00521-003-0382-z

A neuro-fuzzy technique for document binarisation

Original Article
Published: 08 November 2003

Volume 12, pages 190–199, (2003)
Cite this article

Neural Computing & Applications Aims and scope Submit manuscript

Nikos Papamarkos¹

110 Accesses
18 Citations
Explore all metrics

Abstract

This paper proposes a new neuro-fuzzy technique suitable for binarisation or, in general, the colour reduction of digital documents. The proposed approach uses the image colour values and additional local spatial features extracted in the neighbourhood of the pixels. Both image and local features values feed a Kohonen self-organised feature map (SOFM) neural network classifier. After training, the neurons of the output competition layer of the SOFM define a first approach of the final classes. Using the content of these classes, fuzzy membership functions are obtained that are next used by the fuzzy C-means (FCM) algorithm in order to obtain the colours of the final document. The method can be applied to greyscale and colour documents; it is suitable for improving blurring and badly illuminated documents and can be easily modified to accommodate any type of spatial characteristics.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Combining Local Knowledge with Object-Based Machine Learning Techniques for Extracting Informal Settlements from Very High-Resolution Satellite Data

Article Open access 10 April 2024

OCR-MRD: performance analysis of different optical character recognition engines for medical report digitization

Article 24 November 2023

A quantitative discriminant method of elbow point for the optimal number of clusters in clustering algorithm

Article Open access 15 February 2021

References

Kittler J, lllingworth J (1986) Minimum error thresholding. Patt Recog 19:41–47
Article Google Scholar
Reddi SS, Rudin SF and Keshavan HR (1984) An optimal multiple threshold scheme for image segmentation. IEEE Tran Sys Man Cybern 14(4):661–665
Google Scholar
Otsu N (1979) A threshold selection method from grey level histograms. IEEE Tran Sys Man Cybern 9(1):62–69
Google Scholar
Kapur JN, Sahoo PK and Wong AK (1985) A new method for grey level picture thresholding using the entropy of the histogram. Comp Vis Graph Imag Process 29:273–285
Google Scholar
Papamarkos N, Gatos B (1994) A new approach for multithreshold selection. Comp Vis Graph Imag Process Graph Mod Imag Proc 56(5):357–370
Google Scholar
Sahoo PK, Soltani S and Wong AKC (1988) A survey of thresholding techniques. Comp Vis Graph Imag Process 41:233–260
Google Scholar
Strouthopoulos C, Papamarkos N (2000) Multithresholding of mixed type documents. Engin App Art Intellig 13(3):323–343
Article Google Scholar
Strouthopoulos C, Papamarkos N (1998) Text identification for image analysis using a neural network. Imag Vis Comp 16:879–896
Article Google Scholar
Kohonen T (1997) Self-organizing maps. Springer, Berlin Heidelberg New York
Haykin S (1994) Neural networks: a comprehensive foundation. MacMillan, New York
Google Scholar
Yang Y, Yan H (2000) An adaptive logical method for binarization of degraded document images. Patt Recog 33(5):787–807
Article Google Scholar
O’Gorman L (1994) Binarization and multithresholding of document images using connectivity. CVGIP: Grap Mod Imag Proc 56(6):494–506
Parker JR (1991) Gray level thresholding in badly illuminated images. IEEE Trans Patt Anal Mach Intell 13(8):813-819
Article Google Scholar
Sauvola J, Pietikäinen M (2000) Adaptive document image binarization. Patt Recog 33(2):225–236
Article Google Scholar
Trier OD, Taxt T (1995) Improvement of integrated function algorithm for binarization of document images. Patt Recog Lett 16(3):277–283
Article Google Scholar
Liu Y, Srihari SN (1997) Document image binarization based on texture features. IEEE Trans Pattern Anal Mach Intell 19(5):540–544
Article Google Scholar
Nauck D, Klawonn F and Kruse R (1997) Neuro-fuzzy systems. Wiley, New York
Chi Z, Yan H and Pham T (1996) Fuzzy algorithms: with applications to image processing and pattern recognition. World Scientific, Singapore
Google Scholar
Sagan H (1994) Space-filling curves. Springer, Berlin Heidelberg New York
Chung KL, Tsai YH and Hu FC (2000)Space-filling approach for fast window query on compressed images. IEEE Tran Imag Proc 9(12):2109–2116
MathSciNet MATH Google Scholar
Papamarkos N, Atsalakis A (2000) Grey level reduction using local spatial features. Comp Vis Imag Under 78:336–350
Article Google Scholar
Huang LK, Wang MJ (1995) Image thresholding by minimizing the measure of fuzziness. Patt Recog 28:41–51
MATH Google Scholar
Papamarkos N, Strouthopoulos C and Andreadis I (2000) Multithresholding of color and grey level images through a neural network technique. Imag Vis Comp 18:213–222
Article Google Scholar
Duda RO, Hart PE (1973) Pattern recognition and scene analysis. Wiley, New York
Heckbert P (1982) Color image quantization for frame buffer display. Comp Graph 16:297–307
Google Scholar
Wan SJ, Prusinkiewicz P and Wong SKM (1990) Variance based color image quantization for frame buffer display. Col Res Appl 15(1):52–58
Google Scholar
Dekker AH (1994) Kohonen neural networks for optimal colour quantization. Ntwk: Comp Neur Sys 5:351–367

Download references

Author information

Authors and Affiliations

Department of Electrical & Computer Engineering, Democritus University of Thrace, 67100 Xanthi, Greece
Nikos Papamarkos

Authors

Nikos Papamarkos
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nikos Papamarkos.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Papamarkos, N. A neuro-fuzzy technique for document binarisation. Neural Comput & Applic 12, 190–199 (2003). https://doi.org/10.1007/s00521-003-0382-z

Download citation

Received: 08 March 2002
Accepted: 03 July 2003
Published: 08 November 2003
Issue Date: December 2003
DOI: https://doi.org/10.1007/s00521-003-0382-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A neuro-fuzzy technique for document binarisation

Abstract

Access this article

Similar content being viewed by others

Combining Local Knowledge with Object-Based Machine Learning Techniques for Extracting Informal Settlements from Very High-Resolution Satellite Data

OCR-MRD: performance analysis of different optical character recognition engines for medical report digitization

A quantitative discriminant method of elbow point for the optimal number of clusters in clustering algorithm

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A neuro-fuzzy technique for document binarisation

Abstract

Access this article

Similar content being viewed by others

Combining Local Knowledge with Object-Based Machine Learning Techniques for Extracting Informal Settlements from Very High-Resolution Satellite Data

OCR-MRD: performance analysis of different optical character recognition engines for medical report digitization

A quantitative discriminant method of elbow point for the optimal number of clusters in clustering algorithm

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation