ABSTRACT
Whenever one finds documents written on both sides on translucent paper there is a "back-to-front interference". The direct binarization of such documents yields unreadable documents. This paper presents a fast segmentation-based method for generating high-quality binarized images of documents with back-to-front interference. The proposed segmentation algorithm is based on the entropy of the histogram of the image.
- N. Abramson. Information Theory and Coding. McGraw-Hill Book Company, 1963.Google Scholar
- R. D. Lins et al. An Environment for Processing Images of Historical Documents. Microproc. & Microprogramming, pp. 111--121, North-Holland, 1995. Google ScholarDigital Library
- R. D. Lins, B. T. Ávila, and A. A. Formiga, BigBatch: An Environment for Processing Monochromatic Documents. ICIAR2006, LNCS 4142, pp.886--896, SpringerVerlag 2006. Google ScholarDigital Library
- R. D. Lins and J. M. M da Silva. A Quantitative Method for Assessing Algorithms to Remove Back-to-front Interference in Documents. ACM-SAC 2007, ACM Press, March 2007. Google ScholarDigital Library
- J. M. M. da Silva, R. D. Lins and V. C. da Rocha Jr. Binarizing and Filtering Historical Documents with Back-to-Front Interference, ACM-SAC 2006, Nancy, April 2006. Google ScholarDigital Library
- FUNDAJ: http://www.fundaj.gov.br.Google Scholar
Index Terms
- A fast algorithm to binarize and filter documents with back-to-front interference
Recommendations
Enhancing the Quality of Color Documents with Back-to-Front Interference
ICIAR '09: Proceedings of the 6th International Conference on Image Analysis and RecognitionBack-to-front, show-through, or bleeding are the names given to the overlapping interference whenever a document is written (or printed) on both sides of a translucent paper. Such interference makes more difficult, if not impossible, document ...
HistDoc - a toolbox for processing images of historical documents
ICIAR'10: Proceedings of the 7th international conference on Image Analysis and Recognition - Volume Part IIHistDoc is a software tool designed to process images of historical documents. It has two operation modes: standalone mode - one can process one image a time; and batch mode - one can process thousands of documents automatically. This tool automatically ...
A quantitative method for assessing algorithms to remove back-to-front interference in documents
SAC '07: Proceedings of the 2007 ACM symposium on Applied computingDocuments written on both sides on translucent paper make visible the ink from one side on the other. This artifact is called "back-to-front interference", "bleeding" or "show-through". The direct binarization of documents with such interference yields ...
Comments