Abstract.
In this paper we propose a complete methodology of printed text characterization for document labeling using texture features that have been inspired by a psychovisual approach. This approach considers visual human-based predicates to describe and identify text units according to their visual saliency and their perceptual attraction power on the reader’s eye. It supports a quick and robust process of functional labeling used to characterize text regions of document pages. The test databases are the Finland MTDB Oulu base J. Sauvola and H. Kauniskangas (1999) MediaTeam Document Database II, a CD-ROM document image collection, Oulu University, Finland that provides a great panel of document layouts and contents and our laboratory corpus that contains a large variety of composite documents (about 200 pages). The performance of the method gives very promising results.
Similar content being viewed by others
References
Bergler S, Suen CY, Nadal C, Nobile N, Waked B, Bloch A Logical block labeling for diverse types of document images. In: Proceedings of the conference on document layout interpretation and its applications, pp 231-235
Bres S (1994) Contributions á la quantification des critéres de transparence et d’anisoptropie par une approche globale. Application au contrôle de qualité de matériaux composites. PhD thesis: INSA de Lyon
Bruce V, Green PR (193) Visual perception: Physiology, psychology and ecology. Presse universitaire de Grenoble, Grenoble, France
Chetverikov D, Liang J, Komuves J, Haralick RM (1996) Zone classification using texture features. In: Proceedings of the 13th international conference on pattern recognition, 3:676-680
Doermann D, Rosenfeld A, Rivlin E (1997) The function of documents. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 2:1077-1081
Eglin V (1998) Contribution á la structuration fonctionnelle des documents. PhD thesis, INSA de Lyon
Eglin V, Bres S, Emptoz H (1998) Printed text featuring using visual criteria of legibility and complexity. In: Proceedings of the 14th international conference on pattern recognition, Brisbane, Australia, August 1998, pp 942-944
Jain AK, Zhong Y (1996) Page segmentation using texture analysis. Pattern Recog 29(5):743-770
Jain AK, Yu B (1997) Page segmentation using document models. In: Proceedings of the 4th international conference on document analysis and recognition, 1:34-39
Jain AK, Bhattacharjee S (1992) Text segmentation using Gabor filters for automatic document processing. Mach Vision Appl 5(3):169-184
Julesz B, Bergen JR (1983) Textons, the fundamental elements in preattentive vision and the perception of textures. Bell Sys Tech J 62(6):1619-1645
Jung MC, Shin YC, Srihari SN (1999) Multifont classification using typographical attributes. In: Proceedings of the 3rd international conference on document analysis and recognition, pp 353-356
Le DX, Kim J, Pearson G, Thom GR (1999) Automated labeling of zones from scanned documents. In: Proceedings of SDIUT’99, pp 219-226
Liang J, Haralick R, Phillips I (1996) Document zone classification using sizes of connected components. In: Proceedings of Document Recognition III, SPIE 96, pp 150-157
Maderlechner G, Schreyer A, Suda P (1999) Information extraction from document images using attention based layout segmentation. In: Proceedings of the conference on document layout interpretation and its applications, pp 216-219
Maderlechner G, Suda P, Brucker T (1997) Classification of documents by form and content. Patt Recog Lett 18:1225-1231
Marr D (1982) Vision. Freeman, San Francisco
Nagy, G.: Twenty years of Document Image Analysis in PAMI. IEEE Trans Patt Anal Mach Intell 22(1):38-62
Randen T, Husoy H (1994) Segmentation of text/image documents using texture approaches. In: Proceedings of NOBIM, pp 60-67
Schreyer A, Maderlechner G, Suda P (1998) Font style detection using textons. In: Proceedings of Document Analysis System, pp 99-108
Sivaramakrishnan R, Phillips I, Ha J, Subramanium S, Haralick R (1995) Zone classification in a document using the method of feature vector generation. In: Proceedings of the 3rd international conference on document analysis and recognition, pp 541-544
Spitz AL (1997) Determination of the script and language content of document images. IEEE Trans Patt Anal Mach Intell 3(19):235-245
Strouthopoulos C, Papamarkos N (1998) Text identification for document image analysis using a neural network. Image Vision Comput 16:879-896
Suen CY, Bergler S, Nobile N, Waked B, Nadal CP, Bloch A (1998) Categorizing document images into script and language classes. In: Proceedings of the international conference on advances in pattern recognition, pp 297-306
Wang Y, Phillips IT, Haralick RM (2002) A method for document zone content classification. In: Proceedings of the international conference on pattern recognition, 3:196-199
Wong FWK, Casey R (1982) Block segmentation and text extraction in mixed text/image documents. Comput Graph Image Process 20:375-390
Wood S, Yao X, Krishnamurthi K, Dang L (1995) Language identification for printed text independent of segmentation. In: Proceedings of the international conference on image processing, pp 428-431
Zhu Y, Tan T, Wang Y (1999) Font recognition based on global texture analysis. In: Proceedings of the 5th international conference on document analysis and recognition, pp 349-352
Author information
Authors and Affiliations
Corresponding author
Additional information
Received: 6 December 2003, Accepted: 22 December 2003, Published online: 12 August 2004
Rights and permissions
About this article
Cite this article
Eglin, V., Bres, S. Analysis and interpretation of visual saliency for document functional labeling. IJDAR 7, 28–43 (2004). https://doi.org/10.1007/s10032-004-0127-2
Issue Date:
DOI: https://doi.org/10.1007/s10032-004-0127-2