Skip to main content
Log in

Analysis and interpretation of visual saliency for document functional labeling

  • Published:
Document Analysis and Recognition Aims and scope Submit manuscript

Abstract.

In this paper we propose a complete methodology of printed text characterization for document labeling using texture features that have been inspired by a psychovisual approach. This approach considers visual human-based predicates to describe and identify text units according to their visual saliency and their perceptual attraction power on the reader’s eye. It supports a quick and robust process of functional labeling used to characterize text regions of document pages. The test databases are the Finland MTDB Oulu base J. Sauvola and H. Kauniskangas (1999) MediaTeam Document Database II, a CD-ROM document image collection, Oulu University, Finland that provides a great panel of document layouts and contents and our laboratory corpus that contains a large variety of composite documents (about 200 pages). The performance of the method gives very promising results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. Bergler S, Suen CY, Nadal C, Nobile N, Waked B, Bloch A Logical block labeling for diverse types of document images. In: Proceedings of the conference on document layout interpretation and its applications, pp 231-235

  2. Bres S (1994) Contributions á la quantification des critéres de transparence et d’anisoptropie par une approche globale. Application au contrôle de qualité de matériaux composites. PhD thesis: INSA de Lyon

  3. Bruce V, Green PR (193) Visual perception: Physiology, psychology and ecology. Presse universitaire de Grenoble, Grenoble, France

  4. Chetverikov D, Liang J, Komuves J, Haralick RM (1996) Zone classification using texture features. In: Proceedings of the 13th international conference on pattern recognition, 3:676-680

  5. Doermann D, Rosenfeld A, Rivlin E (1997) The function of documents. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 2:1077-1081

  6. Eglin V (1998) Contribution á la structuration fonctionnelle des documents. PhD thesis, INSA de Lyon

  7. Eglin V, Bres S, Emptoz H (1998) Printed text featuring using visual criteria of legibility and complexity. In: Proceedings of the 14th international conference on pattern recognition, Brisbane, Australia, August 1998, pp 942-944

  8. Jain AK, Zhong Y (1996) Page segmentation using texture analysis. Pattern Recog 29(5):743-770

    Google Scholar 

  9. Jain AK, Yu B (1997) Page segmentation using document models. In: Proceedings of the 4th international conference on document analysis and recognition, 1:34-39

  10. Jain AK, Bhattacharjee S (1992) Text segmentation using Gabor filters for automatic document processing. Mach Vision Appl 5(3):169-184

    Google Scholar 

  11. Julesz B, Bergen JR (1983) Textons, the fundamental elements in preattentive vision and the perception of textures. Bell Sys Tech J 62(6):1619-1645

    Google Scholar 

  12. Jung MC, Shin YC, Srihari SN (1999) Multifont classification using typographical attributes. In: Proceedings of the 3rd international conference on document analysis and recognition, pp 353-356

  13. Le DX, Kim J, Pearson G, Thom GR (1999) Automated labeling of zones from scanned documents. In: Proceedings of SDIUT’99, pp 219-226

  14. Liang J, Haralick R, Phillips I (1996) Document zone classification using sizes of connected components. In: Proceedings of Document Recognition III, SPIE 96, pp 150-157

  15. Maderlechner G, Schreyer A, Suda P (1999) Information extraction from document images using attention based layout segmentation. In: Proceedings of the conference on document layout interpretation and its applications, pp 216-219

  16. Maderlechner G, Suda P, Brucker T (1997) Classification of documents by form and content. Patt Recog Lett 18:1225-1231

    Google Scholar 

  17. Marr D (1982) Vision. Freeman, San Francisco

  18. Nagy, G.: Twenty years of Document Image Analysis in PAMI. IEEE Trans Patt Anal Mach Intell 22(1):38-62

  19. Randen T, Husoy H (1994) Segmentation of text/image documents using texture approaches. In: Proceedings of NOBIM, pp 60-67

  20. Schreyer A, Maderlechner G, Suda P (1998) Font style detection using textons. In: Proceedings of Document Analysis System, pp 99-108

  21. Sivaramakrishnan R, Phillips I, Ha J, Subramanium S, Haralick R (1995) Zone classification in a document using the method of feature vector generation. In: Proceedings of the 3rd international conference on document analysis and recognition, pp 541-544

  22. Spitz AL (1997) Determination of the script and language content of document images. IEEE Trans Patt Anal Mach Intell 3(19):235-245

    Google Scholar 

  23. Strouthopoulos C, Papamarkos N (1998) Text identification for document image analysis using a neural network. Image Vision Comput 16:879-896

    Google Scholar 

  24. Suen CY, Bergler S, Nobile N, Waked B, Nadal CP, Bloch A (1998) Categorizing document images into script and language classes. In: Proceedings of the international conference on advances in pattern recognition, pp 297-306

  25. Wang Y, Phillips IT, Haralick RM (2002) A method for document zone content classification. In: Proceedings of the international conference on pattern recognition, 3:196-199

  26. Wong FWK, Casey R (1982) Block segmentation and text extraction in mixed text/image documents. Comput Graph Image Process 20:375-390

    Google Scholar 

  27. Wood S, Yao X, Krishnamurthi K, Dang L (1995) Language identification for printed text independent of segmentation. In: Proceedings of the international conference on image processing, pp 428-431

  28. Zhu Y, Tan T, Wang Y (1999) Font recognition based on global texture analysis. In: Proceedings of the 5th international conference on document analysis and recognition, pp 349-352

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to V. Eglin.

Additional information

Received: 6 December 2003, Accepted: 22 December 2003, Published online: 12 August 2004

Rights and permissions

Reprints and permissions

About this article

Cite this article

Eglin, V., Bres, S. Analysis and interpretation of visual saliency for document functional labeling. IJDAR 7, 28–43 (2004). https://doi.org/10.1007/s10032-004-0127-2

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-004-0127-2

Keywords:

Navigation