Skip to main content
Log in

A generalised framework for script identification

  • ORIGINAL PAPER
  • Published:
International Journal of Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Abstract

Automatic identification of a script in a given document image facilitates many important applications such as automatic archiving of multilingual documents, searching online archives of document images and for the selection of script-specific OCR in a multi-lingual environment. In this paper, we model script identification as a texture classification problem and examine a global approach inspired by human visual perception. A generalised, hierarchical framework is proposed for script identification. A set of energy and intensity space features for this task is also presented. The framework serves to establish the utility of a global approach to the classification of scripts. The framework has been tested on two datasets: 10 Indian and 13 world scripts. The obtained accuracy of identification across the two datasets is above 94%. The results demonstrate that the framework can be used to develop solutions for script identification from document images across a large set of script classes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Spitz A. (1997). Determination of the script and language content of document images. IEEE Trans. Pattern Anal. Mach. Intell. 19(3): 235–245

    Article  Google Scholar 

  2. Hochberg J., Kerns L., Kelly P., Thomas T. (1997). Automatic script identification from images using cluster-based templates. IEEE Trans. Pattern Anal. Mach. Intell. 19(2): 176–181

    Article  Google Scholar 

  3. Zhitao, X., Chengming, G., Ming, Y., Qiang, L.: Research on log Gabor wavelet and its application in image edge detection. In: Proceedings of 6th International Conference on Signal Processing 1, pp. 592–595 (2002)

  4. Wood S.L., Yao X., Krishnamurthi K., Dang L. (1995). Language identification for printed text independent of segmentation. Proc. Int. Conf. Image Process. 3: 428–431

    Article  Google Scholar 

  5. Tan T.N. (1998). Rotation invariant texture features and their use in automatic script identification. IEEE Trans. Pattern Anal. Mach. Intell. 20(7): 751–756

    Article  Google Scholar 

  6. Busch A., Boles W.W., Sridharan S. (2005). Texture for script identification. IEEE Trans. Pattern Anal. Mach. Intell. 27(11): 1720–1732

    Article  Google Scholar 

  7. Pal, U., Sinha, S., Chaudhuri, B.B.: Multi-script line identification from Indian document. In: 7th International Conference on Document Analysis and Recognition, vol. 2, pp. 880–884 (2003)

  8. Chaudhurym, S., Sheth, R.: Trainable script identification strategies for Indian languages. In: 5th International Conference on Document Analysis and Recognition, pp. 657–660 (1999)

  9. Chan, W., Sivaswamy, J.: Local energy analysis for text script classification. Proc. Image Vis. Comput. NZ (1999)

  10. Chan W., Coghill G.G. (2001). Text analysis using local energy. Pattern Recognit. 34(12): 2523–2532

    Article  MATH  Google Scholar 

  11. Digital Library of India. http://dli.iiit.ac.in/

  12. LIFI: Language identification from images. http://www.c3.lanl.gov/~kelly/LIFI/

  13. Samachar. http://www.samachar.com/

  14. Morrone M.C., Burr D.C. (1988). Feature detection in human vision: A phase-dependent energy model. Proc. R. Soc. Lond. Ser. B 235: 221–245

    Article  Google Scholar 

  15. PRTools: A Matlab toolbox for pattern recognition. http://www.prtools.org/

  16. Jain A.K., Zhong Y. (1996). Page segmentation using texture analysis. Pattern Recognit. 29: 743–770

    Article  Google Scholar 

  17. Duda R., Hart P., Stork D. (2001). Pattern classification. Wiley, New York

    MATH  Google Scholar 

  18. Randen, T., Husy, J.H.: Segmentation of text/image documents using texture approaches. In: Proceedings of NOBIM- Konferansen-94, pp. 60–67 (1994)

  19. Sun C., Si D. (1997). Skew and slant correction for document images using gradient direction. Proc. Doc. Anal. Recogn. 1: 142–146

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gopal Datt Joshi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Joshi, G.D., Garg, S. & Sivaswamy, J. A generalised framework for script identification. IJDAR 10, 55–68 (2007). https://doi.org/10.1007/s10032-007-0043-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-007-0043-3

Keywords

Navigation