Skip to main content

Word-Level Handwritten Script Identification from Multi-script Documents

  • Conference paper
  • First Online:
Recent Advances in Information Technology

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 266))

Abstract

In this paper, we present the directional discrete cosine transform (DCT) based rotation invariant features for word-level handwritten script identification. Our aim in this paper is two folds: one is to validate the effectiveness of the directional DCT (D-DCT) in extracting edge information of the studied word image and another is to provide rotation invariant property since conventional DCT (C-DCT) does not offer both issues. For each extracted word image, we compute DCT, its coefficient matrix and decompose into different directions such as horizontal, vertical, left and right diagonals plus mean and standard deviations of the decomposed components. These statistical features are then evaluated with hundreds of word images from six different scripts by using linear discriminant analysis (LDA) and achieved an accuracy of 97.35 % in average.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Belaid, A., Santhosh K.C., D’ Andeey, V.P.: Handwritten and printed text separation in real document, In: Proceedings of Machine Vision Applications, pp. 218–221 (2013)

    Google Scholar 

  2. D Ghosh, T.D., Shivaprasad, A.P.: Script recognition a review. IEEE Trans. Pattern Anal. Mach. Intell. 32(12), 2142–2161 (2010)

    Google Scholar 

  3. Fu, J., Zeng, B.: Directional discrete cosine transforms: A theoretical analysis. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 1105–1108 (2007)

    Google Scholar 

  4. Hangarge, M., Dhandra, B.V.: Offline handwritten script identification in document images. Int. J Comput. Appl. 4(6), 1–5 (2008)

    Google Scholar 

  5. Hangarge, M.,Santhos, K.C., Rajmohan, P.: Directional Discrete Cosine Transform for Handwritten Script Identification. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 344–348 (2013)

    Google Scholar 

  6. Hochberg, J., Bowers, K., Cannon, M., Kelly, P.: Script and language identification for handwritten document images. Int. J. Doc. Anal. Recogn. 2(2–3), 45–52 (1999)

    Article  Google Scholar 

  7. Roy, K., Banerjee, A., Pal, U.: Word-wise hand-written script separation for Indian postal automation. In: Proceedings of International Workshop on Frontiers in Handwriting Recognition, pp. 521–526, (2006)

    Google Scholar 

  8. Zhou, L., Lu, Y., Tan, C.L.: Bangla/english script identification based on analysis of connected component profiles. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 243–254 (2006)

    Google Scholar 

  9. Moussa, S.B., Zahour, A., BenAbdelhafid, A., Alimi, A.M.: Fractal-based system for Arabic/Latin, printed/handwritten script identification. In: Proceedings of International Conference on Pattern Recognition, pp.1–4 (2008)

    Google Scholar 

  10. Namboodiri, A., Jain, A.: Online handwritten script recognition. IEEE Trans. Pattern Anal. Mach. Intell. 26(1), 124–130 (2004)

    Article  Google Scholar 

  11. Pati, P.B., Ramakrishnan, A.G.: Word level multi-script identification. Phys. Rev. Lett. 29(9), 122–1218 (2008)

    Google Scholar 

  12. Pati, P.B., Ramakrishnan, A.G.: Hvs inspired system for script identification in Indian multi-script documents. In: Proceedings of International Conference on Document Analysis and Recognition, pp. 380–389 (2006)

    Google Scholar 

  13. Rajput, G.G., Anitha, B.H.: Handwritten script recognition using DCT and wavelet features at block level. Int. J Comput. Appl. 158–163 (2010)

    Google Scholar 

  14. Sarkar, R., Das, N., Basu, S., Kundu, M., Nasipuri, M., Basu, D.K.: Word level script identification from Bangla and Devanagri handwritten texts mixed with Roman script. J Comput 2(2), 103–108 (2010)

    Google Scholar 

  15. Tan, T.: Rotation invariant texture features and their use in automatic script identification. IEEE Trans. Pattern Anal. Mach. Intell. 20(7), 751–758 (1998)

    Article  Google Scholar 

  16. Zeng, B., Fu, J.: Directional discrete cosine transforms: a new framework for image coding. IEEE Trans. Circuits Syst. Video Technol. 18(3), 305–313 (2008)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Mallikarjun Hangarge or K. C. Santosh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer India

About this paper

Cite this paper

Hangarge, M., Santosh, K.C. (2014). Word-Level Handwritten Script Identification from Multi-script Documents. In: Biswas, G., Mukhopadhyay, S. (eds) Recent Advances in Information Technology. Advances in Intelligent Systems and Computing, vol 266. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1856-2_6

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-1856-2_6

  • Published:

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-1855-5

  • Online ISBN: 978-81-322-1856-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics