Skip to main content

Handwritten Indic Script Identification – A Multi-level Approach

  • Conference paper
  • First Online:

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1031))

Abstract

Script identification is an emerging document analysis problem where we identify scripts type from multilingual documents. It is well known that there are 22 official languages in India and 11 scripts are used to write them. Traditional approaches for script identification consider all the scripts together and perform a classification at single level in brute force manner. In this paper, we propose a novel multi-level approach that separate 11 different scripts (Bangla, Devanagari, Gujarati, Gurumukhi, Kannada, Malayalam, Oriya, Roman, Tamil, Telugu & Urdu) from multi-script documents. A three-level hierarchy is followed during the grouping of different Indic scripts based on their structural similarities. The proposed approach not only performs well in terms of classification accuracy but also it shows more realistic way to separate multiple numbers of Indic scripts. We obtain an average script identification accuracy of 94.43% at individual script-level which is the encouraging observation of the current inherent complex problem.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Ghosh, D., Dube, T., Shivprasad, S.P.: Script recognition—a review. IEEE Trans. Pattern Anal. Mach. Intell. 32(12), 2142–2161 (2010)

    Article  Google Scholar 

  2. Singh, P.K., Sarkar, R., Nasipuri, M.: Offline script identification from multilingual indic-script documents: a state-of-the-art. Comput. Sci. Rev. 15–16, 1–28 (2015)

    Article  MathSciNet  Google Scholar 

  3. Obaidullah, S.M., Das, S.K., Roy, K.: A system for handwritten script identification from Indian document. J. Pattern Recogn. Res. 8, 1–12 (2013)

    Google Scholar 

  4. Hochberg, J., Bowers, K., Cannon, M., Kelly, P.: Script and language identication for handwritten document images. J. Doc. Anal. Recogn. 2(2/3), 45–52 (1999)

    Article  Google Scholar 

  5. Zhu, X., Li, Y.Y., Doermann, D.: Language identication for handwritten document images using a shape codebook. Pattern Recogn. 42, 3184–3191 (2009)

    Article  Google Scholar 

  6. Obaidullah S.M., Das, N., Roy, K.: Gabor filter based technique for offline indic script identification from handwritten document images. In: International Conference on Devices, Circuits and Communications, ICDCCom 2014, pp 1–6 (2014)

    Google Scholar 

  7. Hangarge, M., Santosh, K.C., Pardeshi, R.: Directional discrete cosine transform for handwritten script identification. In: Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, pp 344–348 (2013)

    Google Scholar 

  8. Obaidullah, S.M., Karim, R., Shaikh, S., Halder, C., Das, N., Roy, K.: Transform based approach for Indic script identification from handwritten document images. In: 3rd International Conference on Signal Processing, Communications and Networking, pp 1–7 (2015)

    Google Scholar 

  9. Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: A novel framework for automatic sorting of postal documents with multi-script address blocks. Pattern Recogn. 43(10), 3507–3521 (2010)

    Article  Google Scholar 

  10. Obaidullah, S.M., Halder, C., Das, N., Roy, K.: An approach for automatic Indic script identification from handwritten document images. In: 2nd Doctoral Symposium on Applied Computation and Security Systems, pp 37–51 (2015)

    Google Scholar 

  11. Singh, P.K., Mondal, A., Bhowmik, S., Sarkar, R., Nasipuri, M.: Word-Level Script Identification from Handwritten Multi-script Documents. In: Satapathy, S.C., Biswal, B.N., Udgata, Siba K., Mandal, J.K. (eds.) Proceedings of the 3rd International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA) 2014. AISC, vol. 327, pp. 551–558. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-11933-5_62

    Chapter  Google Scholar 

  12. Pardeshi, R., Chaudhuri, B.B., Hangarge, M., Santosh, K.C.: Automatic handwritten Indian scripts identification. In: 14th International Conference on Frontiers in Handwriting Recognition, pp 375–380 (2014)

    Google Scholar 

  13. Obaidullah, S.M., Halder, C., Das, N., Roy, K.: Numeral script identification from handwritten document images. Proc. Comput. Sci. J. 54C, 585–594 (2015)

    Article  Google Scholar 

  14. Obaidullah, S.M., Roy, K., Das, N.: Comparison of different classifiers for script identification from handwritten document. In: IEEE International Conference on Signal Processing, Computing and Control, pp. 019–024 (2013)

    Google Scholar 

  15. Mandelbrot, B.B.: The Fractal Geometry of Nature. Freeman, NY (1982)

    MATH  Google Scholar 

  16. Jayara, M.A., Fleyeh, H.: Convex hulls in image processing: a scoping review. Sci. Acad. Publ. 6(2), 48–58 (2016)

    Google Scholar 

  17. Avis, D., Bremner, D., Seidel, R.: How good are convex hull algorithms? Comput. Geom. 7, 265–301 (1997)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Subhasmita Ghosh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ghosh, S. et al. (2019). Handwritten Indic Script Identification – A Multi-level Approach. In: Mandal, J., Mukhopadhyay, S., Dutta, P., Dasgupta, K. (eds) Computational Intelligence, Communications, and Business Analytics. CICBA 2018. Communications in Computer and Information Science, vol 1031. Springer, Singapore. https://doi.org/10.1007/978-981-13-8581-0_9

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-8581-0_9

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-8580-3

  • Online ISBN: 978-981-13-8581-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics