Skip to main content

Reader System for Transliterate Handwritten Bilingual Documents

  • Conference paper
  • First Online:
  • 966 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1037))

Abstract

India is a Multistate- Multilingual country. Most of the people in India used their state official language and English is treated as a binding language used for form filling or some official work. So there is a need to create a system which will convert the handwritten bilingual document into digitized form. This paper aims at development of reader system for handwritten bilingual (Marathi-English) documents by recognizing words. This facilitates many applications such as Natural language processing, School, Society, Banking, post office and Library automation. The proposed system is divided into two phases. The first phase focuses on recognition of handwritten bilingual words using two different feature extraction methods including combination of structural and statistical method and Histogram of Oriented Gradient Method. K-Nearest Neighbor classifier is used for recognition. This classifier gives 82.85% recognition accuracy using Histogram of Oriented Gradient method. The dataset containing 4390 words collected from more than 100 writers. The second phase focuses on digitization and transliteration of recognized words and conversion of transliterated text into speech, which is useful in the society for visually impaired people.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Balakrishnan, K., et al.: Offline handwritten recognition of Malayalam district name-a holistic approach. arXiv preprint arXiv:1705.00794 (2017)

  2. Belaïd, A., Santosh, K.C., d’Andecy, V.P.: Handwritten and printed text separation in real document. arXiv preprint arXiv:1303.4614 (2013)

  3. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, vol. 1, pp. 886–893. IEEE (2005)

    Google Scholar 

  4. Dhandra, B.V., Hangarge, M.: Morphological reconstruction for word level script identification. Int. J. Comput. Sci. Secur. (IJCSS) 1(1), 41–51 (2007)

    Google Scholar 

  5. Gatos, B., Pratikakis, I., Kesidis, A.L., Perantonis, S.J.: Efficient off-line cursive handwriting word recognition. In: Tenth International Workshop on Frontiers in Handwriting Recognition, Suvisoft (2006)

    Google Scholar 

  6. Kamble, P.M., Hegadi, R.S.: Handwritten Marathi basic character recognition using statistical method (2014)

    Google Scholar 

  7. Sandyal, K.S., Patel, M.S.: Offline handwritten Kannada word recognition. pp. 19–22 (2014)

    Google Scholar 

  8. Manoj Kumar, P., Chandran, S.: Handwritten Malayalam word recognition system using neural networks. Int. J. Eng. Res. Technol. 4, 90–99 (2015)

    Google Scholar 

  9. Obaidullah, S.M., Santosh, K.C., Das, N., Halder, C., Roy, K.: Handwritten Indic script identification in multi-script document images: a survey. Int. J. Pattern Recogn. Artif. Intell. 32(10), 1856012 (2018)

    Article  Google Scholar 

  10. Obaidullah, S.M., Santosh, K.C., Halder, C., Das, N., Roy, K.: Automatic Indic script identification from handwritten documents: page, block, line and word-level approach. Int. J. Mach. Learn. Cybern. 10, 1–20 (2017)

    Google Scholar 

  11. Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)

    Article  Google Scholar 

  12. Pal, U., Chaudhuri, B.B.: Automatic separation of words in multi-lingual multi-script Indian documents. In: ICDAR, p. 576. IEEE (1997)

    Google Scholar 

  13. Patel, M.S., Kumar, R., Linga Reddy, S.C.: Offline Kannada handwritten word recognition using locality preserving projection (LPP) for feature extraction. IJIRSET, 4(7) (2015)

    Google Scholar 

  14. Patel, M.S., Reddy, S.L., Naik, A.J.: An efficient way of handwritten English word recognition. In: Satapathy, S.C., Biswal, B.N., Udgata, S.K., Mandal, J.K. (eds.) Proceedings of the 3rd International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA) 2014. AISC, vol. 328, pp. 563–571. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-12012-6_62

    Chapter  Google Scholar 

  15. Ansari, S., Patil, P.M.: A research survey of Devnagari handwritten word recognition. Int. J. Eng. Res. Technol. 2(10), 1010–1015 (2013)

    Google Scholar 

  16. Plamondon, R., Srihari, S.N.: Online and off-line handwriting recognition: a comprehensive survey. IEEE Trans. Pattern Anal. Mach. Intell. 22(1), 63–84 (2000)

    Article  Google Scholar 

  17. Shaikh, M.A., Dagade, M.R.: Offline recognition of handwritten devanagari words using hidden markov model. IJIRST, 1(11) (2015)

    Google Scholar 

  18. Shaw, B., Parui, S.K., Shridhar, M.: Offline handwritten Devanagari word recognition: a segmentation based approach. In: 2008 19th International Conference on Pattern Recognition, ICPR 2008, pp. 1–4. IEEE (2008)

    Google Scholar 

  19. Singh, B., Mittal, A., Ansari, M.A., Ghosh, D.: Handwritten Devanagari word recognition: a curvelet transform based approach. Int. J. Comput. Sci. Eng. 3(4), 1658–1665 (2011)

    Google Scholar 

  20. Student, R.V.: Off-line handwritten Kannada text recognition using support vector machine using zernike moments. IJCSNS 11(7), 128 (2011)

    Google Scholar 

  21. Zinjore, R.S., Ramteke, R.J., Pathak, V.M.: Segmentation of merged lines and script identification in handwritten bilingual documents. In: Proceedings of the 9th Annual Meeting of the Forum for Information Retrieval Evaluation, pp. 29–32. ACM (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. J. Ramteke .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zinjore, R.S., Ramteke, R.J. (2019). Reader System for Transliterate Handwritten Bilingual Documents. In: Santosh, K., Hegadi, R. (eds) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2018. Communications in Computer and Information Science, vol 1037. Springer, Singapore. https://doi.org/10.1007/978-981-13-9187-3_16

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-9187-3_16

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-9186-6

  • Online ISBN: 978-981-13-9187-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics