Skip to main content
Log in

Meitei Mayek handwritten dataset: compilation, segmentation, and character recognition

  • Original Article
  • Published:
The Visual Computer Aims and scope Submit manuscript

Abstract

A peculiar Indian Script Meitei Mayek has experienced a resurgence in the last few years and gets very little attention in handwriting research due to recently insurgence and limited sources. The objective of this paper is two folds; firstly, develop two different datasets: Mayek27 having 4900 isolated Meitei Mayek alphabets and MM (Meitei Mayek) dataset of 189 full-length handwritten text page. Secondly, develop a recognition system on the Mayek27 dataset using convolutional neural network and segmentation algorithms (text-lines, words, and characters) on the full-length Meitei Mayek handwritten text. A recognition rate of \(99.02\%\) is achieved using three layers of convolutional layers with a filter size of \(3 \times 3\) with 16, 32, and 96 kernels. In MM text dataset, the text-line and word segmentation are performed concurrently on 809 lines by tracking space between lines in a novel approach based on horizontal projection histogram and monitoring vertical projection histogram along the run-length of segmentation. Various constraints like skew, curve, close, and touching text-lines are incorporated, and the segmentation algorithm results are 91.84% and 88.96% for text-line and word, respectively. Furthermore, characters are segmented by headline removal, and connected component analysis achieves an accuracy of 91.12%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

References

  1. Wang, D.-H., Liu, C.-L., Zhou, X.-D.: An approach for real-time recognition of online Chinese handwritten sentences. Pattern Recognit. 45(10), 3661–3675 (2012)

    Article  Google Scholar 

  2. Wang, Q.-F., Yin, F., Liu, C.-L.: Handwritten Chinese text recognition by integrating multiple contexts. IEEE Trans. Pattern Anal. Mach. Intell. 34(8), 1469–1481 (2011)

    Article  Google Scholar 

  3. Zhou, X.-D., Wang, D.-H., Tian, F., Liu, C.-L., Nakagawa, M.: Handwritten Chinese/Japanese text recognition using semi-Markov conditional random fields. IEEE Trans. Pattern Anal. Mach. Intell. 35(10), 2413–2426 (2013)

    Article  Google Scholar 

  4. Zhou, X.-D., Zhang, Y.-M., Tian, F., Wang, H.-A., Liu, C.-L.: Minimum-risk training for semi-Markov conditional random fields with application to handwritten Chinese/Japanese text recognition. Pattern Recognit. 47(5), 1904–1916 (2014)

    Article  Google Scholar 

  5. Wu, Y.-C., Yin, F., Liu, C.-L.: Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models. Pattern Recognit. 65, 251–264 (2017)

    Article  Google Scholar 

  6. Laishram, R., Singh, A.U., Singh, N.C., Singh, A.S., James, H.: Simulation and modeling of handwritten Meitei Mayek digits using neural network approach. In: Proceedings from the International Conference Advances in Electronics, Electrical and Computer Science Engineering-EEC, pp. 355–358 (2012)

  7. Laishram, R., Singh, P.B., Singh, T.S.D., Anilkumar, S., Singh, A.U.: A neural network based handwritten Meitei Mayek alphabet optical character recognition system. In: 2014 IEEE International Conference on Computational Intelligence and Computing Research, pp. 1–5. IEEE (2014)

  8. Surinta, O., Schomaker, L., Wiering, M.: A comparison of feature and pixel-based methods for recognizing handwritten bangla digits. In: 2013 12th International Conference on Document Analysis and Recognition. IEEE (2013)

  9. Hassan, T., Khan, H.A.: Handwritten bangla numeral recognition using local binary pattern. In: 2015 International Conference on Electrical Engineering and Information Communication Technology (ICEEICT), pp. 1–4. IEEE (2015)

  10. Ilmi, N., Budi, W.T.A., Nur, R.K.: Handwriting digit recognition using local binary pattern variance and K-nearest neighbor classification. In: 2016 4th International Conference on Information and Communication Technology (ICoICT), pp. 1–5. IEEE (2016)

  11. Inunganbi, S., Seal, A., Khanna, P.: Classification of food images through interactive image segmentation. In: Asian Conference on Intelligent Information and Database Systems, pp. 519–528. Springer, Cham (2018)

  12. Kabbai, L., Abdellaoui, M., Douik, A.: Image classification by combining local and global features. Vis. Comput. 35(5), 679–693 (2019)

    Article  Google Scholar 

  13. Ghosh, S., Barman, U., Bora, P.K., Singh, T.H., Chaudhuri, B.B.: An OCR system for the Meetei Mayek script. In: 2013 Fourth National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), pp. 1–4. IEEE (2013)

  14. Kumar, C.J., Kalita, S.K.: Recognition of handwritten numerals of Manipuri script. Int. J. Comput. Appl. 84(17), 1–5 (2013)

    Google Scholar 

  15. Hammouda, G., Sellami, D., Hammouda, A.: Pattern recognition based on compound complex shape-invariant Radon transform. Vis. Comput. (2018). https://doi.org/10.1007/s00371-018-1604-9

    Article  Google Scholar 

  16. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  17. Lim, J., Lee, K.: 3D object recognition using scale-invariant features. Vis. Comput. 35(1), 71–84 (2019)

    Article  Google Scholar 

  18. Mohammadi, S., Maleki, R.: Air-writing recognition system for Persian numbers with a novel classifier. Vis. Comput. (2019). https://doi.org/10.1007/s00371-019-01717-3

    Article  Google Scholar 

  19. Mahmoud, S.A.: Arabic character recognition using Fourier descriptors and character contour encoding. Pattern Recognit. 27(6), 815–824 (1994)

    Article  Google Scholar 

  20. Shridhar, M., Badreldin, A.: High accuracy character recognition algorithm using Fourier and topological descriptors. Pattern Recognit. 17(5), 515–524 (1984)

    Article  Google Scholar 

  21. Mowlaei, A., Faez, K., Haghighat, A.T.: Feature extraction with wavelet transform for recognition of isolated handwritten Farsi/Arabic characters and numerals. In: 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No. 02TH8628), vol. 2, pp. 923–926. IEEE (2002)

  22. Liu, C.-L., Koga, M., Fujisawa, H.: Gabor feature extraction for character recognition: comparison with gradient feature. In: Eighth International Conference on Document Analysis and Recognition (ICDAR’05), pp. 121–125. IEEE (2005)

  23. Maring, K.A., Dhir, R.: Recognition of cheising iyek/eeyek-Manipuri digits using support vector machines. Ijcsit 1(2) (2014)

  24. Stockwell, R.G., Mansinha, L., Lowe, R.P.: Localization of the complex spectrum: the S transform. IEEE Trans. Signal Process. 44(4), 998–1001 (1996)

    Article  Google Scholar 

  25. Mansinha, L., Stockwell, R.G., Lowe, R.P.: Pattern analysis with two-dimensional spectral localisation: applications of two-dimensional S transforms. Phys. A Stat. Mech. Appl. 239(1–3), 286–295 (1997)

    Article  Google Scholar 

  26. Badrinath, G.S., Gupta, P.: Stockwell transform based palm-print recognition. Appl. Soft Comput. 11(7), 4267–4281 (2011)

    Article  Google Scholar 

  27. Dash, K.S., Puhan, N.B., Panda, G.: Handwritten numeral recognition using non-redundant Stockwell transform and bio-inspired optimal zoning. IET Image Process. 9(10), 874–882 (2015)

    Article  Google Scholar 

  28. Drabycz, S., Stockwell, R.G., Mitchell, J.R.: Image texture characterization using the discrete orthonormal S-transform. J. Digit. Imaging 22(6), 696 (2009)

    Article  Google Scholar 

  29. Bianne-Bernard, A.-L., Menasri, F., Mohamad, R.A.-H., Mokbel, C., Kermorvant, C., Likforman-Sulem, L.: Dynamic and contextual information in HMM modeling for handwritten word recognition. IEEE Trans. Pattern Anal. Mach. Intell. 33(10), 2066–2080 (2011)

    Article  Google Scholar 

  30. Thokchom, T., Bansal, P.K., Vig, R., Bawa, S.: Recognition of handwritten character of manipuri script. JCP 5(10), 1570–1574 (2010)

    Google Scholar 

  31. Pirlo, G., Impedovo, D.: Fuzzy-zoning-based classification for handwritten characters. IEEE Trans. Fuzzy Syst. 19(4), 780–785 (2011)

    Article  Google Scholar 

  32. Pirlo, G., Impedovo, D.: Adaptive membership functions for handwritten character recognition by voronoi-based image zoning. IEEE Trans. Image Process. 21(9), 3827–3837 (2012)

    Article  MathSciNet  Google Scholar 

  33. Schmidhuber, J.: Deep learning in neural networks: an overview. Neural Netw. 61, 85–117 (2015)

    Article  Google Scholar 

  34. LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436 (2015)

    Article  Google Scholar 

  35. Fukushima, K.: Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol. Cybern. 36(4), 193–202 (1980)

    Article  Google Scholar 

  36. LeCun, Y., et al.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)

    Article  Google Scholar 

  37. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25, 1106–1114 (2012)

    Google Scholar 

  38. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)

  39. Kahan, S., Pavlidis, T., Baird, H.S.: On the recognition of printed characters of any font and size. IEEE Trans. Pattern Anal. Mach. Intell. 2, 274–288 (1987)

    Article  Google Scholar 

  40. dos Santos, R.P., Clemente, G.S., Ren, T.I., Cavalcanti, G.D.C.: Text line segmentation based on morphology and histogram projection. In: 2009 10th International Conference on Document Analysis and Recognition, pp. 651–655. IEEE (2009)

  41. Zahour, A., Taconet, B., Mercy, P., Ramdane, S.: Arabic hand-written text-line extraction. In: Proceedings of Sixth International Conference on Document Analysis and Recognition, pp. 281–285. IEEE (2001)

  42. Nguyen, K.C., Nakagawa, M.: Text-line and character segmentation for offline recognition of handwritten japanese text. IEICE Tech. Rep. 115(517), 53–58 (2016)

    Google Scholar 

  43. Zahour, A., Likforman-Sulem, L., Boussellaa, W., Taconet, B.: Text line segmentation of historical arabic documents. In: Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), vol. 1, pp. 138–142. IEEE (2007)

  44. Nagy, G., Seth, S., Viswanathan, M.: A prototype document image analysis system for technical journals. Computer 25(7), 10–22 (1992)

    Article  Google Scholar 

  45. He, J., Downton, A.C.: User-assisted archive document image analysis for digital library construction. In: Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings, pp. 498–502. IEEE (2003)

  46. Pal, U., Datta, S.: Segmentation of Bangla unconstrained handwritten text. In: Null, p. 1128. IEEE (2003)

  47. Arivazhagan, M., Srinivasan, H., Srihari, S.: A statistical approach to line segmentation in handwritten documents. In: Document Recognition and Retrieval XIV, vol. 6500, p. 65000T. International Society for Optics and Photonics (2007)

  48. Su, T.-H., Zhang, T.-W., Huang, H.-J., Zhou, Y.: Skew detection for Chinese handwriting by horizontal stroke histogram. In: Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), vol. 2, pp. 899–903. IEEE (2007)

  49. Weliwitage, C., Harvey, A.L., Jennings, A.B.: Handwritten document offline text line segmentation. In: Digital Image Computing: Techniques and Applications (DICTA’05), pp. 27–27. IEEE (2005)

  50. O’Gorman, L.: The document spectrum for page layout analysis. IEEE Trans. Pattern Anal. Mach. Intell. 15(11), 1162–1173 (1993)

    Article  Google Scholar 

  51. Kise, K., Sato, A., Iwata, M.: Segmentation of page images using the area Voronoi diagram. Comput. Vis. Image Underst. 70(3), 370–382 (1998)

    Article  Google Scholar 

  52. Marti, U.-V., Bunke, H.: Text line segmentation and word recognition in a system for general writer independent handwriting recognition. In: Proceedings of Sixth International Conference on Document Analysis and Recognition, pp. 159–163. IEEE (2001)

  53. Li, Y., Zheng, Y., Doermann, D., Jaeger, S.: Script-independent text line segmentation in freestyle handwritten documents. IEEE Trans. Pattern Anal. Mach. Intell. 30(8), 1313–1329 (2008)

    Article  Google Scholar 

  54. Jindal, P., Jindal, B.: Line and word segmentation of handwritten text documents written in Gurmukhi script using mid point detection technique. In: 2015 2nd International Conference on Recent Advances in Engineering & Computational Sciences (RAECS), pp. 1–6. IEEE (2015)

  55. Saha, S., Basu, S., Nasipuri, M., Basu, D.K.: A Hough transform based technique for text segmentation. arXiv preprint arXiv:1002.4048 (2010)

  56. Louloudis, G., Gatos, B., Pratikakis, I., Halatsis, C.: Text line and word segmentation of handwritten documents. Pattern Recognit. 42(12), 3169–3183 (2009)

    Article  Google Scholar 

  57. Ryu, J., Koo, H.I., Cho, N.I.: Word segmentation method for handwritten documents based on structured learning. IEEE Signal Process. Lett. 22(8), 1161–1165 (2015)

    Article  Google Scholar 

  58. Javed, M., Nagabhushan, P., Chaudhuri, B.B.: Extraction of line-word-character segments directly from run-length compressed printed text-documents. In: 2013 Fourth National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), pp. 1–4. IEEE (2013)

  59. Inunganbi, S., Choudhary, P.: Recognition of handwritten Meitei Mayek script based on texture feature. Int. J. Natural Lang. Comput. (IJNLC) 7(5), 99–108 (2018)

    Google Scholar 

  60. Inunganbi, S.C., Choudhary, P.: Recognition of handwritten Meitei Mayek and English alphabets using combination of spatial features. In: International Conference on Intelligent Systems Design and Applications. Springer, Cham (2018)

  61. Kumar, C.J., Kalita, S.K., Sharma, U.: Recognition of Meetei Mayek characters using hybrid feature generated from distance profile and background directional distribution with support vector machine classifier. In: 2015 Communication, Control and Intelligent Systems (CCIS), pp. 186–189. IEEE (2015)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sanasam Inunganbi.

Ethics declarations

Conflict of interest

All authors have participated in (a) conception and design, or analysis and interpretation of the data; (b) drafting the article or revising it critically for relevant intellectual content; and (c) approval of the final version. The stated authors have written the article, and the work is original which is not published elsewhere. This manuscript has not been submitted to, nor is under review at, another journal or other publishing venue. The authors have no affiliation with any organization with a direct or indirect financial interest in the subject matter discussed in the manuscript. The below authors have affiliations with organizations with a direct or indirect financial interest in the subject matter discussed in the manuscript: Sanasam Inunganbi (National Institute of Technology, Manipur) Prakash Choudhary (National Institute of Technology, Hamirpur) Khumanthem Manglem Singh (National Institute of Technology, Manipur)

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Inunganbi, S., Choudhary, P. & Manglem, K. Meitei Mayek handwritten dataset: compilation, segmentation, and character recognition. Vis Comput 37, 291–305 (2021). https://doi.org/10.1007/s00371-020-01799-4

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00371-020-01799-4

Keywords

Navigation