Skip to main content
Log in

Improved linear density technique for segmentation in Arabic handwritten text recognition

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

This article has been updated

Abstract

The challenge in handwriting recognition, especially in the segmentation process, took the researchers’ attention. These Arabic handwritten text processes are a challenging job because their characters are generally both cursive and unconstrained. In this paper, a new segmentation technique is proposed for solving the problem of Arabic handwritten scripts, called ILDT. The proposed technique’s main objective is to use the word image’s vertical linear density for clarifying character boundaries and districting between characters. In the proposed method, three pre-processing steps are applied: fill close and open holes (missing circle), remove punctuation to clarify the area of ligature points and avoid characters overlapping, and crop the word image to remove excess white space. The goal of filling close and open holes is to increase the character’s pixel density and then apply the vertical linear density. The proposed technique calculates the distance histogram of vertical linear, aiming to discover local minima points to precisely determine the segmentation points. Several experiments were conducted, including elapsed CPU times and accuracies values. All comparative techniques are examined on a local benchmark database. The proposed method (ILDT) got almost all the best segmentation and recognition accuracy compared with other comparative methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25
Fig. 26
Fig. 27
Fig. 28
Fig. 29
Fig. 30
Fig. 31
Fig. 32
Fig. 33
Fig. 34

Similar content being viewed by others

Change history

  • 02 April 2022

    The original version of this paper was updated to present the correct affiliation of Khalil H. A. Al-Shqeerat and his email address.

References

  1. Abualigah LMQ (2019) Feature selection and enhanced krill herd algorithm for text document clustering. Springer, Berlin

    Book  Google Scholar 

  2. Abualigah L, Diabat A (2021) Advances in sine cosine algorithm: a comprehensive survey. Artif Intell Rev 54:1–42

    Article  Google Scholar 

  3. Abualigah LM, Khader AT, Hanandeh ES (2018) Hybrid clustering analysis using improved krill herd algorithm. Appl Intell 48(11):4047–4071

    Article  Google Scholar 

  4. Abualigah L, Shehab M, Alshinwan M, Alabool H (2019) Salp swarm algorithm: a comprehensive survey. Neural Comput & Applic:1–21

  5. Abualigah L, Diabat A, Mirjalili S, Abd Elaziz M, Gandomi AH (2021) The arithmetic optimization algorithm. Comput Methods Appl Mech Eng 376:113609

    Article  MathSciNet  Google Scholar 

  6. Ahmad I, Fink GA (2019) Handwritten Arabic text recognition using multi-stage sub-core-shape HMMs. International Journal on Document Analysis and Recognition (IJDAR) 22(3):329–349

    Article  Google Scholar 

  7. Akhtar MS, Qureshi HA, Al-Quhayz H (2019) High quality wavelets features extraction for handwritten Arabic numerals recognition. International Journal on Advanced Science, Engineering and Information Technology 9(2):700–710

    Article  Google Scholar 

  8. Al Hamad HA (2012) Over-segmentation of handwriting Arabic scripts using an efficient heuristic technique. IEEE International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR) 180–185

  9. Al Hamad HA (2013) Use an efficient neural network to improve the Arabic handwriting recognition. In: IEEE international conference on signal and image processing applications (ICSIPA)

    Google Scholar 

  10. Al Hamad HA (2013) Neural-based segmentation technique for Arabic handwriting scripts. In: 21st international conference on computer graphics, visualization and computer vision (WSCG 2013), Czech

    Google Scholar 

  11. Al Hamad HA (2015) Skew detection/correction and local minima/maxima techniques for extracting a new Arabic benchmark database. International Journal of Advanced Computer Science and Applications (IJACSA) 6(9):1–10

    Google Scholar 

  12. Al Hamad HA, Abu Zitar R (2010) Development of an efficient neural-based segmentation technique for Arabic handwriting recognition. Pattern Recognition journal 8(43):2773–2798

    Article  Google Scholar 

  13. Al Hamad HA, Hamdi-Cherif A (2012) The Arabic Center for Document Analysis and Recognition (ACDAR) - structure and perspective. In: European conference of COMPUTER SCIENCE (ECCS '12), pp 85–91

    Google Scholar 

  14. Al-Dmour A, Fraij F (2014) Segmenting Arabic handwritten documents into text lines and words. International journal of Advancements in Computing technology 6(3):109

    Google Scholar 

  15. Ali AAA, Suresha M (2019) A novel features and classifiers fusion technique for recognition of Arabic handwritten character script. SN Applied Sciences 1(10):1286

    Article  Google Scholar 

  16. Ali AAA, Suresha M (2019) Efficient algorithms for text lines and words segmentation for recognition of Arabic handwritten script. In: Emerging research in computing, information, communication and applications. Springer, Singapore, pp 387–401

    Chapter  Google Scholar 

  17. Ali AAA, Suresha M (2019) An efficient character segmentation algorithm for recognition of Arabic handwritten script. In: 2019 international conference on data Science and communication (IconDSC). IEEE, pp 1–6

    Google Scholar 

  18. AlKhateeb JH, Jiang J, Ren J, Ipson SS (2008) Component-based segmentation of words from handwritten Arabic text. In: Proceedings of world academy of science, engineering and technology, ISSN, vol 31, pp 1307–6884

    Google Scholar 

  19. Ashiquzzaman A, Tushar AK, Rahman A, Mohsin F (2019) An efficient recognition method for handwritten Arabic numerals using CNN with data augmentation and dropout. In: Data management, analytics and innovation. Springer, Singapore, pp 299–309

    Chapter  Google Scholar 

  20. Daifallah, K., Zarka, N., & Jamous, H. (2009). Recognition-based segmentation algorithm for on-line arabic handwriting. In 2009 10th international conference on document analysis and recognition (pp. 886-890). IEEE.

  21. Hadjadji B, Chibani Y, Nemmour H (2019) Hybrid one-class classifier ensemble based on fuzzy integral for open-lexicon handwritten Arabic word recognition. Pattern Anal Applic 22(1):99–113

    Article  MathSciNet  Google Scholar 

  22. Hamad HA (2016) Effect of the classifier training set size on accuracy of pattern recognition. Int J Eng Res Dev 12(11):24–31

    Google Scholar 

  23. Hamid A, Haraty R (2001) A neuro-Heuristic approach for segmenting handwritten Arabic text. In: ACS/IEEE international conference on computer systems and applications, p 0110

    Chapter  Google Scholar 

  24. Athoillah M, Putri RK (2019) Handwritten arabic numeral character recognition using multi kernel support vector machine. KINETIK: Game technology, information system, computer network, computing, electronics, and control, 99–106

  25. Jemni SK, Kessentini Y, Kanoun S (2019) Out of vocabulary word detection and recovery in Arabic handwritten text recognition. Pattern Recogn 93:507–520

    Article  Google Scholar 

  26. Khorsheed MS (2003) Recognising handwritten Arabic manuscripts using a single hidden Markov model. Pattern Recogn Lett 24(14):2235–2242

    Article  Google Scholar 

  27. Lorigo LM, Govindaraju V (2006) Offline Arabic handwriting recognition: a survey. IEEE Trans Pattern Anal Mach Intell 28(5):712–724

    Article  Google Scholar 

  28. Maliki M, Jassim S, Al-Jawad N, Sellahewa H (2012) Arabic handwritten: pre-processing and segmentation. In: Mobile multimedia/image processing, security, and applications 2012 (Vol. 8406, p. 84060D). International society for optics and photonics

    Google Scholar 

  29. Parvez MT, Mahmoud SA (2013) Offline Arabic handwritten text recognition: a survey. ACM Computing Surveys (CSUR) 45(2):23–35

    Article  Google Scholar 

  30. Shatnawi M (2015) Off-line handwritten Arabic character recognition: a survey. In: Proceedings of the international conference on image processing, computer vision, and pattern recognition (IPCV) (p. 52). The Steering Committee of The World Congress in Computer Science, Computer Engineering and Applied Computing (WorldComp)

  31. Shehab M, Abualigah L, Al Hamad H, Alabool H, Alshinwan M, Khasawneh AM (2019) Moth–flame optimization algorithm: variants and applications. Neural Comput & Applic:1–26

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Laith Abualigah.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Al Hamad, H.A., Abualigah, L., Shehab, M. et al. Improved linear density technique for segmentation in Arabic handwritten text recognition. Multimed Tools Appl 81, 28531–28558 (2022). https://doi.org/10.1007/s11042-022-12717-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-12717-2

Keywords

Navigation