Abstract
The challenge in handwriting recognition, especially in the segmentation process, took the researchers’ attention. These Arabic handwritten text processes are a challenging job because their characters are generally both cursive and unconstrained. In this paper, a new segmentation technique is proposed for solving the problem of Arabic handwritten scripts, called ILDT. The proposed technique’s main objective is to use the word image’s vertical linear density for clarifying character boundaries and districting between characters. In the proposed method, three pre-processing steps are applied: fill close and open holes (missing circle), remove punctuation to clarify the area of ligature points and avoid characters overlapping, and crop the word image to remove excess white space. The goal of filling close and open holes is to increase the character’s pixel density and then apply the vertical linear density. The proposed technique calculates the distance histogram of vertical linear, aiming to discover local minima points to precisely determine the segmentation points. Several experiments were conducted, including elapsed CPU times and accuracies values. All comparative techniques are examined on a local benchmark database. The proposed method (ILDT) got almost all the best segmentation and recognition accuracy compared with other comparative methods.
Similar content being viewed by others
Change history
02 April 2022
The original version of this paper was updated to present the correct affiliation of Khalil H. A. Al-Shqeerat and his email address.
References
Abualigah LMQ (2019) Feature selection and enhanced krill herd algorithm for text document clustering. Springer, Berlin
Abualigah L, Diabat A (2021) Advances in sine cosine algorithm: a comprehensive survey. Artif Intell Rev 54:1–42
Abualigah LM, Khader AT, Hanandeh ES (2018) Hybrid clustering analysis using improved krill herd algorithm. Appl Intell 48(11):4047–4071
Abualigah L, Shehab M, Alshinwan M, Alabool H (2019) Salp swarm algorithm: a comprehensive survey. Neural Comput & Applic:1–21
Abualigah L, Diabat A, Mirjalili S, Abd Elaziz M, Gandomi AH (2021) The arithmetic optimization algorithm. Comput Methods Appl Mech Eng 376:113609
Ahmad I, Fink GA (2019) Handwritten Arabic text recognition using multi-stage sub-core-shape HMMs. International Journal on Document Analysis and Recognition (IJDAR) 22(3):329–349
Akhtar MS, Qureshi HA, Al-Quhayz H (2019) High quality wavelets features extraction for handwritten Arabic numerals recognition. International Journal on Advanced Science, Engineering and Information Technology 9(2):700–710
Al Hamad HA (2012) Over-segmentation of handwriting Arabic scripts using an efficient heuristic technique. IEEE International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR) 180–185
Al Hamad HA (2013) Use an efficient neural network to improve the Arabic handwriting recognition. In: IEEE international conference on signal and image processing applications (ICSIPA)
Al Hamad HA (2013) Neural-based segmentation technique for Arabic handwriting scripts. In: 21st international conference on computer graphics, visualization and computer vision (WSCG 2013), Czech
Al Hamad HA (2015) Skew detection/correction and local minima/maxima techniques for extracting a new Arabic benchmark database. International Journal of Advanced Computer Science and Applications (IJACSA) 6(9):1–10
Al Hamad HA, Abu Zitar R (2010) Development of an efficient neural-based segmentation technique for Arabic handwriting recognition. Pattern Recognition journal 8(43):2773–2798
Al Hamad HA, Hamdi-Cherif A (2012) The Arabic Center for Document Analysis and Recognition (ACDAR) - structure and perspective. In: European conference of COMPUTER SCIENCE (ECCS '12), pp 85–91
Al-Dmour A, Fraij F (2014) Segmenting Arabic handwritten documents into text lines and words. International journal of Advancements in Computing technology 6(3):109
Ali AAA, Suresha M (2019) A novel features and classifiers fusion technique for recognition of Arabic handwritten character script. SN Applied Sciences 1(10):1286
Ali AAA, Suresha M (2019) Efficient algorithms for text lines and words segmentation for recognition of Arabic handwritten script. In: Emerging research in computing, information, communication and applications. Springer, Singapore, pp 387–401
Ali AAA, Suresha M (2019) An efficient character segmentation algorithm for recognition of Arabic handwritten script. In: 2019 international conference on data Science and communication (IconDSC). IEEE, pp 1–6
AlKhateeb JH, Jiang J, Ren J, Ipson SS (2008) Component-based segmentation of words from handwritten Arabic text. In: Proceedings of world academy of science, engineering and technology, ISSN, vol 31, pp 1307–6884
Ashiquzzaman A, Tushar AK, Rahman A, Mohsin F (2019) An efficient recognition method for handwritten Arabic numerals using CNN with data augmentation and dropout. In: Data management, analytics and innovation. Springer, Singapore, pp 299–309
Daifallah, K., Zarka, N., & Jamous, H. (2009). Recognition-based segmentation algorithm for on-line arabic handwriting. In 2009 10th international conference on document analysis and recognition (pp. 886-890). IEEE.
Hadjadji B, Chibani Y, Nemmour H (2019) Hybrid one-class classifier ensemble based on fuzzy integral for open-lexicon handwritten Arabic word recognition. Pattern Anal Applic 22(1):99–113
Hamad HA (2016) Effect of the classifier training set size on accuracy of pattern recognition. Int J Eng Res Dev 12(11):24–31
Hamid A, Haraty R (2001) A neuro-Heuristic approach for segmenting handwritten Arabic text. In: ACS/IEEE international conference on computer systems and applications, p 0110
Athoillah M, Putri RK (2019) Handwritten arabic numeral character recognition using multi kernel support vector machine. KINETIK: Game technology, information system, computer network, computing, electronics, and control, 99–106
Jemni SK, Kessentini Y, Kanoun S (2019) Out of vocabulary word detection and recovery in Arabic handwritten text recognition. Pattern Recogn 93:507–520
Khorsheed MS (2003) Recognising handwritten Arabic manuscripts using a single hidden Markov model. Pattern Recogn Lett 24(14):2235–2242
Lorigo LM, Govindaraju V (2006) Offline Arabic handwriting recognition: a survey. IEEE Trans Pattern Anal Mach Intell 28(5):712–724
Maliki M, Jassim S, Al-Jawad N, Sellahewa H (2012) Arabic handwritten: pre-processing and segmentation. In: Mobile multimedia/image processing, security, and applications 2012 (Vol. 8406, p. 84060D). International society for optics and photonics
Parvez MT, Mahmoud SA (2013) Offline Arabic handwritten text recognition: a survey. ACM Computing Surveys (CSUR) 45(2):23–35
Shatnawi M (2015) Off-line handwritten Arabic character recognition: a survey. In: Proceedings of the international conference on image processing, computer vision, and pattern recognition (IPCV) (p. 52). The Steering Committee of The World Congress in Computer Science, Computer Engineering and Applied Computing (WorldComp)
Shehab M, Abualigah L, Al Hamad H, Alabool H, Alshinwan M, Khasawneh AM (2019) Moth–flame optimization algorithm: variants and applications. Neural Comput & Applic:1–26
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Al Hamad, H.A., Abualigah, L., Shehab, M. et al. Improved linear density technique for segmentation in Arabic handwritten text recognition. Multimed Tools Appl 81, 28531–28558 (2022). https://doi.org/10.1007/s11042-022-12717-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12717-2