Abstract
In this paper we present a novel approach for the recognition of offline Arabic handwritten text motivated by the Arabic letters’ conditional joining rules. A lexicon of Arabic words can be expressed in terms of a new alphabet of PAWs (Part of Arabic Word). PAWs can be expressed in terms of letters. The recognition problem is decomposed into two problems to solve simultaneously. To find the best matching word for an input image, a Two-Tier Beam search is performed. In Tier One, the search is constrained by a letter to PAW lexicon. In Tier Two, the search is constrained by a PAW to word lexicon. The searches are driven by a PAW recognizer.
Experiments conducted on the standard IFN/ENIT database [6] of handwritten Tunisian town names show word error rates of about 11%. This result compares to the results of the commonly used HMM based approaches.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Vinciarelli, A., Luettin, J.: Off-Line Cursive Script Recognition Based on Continuous Density HMM. In: International Workshop on Frontiers in Handwriting Recognition, IWFHR 2000 (2000)
Al-Badr, B., Mohmond, S.A.: Survey and bibliography of Arabic optical text recognition. Signal Processing 41, 49–77 (1995)
Ney, H., Mergel, D., Noll, A., Paesler., A.: Data driven search organization for continuous speech recognition. IEEE Transactions on Signal Processing 40(2), 272–281 (1992)
Bilmes, J.A.: A gentle tutorial of the EM algorithm and its applications to parameter estimation for Gaussian mixture and hidden Markov models, Technical Report TR-97-021, International Computer Science Institute, Berkeley, California (1998)
Versteegh, K.: The Arabic Language. Edinburgh University Press (1997)
Pechwitz, M., Maergner, V.: HMM based approach for hand- written Arabic word recognition using the IFN/ENIT database. In: Proc. 7th Int. Conf. on Document Analysis and Recognition, Edinburgh, Scotland (2003)
Pechwitz, M., Maddouri, S.S., Maergner, V., Ellouze, N., Amiri, H.: IFN/ENIT - database of handwritten Arabic words. In: Proc. of CIFED, pp. 129–136 (2002)
Margner, V., Pechwitz, M., Abed, H.E.: ICDAR 2005 Arabic handwriting recognition competition. In: Eighth International Conference on Document Analysis and Recognition. Proceeding, vol. 1, pp. 70–74 (2005)
Simard, P., Steinkraus, D., Platt, J.C.: Best Practices for Convolutional Neural Networks Applied to Visual Document Analysis. In: ICDAR 2003, pp. 958–962 (2003)
Haraty, R.A., El-Zabadani, H.M.: Hawwaz: An Offline Arabic Handwriting Recognition System. International Journal of Computers and Applications (2005)
Steinherz, T., Rivlin, E., Intrator, N.: Off-Line Cursive Script Word Recognition A Survey. International Journal on Document Analysis and Recognition 2, 90–110 (1999)
Wikipedia: Arabic Language, http://en.wikipedia.org/wiki/Arabic
Omniglot: Writing System and Languages of the World, http://www.omniglot.com/writing/arabic.htm
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
AbdulKader, A. (2008). A Two-Tier Arabic Offline Handwriting Recognition Based on Conditional Joining Rules. In: Doermann, D., Jaeger, S. (eds) Arabic and Chinese Handwriting Recognition. SACH 2006. Lecture Notes in Computer Science, vol 4768. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78199-8_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-78199-8_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-78198-1
Online ISBN: 978-3-540-78199-8
eBook Packages: Computer ScienceComputer Science (R0)