Skip to main content

New Rules to Enhance the Performances of Histogram Projection for Segmenting Small-Sized Arabic Words

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 420))

Abstract

Off-line Arabic segmentation has been a popular field of research. It still remains an open problem for discussion. In fact, the challenging nature of Arabic writing which increases the complexity of recognition and segmentation task has attracted the attention of many researchers. This paper proposes and investigates an enhanced algorithm based on the vertical histogram projection and some rules to segment Arabic words with small size. These rules are based on not only the structural characteristics of Arabic language, but also on the baselines positions and their relation with the characters. Our approach aims at cooperating together the segmentation method based on histogram projection and the contextual topographies of Arabic writing in order to improve the segmentation rate. Thus, we use the vertical histogram to detect the preliminary segmentation points and some other rules to find real segmentation points. The proposed approach has been tested with Arabic Printed Text Image Database (APTI). Actually, promising results have been obtained. Compared with the previously-proposed approach, our algorithm gives better result if applied on smaller size.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Amara, M., Zidi, K.: Feature selection using a neuro-genetic approach for arabic text recognition. In: Meta heuristics and Nature Inspired Computing (2012)

    Google Scholar 

  2. Amara, M., Zidi, K.: Arabic text recognition based on neuro-genetic feature selection approach. In: Advanced Machine Learning Technologies and Applications, pp. 3–10. Springer International Publishing (2014)

    Google Scholar 

  3. Alginahi, Y.M.: A survey on arabic character segmentation. Int. J. Doc. Anal. Recogn. 16(2), 105–126 (2013)

    Article  Google Scholar 

  4. Najoua, B.A., Noureddine, E.: A robust approach for arabic printed character segmentation. Doc. Anal. Recogn. 38(4), 420–433 (1995)

    Google Scholar 

  5. Parhami, B., Taraghi, M.: Automatic recognition of printed Farsi texts. Pattern Recogn. 14(1), 395–403 (1981)

    Article  Google Scholar 

  6. Amin, A., Masini, G.: Machine recognition of multifont printed arabic texts. In: Proceedings of International Conference on Pattern Recognition, pp 392–395 (1986)

    Google Scholar 

  7. Hamami, L., Berkani, D.: Recognition system for printed multi-font and multi-size arabic characters. Arab. J. Sci. Eng. 27(1), 57–72 (2002)

    Google Scholar 

  8. Zheng, L., Hassin, A.H., Tang, X.: A new algorithm for machine printed Arabic character segmentation. Pattern Recogn. Lett. 25(15), 1723–1729 (2004)

    Article  Google Scholar 

  9. Abuhaiba, I.S.: A discrete arabic script for better automatic document understanding. Arab. J. Sci. Eng. 28(1), 77–94 (2003)

    Google Scholar 

  10. Amara, M., Zidi, K., Zidi, S., Ghedira, K.: Arabic character recognition based M-SVM: review. In: Advanced Machine Learning Technologies and Applications, pp 18–25. Springer International Publishing (2014)

    Google Scholar 

  11. Amara, M., Ghedira, K., Zidi, K., Zidi, S.: A Comparative study of multi-class support vector machine methods for Arabic characters recognition. In International Conference on Computer Systems and Applications (2015)

    Google Scholar 

  12. Slimane, F., Ingold, R., Kanoun, S., Alimi, A. M., Hennebert, J.: A new arabic printed text image database and evaluation protocols. In: Document Analysis and Recognition, pp. 946–950 (2009)

    Google Scholar 

Download references

Acknowledgment

This research and innovation work is carried out within a MOBIDOC thesis funded by the EU under the PASRI project.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marwa Amara .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Amara, M., Zidi, K., Ghedira, K., Zidi, S. (2016). New Rules to Enhance the Performances of Histogram Projection for Segmenting Small-Sized Arabic Words. In: Abraham, A., Han, S., Al-Sharhan, S., Liu, H. (eds) Hybrid Intelligent Systems. HIS 2016. Advances in Intelligent Systems and Computing, vol 420. Springer, Cham. https://doi.org/10.1007/978-3-319-27221-4_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27221-4_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27220-7

  • Online ISBN: 978-3-319-27221-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics