skip to main content
10.1145/2345396.2345529acmotherconferencesArticle/Chapter ViewAbstractPublication PagesacciciConference Proceedingsconference-collections
research-article

Morphar+: an Arabic morphosyntactic analyzer

Published:03 August 2012Publication History

ABSTRACT

We present in this paper an Arabic morpho-syntactic analyzer (Morphar+) built on top of the free Arabic Morphological analyzer (AraMorph). It is known that AraMorph produces a large number of morphological solutions, but little information to select the appropriate morphological solution for words in context. For this purpose, we start characterizing/describing all particles of the Arabic language, broken noun patterns, and most nominal and verbal sentence structures; next, we formulated dozens of rules associated with these descriptions and then programmed them in a simple and efficient manner to help deducing not only the appropriate solution but also both case and ending case marks. We divided the Arabic particles into groups according to their grammatical functions for extracting the exact and final morphological function of words. Appropriate contextual rules have been stated based on the above descriptions; after applying our contextual rules on the output produced by AraMorph, we obtained an improvement of about 6% in the number of correct words with an accurate morphological function. Our goal is to reduce the error rate to less than 5% in order to integrate this very fast and accurate morpho-syntactic analyzer into a system to translate Arabic written text into Arabic sign language; we believe that this will give enough information to achieve a quick and effective translation.

References

  1. Alansary S, Nagi M, Adly N. Towards Analyzing the International Corpus of Arabic (ICA): Progress of Morphological Stage. 8th International Conference on Language Engineering, Egypt, December 2008.Google ScholarGoogle Scholar
  2. Atwell E, Al-Sulaiti L, Al-Osaimi S, Abu-Shawar B. A Review of Arabic Corpus Analysis Tools. Proceedings of JEP-TALN'04 Arabic Language Processing, Fez, April 2004.Google ScholarGoogle Scholar
  3. Tim Buckwalter. Buckwalter Arabic Morphological Analyzer Version 2.0. LDC Catalog No. LDC2004L02, Linguistic Data Consortium, 2004, www.ldc.upenn.edu/Catalog.Google ScholarGoogle Scholar
  4. Beesley K. Xerox Arabic Morphological Analyzer Surface-Language (Unicode) documentation. Xerox Research Center Europe, 2003.Google ScholarGoogle Scholar
  5. Berri J, Zidom H, Atif Y. Web-based Arabic Morphological Analyzer. In Gelbukh, A (Ed): CICLing 2001, LNCS 2004, pp 216--225, Springer-Verlag Berlin Heidlberg. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Darwish K. Building a shallow Arabic Morphological Analyzer in One Day. ACL02 Workshop on Computer Processing of Semitic Languages, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Sakhr's Morphological Analyzer.Google ScholarGoogle Scholar
  8. RDI Arabic Morphological Analyzer.Google ScholarGoogle Scholar
  9. Al-Khalil morphological analyzer.Google ScholarGoogle Scholar
  10. Jaccarini A., Mourad G., Gaubert C, Dijioua B. Un logiciel pour la mise au point de grammaires pour le filtrage d'information en Arabe. TALN03, Batz-sur-Mer, 11--14 juin 2003.Google ScholarGoogle Scholar
  11. Habash, Nizar, Owen Rambow and Ryan Roth. MADA+TOKAN: A Toolkit for Arabic Tokenization, Diacritization, Morphological Disambiguation, POS Tagging, Stemming and Lemmatization. In Proceedings of the 2nd International Conference on Arabic Language Resources and Tools (MEDAR), Cairo, Egypt, 2009.Google ScholarGoogle Scholar
  12. Boudlal, R. Belahbib, A. Lakhouaja, A. Mazroui, A. Meziane, M. Bebah. A Markovian Approach for Arabic Root Extraction. The International Arab Journal of Information Technology, Vol. 8, No. 1, January 2011.Google ScholarGoogle Scholar
  13. Y.O.M. Elhadj, A. M. Alansari, LA AlSughayeir. Using Statistical Models for Automatic Recognition of Arabic Terms Properties (in Arabic). International Journal of Computer Science and Engineering in Arabic, Vol. 3, No 2, 2010.Google ScholarGoogle Scholar
  14. Y.O.M. Elhadj, Z. Zemirli. Virtual Translator from Arabic text to Saudi Sign-Language (A2SaL). Annual Technical Report of the Project Number: 08-INF432-8, KACST, KSA, 2011.Google ScholarGoogle Scholar

Index Terms

  1. Morphar+: an Arabic morphosyntactic analyzer

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICACCI '12: Proceedings of the International Conference on Advances in Computing, Communications and Informatics
      August 2012
      1307 pages
      ISBN:9781450311960
      DOI:10.1145/2345396

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 3 August 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader