skip to main content
10.1145/3230905.3230928acmotherconferencesArticle/Chapter ViewAbstractPublication PageslopalConference Proceedingsconference-collections
research-article

Formalization of the Arabic grammatical category (V-a) using the NooJ platform

Published:02 May 2018Publication History

ABSTRACT

We present in this paper1 a morpho-syntactical description with broad coverage of lexical entries of standard/classical Arabic. This work will be presented in the form of an electronic dictionary named Al-Erfan, based on operators of the NooJ platform and implemented by local grammars in the form of finite state machines (FST). Our work is inspired by the mathematical model of Z. Harris (the transformations) and the linguistic theoretical framework "lexicon grammar" developed by Maurice Gross. The starting point of our approach is the fundamental fact that Arabic is based on the merger between the two components: Root/Pattern. This is opposed to the set-theoretic approach represented by the formula Prefix-Lemma-Suffix, which is specific to the morpho-syntactic system of the Latin languages. Our approach consists in the fusion of 480 patterns of Arabic which operate on 12400 usual roots constituting the base of any morpho-syntactic derivation of this language. The implementation of this process, via the linguistic-computer techniques of the NooJ platform, has enabled us to generate more than 120 million entries including all morpholexical categories. These data are all contained in the electronic dictionary Al-Erfan developed from a database built manually during the past 20 years in different research laboratories specialized in ANLP. We will conclude this article by examining the category V-a extracted from our Al-Erfan electronic dictionary.

References

  1. Beesley, Kenneth R & Karttunen, Lauri, Finite-states non concatenative morphotactics. Procedings of the 38th annaul meeting of the association for comutational linguistics (ACL-00), 2000,191--198. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Boudchiche M, Mazroui A, Ould Abdallahi Ould Bebah M, Lakhouaja A, Boudlal A, AlKhalil Morpho Sys 2: A robust Arabic morpho-syntactic analyzer, Journal of King Saud University - Computer and Information Sciences, Volume 29, Issue 2 (2017) 141--146. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Buckwalter Tim, Backwalter arabic morphological analyzer, Version 1.0, Linguistic data consortium, Philadelphia, 2002.Google ScholarGoogle Scholar
  4. Diab, Mona & alii, Automatic processing of modern standard arabic text, Soudi Abdelhadi (editor), 2007, Springer.Google ScholarGoogle Scholar
  5. Dichy J, Linguistic Knowledge integration in optical Arabic word and text recognition process, Linguistica Communicatio journal, Sprcial issues, 2013.Google ScholarGoogle Scholar
  6. Elghamry, Khaled, A constraint-based algotithm for the identification of arabic roots, Proceeding of the Midwest computational linguistics colloquium, 2004.Google ScholarGoogle Scholar
  7. El Hannach, Mohamed, Sytaxe des verbes psychologiques de l'arabe, Thèse de doctorat d'Etat, Université Paris VII, 1988.Google ScholarGoogle Scholar
  8. El Hannach, Mohamed, Syntaxe des verbes qualitatifs de l'arabe, Synergie monde arabe, Vol. I, 2001.Google ScholarGoogle Scholar
  9. Farghaly Ali, Handbook for language engineers, CSLI Publications, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Goldsmith, John A, An algorithm for the unsupervised learning of morphology, Natural language engineering, 2006, 12 (4): 353--371. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Gross Maurice, Métodes en synatxe, Hermann, Paris, 1975.Google ScholarGoogle Scholar
  12. Harris, Zellig S, Structure mathématique du langage, Duno, Paris, 1972.Google ScholarGoogle Scholar
  13. Isabelle T, Apprentissage automatique pour le TAL, inria-00541535, 2010.Google ScholarGoogle Scholar
  14. Kenneth R. Beesly, Arabic finite-state Morphological analysis and generation, Bank Xerox research center, Gonoble, 2009.Google ScholarGoogle Scholar
  15. Khaled Shaalan, Amin Allam, and Abdallah Gomah, Towards Automatic Spell Checking for Arabic, Conference on Language Engineering, ELSE, Cairo, Egypt, 2003, 36.Google ScholarGoogle Scholar
  16. Nizar Habash & Ryan M Roth, CATib: The Columbia Arabic treebank, Proceeding of the ACL-IJCNLP Conference Short Papers, 2009, 221--224. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Soudi, Abdelhadi & alii, Arabic Computational Morphology: Knowledge-Based and Empirical Methods, 2007, Springer. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Saleh Najim, Inheritance-based Approach to Arabic Verbal Root-and-Pattern Morphology, Soudi A, 2007, Springer.Google ScholarGoogle Scholar
  19. Siberztein Max & al., Atomatic Processing of Natural-Language Electronic Texts with Nooj, 2015.Google ScholarGoogle Scholar
  20. Silberztein Max, La formalisation des langues: l'approche NoojIste editions, London, 2015.Google ScholarGoogle Scholar
  21. مدخل إلى اللسانيات الحاسوبية، تنسيق عبد لله بن يحي الفيفي، مركز الملك عبد لله للغة العربية،الرياض 2017 (كتاب جماعي)Google ScholarGoogle Scholar
  22. الخلاف بين النحاة البصريين والكوفيين، أبو البركات بن الانباري،Google ScholarGoogle Scholar
  23. لغويات المدونة الحاسوبية، المنهج والنظرية والتطبيق، طوني ماك إينري، و أندريو هاردي، ترجمة د. سلطان بن ناصر المجيول، دار جامعة الملك سعود للنشر، 2016Google ScholarGoogle Scholar
  24. المعالجة الآلية للغة العربية، المشاكل والحلول، دة. سلوى حمادة، دار غريب، القاهرة، 2009Google ScholarGoogle Scholar
  25. لغويات المدونة الحاسوبية، تطبيقاتها تحليلية على العربية الطبيعية، د. سلطان المجيول، مركز الملك عبد لله للغة العربية، 2016Google ScholarGoogle Scholar

Index Terms

  1. Formalization of the Arabic grammatical category (V-a) using the NooJ platform

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      LOPAL '18: Proceedings of the International Conference on Learning and Optimization Algorithms: Theory and Applications
      May 2018
      357 pages
      ISBN:9781450353045
      DOI:10.1145/3230905

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 2 May 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

      Acceptance Rates

      LOPAL '18 Paper Acceptance Rate61of141submissions,43%Overall Acceptance Rate61of141submissions,43%
    • Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader