Skip to main content

CAIV: Corpus of Arabic Infinitive Verbs

  • Conference paper
  • First Online:
Arabic Language Processing: From Theory to Practice (ICALP 2024)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2340))

Included in the following conference series:

  • 4 Accesses

Abstract

The task of morphological analysis for verbs may seem deceptively simple, but in practice, it can pose considerable challenges, especially when the requisite information regarding the verb’s infinitive form is lacking. In our research, we introduce an extensive Arabic infinitive verb corpus, thoughtfully enriched with a comprehensive set of accompanying details. These details serve as the foundation for various essential processes in Arabic verb analysis, including conjugation, morphological dissection, syntactic parsing, and even delving into the nuanced realm of semantic analysis. Our corpus comprises nearly 17,000 infinitive verbs, each meticulously matched with several variables. These variables offer a rich source of contextual and linguistic data, presenting a valuable resource for researchers, linguists, and NLP practitioners seeking deep insights into the intricacies of the Arabic language’s verb system. In sum, this meticulously crafted Arabic infinitive verb corpus represents a significant contribution to the field of Arabic language processing. It not only facilitates research and innovation but also plays a crucial role in preserving and advancing the understanding of the Arabic language, ultimately benefiting both scholars and language enthusiasts alike.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    محمد بن مكرم بن منظور، “لسان العرب”.

  2. 2.

    محمّد بن محمّد بن عبد الرزّاق الحسيني، “تاج العروس”.

  3. 3.

    محمد الحيدري، معجم الأفعال المتداولة ومواطن استعمالها، 2002.

  4. 4.

    إسماعيل بن عباد، “المحيط في اللغة”، 360 هـ.

  5. 5.

    أحمد رضا، “معجم متن اللغة”، 1960.

  6. 6.

    إبراهيم مصطفى، أحمد الزيات، حامد عبد القادر، محمد النجار، “ القاموس الوسيط”.

  7. 7.

    http://arabic.emi.ac.ma:8080/SafarWeb/faces/safar/morphology/analyzer.xhtml.

  8. 8.

    We have developed an automatic conjugator to generate conjugated verbs.

  9. 9.

    This information is extracted from the LACAL corpus [2].

References

  1. El Hadj, Y., Al-Sughayeir, I., Al-Ansari, A.: Arabic part-of-speech tagging using the sentence structure. In: Proceedings of the Second International Conference on Arabic Language Resources and Tools, Cairo, Egypt, pp. 241–245 (2009)

    Google Scholar 

  2. Yousfi, A., et al.: The large annotated corpus for the Arabic language (LACAL). In: Recent Innovations in Artificial Intelligence and Smart Applications, pp. 205–219. Cham: Springer International Publishing (2022)

    Google Scholar 

  3. Ibn Mandour, M.: Lissan Alaarab

    Google Scholar 

  4. Alhossayni, M.: Taj Alaarouss

    Google Scholar 

  5. Alhaydra, M.: Moajam Alafaal Almotadawala (2002)

    Google Scholar 

  6. Ibn Abbad, I.: Almohit Fi Allogha (980)

    Google Scholar 

  7. Ahmad, R.: Moajam Matn Allogha (1960)

    Google Scholar 

  8. Ibrahim, M., Azyat, A., AbdAlkader, H., Alnajjar, M.: Alkamouss Alwassit

    Google Scholar 

  9. Alasmari, J., Watson, J., Atwell, E.: A contrastive study of the Arabic and English verb tense and aspect a corpus-based approach. Int. J. Soc. Sci. 1604–1615 (2018)

    Google Scholar 

  10. Gadalla, H.A.: Arabic imperfect verbs in translation: a corpus study of English renderings. Meta 51(1), 51–71 (2006)

    Article  MATH  Google Scholar 

  11. Azman, B.: Root identification tool for Arabic verbs. IEEE Access 7, 45866–45871 (2019)

    Article  MATH  Google Scholar 

  12. Bielický, V., Smrž, O.: Building the valency lexicon of Arabic verbs. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008), Marrakech, Morocco. ELRA (2008)

    Google Scholar 

  13. Khalifa, S., Hassan, S., Habash, N.: A morphological analyzer for gulf Arabic verbs. In: Proceedings of the Third Arabic Natural Language Processing Workshop, pp. 35–45, Valencia, Spain. Association for Computational Linguistics (2017)

    Google Scholar 

  14. Samy, D., Moreno-Sandoval, A., Guirao, J.M.: Construction of a bilingual Arabic-Spanish lexicon of verbs based on a parallel corpus. In: Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, ELRA (2004)

    Google Scholar 

  15. Wightwick, J., Gaafar, M.: Arabic Verbs & Essentials of Grammar, 3rd edn. McGraw-Hill, New York, USA (2018)

    MATH  Google Scholar 

  16. Falkert, A.: Valeur sémantique et comportement syntaxique de l’infinitif substitut dans quelques corpus oraux. Linx 57, 69–78 (2007)

    Article  Google Scholar 

  17. Alexis, N.: A lexicon of Arabic verbs constructed on the basis of Semitic taxonomy and using finite-state transducers. In: WoLeR 2011 at ESSLLI International Workshop on Lexical Resources – Ljubljana, Slovenia (2011)

    Google Scholar 

  18. Blanchete, I., Mourchid, M., Mbarki, S., Mouloudi, A.: Formalizing Arabic inflectional and derivational verbs based on root and pattern approach using nooj platform. In: 11th International Conference, NooJ 2017. Kenitra and Rabat, Morocco, May 18–20, 2017, Revised Selected Papers 11, pp. 52–65. Springer International Publishing (2018)

    Google Scholar 

  19. Farwaneh, S.: Non-sound verb inflection in Arabic: allomorphic variation and paradigmatic uniformity. Morphology 30(1), 61–89 (2020)

    Article  Google Scholar 

  20. Mirad, A., Bensaou, N.: Al-Lamy an automatic conjugation system of Arabic verbs. The International Workshop on Computers and Information Sciences (2015)

    Google Scholar 

  21. Modhaffer, M., Sivaramakrishna, C.V.: Prepositional verbs in Arabic: a corpus-based study. Lang. India. 17(10), 154–174 (2017)

    MATH  Google Scholar 

  22. Othman, M.T.B., Al-Hagery, M.A., Hashemi, Y.M.: Arabic text processing model: verbs roots and conjugation automation. In: IEEE Access, 8, pp. 103913–103923 (2020)

    Google Scholar 

  23. Guiraud, C.: Le verbe est-il centre de phrase? L’information grammaticale 2(1), 35–38 (1979)

    Article  MATH  Google Scholar 

  24. Bahloul, M.: Structure and Function of the Arabic Verb. Routledge, (2007)

    Google Scholar 

  25. El-Yasin, M.K.: Basic word order in classical Arabic and Jordanian Arabic. Lingua 65(1–2), 107–122 (1985)

    Article  MATH  Google Scholar 

  26. Saeed, J.I.: Semantics (4th edn.). Wiley-Blackwell (2016)

    Google Scholar 

  27. Comrie, B.: Tense 17. Cambridge university press (1985)

    Google Scholar 

  28. Ryding, K.C.: A Reference Grammar of Modern Standard Arabic. Cambridge university press (2005)

    Google Scholar 

  29. محمد الأمين بو علي، موازين الأفعال وأنواعها، دراسة وتحقيق

    Google Scholar 

  30. Yousfi, A.: The morphological analysis of Arabic verbs by using the surface patterns. Int. J. Comput. Sci. Issues IJCSI 7(3), 33–36 (2010)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Saida Laaroussi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Laaroussi, S., Yousfi, A., Aouragh, S.L., Boumehdi, A., Iazzi, S. (2025). CAIV: Corpus of Arabic Infinitive Verbs. In: Hdioud, B., Aouragh, S.L. (eds) Arabic Language Processing: From Theory to Practice. ICALP 2024. Communications in Computer and Information Science, vol 2340. Springer, Cham. https://doi.org/10.1007/978-3-031-80438-0_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-80438-0_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-80437-3

  • Online ISBN: 978-3-031-80438-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics