Abstract
The task of morphological analysis for verbs may seem deceptively simple, but in practice, it can pose considerable challenges, especially when the requisite information regarding the verb’s infinitive form is lacking. In our research, we introduce an extensive Arabic infinitive verb corpus, thoughtfully enriched with a comprehensive set of accompanying details. These details serve as the foundation for various essential processes in Arabic verb analysis, including conjugation, morphological dissection, syntactic parsing, and even delving into the nuanced realm of semantic analysis. Our corpus comprises nearly 17,000 infinitive verbs, each meticulously matched with several variables. These variables offer a rich source of contextual and linguistic data, presenting a valuable resource for researchers, linguists, and NLP practitioners seeking deep insights into the intricacies of the Arabic language’s verb system. In sum, this meticulously crafted Arabic infinitive verb corpus represents a significant contribution to the field of Arabic language processing. It not only facilitates research and innovation but also plays a crucial role in preserving and advancing the understanding of the Arabic language, ultimately benefiting both scholars and language enthusiasts alike.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
محمد بن مكرم بن منظور، “لسان العرب”.
- 2.
محمّد بن محمّد بن عبد الرزّاق الحسيني، “تاج العروس”.
- 3.
محمد الحيدري، معجم الأفعال المتداولة ومواطن استعمالها، 2002.
- 4.
إسماعيل بن عباد، “المحيط في اللغة”، 360 هـ.
- 5.
أحمد رضا، “معجم متن اللغة”، 1960.
- 6.
إبراهيم مصطفى، أحمد الزيات، حامد عبد القادر، محمد النجار، “ القاموس الوسيط”.
- 7.
- 8.
We have developed an automatic conjugator to generate conjugated verbs.
- 9.
This information is extracted from the LACAL corpus [2].
References
El Hadj, Y., Al-Sughayeir, I., Al-Ansari, A.: Arabic part-of-speech tagging using the sentence structure. In: Proceedings of the Second International Conference on Arabic Language Resources and Tools, Cairo, Egypt, pp. 241–245 (2009)
Yousfi, A., et al.: The large annotated corpus for the Arabic language (LACAL). In: Recent Innovations in Artificial Intelligence and Smart Applications, pp. 205–219. Cham: Springer International Publishing (2022)
Ibn Mandour, M.: Lissan Alaarab
Alhossayni, M.: Taj Alaarouss
Alhaydra, M.: Moajam Alafaal Almotadawala (2002)
Ibn Abbad, I.: Almohit Fi Allogha (980)
Ahmad, R.: Moajam Matn Allogha (1960)
Ibrahim, M., Azyat, A., AbdAlkader, H., Alnajjar, M.: Alkamouss Alwassit
Alasmari, J., Watson, J., Atwell, E.: A contrastive study of the Arabic and English verb tense and aspect a corpus-based approach. Int. J. Soc. Sci. 1604–1615 (2018)
Gadalla, H.A.: Arabic imperfect verbs in translation: a corpus study of English renderings. Meta 51(1), 51–71 (2006)
Azman, B.: Root identification tool for Arabic verbs. IEEE Access 7, 45866–45871 (2019)
Bielický, V., Smrž, O.: Building the valency lexicon of Arabic verbs. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008), Marrakech, Morocco. ELRA (2008)
Khalifa, S., Hassan, S., Habash, N.: A morphological analyzer for gulf Arabic verbs. In: Proceedings of the Third Arabic Natural Language Processing Workshop, pp. 35–45, Valencia, Spain. Association for Computational Linguistics (2017)
Samy, D., Moreno-Sandoval, A., Guirao, J.M.: Construction of a bilingual Arabic-Spanish lexicon of verbs based on a parallel corpus. In: Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004), Lisbon, ELRA (2004)
Wightwick, J., Gaafar, M.: Arabic Verbs & Essentials of Grammar, 3rd edn. McGraw-Hill, New York, USA (2018)
Falkert, A.: Valeur sémantique et comportement syntaxique de l’infinitif substitut dans quelques corpus oraux. Linx 57, 69–78 (2007)
Alexis, N.: A lexicon of Arabic verbs constructed on the basis of Semitic taxonomy and using finite-state transducers. In: WoLeR 2011 at ESSLLI International Workshop on Lexical Resources – Ljubljana, Slovenia (2011)
Blanchete, I., Mourchid, M., Mbarki, S., Mouloudi, A.: Formalizing Arabic inflectional and derivational verbs based on root and pattern approach using nooj platform. In: 11th International Conference, NooJ 2017. Kenitra and Rabat, Morocco, May 18–20, 2017, Revised Selected Papers 11, pp. 52–65. Springer International Publishing (2018)
Farwaneh, S.: Non-sound verb inflection in Arabic: allomorphic variation and paradigmatic uniformity. Morphology 30(1), 61–89 (2020)
Mirad, A., Bensaou, N.: Al-Lamy an automatic conjugation system of Arabic verbs. The International Workshop on Computers and Information Sciences (2015)
Modhaffer, M., Sivaramakrishna, C.V.: Prepositional verbs in Arabic: a corpus-based study. Lang. India. 17(10), 154–174 (2017)
Othman, M.T.B., Al-Hagery, M.A., Hashemi, Y.M.: Arabic text processing model: verbs roots and conjugation automation. In: IEEE Access, 8, pp. 103913–103923 (2020)
Guiraud, C.: Le verbe est-il centre de phrase? L’information grammaticale 2(1), 35–38 (1979)
Bahloul, M.: Structure and Function of the Arabic Verb. Routledge, (2007)
El-Yasin, M.K.: Basic word order in classical Arabic and Jordanian Arabic. Lingua 65(1–2), 107–122 (1985)
Saeed, J.I.: Semantics (4th edn.). Wiley-Blackwell (2016)
Comrie, B.: Tense 17. Cambridge university press (1985)
Ryding, K.C.: A Reference Grammar of Modern Standard Arabic. Cambridge university press (2005)
محمد الأمين بو علي، موازين الأفعال وأنواعها، دراسة وتحقيق
Yousfi, A.: The morphological analysis of Arabic verbs by using the surface patterns. Int. J. Comput. Sci. Issues IJCSI 7(3), 33–36 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Laaroussi, S., Yousfi, A., Aouragh, S.L., Boumehdi, A., Iazzi, S. (2025). CAIV: Corpus of Arabic Infinitive Verbs. In: Hdioud, B., Aouragh, S.L. (eds) Arabic Language Processing: From Theory to Practice. ICALP 2024. Communications in Computer and Information Science, vol 2340. Springer, Cham. https://doi.org/10.1007/978-3-031-80438-0_1
Download citation
DOI: https://doi.org/10.1007/978-3-031-80438-0_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-80437-3
Online ISBN: 978-3-031-80438-0
eBook Packages: Computer ScienceComputer Science (R0)