skip to main content
10.1145/2802612.2802637acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiucdConference Proceedingsconference-collections
research-article

Computational Linguistics and Language Physiology: Insights from Arabic NLP and Cooperative Editing

Authors Info & Claims
Published:18 September 2014Publication History

ABSTRACT

Computer processing of written Arabic raises a number of challenges to traditional parsing architectures on many levels of linguistic analysis. In this contribution, we review some of these core issues and the demands they make, to suggest different strategies to successfully tackle them. In the end, we assess these issues in connection with the behaviour of neuro-biologically inspired lexical architectures known as Temporal Self-Organising Maps. We show that, far from being language-specific problems, issues in Arabic processing can shed light on some fundamental characteristics of the human language processor, such as structure-based lexical recoding, concurrent, competitive activation of output candidates and dynamic selection of optimal solutions.

References

  1. Tsarfaty, R., Seddah D., Kubler S., and Nivre J. 2013. Parsing Morphologically Rich Languages: Introduction to the Special Issue. Computational Linguistics 39, 1, 15--22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Dichy, J. 1997. Pour une lexicomatique de l'arabe: l'unité lexicale simple et l'inventaire fini des spécificateurs du domaine du mot. Meta 42, 2, 291--306.Google ScholarGoogle ScholarCross RefCross Ref
  3. Jackendoff, R. 2002. Foundations of language. Brain, Meaning, Grammar, Evolution. Oxford University Press, New York.Google ScholarGoogle Scholar
  4. Nahli, O. 2013. Computational contributions for Arabic language processing. The automatic morphologic analysis of Arabic texts. In Studia graeco-arabica 3, 195--206.Google ScholarGoogle Scholar
  5. Internet Archive: http://archive.org/.Google ScholarGoogle Scholar
  6. Google Books: http://books.google.com/.Google ScholarGoogle Scholar
  7. Dichy, J. and Kanoun S., Eds. 2013. Linguistic Knowledge integration in optical Arabic word and text recognition process. Linguistica Communicatio 15, 1--2.Google ScholarGoogle Scholar
  8. Märgner, V. and El Abed H. 2012. Guide to OCR for Arabic scripts. Springer, London. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Boschetti, F., Romanello M., Babeu A., Bamman D., and Crane G. 2009. Improving OCR accuracy for classical critical editions. In Proceedings of the 13th European conference on Research and advanced technology for digital libraries (ECDL'09), M. Agosti, J. Borbinha, S. Kapidakis, C. Papatheodorou and G. Tsakonas, Eds. Springer-Verlag, Berlin, Heidelberg, 156--167. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Tesseract/Cube: http://code.google.com/p/tesseract-ocr/.Google ScholarGoogle Scholar
  11. Lasri, Y. 2014. Contribution à la reconnaissance optique (OCR) du texte arabe imprimé, Fès: Université "Sidi Mohamed Ben Abdellah" de Fès, MA Thesis.Google ScholarGoogle Scholar
  12. Boschetti, F. 2013. Acquisizione e Creazione di Risorse Plurilingui per gli Studi di Filologia Classica in Ambienti Collaborativi. In. Collaborative Research Practices and Shared Infrastructures for Humanities Computing, M. Agosti and F. Tomasi, Eds. Proceedings of Revised Papers AIUCD 2013 (Padua, Italy, December 11--12, 2013), 55--67.Google ScholarGoogle Scholar
  13. Del Gratta, R. and Nahli, O. 2014. Enhancing Arabic WordNet with the use of Princeton WordNet and a bilingual dictionary. IEEE - CiST14 Colloquium on Information Science and Technology - ANLP Invited Session.Google ScholarGoogle Scholar
  14. Fellbaum, C., Ed. 1998. WordNet: An Electronic Lexical Database (Language, Speech, and Communication). The MIT Press, Cambridge, MA.Google ScholarGoogle Scholar
  15. Sagot, B. and Fišer D. 2011. Extending Wordnets by learning from multiple resources In LTC'11: 5th Language and Technology Conference, Poznań, Poland.Google ScholarGoogle Scholar
  16. Rodríguez, H., Farwell, D., Farreres, J., Bertran, M., Martí, M.A., Black, W., Elkateb, S., Kirk, J., Vossen, P., and Fellbaum, C. 2008. Arabic Wordnet: Current State and Future Extensions. In Proceedings of the Fourth International Global WordNet - Conference, 387--406.Google ScholarGoogle Scholar
  17. Vossen, P., Ed., 1998. EuroWordNet: A Multilingual Database with Lexical Semantic Networks. Kluwer Academic Publishers, Norwell, MA, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Fellbaum, C., Alkhalifa, M., Black, W. J., Elkateb, S., Pease, A., Rodríguez, H., and Vossen, P. 2006. Building a WordNet for Arabic. In Proceedings of the 5th Conference on Language Resources and Evaluation (ELRA - LREC 2006, Genova), 29--34.Google ScholarGoogle Scholar
  19. Boschetti, F., Del Gratta, R., and Lamè, M. 2014. Computer Assisted Annotation of Themes and Motifs in Ancient Greek Epigrams: First Steps. In Proceedings of CLIC, Computational Linguistics Italian Conference. Pisa, Italy.Google ScholarGoogle Scholar
  20. Blevins, J.P. 2006. Word-based morphology. Journal of Linguistics 42, 531--573.Google ScholarGoogle ScholarCross RefCross Ref
  21. Ferro, M., Pezzulo, G., and Pirrelli, V. 2010. Morphology, Memory and the Mental Lexicon. In Lingue e Linguaggio, vol. IX(2), Interdisciplinary aspects to understanding word processing and storage, V. Pirrelli, Ed. Il Mulino, Bologna, 199--238.Google ScholarGoogle Scholar
  22. Pirrelli, V., Ferro, M., and Calderone, B. 2011. Learning paradigms in time and space. Computational evidence from Romance languages. In Morphological Autonomy: Perspectives from Romance Inflectional Morphology, M. Goldbach, M. O. Hinzelin, M. Maiden, and J.C. Smith, Eds. Oxford University Press, Oxford, 135--157.Google ScholarGoogle Scholar
  23. Marzi, C., Ferro, M., and Pirrelli, V. 2012. Word alignment and paradigm induction. Lingue e Linguaggio XI, 2, 251--274.Google ScholarGoogle Scholar
  24. Marzi, C., Ferro, M., and Pirrelli, V. 2014. Morphological structure through lexical parsability. Lingue e Linguaggio XIII, 2, 263--290.Google ScholarGoogle Scholar
  25. Henson, R. N. A. 1999. Coding position in short-term memory. International Journal of Psychology 34, 5--6, 403--409.Google ScholarGoogle ScholarCross RefCross Ref
  26. Davis, C. J. 2010. The spatial coding model of visual word identification. Psychological Review 117, 3, 713--758.Google ScholarGoogle ScholarCross RefCross Ref
  27. Davis, C. J. and Bowers, J. S. 2004. What do letter migration errors reveal about letter position coding in visual word recognition? Journal of Experimental Psychology: Human Perception and Performance 30, 923--941.Google ScholarGoogle ScholarCross RefCross Ref
  28. Halle, M. and Marantz, A. 1993. Distributed Morphology and the pieces of inflection. In The view from building 20, K. Hale and S. J. Keyser, Eds. MIT Press, Cambridge, MA, 111--176.Google ScholarGoogle Scholar
  29. Embick, D. and Halle, M. 2005. On the Status of stems in morphological theory. In Romance Languages and Linguistics Theory 2003, T. Geerts, I. van Ginneken, and H. Jacobs, Eds. John Benjamins, Amsterdam, 37--62.Google ScholarGoogle Scholar
  30. Marzi, C. 2014. Models and dynamics of the morphological lexicon in mono- and bilingual acquisition. PhD unpublished dissertation. University of Pavia.Google ScholarGoogle Scholar
  31. Marzi, C., Ferro, M., Caudai, C. and Pirrelli, V. 2012. Evaluating Hebbian Self-Organizing Memories for Lexical representation and Access. Proceedings of 8th International Conference on Language Resources and Evaluation, (ELRA - LREC 2012, Malta), 886--893.Google ScholarGoogle Scholar
  32. Marzi, C., Nahli, O., and Ferro, M. 2014. Word Processing for Arabic Language. IEEE - CiST14 Colloquium on Information Science and Technology - ANLP Invited Session.Google ScholarGoogle Scholar
  33. Hickok, G.M., and Poeppel, D. 2004. Dorsal and ventral streams: a framework for understanding aspects of the functional anatomy of language. Cognition 92, 67--99.Google ScholarGoogle ScholarCross RefCross Ref
  34. D'Esposito, M. 2007. From cognitive to neural models of working memory. Philosophical Transactions of the Royal Society B: Biological Sciences 362, 761--772.Google ScholarGoogle ScholarCross RefCross Ref
  35. Saur, D., Kreher, B.W., Schnell, S., Kümmerer, D., Kellmeyer, P., Vry, M.-S., Umarova, R., Musso, M., Glauche, V., Abel, S., Huber, W., Rijntjes, M., Hennig, J., and Weiller, C. 2008. Ventral and dorsal pathways for language. Proc. Nat. Academy of Sciences 105, 46, 18035--18046.Google ScholarGoogle ScholarCross RefCross Ref
  36. Forkel, S. J., Thiebaut de Schotten, M., Dell'Acqua, F., Kalra, L., Murphy, D.G.M., Williams, S.C.R., and Catani, M. 2014. Anatomical predictors of aphasia recovery: a tractography study of bilateral perisylvian language networks. Brain 137, 2027--2039.Google ScholarGoogle ScholarCross RefCross Ref
  37. Ma, W. J., Husain, M., and Bays, P.M. 2014. Changing concepts of working memory. Nature Neuroscience 17, 3, 347--356.Google ScholarGoogle ScholarCross RefCross Ref
  38. Libben, G. 2005. Everything is psycholinguistics: Material and methodological considerations in the study of compound processing. Canadian Journal of Linguistics 50, 267--283.Google ScholarGoogle ScholarCross RefCross Ref
  39. Baayen, R.H. 2007. Storage and computation in the mental lexicon. In The Mental Lexicon: Core Perspectives, G. Jarema, G. Libben, Eds. Elsevier, 81--104.Google ScholarGoogle Scholar
  40. Luce, P., Pisoni, D., and Goldinger, S.D. 1990. Similarity neighborhoods of spoken words. In Cognitive models of speech pro-cessing: Psycholinguistic and computational perspectives, G.T.M. Altmann, Ed. MIT Press, Cambridge, MA, 122--147. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Huntsman, L.A. and Lima, S.D. 2002. Orthographic Neighbors and Visual Word Recognition. Journal of Psycholinguistic Research 31, 289--306.Google ScholarGoogle ScholarCross RefCross Ref
  42. Goldrick, M., Folk, J. R., and Rapp, B. 2010. Mrs. Malaprop's neighborhood: Using word errors to reveal neighborhood structure. Journal of Memory and Language 62, 2, 113--13.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Computational Linguistics and Language Physiology: Insights from Arabic NLP and Cooperative Editing

                Recommendations

                Comments

                Login options

                Check if you have access through your login credentials or your institution to get full access on this article.

                Sign in
                • Published in

                  cover image ACM Other conferences
                  AIUCD '14: Proceedings of the Third AIUCD Annual Conference on Humanities and Their Methods in the Digital Ecosystem
                  September 2014
                  119 pages
                  ISBN:9781450332958
                  DOI:10.1145/2802612

                  Copyright © 2014 ACM

                  Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

                  Publisher

                  Association for Computing Machinery

                  New York, NY, United States

                  Publication History

                  • Published: 18 September 2014

                  Permissions

                  Request permissions about this article.

                  Request Permissions

                  Check for updates

                  Qualifiers

                  • research-article
                  • Research
                  • Refereed limited
                • Article Metrics

                  • Downloads (Last 12 months)8
                  • Downloads (Last 6 weeks)0

                  Other Metrics

                PDF Format

                View or Download as a PDF file.

                PDF

                eReader

                View online with eReader.

                eReader