Skip to main content

Arabic Morphological Analysis and Disambiguation Using a Possibilistic Classifier

  • Conference paper
Book cover Intelligent Computing Theories and Applications (ICIC 2012)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7390))

Included in the following conference series:

Abstract

This paper proposes and experiments a new approach for morphological feature disambiguation of non-vocalized Arabic texts using a possibilistic classifier. The main idea is to learn contextual dependencies between features from vocalized texts and exploit this knowledge to disambiguate non-vocalized ones. We use possibility theory as a means to model imprecision in the training and testing steps, since the context is itself ambiguous. We also investigate the dependency between various features focusing on the Part-Of-Speech (POS).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Buckwalter, T.: Buckwalter Arabic Morphological Analyzer Version 2.0. Linguistic Data Consortium (LDC) catalogue number LDC2004L02 (2004) ISBN 1-58563-324-0

    Google Scholar 

  2. Haouari, B., Ben Amor, N., Elouedi, Z., Mellouli, K.: Naïve Possibilistic Network Classifiers. Fuzzy Sets and Systems 160, 3224–3238 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  3. Harrag, F., Hamdi-Cherif, A., Al-Salman, A.M.S., Elyas El-Qawasmeh, E.: Experiments in Improvement of Arabic Information Retrieval. In: 3rd International Conference on Arabic Language Processing (CITALA), Rabat, Morocco, pp. 71–81 (2009)

    Google Scholar 

  4. Bounhas, I., Elayeb, B., Evrard, F., Slimani, Y.: Toward a Computer Study of The Reliability of Arabic Stories. Journal of the American Society for Information Science and Technology 61, 1686–1705 (2010)

    Google Scholar 

  5. Al-Echikh, A.A.: Encyclopedia of The Six Major Citation Collections, Dar-esselem, Ryadh, KSA (1998)

    Google Scholar 

  6. Roth, R., Rambow, O., Habash, N., Diab, M., Rudin, C.: Arabic Morphological Tagging, Diacritization, and Lemmatization Using Lexeme Models and Feature Ranking. In: Proceedings of Association for Computational Linguistics (ACL), Columbus, Ohio, USA, pp. 573–580 (2008)

    Google Scholar 

  7. Debili, F., Achour, H., Souissi, E.: La langue arabe et l’ordinateur: de l’étiquetage grammatical à la voyellation automatique. In: IRMC (2002) (in French)

    Google Scholar 

  8. Dubois, D., Prade, H.: Possibility Theory: An Approach to Computerized Processing of Uncertainty. Plenum Press, New York (1994)

    Google Scholar 

  9. Merialdo, B.: Tagging English Text with A Probabilistic Model. Computational Linguistics 20, 155–171 (1994)

    Google Scholar 

  10. Daoud, D.: Synchronized Morphological and Syntactic Disambiguation for Arabic. Advances in Computational Linguistics 41, 73–86 (2009)

    Google Scholar 

  11. Khoja, S.: APT: Arabic Part-of-speech Tagger. In: Proc. Student Workshop at the Second Meeting of the North American Association for Computational Linguistics, Carnegie Mellon University, Pennsylvania, USA (2001)

    Google Scholar 

  12. Diab, M., Hacioglu, K., Jurafsky, D.: Automatic Tagging of Arabic Text: From Raw Text to Base Phrase Chunks. In: Proc. Human Language Technology Conference / North American Chapter of the Association for Computational Linguistics Annual Meeting (HLT-NAACL 2004), Boston, USA, pp. 149–152 (2004)

    Google Scholar 

  13. Habash, N., Rambow, O.: Arabic Diacritization Through Full Morphological Tagging. In: Proc. Human Language Technology Conference / North American Chapter of the Association for Computational Linguistics Annual Meeting (HLT-NAACL 2007), Companion Volume, Rochester, NY, USA, pp. 53–56 (2007)

    Google Scholar 

  14. Habash, N., Rambow, O., Roth, R.: MADA+TOKAN: A Toolkit for Arabic Tokenization, Diacritization, Morphological Disambiguation, POS Tagging, Stemming and Lemmatization. In: Proceedings of the 2nd International Conference on Arabic Language Resources and Tools (MEDAR), Cairo, Egypt, pp. 102–109 (2009)

    Google Scholar 

  15. Hoceini, Y., Cheragui, M.A., Abbas, M.: Towards a New Approach for Disambiguation in NLP by Multiple Criterian Decision-Aid. The Prague Bulletin of Mathematical Linguistics 95, 19–32 (2011)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ayed, R., Bounhas, I., Elayeb, B., Evrard, F., Saoud, N.B.B. (2012). Arabic Morphological Analysis and Disambiguation Using a Possibilistic Classifier. In: Huang, DS., Ma, J., Jo, KH., Gromiha, M.M. (eds) Intelligent Computing Theories and Applications. ICIC 2012. Lecture Notes in Computer Science(), vol 7390. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31576-3_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-31576-3_36

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31575-6

  • Online ISBN: 978-3-642-31576-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics