Abstract
This paper proposes the building of a stemmer for the Arabic language. This stemmer is largely based on pattern matching and pattern strength techniques. Stemmers are algorithms to extract root from a word by removing its affixes. Stemming has been applied for large number of applications, such as: indexing, information retrieval systems, and web search engines. This paper will also proposes the application of stemming as a pre-processing stage in a dialogue system (DS). The proposed stemmer was compared with three other well known stemmers and achieved favourable accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hayder, K., Al Ameed, S.O.A.K., Amna, A., Kaabi, A., Khadija, S., Shebli, A., F, N.H.A.N.N., Shamsi, A., Shaikha, S., Muhairi, A.: Arabic Light Stemmer: A New En-Hanced Approach
El-Khoribi, R., Ismael, M.: An intelligent system based on statistical learning for searching in arabic text. ICGST International Journal on Artificial Intelligence and Machine Learning (2006)
Eiman Tamah, A.-S., Jessica, L.: Towards an error-free Arabic stemming. In: Proceeding of the 2nd ACM Workshop on Improving Non English Web Searching. ACM, New York (2008)
O’Shea, K., Bandar, Z., Crockett, K.: A Novel Approach for Constructing Conversational Agents using Sentence Similarity Measures (2008)
Abu Shawar, B., Atwell, E.: Chatbots: Are they Really Useful? (2005)
Al-Kharashi, A.I., Evens, M.: Comparing words, stems, and roots as index terms in an Arabic Information Retrieval System. J. Am. Soc. Inf. Sci. 45(8), 548–560 (1994)
Sawalha, M., Atwell, E.: Comparative evaluation of arabic language morphological analysers and stemmers. In: Proceedings of COLING 2008 22nd International Conference on Comptational Linguistics (2008)
Sawalha, M., Atwell, E.: توظيف قواعد النحو والصرف في بناء محلل صرفي للغة العربية. An application of grammar in building morphological analyzer for Arabic
Diab, M., Hacioglu, K., Jurafsky, D.: Automatic tagging of Arabic text: from raw text to base phrase chunks. In: Proceedings of HLT-NAACL 2004: Short Papers. Association for Computational Linguistics, Boston (2004)
Al-Saidat, E., Al-Momani, I.: Future markers in modern standard arabic and jorda-nian arabic: A contrastive study. European Journal of Social Sciences 12 (2010)
Khoja, S.: Stemming Arabic Text (1999), http://zeus.cs.pacificu.edu/shereen/research.htm
Buckwalter, T.: official web site, http://www.qamus.org
Al-Shalabi, R., Kanaan, G., Al-Serhan, H.: New approach for extracting Arabic roots. In: International Arab Conference on Information Technology (ACIT 2003), Egypt (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hijjawi, M., Bandar, Z., Crockett, K., Mclean, D. (2011). An Application of Pattern Matching Stemmer in Arabic Dialogue System. In: O’Shea, J., Nguyen, N.T., Crockett, K., Howlett, R.J., Jain, L.C. (eds) Agent and Multi-Agent Systems: Technologies and Applications. KES-AMSTA 2011. Lecture Notes in Computer Science(), vol 6682. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22000-5_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-22000-5_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21999-3
Online ISBN: 978-3-642-22000-5
eBook Packages: Computer ScienceComputer Science (R0)