Abstract
Arabic language exhibits a complex but very regular morphological structure that greatly affects its automation. Current available morphological analysis techniques for the Arabic language are based on heavy computational processes and/or the need for large amount of associated data. Utilizing existed morphological techniques greatly degrade the efficiency of some natural language applications such as information retrieval system. This paper proposed a new Arabic morphological analysis technique. The technique is based on the pattern similarity of words derived from different roots. Unique patterns are extended and coded as rules that encode morphological characteristics. The technique does not require either complex computation or associated data yet adjustable to maintain enough accuracy. This technique utilizes a very simple parser to scan coded rules and decompose a given Arabic word into its morphological components.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
J. Lovins. Development of a stemming algorithm. Mechanical Translation and Computational Linguistics, No. 11, pages 22–31, March 1968.
J. Dawson. Suffix removal and word conflation. ALLC Bulletin, 2(3): 33–46, 1974.
N. Ali. Arabic Language and Computer. Ta’reeb, 1988. (in Arabic).
A. Alsuwaynea. Information Retrieval in Arabic language. King Fahad National Library, 1995 (in Arabic).
M. Al-Atram. Effectiveness of Natural Language in Indexing and Retrieving Arabic Documents. KACST, AR-8-47. 1990. (in Arabic).
M. El-Affendi. An algebraic algorithm for Arabic morphological analysis. The Arabian Journal for Science and Engineering. 16(4B):605–611, Oct 1991.
S. Al-Fadaghi and F. Al-Anzi. A new algorithm to generate root-pattern forms. Proceedings of the 11th National Computer Conference, KFUPM, pages 391–400, March 1989.
W. Frakes and R. Baeza-Yates. Editors. Information Retrieval: Data Structures & Algorithms. Prentice Hall, 1992.
B. Thalouth and A Al-Dannan. A comprehensive Arabic morphological analyzer/generator. IBM Kuwait Scientific Center, February 1987.
T. El-Sadany and M. Hashish. An Arabic morphological system. IBM Systems Journal, 28(4):600–612, 1989.
G. Kiraz. Computational analysis of Arabic morphology. Computer Laboratory, University of Cambridge, March 1995.
N. Hegazi and A. Elsharkawi. Natural Arabic language processing. Proceedings of the 9th National Computer Conference, Vol. 2, Pages (10–5–1)–(10–5–17), Riyadh. October 1986.
Y. Hlal. Morphology and syntax of the Arabic language. Proceedings of the Arab School of Science and Technology, pages 201–207, 1990.
M. Gheith and T. El-Sadany. Arabic morphological analyzer on a personal computer. Proceedings of the 1st KSU Symposium on Computer Arabization, pages 55–65, April 1987.
A. Aluthman. A Morphological Analyzer for Arabic. M. S. Thesis, KFUPM, Dhahran, 1990.
K. Beesley. Finite state morphological analysis and generation of Arabic at Xerox research: status and plans in 2001. 2001. http://www.elsnet.org/arabic2001/beesley.pdf.
M. Aref. Object-oriented approach for morphological analysis. Proceedings of the 15th National Computer Conference. pages 5–11, KFUPM, Dhahran 1997.
M. Albawab and M. Altabban. Morphological computer processing for Arabic. Arabian Journal for Sciences, No. 32, pages 6–13, 1998. (in Arabic).
R. Al-shalabi. Design and implementation of an Arabic morphological system to support natural language processing. Ph.D. Dissertation. Computer Science Department, Illinois Institute of Technology. Chicago, 1996.
M. El-Affindi. Performing Arabic morphological search on the internet: a sliding window approximate matching (SWAM) algorithm and its performance. Dept. of Computer Science. CCIS, KSU. Saudi Arabia.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Al-Sughaiyer, I.A., Al-Kharashi, I.A. (2002). Rule Parser for Arabic Stemmer. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_2
Download citation
DOI: https://doi.org/10.1007/3-540-46154-X_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44129-8
Online ISBN: 978-3-540-46154-8
eBook Packages: Springer Book Archive