Skip to main content

Rule Parser for Arabic Stemmer

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2448))

Abstract

Arabic language exhibits a complex but very regular morphological structure that greatly affects its automation. Current available morphological analysis techniques for the Arabic language are based on heavy computational processes and/or the need for large amount of associated data. Utilizing existed morphological techniques greatly degrade the efficiency of some natural language applications such as information retrieval system. This paper proposed a new Arabic morphological analysis technique. The technique is based on the pattern similarity of words derived from different roots. Unique patterns are extended and coded as rules that encode morphological characteristics. The technique does not require either complex computation or associated data yet adjustable to maintain enough accuracy. This technique utilizes a very simple parser to scan coded rules and decompose a given Arabic word into its morphological components.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. J. Lovins. Development of a stemming algorithm. Mechanical Translation and Computational Linguistics, No. 11, pages 22–31, March 1968.

    Google Scholar 

  2. J. Dawson. Suffix removal and word conflation. ALLC Bulletin, 2(3): 33–46, 1974.

    Google Scholar 

  3. N. Ali. Arabic Language and Computer. Ta’reeb, 1988. (in Arabic).

    Google Scholar 

  4. A. Alsuwaynea. Information Retrieval in Arabic language. King Fahad National Library, 1995 (in Arabic).

    Google Scholar 

  5. M. Al-Atram. Effectiveness of Natural Language in Indexing and Retrieving Arabic Documents. KACST, AR-8-47. 1990. (in Arabic).

    Google Scholar 

  6. M. El-Affendi. An algebraic algorithm for Arabic morphological analysis. The Arabian Journal for Science and Engineering. 16(4B):605–611, Oct 1991.

    Google Scholar 

  7. S. Al-Fadaghi and F. Al-Anzi. A new algorithm to generate root-pattern forms. Proceedings of the 11th National Computer Conference, KFUPM, pages 391–400, March 1989.

    Google Scholar 

  8. W. Frakes and R. Baeza-Yates. Editors. Information Retrieval: Data Structures & Algorithms. Prentice Hall, 1992.

    Google Scholar 

  9. B. Thalouth and A Al-Dannan. A comprehensive Arabic morphological analyzer/generator. IBM Kuwait Scientific Center, February 1987.

    Google Scholar 

  10. T. El-Sadany and M. Hashish. An Arabic morphological system. IBM Systems Journal, 28(4):600–612, 1989.

    Article  Google Scholar 

  11. G. Kiraz. Computational analysis of Arabic morphology. Computer Laboratory, University of Cambridge, March 1995.

    Google Scholar 

  12. N. Hegazi and A. Elsharkawi. Natural Arabic language processing. Proceedings of the 9th National Computer Conference, Vol. 2, Pages (10–5–1)–(10–5–17), Riyadh. October 1986.

    Google Scholar 

  13. Y. Hlal. Morphology and syntax of the Arabic language. Proceedings of the Arab School of Science and Technology, pages 201–207, 1990.

    Google Scholar 

  14. M. Gheith and T. El-Sadany. Arabic morphological analyzer on a personal computer. Proceedings of the 1st KSU Symposium on Computer Arabization, pages 55–65, April 1987.

    Google Scholar 

  15. A. Aluthman. A Morphological Analyzer for Arabic. M. S. Thesis, KFUPM, Dhahran, 1990.

    Google Scholar 

  16. K. Beesley. Finite state morphological analysis and generation of Arabic at Xerox research: status and plans in 2001. 2001. http://www.elsnet.org/arabic2001/beesley.pdf.

  17. M. Aref. Object-oriented approach for morphological analysis. Proceedings of the 15th National Computer Conference. pages 5–11, KFUPM, Dhahran 1997.

    Google Scholar 

  18. M. Albawab and M. Altabban. Morphological computer processing for Arabic. Arabian Journal for Sciences, No. 32, pages 6–13, 1998. (in Arabic).

    Google Scholar 

  19. R. Al-shalabi. Design and implementation of an Arabic morphological system to support natural language processing. Ph.D. Dissertation. Computer Science Department, Illinois Institute of Technology. Chicago, 1996.

    Google Scholar 

  20. M. El-Affindi. Performing Arabic morphological search on the internet: a sliding window approximate matching (SWAM) algorithm and its performance. Dept. of Computer Science. CCIS, KSU. Saudi Arabia.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Al-Sughaiyer, I.A., Al-Kharashi, I.A. (2002). Rule Parser for Arabic Stemmer. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_2

Download citation

  • DOI: https://doi.org/10.1007/3-540-46154-X_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44129-8

  • Online ISBN: 978-3-540-46154-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics