Skip to main content

Towards the Refinement of the Arabic Soundex

  • Conference paper
Natural Language Processing and Information Systems (NLDB 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7934))

Abstract

In this paper, we present phonetic encoding functions that play the role of hash functions in the indexation of an Arabic dictionary. They allow us to answer approximate queries that, given a query word, ask for all the words that are phonetically similar to it. They consider the phonetic features of the standard Arabic language and involve some possible phonetic alterations induced by specific habits in the pronunciation of Arabic.

We propose two functions, the first one is called the ”Algerian Dialect Refinement” and it takes into account phonetic confusions usually known to the Algerian people while speaking Arabic; and the second one is named the ”Speech Therapy Refinement” and it examines some mispronunciations common to children.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Maniez, D.: Cours sur les Soundex, http://www-info.univ-lemans.fr/~carlier/recherche/soundex.html

  2. National Archives: The Soundex Indexing System, http://www.archives.gov/research/census/soundex.html

  3. Aqeel, S.U., et al.: On the Development of Name Search Techniques for Arabic. J. Am. Soc. Inf. Sci. Technol. 57(6), 728–739 (2006)

    Article  Google Scholar 

  4. Ben Hamadou, A.: Vérification et correction automatiques par analyse affixale des textes écrits en langage naturel: le cas de l’arabe non voyellé. PhD thesis, University of Sciences, Technology and Medicine of Tunis (2003)

    Google Scholar 

  5. Al Husseiny, A.: Dirassat Qur’aniya-2- Ahkam At-Tajweed Bee Riwayet Arsh An Nafia An Tariq Al’azraq. Maktabat Arradwan (2005)

    Google Scholar 

  6. Hall, P.A.V., Dowling, G.R.: Approximate String Matching. Computing Surveys 12(4) (1980)

    Google Scholar 

  7. Lait, A., Randell, B.: An Assessment of Name Matching Algorithms. Technical Report, University of Newcastle upon Tyne (1993)

    Google Scholar 

  8. Navarro, G.: A Guided Tour to Approximate String Matching. ACM Comput. Surv. 33(1), 31–88 (2001), doi:10.1145/375360.375365

    Article  Google Scholar 

  9. Navarro, G., Baeza-Yates, R.: Very Fast and Simple Approximate String Matching. Information Processing Letters (1999)

    Google Scholar 

  10. Ousidhoum, N.D., Bensalah, A., Bensaou, N.: A New Classical Arabic Soundex algorithm. In: Proceedings of the Second Conference on Advances in Communication and Information Technologies (2012), http://doi.searchdl.org/03.CSS.2012.3.28

  11. Philips, L.: Hanging on the Metaphone. Computer Language 7(12) (December 1990)

    Google Scholar 

  12. Philips, L.: The Double Metaphone Search Algorithm. Dr Dobb’s (2003)

    Google Scholar 

  13. Precision Indexing Staff: The Daitch-Mokoto Soundex Reference Guide. Heritage Quest (1994)

    Google Scholar 

  14. Rytting, C.A., et al.: Error Correction for Arabic Dictionary Lookup. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation, LREC 2010 (2010)

    Google Scholar 

  15. Shaalan, K., Allam, A., Gomah, A.: Towards Automatic Spell Checking for Arabic. In: Proceedings of the Fourth Conference on Language Engineering, Egyptian Society of Language Engineering, ELSE (2003)

    Google Scholar 

  16. Shaalan, K., et al.: Arabic Word Generation and Modelling for Spell Checking. In: Proceedings of the Eight International Conference on Language Resources and Evaluation, LREC 2012 (2012)

    Google Scholar 

  17. Shaalan, K., Aref, R., Fahmy, A.: An Approach for Analyzing and Correcting Spelling Errors for Non-native Arabic learners. In: Proceedings of the 7th International Conference on Informatics and Systems, INFOS 2010. Cairo University (2010)

    Google Scholar 

  18. Taft, R.L.: Name Searching Techniques. Technical Report, New York State Identification and Intelligence System, Albany, N.Y. (1970)

    Google Scholar 

  19. Yahia, M.E., Saeed, M.E., Salih, A.M.: An Intelligent Algorithm For Arabic Soundex Function Using Intuitionistic Fuzzy Logic. In: International IEEE Conference on Intelligent Systems, IS (2006)

    Google Scholar 

  20. Watson, J.C.E.: The Phonology and Morphology of Arabic. OUP Oxford (2007)

    Google Scholar 

  21. Ben Othmane Zribi, C., Ben Ahmed, M.: Efficient Automatic Correction of Misspelled Arabic Words Based on Contextual Information. In: Palade, V., Howlett, R.J., Jain, L. (eds.) KES 2003. LNCS, vol. 2773, pp. 770–777. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ousidhoum, N.D., Bensaou, N. (2013). Towards the Refinement of the Arabic Soundex. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds) Natural Language Processing and Information Systems. NLDB 2013. Lecture Notes in Computer Science, vol 7934. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38824-8_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38824-8_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38823-1

  • Online ISBN: 978-3-642-38824-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics