Abstract
In this paper, we present phonetic encoding functions that play the role of hash functions in the indexation of an Arabic dictionary. They allow us to answer approximate queries that, given a query word, ask for all the words that are phonetically similar to it. They consider the phonetic features of the standard Arabic language and involve some possible phonetic alterations induced by specific habits in the pronunciation of Arabic.
We propose two functions, the first one is called the ”Algerian Dialect Refinement” and it takes into account phonetic confusions usually known to the Algerian people while speaking Arabic; and the second one is named the ”Speech Therapy Refinement” and it examines some mispronunciations common to children.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Maniez, D.: Cours sur les Soundex, http://www-info.univ-lemans.fr/~carlier/recherche/soundex.html
National Archives: The Soundex Indexing System, http://www.archives.gov/research/census/soundex.html
Aqeel, S.U., et al.: On the Development of Name Search Techniques for Arabic. J. Am. Soc. Inf. Sci. Technol. 57(6), 728–739 (2006)
Ben Hamadou, A.: Vérification et correction automatiques par analyse affixale des textes écrits en langage naturel: le cas de l’arabe non voyellé. PhD thesis, University of Sciences, Technology and Medicine of Tunis (2003)
Al Husseiny, A.: Dirassat Qur’aniya-2- Ahkam At-Tajweed Bee Riwayet Arsh An Nafia An Tariq Al’azraq. Maktabat Arradwan (2005)
Hall, P.A.V., Dowling, G.R.: Approximate String Matching. Computing Surveys 12(4) (1980)
Lait, A., Randell, B.: An Assessment of Name Matching Algorithms. Technical Report, University of Newcastle upon Tyne (1993)
Navarro, G.: A Guided Tour to Approximate String Matching. ACM Comput. Surv. 33(1), 31–88 (2001), doi:10.1145/375360.375365
Navarro, G., Baeza-Yates, R.: Very Fast and Simple Approximate String Matching. Information Processing Letters (1999)
Ousidhoum, N.D., Bensalah, A., Bensaou, N.: A New Classical Arabic Soundex algorithm. In: Proceedings of the Second Conference on Advances in Communication and Information Technologies (2012), http://doi.searchdl.org/03.CSS.2012.3.28
Philips, L.: Hanging on the Metaphone. Computer Language 7(12) (December 1990)
Philips, L.: The Double Metaphone Search Algorithm. Dr Dobb’s (2003)
Precision Indexing Staff: The Daitch-Mokoto Soundex Reference Guide. Heritage Quest (1994)
Rytting, C.A., et al.: Error Correction for Arabic Dictionary Lookup. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation, LREC 2010 (2010)
Shaalan, K., Allam, A., Gomah, A.: Towards Automatic Spell Checking for Arabic. In: Proceedings of the Fourth Conference on Language Engineering, Egyptian Society of Language Engineering, ELSE (2003)
Shaalan, K., et al.: Arabic Word Generation and Modelling for Spell Checking. In: Proceedings of the Eight International Conference on Language Resources and Evaluation, LREC 2012 (2012)
Shaalan, K., Aref, R., Fahmy, A.: An Approach for Analyzing and Correcting Spelling Errors for Non-native Arabic learners. In: Proceedings of the 7th International Conference on Informatics and Systems, INFOS 2010. Cairo University (2010)
Taft, R.L.: Name Searching Techniques. Technical Report, New York State Identification and Intelligence System, Albany, N.Y. (1970)
Yahia, M.E., Saeed, M.E., Salih, A.M.: An Intelligent Algorithm For Arabic Soundex Function Using Intuitionistic Fuzzy Logic. In: International IEEE Conference on Intelligent Systems, IS (2006)
Watson, J.C.E.: The Phonology and Morphology of Arabic. OUP Oxford (2007)
Ben Othmane Zribi, C., Ben Ahmed, M.: Efficient Automatic Correction of Misspelled Arabic Words Based on Contextual Information. In: Palade, V., Howlett, R.J., Jain, L. (eds.) KES 2003. LNCS, vol. 2773, pp. 770–777. Springer, Heidelberg (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ousidhoum, N.D., Bensaou, N. (2013). Towards the Refinement of the Arabic Soundex. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds) Natural Language Processing and Information Systems. NLDB 2013. Lecture Notes in Computer Science, vol 7934. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38824-8_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-38824-8_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38823-1
Online ISBN: 978-3-642-38824-8
eBook Packages: Computer ScienceComputer Science (R0)