Skip to main content
Log in

An Overview of Phonetic Encoding Algorithms

  • large scale systems control
  • Published:
Automation and Remote Control Aims and scope Submit manuscript

Abstract

This paper presents an overview of the phonetic encoding algorithms designed to determine the similarity of words in sound (pronunciation). Phonetic encoding algorithms are divided into the algorithms for comparing words and the algorithms for determining the distance between words. Word comparison algorithms, such as SoundEx, NYSIIS, Daitch–Mokotoff, Metaphone, and Polyphone, as well as algorithms for determining the distance between words, such as Levenshtein, Jaro, and N-grams, are described. For each algorithm, the advantages and shortcomings are discussed, and an analog for the Russian language is given. For eliminating the common shortcomings of phonetic encoding algorithms, the idea suggested in this paper is to use not the letter sequences of words, but the sequences of their elementary sounds. In this case, word recognition, record linkage, and word indexing by sounds are expected to improve.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Notes

  1. The symbol Θ denotes the asymptotic lower and upper bounds, f(n) ∈ Θ(g(n)) ↔ ∃ (CDN > 0) ∀ (n > N)∣C ⋅ g(n)∣≤f(n)≤∣D ⋅ g(n)∣. 

References

  1. Kankovski, P. What Is Your Surname? Or Russian MetaPhone. Programmist no. 8, 36–39 (2002).

    Google Scholar 

  2. Levenshtein, V. I. Binary Codes Capable of Correcting Deletions, Insertions, and Reversals. Dokl. Akad. Nauk SSSR 163(no. 4), 845–848 (1965).

    MathSciNet  MATH  Google Scholar 

  3. Abdulhayoglu, M. A. & Thijs, B. Use of ResearchGate and Google CSE for Author Name Disambiguation. Scientometrics, Budapest: Akad. Kiado 111, 1965–1985 (2017).

    Article  Google Scholar 

  4. Adeel, Z.M., Iqbal, R.N., and Masood, S.A.English to Urdu Transliteration: An Application of Soundex Algorithm, IEEE Int. Conf. on Information and Emerging Technologies, Karachi, Pakistan, June 14–16, 2010, pp. 1–5.

  5. Almeida, G., Avanco, L., Duran, M.S., et al.Evaluating Phonetic Spellers for User-Generated Content in Brazilian Portuguese, Proc. Int. Conf. on Computational Processing of the Portuguese Language, Tomar, Portugal, June 13–15, 2016, Springer International Publishing, 2016, pp. 361–373.

  6. Angeles, M.P. and Perez-Franko, L.F.Analysis of String Encoding Functions During De-Duplication Process, Proc. Int. Conf. on Informatics, Electronics & Vision, Fukuaka, Japan, June 15–18, 2015, pp. 1–6.

  7. Angeles, M.P., Espino-Gamez, A., and Gil-Moncada, J.Comparison of a Modified Spanish Phonetic, Soundex, and Phonex Coding Functions During Data Matching Process, Int. Conf. on Informatics, Electronics & Vision, Fukuaka, Japan, June 15–18, 2015, pp. 1–5.

  8. Anonthanasap, O., Ketna, M., and Leelanupab, T.Automated English Mnemonic Keyword Suggestion for Learning Japanese Vocabulary, Proc. IEEE Int. Conf. on Information Technology and Electrical Engineering (ICITEE), Chiang Mai, Thailand, October 29–30, 2015, pp. 638–643.

  9. Bilal, A., Lexical Normalization of Twitter Data, Proc. Science and Information Conf., London, July 28–30, 2015, pp. 326–328.

  10. Binstock, A. & Rex, J. Practical Algorithms for Programmers. (Addison-Wesley, Boston, 1995).

    MATH  Google Scholar 

  11. Calculate Levenshtein Distance between Two Strings. Available at: http://php.net/manual/en/function. levenshtein.php (Accessed September 8, 2017).

  12. Chheda, P., Faruqui, M., and Mitra, P.Handling OOV Words in Indian-Language–English CLIR, Proc. European Conf. on Information, Barcelona, Spain, April 1–5, 2012, Berlin: Springer-Verlag, 2012, pp. 476–479.

  13. Cherichi, S. and Faiz, R.Upgrading Event and Pattern Detection to Big Data, Proc. Int. Conf. on Computational Collective Intelligence, Halkidiki, Greece, September 28–30, 2016, Springer International Publishing, 2016, pp. 377–386.

  14. Choudhury, M., Saraf, R., Jain, V., Mukherjee, A., Sarkar, S. & Basu, A. Investigation and Modeling of the Structure of Texting Language. Int. J. Document Analys. Recognition 10, 157–174 (2007).

    Article  Google Scholar 

  15. Christen, P. and Pudjijono, A.Accurate Synthetic Generation of Realistic Personal Information, Proc. Pacific-Asia Conf. on Knowledge Discovery and Data Mining, Bangkok, Thailand, April 27–30, 2009, Berlin: Springer-Verlag, 2009, pp. 507–514.

  16. Christen, P., Geocode Matching and Privacy Preservation Proc. Int. Conf. on Privacy, Security and Trust in KDD, Las Vegas, USA, August 24, 2008, Berlin: Springer-Verlag, 2009, pp. 7–24.

  17. Chung, J.M., Lu, C.Y., Lee, H.M., and Ho, J.M.Automatic English-Chinese Name Translation by Using Web-Mining and Phonetic Similarity, Proc. IEEE Int. Conf. on Information Reuse & Integration, Las Vegas, USA, August 3–5, 2011, pp. 283–287.

  18. Damerau, F. J. A Technique for Computer Detection and Correction of Spelling Errors. ACM no. 7(3), 171–176 (1964).

    Article  Google Scholar 

  19. Denk, M. Framework for Statistical Entity Identification in R, in Data Analysis, Machine Learning and Applications (pp. 335–342. Springer-Verlag, Berlin, 2008).

    Google Scholar 

  20. Donghui, L. and Dewei, P.Spelling Correction for Chinese Language Based on Pinyin–Soundex Algorithm, Proc. Int. Conf. on Internet Technology and Applications, Wuhan, China, August 16–18, 2011, pp. 1–3.

  21. Ernst-Gerlach, A. and Fuhr, N.Advanced Training Set Construction for Retrieval in Historic Documents, Proc. Asia Information Retrieval Symp., Taipei, Taiwan, December 1–3, 2010, Berlin: Springer-Verlag, 2010, pp. 131–140.

  22. Gaona, M.A., Gelbukh, A., and Bandyopadhyay, S.Recognizing Textual Entailment Using a Machine Learning Approach, Proc. Mexican Int. Conf. on Artificial Intelligence, Pachuca, Mexico, November 8–13, 2010, Berlin: Springer-Verlag, 2010, pp. 177–185.

  23. Giraud-Carrier, C., Goodliffe, J., Jones, B. M. & Cueva, S. Effective Record Linkage for Mining Campaign Contribution Data, in Knowledge and Information Systems 45, (389–416. Springer-Verlag, London, 2015).

    Google Scholar 

  24. Grzebala, P. and Cheatham, M.Private Record Linkage: Comparison of Selected Techniques for Name Matching, Proc. Int. Semantic Web Conf., Heraklion, Crete, Greece, May 29–June 2, 2016, Springer International Publishing, 2016, pp. 593–606.

  25. Jian, H.-L.Speech Driven by Artificial Larynx: Potential Advancement Using Synthetic Pitch Contours, Proc. Int. Conf. on Universal Access in Human-Computer Interaction, Los Angeles, USA, August 2–7, 2015, Springer International Publishing, 2015, pp. 312–321.

  26. Han, Y., Min, L., Zou, Y., et al.LRC Sousou: A Lyrics Retrieval System, Proc. Int. Conf. of Young Computer Scientists, Engineers and Educators, Harbin, China, January 10–12, 2015, Berlin: Springer-Verlag, 2015, pp. 464–467.

  27. Herzog, T. N., Scheuren, F. J. & Winkler, W. E. Data Quality and Record Linkage Techniques (pp. 115–121. Springer, New York, 2007).

    Book  Google Scholar 

  28. Hong, S.G., Jang, S., Chung, Y.H., et al.News Media Analysis Using Focused Crawl and Natural Language Processing: Case of Lithuanian News Websites, Proc. Int. Conf. on Information and Software Technologies, Kaunas, Lithuania, September 13–14, 2012, Berlin: Springer-Verlag, 2012, pp. 48–61.

  29. Hu, B. & Hu, B. On Capturing Semantics in Ontology Mapping. World Wide Web 11, 361–385 (2008).

    Article  Google Scholar 

  30. Jaro, M.A.UNIMATCH—A Computer System for Generalized Record Linkage under Conditions of Uncertainty, Proc. Spring Joint Computer Conf., Anaheim, USA December 5–17, 1972, pp. 523–530.

  31. Jaro, M. A. Advances in Record-Linkage Methodology as Applied to Matching the 1985 Census of Tampa, Florida. J. Am. Statist. Ass. no. 84(406), 414–420 (1989).

    Article  Google Scholar 

  32. Jordao, C.C. and Rosa, J.L.Metaphone-pt_BR: The Phonetic Importance on Search and Correction of Textual Information, Proc. Int. Conf. on Intelligent Text Processing and Computational Linguistics, New Delhi, India, March 11–17, 2012, Berlin: Springer-Verlag, 2012, pp. 297–305.

  33. Jurafsky, D. & Martin, J. H. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. (Prentice Hall, Pearson, 2009).

    Google Scholar 

  34. Karakasidis, A. and Verykios, V.S., Privacy Preserving Record Linkage Using Phonetic Codes, Proc. Balkan Conf. in Informatics, Thessaloniki, Greece, September 17–19, 2009, pp. 101–106.

  35. Karakasidis, A., Koloniari, G., and Verykios, V.S.Privacy Preserving Blocking and Meta-Blocking, Proc. Joint European Conf. on Machine Learning and Knowledge Discovery in Databases, Porto, Portugal, September 7–11, 2015, Springer International Publishing, 2015, pp. 232–236.

  36. Kaveh-Yazdy, F., Zareh-Bidoki, A.M.Aleph or Aleph-Maddah, That Is the Question! Spelling Correction for Search Engine Autocomplete Service, Proc. Int. Conf. on Computer and Knowledge, Mashhad, Iran, October 29–30, 2014, pp. 273–278.

  37. Knuth, D.E.The Art of Computer Programming, Boston: Addison-Wesley, 1998, vol. 3.

  38. Kocoń, J. and Piasecki, M.Named Entity Matching Method Based on the Context-Free Morphological Generator, Proc. Int. Conf. on Natural Language Processing, Warsaw, Poland, September 17–19, 2014, Springer International Publishing, 2014, pp. 34–44.

  39. Kroll, M. and Steinmetzer, S.Who Is 1011011111...1110110010? Automated Cryptanalysis of Bloom Filter Encryptions of Databases with Several Personal Identifiers, Proc. Int. Joint Conf. on Biomedical Engineering Systems and Technologies, Lisbon, Portugal, January 12–15, 2015, Springer International Publishing, 2015, pp. 341–356.

  40. Lawrence, P. Hanging on the Metaphone. Comput. Language 7(no. 12), 39–44 (1990).

    Google Scholar 

  41. Lawrence, P. The Double Metaphone Search Algorithm. C/C++ Users J. no. 18(6), 38–43 (2000).

    Google Scholar 

  42. Lawrence, P.Metaphone 3: Version 2.1.3. https://searchcode.com/codesearch/view/2366000/ (Accessed February 18, 2012).

  43. Li, J., Ouazzane, K., Jing, Y., et al.Evolutionary Ranking on Multiple Word Correction Algorithms Using Neural Network Approach, Proc. Int. Conf. on Engineering Applications of Neural Networks, London, August 27–29, 2009, Berlin: Springer-Verlag, 2009, pp. 409–418.

  44. Li, L., Quazzane, K., Kazemian, H., Jing, Y. & Boyd, R. A Neural Network Based Solution for Automatic Typing Errors Correction. Neural Comput. Appl. 20, 889–896 (2011).

    Article  Google Scholar 

  45. Lisbach, B. & Meyer, M. Linguistic Identity Matching (pp. 118–120. Springer Fachmedien Wiesbaden, Wiesbaden, 2013).

    Book  Google Scholar 

  46. Louppe, G., Al-Natsheh, H.T., Susik, M., and Maguire, E.J.Ethnicity Sensitive Author Disambiguation Using Semi-Supervised Learning, Proc. Int. Conf. on Knowledge Engineering and the Semantic Web, Prague, Czech Republic, September 21–23, 2016, Springer International Publishing, 2016, pp. 272–287.

  47. Maarif, H.A., Akmeliawati, R., Htike, Z.Z., and Gunawan, T.S.Complexity Algorithm Analysis for Edit Distance, Proc. Int. Conf. on Computer and Communication Engineering, Kuala L umpur, Malaysia, September 23–24, 2014, pp. 135 –137.

  48. Mandal, A.K., Hossan, D., and Nadim, M.Developing an Efficient Search Suggestion Generator, Ignoring Spelling Error for High Speed Data Retrieval Using Double Metaphone Algorithm, Proc. Int. Conf. on Computer and Information Technology, Dhaka, Bangladesh, December 23–25, 2010, pp. 317–320.

  49. Martins, B., ASupervisedMachine Learning Approach for Duplicate Detection over Gazetteer Records, Proc. Int. Conf. on GeoSpatial Semantics, Brest, France, May 12–13, 2011, Berlin: Springer-Verlag, 2011, pp. 34–51.

  50. Mason-Blakley, F., Lu, L., Price, M., and Roudsari, A.An RCT Simulation Study on Performance and Accuracy of Inexact Matching Algorithms for Patient Identity in Ambulatory Care Settings, Proc. Int. Conf. on Healthcare Informatics, Dallas, USA, October 21–23, 2015, pp. 8–17.

  51. Meintanis, K. and Shipman, F.M.Visual Expression for Organizing and Accessing Music Collections in MusicWiz, Proc. Int. Conf. on Theory and Practice of Digital Libraries, Glasgow, United Kingdom, September 6–10, 2010, Berlin Heidelberg: Springer-Verlag, 2010, pp. 80–91.

  52. Mosquera, A. and Moreda, P.The Study of Informality as a Framework for Evaluating the Normalisation of Web 2.0 Texts, Proc. Int. Conf. on Application of Natural Language to Information Systems, Groningen, The Netherlands, June 26–28, 2012, Berlin Heidelberg: Springer-Verlag, 2012, pp. 241–246.

  53. Mutalib, A. and Noah, S.A.Phonetic Coding Methods for Malay Names Retrieval, Proc. IEEE Int. Conf. on Semantic Technology and Information, Putrajaya, Malaysia, June 28–29, 2011, pp. 125–129.

  54. Muthmann, K. and Loser, A.Detecting Near-Duplicate Relations in User Generated Forum Content, Proc. OTM Confederated Int. Conf. “On the Move to Meaningful Internet Systems,” Hersonissos, Crete, Greece, October 25–29, 2010, Berlin: Springer-Verlag, 2010, pp. 698–707.

  55. Ouazzane, K., Li, J., and Kazemian, H.B.An Intelligent Keyboard Framework for Improving Disabled People Computer Accessibility, Proc. Int. Conf. on Engineering Applications of Neural Networks, International Federation for Information Processing, Corfu, Greece, September 15–18, 2011, pp. 382–391.

  56. Ousidhoum, N.D. and Bensaou, N.Towards the Refinement of the Arabic Soundex, Proc. Int. Conf. on Application of Natural Language to Information Systems, Salford, United Kingdom, June 19–21, 2013, Berlin: Springer-Verlag, 2013, pp. 309–314.

  57. Owonibi, M. and Koenig-Ries, B.A Quality Management Workflow Proposal for a Biodiversity Data Repository, Proc. Int. Conf. on Conceptual Modeling, Atlanta, USA, October 27–29, 2014, in Advances in Conceptual Modeling, Lecture Notes in Computer Science, vol. 8823, Switzerland: Springer International Publishing, 2014, pp. 157–167.

  58. Pande, B. P. & Dhami, H. S. Application of Natural Language Processing Tools in Stemming. Int. J. Computer Appl. no. 27(6), 14–19 (2011).

    Google Scholar 

  59. Paramonov, V.V., Shigarov, A.O., Ruzhnikov, G.M., et al.Polyphon: An Algorithm for Phonetic String Matching in Russian Language, Proc. Int. Conf. on Information and Software Technologies, Druskininkai, Lithuania, October 13–15, 2016, Springer International Publishing, 2016, pp. 568–579.

  60. Parmar, V.P., Pandya, A.K., and Kumbharana, C.K.Determining the Character Replacement Rules and Implementing Them for Phonetic Identification of Given Words to Identify Similar Pronunciation Words, Proc. IEEE Int. Conf. on Futuristic Trends in Computational Analysis and Knowledge Management (ABLAZE), New Delhi, India, February 25–27, 2015, pp. 272–277.

  61. Parvathy, A.G, Vasudevan, G.B., Kumar, A., and Balakrishnan, R.Leveraging Call Center Logs for Customer Behavior Prediction, Proc. Int. Symp. on Intelligent Data Analysis, New Delhi, India, February 25–27, 2009, Berlin: Springer-Verlag, 2009, pp. 143–154.

  62. Pfeiffer, A., Kuraeva, A., Foulonneau, M., et al.Automatically Generated Metrics for Multilingual Inline Choice Questions on Reading Comprehension, Proc. Int. Computer Assisted Assessment Conf., Zeist, The Netherlands, June 22–23, 2015, Springer International Publishing, 2015, pp. 80–95.

  63. Pinto, D., Vilarino, D., Aleman, Y., et al.The Soundex Phonetic Algorithm Revisited for SMS Text Representation, Proc. Int. Conf. on Text, Speech and Dialogue, Brno, Czech Republic, September 3–7, 2012, Berlin: Springer-Verlag, 2012, pp. 47–55.

  64. Purao, S., Storey, V.C., Sugumaran, V., et al.Repurposing Social Tagging Data for Extraction of Domain-Level Concepts, Proc. Int. Conf. on Application of Natural Language to Information Systems, Alicante, Spain, June 28–30, 2011. Berlin: Springer-Verlag, 2011, pp. 185–192.

  65. Rakibul, H. and Reaz, A.FM-Chord: Fault-tolerant Chord Supporting Misspelled Queries, Proc. Int. Conf. on Computers and Information Technology, Dhaka, Bangladesh, December 21–23, 2009, pp. 651–656.

  66. Reyes-Barragan, M., Villasenor-Pineda, L., Montes-Y-Gomez, M.A Soundex-Based Approach for Spoken Document Retrieval, Proc. Mexican Int. Conf. on Artificial Intelligence, Atizapan de Zaragoza, Mexico, October 27–31, 2008, Berlin: Springer-Verlag, 2008, pp. 204–211.

  67. Reyes-Barragan, A., Montes-Y-Gomez, M., Villasenor-Pineda, L.Combining Word and Phonetic-Code Representations for Spoken Document Retrieval, Proc. Int. Conf. on Intelligent Text Processing and Computational Linguistics, Tokyo, Japan, February 20–26, 2011, Berlin: Springer-Verlag, 2011, pp. 458–466.

  68. Roedler, R., Kergl, D., and Rodosek, G.D.Profile Matching Across Online Social Networks Based on Geo-Tags, in Advances in Nature and Biologically Inspired Computing, Springer International Publishing, 2016, pp. 417–428.

  69. Russell, R.C. and Margaret, K.O.US Patent 1262167, 1435663, 1918, 1922.

  70. Sanchez-Vilas, F., Lama, M., Vidal, J.C., et al.Combining Uncorrelated Similarity Measures for Service Discovery Proc. Int. Workshop on Resource Discovery, Paris, France, November 5, 2010, Berlin: Springer-Verlag, 2012, pp. 160–180.

  71. Schierle, M., Schulz, S., and Ackermann, M.From Spelling Correction to Text Cleaning—Using Context Information, in Data Analysis, Machine Learning and Applications, Berlin: Springer-Verlag, 2008, pp. 397–404

  72. Schneider, J.M., Fernandez, J., and Martinez, P.A Proof-of-Concept for Orthographic Named Entity Correction in Spanish Voice Queries, Proc. Int. Workshop on Adaptive Multimedia Retrieval, Copenhagen, Denmark, October 24–25, 2012, Springer International Publishing, 2014, pp. 181–190.

  73. Smart, P.D., Jones, C.B., and Twaroch, F.A.Multi-source Toponymal Data Integration and Mediation for a Meta-Gazetteer Service, Proc. Int. Conf.on Geographic Information Science, Zurich, Switzerland, April 30, 2010, Berlin: Springer-Verlag, 2010, pp. 234–248.

  74. Soman, S., Srivastava, P., and Murthy, B.K.Unique Health Identifier for India: An Algorithm an Feasibility Analysis on Patient Data, Proc. Int. Conf. on E-health Networking, Application & Services, Boston, USA, October 13–17, 2015, pp. 250–255.

  75. Soundexing and Genealogy by Gary Mokotoff. www.avotaynu.com/soundex.htm (Accessed September 8, 2017).

  76. Souza, M. and Vieira, R.Sentiment Analysis on Twitter Data for Portuguese Language Proc. 10th Int. Conf. on Computational Processing of the Portuguese Language (PROPOR’12), Berlin: Springer-Verlag, 2012, pp. 241–247.

  77. Taft, R.L.Name Search Techniques, New York State Identification and Intelligence System: Special Report no. 1, New York: Albany, 1970.

  78. Souza, J., Botega, L.C., Segundo, S., et al.Conceptual Framework to Enrich Situation Awareness of Emergency Dispatchers, Proc. Int. Conf. on Human Interface and the Management of Information, Los Angeles, USA, August 2–7, 2015, Springer International Publishing, 2015, pp. 33–44.

  79. Tissot, H., Peschl, G., and Fabro, M.D.Fast Phonetic Similarity Search over Large Repositories, Proc. Int. Conf. on Database and Expert Systems Applications, Munich, Germany, September 1–4, 2014, Springer International Publishing, 2014, pp. 74–81.

  80. Wagner, R. A. & Fischer, M. J. The String-to-String Correction Problem. ACM no. 21(1), 168–173 (1974).

    MathSciNet  MATH  Google Scholar 

  81. Winkler, W.E.String Comparator Metrics and Enhanced Decision Rules in the Fellegi–Sunter Model of Record, Proc. Section on Survey Research Methods, American Statistical Association, 1990, pp. 354–359.

  82. Zampieri, M. and Amorim, R.C., Between Sound and Spelling: Combining Phonetics and Clustering Algorithms to Improve Target Word Recovery, Proc. Int. Conf. on Natural Language Processing, Warsaw, Poland, September 17–19, 2014, Springer International Publishing, 2014, pp. 438–449.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Vykhovanets, V., Du, J. & Sakulin, S. An Overview of Phonetic Encoding Algorithms. Autom Remote Control 81, 1896–1910 (2020). https://doi.org/10.1134/S0005117920100082

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1134/S0005117920100082

Keywords

Navigation