Skip to main content

Automatic Korean Phoneme Generation Via Input-Text Preprocessing and Disambiguation

  • Conference paper
Text, Speech and Dialogue (TSD 2006)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4188))

Included in the following conference series:

  • 1038 Accesses

Abstract

This paper proposes an Automatic Korean Phoneme Generator (AKPG) that can be adapted to various natural language processing systems that handle raw input-text from users such as the Korean pronunciation education system. Resolving noise and ambiguity is a precondition for correct natural language processing. In order to satisfy this condition, the AKPG, as a module of an NLP system, combines linguistic and IR methods. Preprocessing modules are incorporated into the AKPG to handle spelling-errors that render correct phoneme generation impossible. In addition, the preprocessing modules convert alphanumeric symbols into Korean characters. Finally, in order to remove part-of-speech (POS) ambiguities and those of homographs with the same POS, homograph collocations are collected from a large corpus using the IR method. In addition, those homographs are integrated into dependency rules for partial parsing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Belew, R.K.: Finding Out About. Cambridge University Press, Cambridge (2000)

    MATH  Google Scholar 

  2. CoreVoice, http://corevoice.com/

  3. Ingram, C.L., Park, S.G.: Cross-language vowel perception and production by Japanese and Korean learners of English. Journal of Phonetics 25, 343–370 (1997)

    Article  Google Scholar 

  4. Jung, Y.I., Lee, D.H., Nam, H.S., Yoon, A., Kwon, H.C.: Learning for Transliteration of Arabic-Numeral Expressions Using Decision Tree for Korean TTS. In: Proc. InterSpeech 2004-ICSLP, vol. 3, pp. 1937–1940 (2004)

    Google Scholar 

  5. Kang, M.Y., Yoon, A., Kwon, H.C.: Improving Partial Parsing Based on Error-Pattern Analysis for Korean Grammar-Checker. In: TALIP, vol. 2-4, pp. 301–323. ACM, New York (2003)

    Google Scholar 

  6. Resnik, P., Yarowsky, D.: Distinguishing Systems and Distinguishing Senses: New Evaluation Methods for Word Sense Disambiguation. Natural Language Engineering 5(2), 113–133 (1997)

    Article  Google Scholar 

  7. Taylor, I., Taylor, M.: Writing and Literacy in Chinese, Korean and Japanese. John Benjamins Publishing Company, Amsterdam (1995)

    Google Scholar 

  8. VoiceWare, http://www.voiceware.co.kr

  9. Yarowsky, D.: Homograph Disambiguation in Text-to-speech Synthesis, pp. 157–172. Springer, Heidelberg (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kang, My., Jung, Sw., Kwon, Hc., Yoon, A. (2006). Automatic Korean Phoneme Generation Via Input-Text Preprocessing and Disambiguation. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2006. Lecture Notes in Computer Science(), vol 4188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846406_56

Download citation

  • DOI: https://doi.org/10.1007/11846406_56

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-39090-9

  • Online ISBN: 978-3-540-39091-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics