Abstract
This paper proposes an Automatic Korean Phoneme Generator (AKPG) that can be adapted to various natural language processing systems that handle raw input-text from users such as the Korean pronunciation education system. Resolving noise and ambiguity is a precondition for correct natural language processing. In order to satisfy this condition, the AKPG, as a module of an NLP system, combines linguistic and IR methods. Preprocessing modules are incorporated into the AKPG to handle spelling-errors that render correct phoneme generation impossible. In addition, the preprocessing modules convert alphanumeric symbols into Korean characters. Finally, in order to remove part-of-speech (POS) ambiguities and those of homographs with the same POS, homograph collocations are collected from a large corpus using the IR method. In addition, those homographs are integrated into dependency rules for partial parsing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Belew, R.K.: Finding Out About. Cambridge University Press, Cambridge (2000)
CoreVoice, http://corevoice.com/
Ingram, C.L., Park, S.G.: Cross-language vowel perception and production by Japanese and Korean learners of English. Journal of Phonetics 25, 343–370 (1997)
Jung, Y.I., Lee, D.H., Nam, H.S., Yoon, A., Kwon, H.C.: Learning for Transliteration of Arabic-Numeral Expressions Using Decision Tree for Korean TTS. In: Proc. InterSpeech 2004-ICSLP, vol. 3, pp. 1937–1940 (2004)
Kang, M.Y., Yoon, A., Kwon, H.C.: Improving Partial Parsing Based on Error-Pattern Analysis for Korean Grammar-Checker. In: TALIP, vol. 2-4, pp. 301–323. ACM, New York (2003)
Resnik, P., Yarowsky, D.: Distinguishing Systems and Distinguishing Senses: New Evaluation Methods for Word Sense Disambiguation. Natural Language Engineering 5(2), 113–133 (1997)
Taylor, I., Taylor, M.: Writing and Literacy in Chinese, Korean and Japanese. John Benjamins Publishing Company, Amsterdam (1995)
VoiceWare, http://www.voiceware.co.kr
Yarowsky, D.: Homograph Disambiguation in Text-to-speech Synthesis, pp. 157–172. Springer, Heidelberg (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kang, My., Jung, Sw., Kwon, Hc., Yoon, A. (2006). Automatic Korean Phoneme Generation Via Input-Text Preprocessing and Disambiguation. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2006. Lecture Notes in Computer Science(), vol 4188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846406_56
Download citation
DOI: https://doi.org/10.1007/11846406_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-39090-9
Online ISBN: 978-3-540-39091-6
eBook Packages: Computer ScienceComputer Science (R0)