Abstract
This paper presents a method of automatic recognition of pronunciation-translated names (P-Names) based on support vector machines (SVMs): extracting the character itself, character-based part-of-speech (POS) tag, frequency information of a character in P-Name table and context information as the attributes of feature vectors, a training set is established. The machine learning models of automatic identification of P-Names based on support vector machines are obtained using polynomial kernel functions. The testing results show that this method is efficient for identifying pronunciation-translated names.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, Berlin (1995)
Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In: Nedellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998)
Li, H., Zhu, J., Yao, T.: SVM Based Chinese Text Chunking. Journal of Chinese Information Processing 2, 1–7 (2004)
Goh, C.L., Asahara, M., Matsumoto, Y.: Chinese Unknown Word Identification Using Character-Based Tagging and Chunking. In: ACL 2003: 41st Annual Meeting of the Association for Computational Linguistics, Interactive Poster/Demo Sessions, Companion Volume of the Proceesings, pp. 197–200 (2003)
Vapnik, V.N.: Statistical Learning Theory. John Wiley & Sons, New York (1998)
Xi, C., Sun, M.: Automatic Prediction of Chinese Phrase Boundary Location with Neural Networks. Journal of Chinese Information Processing 2, 20–26 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, L., Chen, C., Huang, D., Yang, Y. (2004). Identifying Pronunciation-Translated Names from Chinese Texts Based on Support Vector Machines. In: Yin, FL., Wang, J., Guo, C. (eds) Advances in Neural Networks – ISNN 2004. ISNN 2004. Lecture Notes in Computer Science, vol 3173. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28647-9_162
Download citation
DOI: https://doi.org/10.1007/978-3-540-28647-9_162
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22841-7
Online ISBN: 978-3-540-28647-9
eBook Packages: Springer Book Archive