Identifying Pronunciation-Translated Names from Chinese Texts Based on Support Vector Machines

Li, Lishuang; Chen, Chunrong; Huang, Degen; Yang, Yuansheng

doi:10.1007/978-3-540-28647-9_162

Lishuang Li¹⁹,
Chunrong Chen¹⁹,
Degen Huang¹⁹ &
…
Yuansheng Yang¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3173))

Included in the following conference series:

International Symposium on Neural Networks

1371 Accesses

Abstract

This paper presents a method of automatic recognition of pronunciation-translated names (P-Names) based on support vector machines (SVMs): extracting the character itself, character-based part-of-speech (POS) tag, frequency information of a character in P-Name table and context information as the attributes of feature vectors, a training set is established. The machine learning models of automatic identification of P-Names based on support vector machines are obtained using polynomial kernel functions. The testing results show that this method is efficient for identifying pronunciation-translated names.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Automatic language identification: a case study of Pahari languages

Article 12 May 2023

String Kernel-Based Techniques for Native Language Identification

Article Open access 14 June 2023

Italian Text Categorization with Lemmatization and Support Vector Machines

References

Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, Berlin (1995)
MATH Google Scholar
Joachims, T.: Text Categorization with Support Vector Machines: Learning with Many Relevant Features. In: Nedellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998)
Chapter Google Scholar
Li, H., Zhu, J., Yao, T.: SVM Based Chinese Text Chunking. Journal of Chinese Information Processing 2, 1–7 (2004)
Google Scholar
Goh, C.L., Asahara, M., Matsumoto, Y.: Chinese Unknown Word Identification Using Character-Based Tagging and Chunking. In: ACL 2003: 41st Annual Meeting of the Association for Computational Linguistics, Interactive Poster/Demo Sessions, Companion Volume of the Proceesings, pp. 197–200 (2003)
Google Scholar
Vapnik, V.N.: Statistical Learning Theory. John Wiley & Sons, New York (1998)
MATH Google Scholar
Xi, C., Sun, M.: Automatic Prediction of Chinese Phrase Boundary Location with Neural Networks. Journal of Chinese Information Processing 2, 20–26 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Dalian University of Technology, 116024, Dalian, China
Lishuang Li, Chunrong Chen, Degen Huang & Yuansheng Yang

Authors

Lishuang Li
View author publications
You can also search for this author in PubMed Google Scholar
Chunrong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Degen Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yuansheng Yang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Electronic and Information Engineering, Dalian University of Technology, 116023, Dalian, China
Fu-Liang Yin
Department of Mechanical and Automation Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong, China
Jun Wang
School of Electronic and Information Engineering, Dalian University of Technology, 116023, Dalian, Liaoning, China
Chengan Guo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, L., Chen, C., Huang, D., Yang, Y. (2004). Identifying Pronunciation-Translated Names from Chinese Texts Based on Support Vector Machines. In: Yin, FL., Wang, J., Guo, C. (eds) Advances in Neural Networks – ISNN 2004. ISNN 2004. Lecture Notes in Computer Science, vol 3173. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28647-9_162

Download citation

DOI: https://doi.org/10.1007/978-3-540-28647-9_162
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22841-7
Online ISBN: 978-3-540-28647-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics