Abstract
The paper presents a method of Chinese chunk recognition based on Support Vector Machines (SVMs) plus Sigmoid. It is well known that SVMs are binary classifiers which achieve the best performance in many tasks. However, directly applying binary classifiers in the task of Chinese chunking will face the dilemmas that either two or more different class labels are given to a single unlabeled constituent, or no class labels are given for some unlabeled constituents. Employing sigmoid functions is a method of extracting probabilities (class/input) from SVMs outputs, which is helpful to post-processing of classification. These probabilities are then used to resolve the dilemmas. We compare our method based on SVMs plus Sigmoid with methods based only on SVMs. The experiments show that significant improvements have been achieved.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Zhang, Y., Zhou, Q.: Automatic identification of chinese base phrases. Journal of Chinese Information Processing 16 (2002)
Zhou, Q., song Sun, M., ning Huang, C.: Chunking parsing scheme for chinese sentences. Chinese J. Computers 22, 1158–1165 (1999)
Zhao, J., Huang, C.: A transformation-based model for chinese basenp recognition. Journal of Chinese Information Processing 13, 1–7 (1998)
Kudo, T., Matsumoto, Y.: Use of support vector learning for chunk identification. In: Proc. of CoNLL 2000 and LLL 2000 (2000)
Kudo, T., Matsumoto, Y.: Chunking with support vector machines. In: NAACL 2001 (2001)
Nakagawa, T., Kudoh, T., Matsumoto, Y.: Unknown word guessig and part-ofspeech tagging using support vector machines. In: Proc. of the 6th NLPRS, pp. 325–331 (2001)
Yamada, H., Kudoh, T., Matsumoto, Y.: Japanese named entity extraction using support vector machines. IPSJSIG (1999)
Joachims, T.: Learning to classify text using support vector machines. Kluwer, Dordrecht (2002)
Abney, S.: Parsing by chunks. In: Berwick, R.C., Abney, S.P., Tenny, C. (eds.) Principle-Based Parsing: Computation and Psycholinguistics, pp. 257–278. Kluwer Academic Publishers, Boston (1991)
Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20, 273–297 (1995)
N.Vapnik, V.: The nature of statistical learning theory. Springer, Heidelberg (1995)
Boswell, D.: Introduction to support vector machines (2002)
Kazama, J., Makino, T., Ohta, Y., Tsujii, J.: Tuning support vector machines for biomedical named entity recognition (2002)
Platt, J.: Probabilistic outputs for support vector machines and comparison to regularized likelihood methods, pp. 61–74 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tan, Y., Yao, T., Chen, Q., Zhu, J. (2005). Chinese Chunk Identification Using SVMs Plus Sigmoid. In: Su, KY., Tsujii, J., Lee, JH., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2004. IJCNLP 2004. Lecture Notes in Computer Science(), vol 3248. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30211-7_56
Download citation
DOI: https://doi.org/10.1007/978-3-540-30211-7_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24475-2
Online ISBN: 978-3-540-30211-7
eBook Packages: Computer ScienceComputer Science (R0)