Discrete and Neural Models for Chinese POS Tagging: Comparison and Combination

Zhang, Meishan; Yu, Nan; Fu, Guohong

doi:10.1007/978-3-319-50496-4_37

Meishan Zhang¹⁸,
Nan Yu¹⁸ &
Guohong Fu¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10102))

Included in the following conference series:

4634 Accesses

Abstract

Discrete and Neural models are two mainstream methods for Chinese POS tagging nowadays. Both have achieved state-of-the-art performances. In this paper, we compare the two kinds of models empirically, and further investigate the combination methods of them. In particular, as the pre-trained word embeddings are exploited under the neural setting, one can regard neural models as semi-supervised setting. To make a fairer comparison of the discrete and the neural models, we incorporate word clusters for both models as well as their combination, since it has been generally accepted that word clusters can encode similar information as pre-trained word embeddings.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://sourceforge.net/projects/zpar, version 0.7.
2.
http://word2vec.googlecode.com/.
3.
We use the tool available at https://github.com/percyliang/brown-cluster to produce word clusters, using the same corpus as the training of word embeddings.

References

Brants, T.: TnT: a statistical part-of-speech tagger. In: Proceedings of the Sixth Conference on Applied Natural Language Processing, pp. 224–231 (2000)
Google Scholar
Collins, M.: Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms. In: EMNLP (2002)
Google Scholar
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. JMLR 12, 2493–2537 (2011)
MATH Google Scholar
Duchi, J., Hazan, E., Singer, Y.: Adaptive subgradient methods for online learning and stochastic optimization. JMLR 12, 2121–2159 (2011)
MathSciNet MATH Google Scholar
Giménez, J., Marquez, L.: SVMTool: a general POS tagger generator based on support vector machines. In: Proceedings of the 4th LREC (2004)
Google Scholar
Hatori, J., Matsuzaki, T., Miyao, Y., Tsujii, J.: Incremental joint POS tagging and dependency parsing in Chinese. In: IJCNLP, pp. 1216–1224 (2011)
Google Scholar
Lafferty, J.D., McCallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: ICML, pp. 282–289 (2001)
Google Scholar
Li, Z., Chao, J., Zhang, M., Chen, W.: Coupled sequence labeling on heterogeneous annotations: POS tagging as a case study. In: ACL, pp. 1783–1792 (2015)
Google Scholar
Li, Z., Che, W., Liu, T.: Improving Chinese POS tagging with dependency parsing. In: IJCNLP, pp. 1447–1451 (2011)
Google Scholar
Li, Z., Liu, T., Che, W.: Exploiting multiple treebanks for parsing with quasi-synchronous grammars. In: Proceedings of the 50th ACL, pp. 675–684 (2012)
Google Scholar
Li, Z., Zhang, M., Che, W., Liu, T.: A separately passive-aggressive training algorithm for joint POS tagging and dependency parsing. In: COLING 2012, pp. 1681–1698 (2012)
Google Scholar
Ma, X., Hovy, E.: End-to-end sequence labeling via bi-directional LSTM-CNNS-CRF. In: ACL (2016)
Google Scholar
Manning, C.D.: Part-of-speech tagging from 97% to 100%: is it time for some linguistics? In: Gelbukh, A.F. (ed.) CICLing 2011. LNCS, vol. 6608, pp. 171–189. Springer, Heidelberg (2011). doi:10.1007/978-3-642-19400-9_14
Chapter Google Scholar
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
dos Santos, C.N., Zadrozny, B.: Learning character-level representations for part-of-speech tagging. In: ICML, pp. 1818–1826 (2014)
Google Scholar
Sun, W.: Word-based and character-based word segmentation models: comparison and combination. In: COLING 2010: Posters, pp. 1211–1219 (2010)
Google Scholar
Sun, W., Uszkoreit, H.: Capturing paradigmatic and syntagmatic lexical relations: towards accurate Chinese part-of-speech tagging. In: ACL 2012, pp. 242–252 (2012)
Google Scholar
Sun, W., Wan, X.: Data-driven, PCFG-based and pseudo-PCFG-based models for Chinese dependency parsing. TACL 1(1), 301–314 (2013)
MathSciNet Google Scholar
Toutanova, K., Klein, D., Manning, C., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: HLT-NAACL 2003 (2003)
Google Scholar
Wang, Y., Kazama, J., Tsuruoka, Y., Chen, W., Zhang, Y., Torisawa, K.: Improving Chinese word segmentation and POS tagging with semi-supervised methods using large auto-analyzed data. In: IJCNLP, pp. 309–317 (2011)
Google Scholar
Wolpert, D.H.: Stacked generalization. Neural Netw. 5, 241–259 (1992)
Article Google Scholar
Zhang, M., Che, W., Liu, T., Li, Z.: Stacking heterogeneous joint models of Chinese POS tagging and dependency parsing. In: COLING 2012, pp. 3071–3088 (2012)
Google Scholar
Zhang, M., Yang, J., Teng, Z., Zhang, Y.: LibN3L: a lightweight package for neural NLP. In: LREC (2016)
Google Scholar
Zhang, M., Zhang, Y.: Combining discrete and continuous features for deterministic transition-based dependency parsing. In: EMNLP, pp. 1316–1321 (2015)
Google Scholar
Zhang, M., Zhang, Y., Fu, G.: Transition-based neural word segmentation. In: Proceedings of the 54nd ACL (2016)
Google Scholar
Zhang, M., Zhang, Y., Vo, D.T.: Neural networks for open domain targeted sentiment. In: Proceedings of the EMNLP, pp. 612–621 (2015)
Google Scholar

Download references

Acknowledgments

This work is supported by National Natural Science Foundation of China (NSFC) under grant 61602160 and 61672211, Natural Science Foundation of Heilongjiang Province (China) under grant No. F2016036.

Author information

Authors and Affiliations

School of Computer Science and Technology, Heilongjiang University, Harbin, China
Meishan Zhang, Nan Yu & Guohong Fu

Authors

Meishan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Nan Yu
View author publications
You can also search for this author in PubMed Google Scholar
Guohong Fu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Meishan Zhang .

Editor information

Editors and Affiliations

Microsoft Research Asia, Beijing, China
Chin-Yew Lin
Brandeis University, Waltham, Massachusetts, USA
Nianwen Xue
Peking University, Beijing, China
Dongyan Zhao
Fudan University, Shanghai, China
Xuanjing Huang
Peking University, Beijing, China
Yansong Feng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, M., Yu, N., Fu, G. (2016). Discrete and Neural Models for Chinese POS Tagging: Comparison and Combination. In: Lin, CY., Xue, N., Zhao, D., Huang, X., Feng, Y. (eds) Natural Language Understanding and Intelligent Applications. ICCPOL NLPCC 2016 2016. Lecture Notes in Computer Science(), vol 10102. Springer, Cham. https://doi.org/10.1007/978-3-319-50496-4_37

Download citation

DOI: https://doi.org/10.1007/978-3-319-50496-4_37
Published: 02 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50495-7
Online ISBN: 978-3-319-50496-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics