Abstract
The increasingly widespread application of natural language processing technology leads parsing to play a significant role. As a result, the size and quality of treebank have become the focus of relevant research. However, there exists data sparseness when we use the treebank to parse. With the help of Cilin semantic information and words contextual information, this paper proposes a context-based lexical semantics disambiguation method. After applying this method on CTB (Chinese Treebank) 5.0 and TCT (Tsinghua Chinese Treebank), using Berkeley Parser achieved relatively good results. In Penn Chinese Treebank, the precision and recall rates reached 85.35% and 84.34% respectively, and the F value reached 84.84%. Comparing with the parsing results of using the original corpus, the correct rate increased by 1.86% and the recall rate increased by 1.02% and the comprehensive index F value increased by 1.35%. As consequence, the overall parsing error rate dropped by 8.17%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Manning, C.D., Schütze, H.: Foundations of statistical natural language processing. MIT Press (1999)
Wang, Y., Ji, D.: Summary of Chinese Treebank. Contemporary Linguistics 11(1), 47–55 (2009)
Zhang, M., Zhang, Y., Che, W., et al.: Chinese Parsing Exploiting Characters. In: ACL (1), pp. 125–134 (2013)
Hatori, J., Matsuzaki, T., Miyao, Y., et al.: Incremental joint approach to word segmentation, pos tagging, and dependency parsing in chinese. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics. Long Papers, vol. 1, pp. 1045–1053. Association for Computational Linguistics (2012)
Charniak, E.: Statistical parsing with a context-free grammar and word statistics. In: AAAI/IAAI 2005, vol. 18, pp. 598–603 (1997)
Jones, B.K., Johnson, M., Goldwater, S.: Semantic parsing with bayesian tree transducers. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol. 1, pp. 488–496. Association for Computational Linguistics (2012)
Chen, W., Zhang, M., Li, H.: Utilizing dependency language models for graph-based dependency parsing models. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol. 1. Association for Computational Linguistics (2012)
Feng, V.W., Hirst, G.: Text-level discourse parsing with rich linguistic features. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol. 1. Association for Computational Linguistics (2012)
Carreras, X.: Experiments with a High-order Projective Dependency Parser. In: Proceedings of the CoNLL 2007 Shared Task Session of EMNLP-CoNLL, pp. 957–961. CoNLL, Prague (2007)
McDonald, R., Lerman, K., Pereira, F.: Multilingual dependency analysis with a two-stage discriminative parser. In: Proceedings of the Tenth Conference on Computational Natural Language Learning, pp. 216–220. Association for Computational Linguistics (2006)
Petrov, S., Barrett, L., Thibaux, R., et al.: Learning accurate, compact, and interpretable tree annotation. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual meeting of the Association for Computational Linguistics, pp. 433–440. Association for Computational Linguistics (2006)
Agirre, E., Baldwin, T., Martinez, D.: Improving Parsing and PP Attachment Performance with Sense Information. In: ACL (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Miao, L., Lv, X., Wu, Y., Wang, Y. (2015). Research on Semantic Disambiguation in Treebank. In: Cheng, R., Cui, B., Zhang, Z., Cai, R., Xu, J. (eds) Web Technologies and Applications. APWeb 2015. Lecture Notes in Computer Science(), vol 9313. Springer, Cham. https://doi.org/10.1007/978-3-319-25255-1_54
Download citation
DOI: https://doi.org/10.1007/978-3-319-25255-1_54
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25254-4
Online ISBN: 978-3-319-25255-1
eBook Packages: Computer ScienceComputer Science (R0)