Skip to main content

Research on Semantic Disambiguation in Treebank

  • Conference paper
  • First Online:
Book cover Web Technologies and Applications (APWeb 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9313))

Included in the following conference series:

  • 2782 Accesses

Abstract

The increasingly widespread application of natural language processing technology leads parsing to play a significant role. As a result, the size and quality of treebank have become the focus of relevant research. However, there exists data sparseness when we use the treebank to parse. With the help of Cilin semantic information and words contextual information, this paper proposes a context-based lexical semantics disambiguation method. After applying this method on CTB (Chinese Treebank) 5.0 and TCT (Tsinghua Chinese Treebank), using Berkeley Parser achieved relatively good results. In Penn Chinese Treebank, the precision and recall rates reached 85.35% and 84.34% respectively, and the F value reached 84.84%. Comparing with the parsing results of using the original corpus, the correct rate increased by 1.86% and the recall rate increased by 1.02% and the comprehensive index F value increased by 1.35%. As consequence, the overall parsing error rate dropped by 8.17%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Manning, C.D., Schütze, H.: Foundations of statistical natural language processing. MIT Press (1999)

    Google Scholar 

  2. Wang, Y., Ji, D.: Summary of Chinese Treebank. Contemporary Linguistics 11(1), 47–55 (2009)

    Google Scholar 

  3. Zhang, M., Zhang, Y., Che, W., et al.: Chinese Parsing Exploiting Characters. In: ACL (1), pp. 125–134 (2013)

    Google Scholar 

  4. Hatori, J., Matsuzaki, T., Miyao, Y., et al.: Incremental joint approach to word segmentation, pos tagging, and dependency parsing in chinese. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics. Long Papers, vol. 1, pp. 1045–1053. Association for Computational Linguistics (2012)

    Google Scholar 

  5. Charniak, E.: Statistical parsing with a context-free grammar and word statistics. In: AAAI/IAAI 2005, vol. 18, pp. 598–603 (1997)

    Google Scholar 

  6. Jones, B.K., Johnson, M., Goldwater, S.: Semantic parsing with bayesian tree transducers. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol. 1, pp. 488–496. Association for Computational Linguistics (2012)

    Google Scholar 

  7. Chen, W., Zhang, M., Li, H.: Utilizing dependency language models for graph-based dependency parsing models. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol. 1. Association for Computational Linguistics (2012)

    Google Scholar 

  8. Feng, V.W., Hirst, G.: Text-level discourse parsing with rich linguistic features. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol. 1. Association for Computational Linguistics (2012)

    Google Scholar 

  9. Carreras, X.: Experiments with a High-order Projective Dependency Parser. In: Proceedings of the CoNLL 2007 Shared Task Session of EMNLP-CoNLL, pp. 957–961. CoNLL, Prague (2007)

    Google Scholar 

  10. McDonald, R., Lerman, K., Pereira, F.: Multilingual dependency analysis with a two-stage discriminative parser. In: Proceedings of the Tenth Conference on Computational Natural Language Learning, pp. 216–220. Association for Computational Linguistics (2006)

    Google Scholar 

  11. Petrov, S., Barrett, L., Thibaux, R., et al.: Learning accurate, compact, and interpretable tree annotation. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual meeting of the Association for Computational Linguistics, pp. 433–440. Association for Computational Linguistics (2006)

    Google Scholar 

  12. Agirre, E., Baldwin, T., Martinez, D.: Improving Parsing and PP Attachment Performance with Sense Information. In: ACL (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Miao, L., Lv, X., Wu, Y., Wang, Y. (2015). Research on Semantic Disambiguation in Treebank. In: Cheng, R., Cui, B., Zhang, Z., Cai, R., Xu, J. (eds) Web Technologies and Applications. APWeb 2015. Lecture Notes in Computer Science(), vol 9313. Springer, Cham. https://doi.org/10.1007/978-3-319-25255-1_54

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25255-1_54

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25254-4

  • Online ISBN: 978-3-319-25255-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics