Abstract
In recent years, the study of machine translation has made great progress. However there are still many things to do for machine translation to reach the semantic level. In this paper, case grammar’s features that could well describe the semantic relationships in sentences were concluded. 23 thousand annotation errors that occurred in treebank construction were analyzed. 13 typical problems were summarized and the corresponding revolutions were proposed. The application of case grammar may contribute a new way of thinking for machine translation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Maosong, S., Ting, L., Donghong, J.: Frontiers of Language Computing. Journal of Chinese Information Processing 28(1), 1–8 (2014). (in Chinese)
Fillmore, C.J.: The case for case. Universals in Linguistic Theory,1-88 (1967)
Fillmore, C.J.: The case for case reopened. Syntax and semantics 8, 59–82 (1977)
Lingling, W.: Case Grammar and its Application in Chinese Research (格语法及其在汉语研究中的应用). Applied Linguistics 4, 97–101 (1994). (in Chinese)
Chengqing, Z.: Statistical Natural Language Processing(统计自然语言处理). Tsinghua University Press, Beijing (2013). (in Chinese)
Nagao, M: A framework of a mechanical translation between Japanese and English by analogy principle. Artificial and Human Intelligence 351–354 (1984)
Brown, P.F., Pietra, V.J.D., Pietra, S.A.D., Mercer, R.L.: The mathematics of statistical machine translation: Parameter estimation. Computational linguistics 19(2), 263–311 (1993)
Zens, R., Och, F.J., Ney, H.: Phrase-based statistical machine translation. In: KI 2002: Advances in Artificial Intelligence, pp. 18–32. Springer, Heidelberg (2002)
Koehn, P., Och, F.J., Marcu, D.: Statistical phrase-based translation. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1, pp. 48–54. Association for Computational Linguistics (2003)
Wang, Y.Y., Waibel, A.: Modeling with structures in statistical machine translation. In: Proceedings of the 17th international conference on Computational linguistics-Volume 2, pp. 1357–1363. Association for Computational Linguistics (1998)
Chiang, D.: A hierarchical phrase-based model for statistical machine translation. In: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pp. 263–270. Association for Computational Linguistics (2005)
Liu, Y., Liu, Q., Lin, S.: Tree-to-string alignment template for statistical machine translation. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, pp. 609–616. Association for Computational Linguistics (2006)
Wu, D., Fung, P.: Semantic roles for SMT: a hybrid two-pass model. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, pp. 13–16. Association for Computational Linguistics (2009)
Gao, Q., Vogel, S.: Corpus expansion for statistical machine translation with semantic role label substitution rules. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers-Volume 2, pp. 294–298. Association for Computational Linguistics (2011)
Liu, D., Gildea, D.: Semantic role features for machine translation. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 716-724. Association for Computational Linguistics (2010)
Zhang, K., Zan, H., Han, Y., Mu, L.: Preliminary study on the construction of bilingual phrase structure treebank. In: Su, X., He, T. (eds.) CLSW 2014. LNCS, vol. 8922, pp. 403–413. Springer, Heidelberg (2014)
Baobao, C., Weidong, Z., Huarui, Z.: Bilingual Corpus Construction and its Management for Chinese-English Machine Translation. Terminology Standardization & Information Technology 1, 28–31 (2003). (in Chinese)
Tse ring rgyal: Research on Large-scale Sino-Tibetan Bilingual Corpus Construction for Natural Language Processing. Journal of Chinese Information Processing 25(6), 157–161 (2011). (in Chinese)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Zan, H., Chen, W., Zhang, K., Jia, Y. (2015). Analysis of Typical Annotation Problems in Bilingual Case Grammar Treebank Construction. In: Lu, Q., Gao, H. (eds) Chinese Lexical Semantics. CLSW 2015. Lecture Notes in Computer Science(), vol 9332. Springer, Cham. https://doi.org/10.1007/978-3-319-27194-1_53
Download citation
DOI: https://doi.org/10.1007/978-3-319-27194-1_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27193-4
Online ISBN: 978-3-319-27194-1
eBook Packages: Computer ScienceComputer Science (R0)