Abstract
This paper proposed a flattened syntactical phrase-based translation model for Statistical Machine Translation (SMT) learned from bilingual parallel parsed texts. The flattened syntactical phrases are sets of ordered leaf nodes with their father nodes of single syntax trees or forests ignoring the inner structure, containing lexicalized terminals and non-terminals as variable nodes. Constraints over the variable nodes in target side guarantee correct syntactical structures of translations in accordant to the syntactical knowledge learned from parallel texts. The experiments based on Chinese-to-English translation show us a predictable result that our model achieves 1.87% and 4.76% relative improvements, over Pharaoh, the state-of-art phrase-based translation system, and the system of traditional tree-to-tree model based on STSG.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Brown, P.F., Cocke, J., Della Pietra, S.A., Della Pietra, V.J., Jelinek, F., Lafferty, J.D., Mercer, R.L., Roossin, P.S.: A statistical approach to machine translation. Computational Linguistics 16(2), 79–85 (1990)
Brown, P.F., Cocke, J., Della Pietra, S.A., Della Pietra, V.J., Mercer, R.L.: The mathematics of machine translation: Parameter estimation. Computational Linguistics 19, 263–312 (1993)
Och, F.J., Ney, H.: Discriminative training and maximum entropy models for statistical machine translation. In: Proceedings of the 40th Annual Meeting of the ACL, pp. 295–302 (2002)
Wu, D.: Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics 23, 377–404 (1997)
Chiang, D.: A hierarchical phrase-based model for statistical machine translation. In: Proceedings of the 43rd Annual Meeting of the ACL, pp. 263–270 (2005)
Gildea, D.: Loosely tree-based alignment for machine translation. In: Proceedings of the 41st Annual Meeting of the ACL, companion volume (2003)
Cowan, B., Kucerova, I., Collins, M.: A discriminative model for tree-to-tree translation. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pp. 232–241 (2006)
Quirk, C., Menezes, A., Cherry, C.: Dependency Treelet Translation: Syntactically Informed Phrasal SMT. In: Proceedings of the 43rd Annual Meeting of the ACL, pp. 271–279 (2005)
Liu, Y., Liu, Q., Lin, S.: Tree-to-string Alignment Template for Statistical Machine Translation. In: Proceedings of the 44th Annual Meeting of the ACL, pp. 609–616 (2006)
Huang, L., Knight, K., Joshi, A.: Statistical syntax-directed translation with extended domain of locality. In: Proceedings of the 7th Biennial Conference of the Association for Machine Translation in the Americas (AMTA), pp. 66–73 (2006)
Yamada, K., Knight, K.: A syntax-based statistical translation model. In: Proceedings of the 39th Annual Meeting of the ACL (2001)
Marcu, D., Wang, W., Echihabi, A., Knight, K.: SPMT: Statistical Machine Translation with syntactified target language phrases. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pp. 44–52 (2006)
Koehn, P., Och, F.J., Marcu, D.: Statistical Phrase-based Translation. In: HLT/NAACL 2003, pp. 127–133 (2003)
Och, F.J., Ney, H.: The alignment template approach to statistical machine translation. Computational linguistics 30, 417–449 (2004)
Koehn, P., Och, F.J., Marcu, D.: Statistical phrase-based translation. In: Proceeding of HLT-MAACL, pp. 127–133 (2003)
Kumar, S., Byrne, W.: Local phrase reordering models for statistical machine translation. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language processing (HLT/EMNLP), Vancouver, Canada, pp. 161–168 (2005)
Kumar, S., Byrne, W.: Local phrase reordering models for statistical machine translation. In: HLT-EMNLP 2005 (2005)
Koehn, P., Axelrod, A., Birch, A., Mayne, Callison-Burch, C.: Edinburgh system description. In: IWSLT 2005 Speech Translation Evaluation (2005)
Xiong, D., Liu, Q., Lin, S.: Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation. In: Proceedings of the 21st International Conference on Computational Linguistics and 44th ACL, Sydney, Australia, pp. 521–528 (2006)
Li, C.-H., Zhang, D., Li, M., Zhou, M., Guan, Y.: A Probabilistic Approach to Syntax-based Reordering for Statistical Machine Translation. In: Proceedings of the 45th Annual Meeting of the ACL (2007)
Wang, C., Collins, M., Koehn, P.: Chinese Syntactic Reordering for Statistical Machine Translation. In: EMNLP-CNLL 2007, pp. 737–745 (2007)
Lin, D.: A path-based transfer model for machine translation. In: Proceedings of COLING (2004)
Ding, Y., Palmer, M.: Machine Translation Using Probabilistic Synchronous Dependency Insertion Grammars. In: Proceedings of the 43rd Annual Meeting of the ACL (2005)
Zhang, M., Jiang, H., Aw, A., Li, H., Tan, C.L., Li, S.: A Tree Sequence Alignment-based Tree-to-Tree Translation Model. In: Proceedings of the 46th Annual Meeting of the ACL (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, Q., Yao, T. (2009). Flattened Syntactical Phrase-Based Translation Model for SMT. In: Li, W., Mollá-Aliod, D. (eds) Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy. ICCPOL 2009. Lecture Notes in Computer Science(), vol 5459. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00831-3_34
Download citation
DOI: https://doi.org/10.1007/978-3-642-00831-3_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00830-6
Online ISBN: 978-3-642-00831-3
eBook Packages: Computer ScienceComputer Science (R0)