ABSTRACT
Syntactic heterogeneity between source and target languages has an important impact on the performance of Statistical Machine Translation (SMT). On the basis of phrase-based Chinese-English SMT, a method of source language pre-ordering based on N-best syntactic knowledge enhancement is proposed. First, the source language input sentences are analyzed by N-best Syntax, and the high reliability sub-tree structure is obtained by calculating statistical probability. Two optimization strategies are used to optimize the initial rule set: rule deduction and rule probability threshold control mechanism. Second, the source language phrase translation table is used as a constraint to control the sequence between phrases. Finally, the syntax analysis tree of the source-side sentences is pre-ordered. The experimental results of Chinese-English SMT based on the NIST 2005 and 2008 test data sets show that comparing to the baseline system, the BLEU score of automatic evaluation criterion of the N-best syntactic knowledge-enhanced SMT pre-ordering method increased by 0.68 and 0.83 respectively.
- Schwarts R, Chow Y L. The N-Best Algorithm: An Efficient and Exact Procedures for Finding the N Most Likely Sentences Hypotheses. ICASSP. 1994: 81∼84 .Google Scholar
- Wang C, Collins M, Koehn P. Chinese syntactic reordering for statistical machine translation [C]// Proceedings of the 2007 Joint Meeting of the Conference on Empirical Methods on Natural Language Processing and on Computational Natural Language Learning Prague: ACL, 2007: 869-876.Google Scholar
- Patel R N, Gupta R, Pimpale P B, Reordering rules for English-Hindi SMT[C]//Proceedings of the Second Workshop on Hybrid Approaches to Translation of Association for Computational Linguistics. Sofia: ACL, 2013: 88-92.Google Scholar
- Mennon A, Mehrotra K, Mohan C K, Characterization of a class of sigmoid functions with applications to neural networks [J]. Neural Networks, 1996, 9(5): 819-835.Google ScholarDigital Library
- Liu X, Zhu Y, Jin Y. Local phrase reordering model for Chinese-English patent machine translation[C]// Proceedings of the Third CIPS-SIGHAN Joint Conference on Chinese Language Processing. Wu-han: ACL, 2014: 95-107.Google Scholar
- Zhang Yang, Yu Zhengtao, Zhou Ke. A Study on the Method of Hierarchical Phrase Translation Fusion of. Language. Characteristics Based on Lexical Ordering. Model [J]. Computers and Numbers Engineering, 2017 pr. 45 (12): 2389-2392.Google Scholar
- Chen J K, Soong F K, Lee L S. Large Vocabulary Word Recognition Based on Tree-Trellis Search. ICASSP. 1994: 137-140.Google Scholar
- Lei Xianghua, Jin Yu. Intelligent English Automatic Translation System Based on Phrase Translation Combination. [J]. Automation and instrumentation, 2018, 15(5): 1.Google Scholar
Recommendations
Syntactic discriminative language model rerankers for statistical machine translation
This article describes a method that successfully exploits syntactic features for n-best translation candidate reranking using perceptrons. We motivate the utility of syntax by demonstrating the superior performance of parsers over n-gram language ...
Syntactic parsing of clause constituents for statistical machine translation
The clause is considered as the basic unit of grammar in linguistics, which is a structure between a chunk and a sentence. Clause constituents, therefore, are one important kind of linguistically valid syntactic phrases. This paper adopts the CRFs model ...
Inter-, Intra-, and Extra-Chunk Pre-Ordering for Statistical Japanese-to-English Machine Translation
A rule-based pre-ordering approach is proposed for statistical Japanese-to-English machine translation using the dependency structure of source-side sentences. A Japanese sentence is pre-ordered to an English-like order at the morpheme level for a ...
Comments