research-article

The Pre-ordering Model for Statistical Machine Translation of Enhancing the N-best Syntactic Knowledge

Author:
Junyan Liu

The Department of Foreign Language and Literature, Wuhan Donghu University, China

The Department of Foreign Language and Literature, Wuhan Donghu University, China

0000-0002-0421-9419
View Profile

EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer EngineeringOctober 2022Pages 109–113https://doi.org/10.1145/3573428.3573448

Published:15 March 2023Publication History

EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering

Pages 109–113

ABSTRACT

Syntactic heterogeneity between source and target languages has an important impact on the performance of Statistical Machine Translation (SMT). On the basis of phrase-based Chinese-English SMT, a method of source language pre-ordering based on N-best syntactic knowledge enhancement is proposed. First, the source language input sentences are analyzed by N-best Syntax, and the high reliability sub-tree structure is obtained by calculating statistical probability. Two optimization strategies are used to optimize the initial rule set: rule deduction and rule probability threshold control mechanism. Second, the source language phrase translation table is used as a constraint to control the sequence between phrases. Finally, the syntax analysis tree of the source-side sentences is pre-ordered. The experimental results of Chinese-English SMT based on the NIST 2005 and 2008 test data sets show that comparing to the baseline system, the BLEU score of automatic evaluation criterion of the N-best syntactic knowledge-enhanced SMT pre-ordering method increased by 0.68 and 0.83 respectively.

References

Schwarts R, Chow Y L. The N-Best Algorithm: An Efficient and Exact Procedures for Finding the N Most Likely Sentences Hypotheses. ICASSP. 1994: 81∼84 .Google Scholar
Wang C, Collins M, Koehn P. Chinese syntactic reordering for statistical machine translation [C]// Proceedings of the 2007 Joint Meeting of the Conference on Empirical Methods on Natural Language Processing and on Computational Natural Language Learning Prague: ACL, 2007: 869-876.Google Scholar
Patel R N, Gupta R, Pimpale P B, Reordering rules for English-Hindi SMT[C]//Proceedings of the Second Workshop on Hybrid Approaches to Translation of Association for Computational Linguistics. Sofia: ACL, 2013: 88-92.Google Scholar
Mennon A, Mehrotra K, Mohan C K, Characterization of a class of sigmoid functions with applications to neural networks [J]. Neural Networks, 1996, 9(5): 819-835.Google ScholarDigital Library
Liu X, Zhu Y, Jin Y. Local phrase reordering model for Chinese-English patent machine translation[C]// Proceedings of the Third CIPS-SIGHAN Joint Conference on Chinese Language Processing. Wu-han: ACL, 2014: 95-107.Google Scholar
Zhang Yang, Yu Zhengtao, Zhou Ke. A Study on the Method of Hierarchical Phrase Translation Fusion of. Language. Characteristics Based on Lexical Ordering. Model [J]. Computers and Numbers Engineering, 2017 pr. 45 (12): 2389-2392.Google Scholar
Chen J K, Soong F K, Lee L S. Large Vocabulary Word Recognition Based on Tree-Trellis Search. ICASSP. 1994: 137-140.Google Scholar
Lei Xianghua, Jin Yu. Intelligent English Automatic Translation System Based on Phrase Translation Combination. [J]. Automation and instrumentation, 2018, 15(5): 1.Google Scholar

Recommendations

Syntactic discriminative language model rerankers for statistical machine translation

This article describes a method that successfully exploits syntactic features for n-best translation candidate reranking using perceptrons. We motivate the utility of syntax by demonstrating the superior performance of parsers over n-gram language ...
Read More
Syntactic parsing of clause constituents for statistical machine translation

The clause is considered as the basic unit of grammar in linguistics, which is a structure between a chunk and a sentence. Clause constituents, therefore, are one important kind of linguistically valid syntactic phrases. This paper adopts the CRFs model ...
Read More
Inter-, Intra-, and Extra-Chunk Pre-Ordering for Statistical Japanese-to-English Machine Translation

A rule-based pre-ordering approach is proposed for statistical Japanese-to-English machine translation using the dependency structure of source-side sentences. A Japanese sentence is pre-ordered to an English-like order at the morpheme level for a ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering
October 2022
1999 pages
ISBN:9781450397148
DOI:10.1145/3573428

Copyright © 2022 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 15 March 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate508of972submissions,52%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 13
  Total Downloads
- Downloads (Last 12 months)10
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

The Pre-ordering Model for Statistical Machine Translation of Enhancing the N-best Syntactic Knowledge

EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering

ABSTRACT

References

Cited By

Recommendations

Syntactic discriminative language model rerankers for statistical machine translation

Syntactic parsing of clause constituents for statistical machine translation

Inter-, Intra-, and Extra-Chunk Pre-Ordering for Statistical Japanese-to-English Machine Translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

The Pre-ordering Model for Statistical Machine Translation of Enhancing the N-best Syntactic Knowledge

EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering

ABSTRACT

References

Cited By

Recommendations

Syntactic discriminative language model rerankers for statistical machine translation

Syntactic parsing of clause constituents for statistical machine translation

Inter-, Intra-, and Extra-Chunk Pre-Ordering for Statistical Japanese-to-English Machine Translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media