Parameter Learning for Statistical Machine Translation Using CMA-ES

Tran, Viet-Hong; Pham, Anh-Tuan; Nguyen, Vinh-Van; Nguyen, Hoai-Xuan; Nguyen, Huy-Quang

doi:10.1007/978-3-319-11680-8_34

Parameter Learning for Statistical Machine Translation Using CMA-ES

Viet-Hong Tran^5,6,
Anh-Tuan Pham^7,8,
Vinh-Van Nguyen⁶,
Hoai-Xuan Nguyen⁸ &
…
Huy-Quang Nguyen⁹

Conference paper

1771 Accesses

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 326))

Abstract

Minimum error rate training (MERT) is probably still the most widely used parameter learning algorithm in statistical machine translation [1] (SMT). However, it does not support the use of large number of learning features (e.g. 30 features or more). Moreover, acting on parameter space, MERT is only a local optimization algorithm. In this paper, we investigate for the first time the use of metaheuristics and global optimization techniques for the problem of learning parameters in SMT. In particular, We replace MERT with the well-known meta-heuristics for global optimization called CovarianceMatrixAdaptation Evolution Strategy (CMAES) [2]. We test the effectiveness of CMA-ES by conducting SMT experiments on an English-Vietnamese corpus. The results show that the improved SMT system using CMA-ES achieved superior BLEU scores compared to the baseline SMT system using MERT both on the dev and test data sets.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Och, F.J.: Minimum error rate training in statistical machine translation. In: Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pp. 160–167. Association for Computational Linguistics, Sapporo (2003)
Google Scholar
Hansen, N.: The CMA evolution strategy: A comparing review. In: Lozano, J.A., Larrañaga, P., Inza, I., Bengoetxea, E. (eds.) Towards a New Evolutionary Computation. STUDFUZZ, vol. 192, pp. 75–102. Springer, Heidelberg (2006)
Google Scholar
Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Herbst, E.: Moses: Open source toolkit for statistical machine translation. In: Proceedings of ACL, Demonstration Session (2007)
Google Scholar
Koehn, P.: Statistical Machine Translation. Cambridge University Press (2010)
Google Scholar
Smith, D.A., Eisner, J.: Minimum risk annealing for training log-linear models. In: Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pp. 787–794. Association for Computational Linguistics, Sydney (2006)
Chapter Google Scholar
Huang, L., Mi, H.: Efficient incremental decoding for tree-to-string translation. In: Proceedings of the 2010 Conference on EmpiricalMethods in Natural Language Processing, pp. 273–283. Association for Computational Linguistics, Cambridge (2010)
Google Scholar
Suzuki, J., Tsukada, H., Watanabe, T., Isozaki, H.: Online large-margin training for statistical machine translation. In: Proceedings of EMNLP-CoNLL, Prague, pp. 764–773 (June 2007)
Google Scholar
Knight-and, K., Wang, W., Chiang, D.: 11,001 new features for statistical machine translation. In: Proceedings of Human Language Technologies: The 2009 Annual Conference of the NACL, Stroudsburg, PA, USA (June 2009)
Google Scholar
Arun, A., Koehn, P.: Online learningmethods for discriminative training of phrase based statistical machine translation. In: MT Summit XI, Copenhagen (September 2007)
Google Scholar
Crammer, K., McDonald, R., Pereira, F.: Online large-margin training of dependency parsers. In: Proceedings of ACL (2005)
Google Scholar
Teukolsky, S.A., Flannery, B.P., Press, W.H., Vetterling, W.T.: Numerical Recipes in C++: the art of scientific computing, 2nd edn. Cambridge University Press, New York (2002)
Google Scholar
Akimoto, Y., Nagata, Y., Ono, I., Kobayashi, S.: Bidirectional relation between CMA evolution strategies and natural evolution strategies. In: Schaefer, R., Cotta, C., Kołodziej, J., Rudolph, G. (eds.) PPSN XI. LNCS, vol. 6238, pp. 154–163. Springer, Heidelberg (2010)
Chapter Google Scholar
Ho, T.B., Nguyen, M.L., Nguyen, T.P., Shimazu, A., Van Nguyen, V.: A tree-to-string phrase-based model for statistical machine translation. In: Proceedings of the Twelfth Conference on Computational Natural Language Learning (CoNLL 2008), Manchester, England, pp. 143–150. Coling 2008 Organizing Committee (August 2008)
Google Scholar
Birch, A., CallisonBurch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Koehn, P., Hoang, H., Herbst, E.: Moses: Open source toolkit for statistical machine translation. In: Proceedings of ACL, Demonstration Session (2007)
Google Scholar
Stolcke, A.: Srilm - an extensible language modeling toolkit. In: Proceedings of International Conference on Spoken Language Processing, Cambridge, MA, vol. 9, pp. 901–904 (2002)
Google Scholar
Roukos, S., Ward, T., Papineni, K., Zhu, W.J.: Bleu: A method for automatic evaluation of machine translation. In: ACL (2002)
Google Scholar
Fortin, F.A., De Rainville, F.M., Gardner, M.A., Parizeau, M., Gagné, C.: DEAP: Evolutionary algorithms made easy. Journal of Machine Learning Research 13, 2171–2175 (2012)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

University of Economic and Technical Industries, Hanoi, Vietnam
Viet-Hong Tran
University of Engineering and Technology, Vietnam National University, Hanoi, Vietnam
Viet-Hong Tran & Vinh-Van Nguyen
IT Center, Military Academy of Logistics, Hanoi, Vietnam
Anh-Tuan Pham
IT R&D Center, Hanoi University, Vietnam, Hanoi
Anh-Tuan Pham & Hoai-Xuan Nguyen
Faculty of IT, Vietnam - Germany Vocational College of Vinh Phuc, Vinh Phuc, Vietnam
Huy-Quang Nguyen

Authors

Viet-Hong Tran
View author publications
You can also search for this author in PubMed Google Scholar
Anh-Tuan Pham
View author publications
You can also search for this author in PubMed Google Scholar
Vinh-Van Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Hoai-Xuan Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Huy-Quang Nguyen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Viet-Hong Tran .

Editor information

Editors and Affiliations

Faculty of Information Technology, VNU University of Engineering and Technology, Hanoi, Vietnam
Viet-Ha Nguyen
Faculty of Information Technology, VNU University of Engineering and Technology, Hanoi, Vietnam
Anh-Cuong Le
School of Knowledge Science, Japan Advanced Institute of Science and Technology, Nomi, Ishikawa, Japan
Van-Nam Huynh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tran, VH., Pham, AT., Nguyen, VV., Nguyen, HX., Nguyen, HQ. (2015). Parameter Learning for Statistical Machine Translation Using CMA-ES. In: Nguyen, VH., Le, AC., Huynh, VN. (eds) Knowledge and Systems Engineering. Advances in Intelligent Systems and Computing, vol 326. Springer, Cham. https://doi.org/10.1007/978-3-319-11680-8_34

Download citation

DOI: https://doi.org/10.1007/978-3-319-11680-8_34
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11679-2
Online ISBN: 978-3-319-11680-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics