Discriminative ridge regression algorithm for adaptation in statistical machine translation

Chinea-Rios, Mara; Sanchis-Trilles, Germán; Casacuberta, Francisco

doi:10.1007/s10044-018-0720-5

Discriminative ridge regression algorithm for adaptation in statistical machine translation

Original Article
Published: 25 May 2018

Volume 22, pages 1293–1305, (2019)
Cite this article

Pattern Analysis and Applications Aims and scope Submit manuscript

Mara Chinea-Rios ORCID: orcid.org/0000-0002-2313-9633¹,
Germán Sanchis-Trilles² &
Francisco Casacuberta¹

251 Accesses
5 Citations
Explore all metrics

Abstract

We present a simple and reliable method for estimating the log-linear weights of a state-of-the-art machine translation system, which takes advantage of the method known as discriminative ridge regression (DRR). Since inappropriate weight estimations lead to a wide variability of translation quality results, reaching a reliable estimate for such weights is critical for machine translation research. For this reason, a variety of methods have been proposed to reach reasonable estimates. In this paper, we present an algorithmic description and empirical results proving that DRR is able to provide comparable translation quality when compared to state-of-the-art estimation methods [i.e. MERT (Och in Proceedings of the annual meeting of the association for computational linguistics, 2003) and MIRA (Cherry and Foster in Proceedings of the North American chapter of the association for computational linguistics, 2012)], with a reduction in computational cost. Moreover, the empirical results reported are coherent across different corpora and language pairs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Log-Linear Weight Optimization Using Discriminative Ridge Regression Method in Statistical Machine Translation

Adaptive Tuning for Statistical Machine Translation (AdapT)

A survey of domain adaptation for statistical machine translation

Article 01 December 2017

Hoang Cuong & Khalil Sima’an

Notes

www.aclweb.org/anthology/P/P16/.

References

Barrachina S, Bender O, Casacuberta F, Civera J, Cubel E, Khadivi S, Lagarda A, Ney H, Tomás J, Vidal E et al (2009) Statistical approaches to computer-assisted translation. Comput Ling 35(1):3–28
Article MathSciNet Google Scholar
Bojar O, Buck C, Federmann C, Haddow B, Koehn P, Monz C, Post M, Specia L (eds) (2014) Proceedings of the ninth workshop on statistical machine translation. Association for Computational Linguistics
Brown PF, Pietra VJD, Pietra SAD, Mercer RL (1993) The mathematics of statistical machine translation: parameter estimation. Comput Ling 19:263–311
Google Scholar
Callison-Burch C, Koehn P, Monz C, Peterson K, Przybocki M, Zaidan OF (2010) Findings of the 2010 joint workshop on statistical machine translation and metrics for machine translation. In: Proceedings of the annual meeting of the association for computational linguistics, pp 17–53
Chen B, Cherry C (2014) A systematic comparison of smoothing techniques for sentence-level bleu. In: Proceedings of the workshop on statistical machine translation, pp 362–367
Cherry C, Foster G (2012) Batch tuning strategies for statistical machine translation. In: Proceedings of the North American chapter of the association for computational linguistics, pp 427–436
Clark JH, Dyer C, Lavie A, Smith NA (2011) Better hypothesis testing for statistical machine translation: controlling for optimizer instability. In: Proceedings of the annual meeting of the association for computational linguistics, pp 176–181
Crammer K, Dekel O, Keshet J, Shalev-Shwartz S, Singer Y (2006) Online passive-aggressive algorithms. J Mach Learn Res 7:551–585
MathSciNet MATH Google Scholar
Hasler E, Haddow B, Koehn P (2011) Margin infused relaxed algorithm for moses. Prague Bull Math Ling 96:69–78
Article Google Scholar
Hopkins M, May J (2011) Tuning as ranking. In: Proceedings of the conference on empirical methods in natural language processing, pp 1352–1362
Kneser R, Ney H (1995) Improved backing-off for m-gram language modeling. In: Proceedings of the international conference on acoustics, speech and signal processing, pp 181–184
Koehn P (2005) Europarl: a parallel corpus for statistical machine translation. In: Proceedings of the machine translation summit, pp 79–86
Koehn P (2010) Statistical machine translation. Cambridge University Press, Cambridge
MATH Google Scholar
Koehn P, Hoang H, Birch A, Callison-Burch C, Federico M, Bertoldi N, Cowan B, Shen W, Moran C, Zens R, Dyer C, Bojar O, Constantin A, Herbst E (2007) Moses: open source toolkit for statistical machine translation. In: Proceedings of the annual meeting of the association for computational linguistics, pp 177–180
Lavie MDA (2014) Meteor universal: language specific translation evaluation for any target language. In: Proceedings of the annual meeting of the association for computational linguistics, pp 376–387
Marie B, Max A (2015) Multi-pass decoding with complex feature guidance for statistical machine translation. In: Proceedings of the annual meeting of the association for computational linguistics, pp 554–559
Martínez-Gómez P, Sanchis-Trilles G, Casacuberta F (2012) Online adaptation strategies for statistical machine translation in post-editing scenarios. Pattern Recogn 45(9):3193–3203
Article MATH Google Scholar
Nakov P, Vogel S (2017) Robust tuning datasets for statistical machine translation. arXiv:1710.00346
Neubig G, Watanabe T (2016) Optimization for statistical machine translation: a survey. Comput Ling 42(1):1–54
Article MathSciNet Google Scholar
Och FJ (2003) Minimum error rate training in statistical machine translation. In: Proceedings of the annual meeting of the association for computational linguistics, pp 160–167
Och FJ, Ney H (2003) A systematic comparison of various statistical alignment models. Comput Ling 29:19–51
Article MATH Google Scholar
Papineni K, Roukos S, Ward T, Zhu WJ (2002) Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the international conference on acoustics, speech and signal processing, pp 311–318
Sanchis-Trilles G, Casacuberta F (2010) Log-linear weight optimisation via Bayesian adaptation in statistical machine translation. In: Proceedings of the annual meeting of the association for computational linguistics, pp 1077–1085
Sanchis-Trilles G, Casacuberta F (2015) Improving translation quality stability using Bayesian predictive adaptation. Comput Speech Lang 34(1):1–17
Article Google Scholar
Snover M, Dorr B, Schwartz R, Micciulla L, Makhoul J (2006) A study of translation edit rate with targeted human annotation. In: Proceedings of the annual meeting of the association for machine translation in the Americas, pp 223–231
Sokolov A, Yvon F (2011) Minimum error rate training semiring. In: Proceedings of the annual conference of the European association for machine translation, pp 241–248
Stauffer C, Grimson WEL (2000) Learning patterns of activity using real-time tracking. Pattern Anal Mach Intell 22(8):747–757
Article Google Scholar
Stolcke A (2002) Srilm—an extensible language modeling toolkit. In: Proceedings of the international conference on spoken language processing, pp 901–904
Tiedemann J (2009) News from opus—a collection of multilingual parallel corpora with tools and interfaces. In: Proceedings of the recent advances in natural language processing, pp 237–248
Tiedemann J (2012) Parallel data, tools and interfaces in opus. In: Proceedings of the language resources and evaluation conference, pp 2214–2218

Download references

Author information

Authors and Affiliations

Pattern Recognition and Human Language Technology Research Center, Universitat Politècnica de València, Valencia, Spain
Mara Chinea-Rios & Francisco Casacuberta
Sciling, Valencia, Spain
Germán Sanchis-Trilles

Authors

Mara Chinea-Rios
View author publications
You can also search for this author in PubMed Google Scholar
Germán Sanchis-Trilles
View author publications
You can also search for this author in PubMed Google Scholar
Francisco Casacuberta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mara Chinea-Rios.

Additional information

The research leading to these results were partially supported by projects CoMUN-HaT-TIN2015-70924-C2-1-R (MINECO/FEDER) and PROMETEO/2018/004. We also acknowledge NVIDIA for the donation of a GPU used in this work.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chinea-Rios, M., Sanchis-Trilles, G. & Casacuberta, F. Discriminative ridge regression algorithm for adaptation in statistical machine translation. Pattern Anal Applic 22, 1293–1305 (2019). https://doi.org/10.1007/s10044-018-0720-5

Download citation

Received: 02 October 2017
Accepted: 08 May 2018
Published: 25 May 2018
Issue Date: November 2019
DOI: https://doi.org/10.1007/s10044-018-0720-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Discriminative ridge regression algorithm for adaptation in statistical machine translation

Abstract

Access this article

Similar content being viewed by others

Log-Linear Weight Optimization Using Discriminative Ridge Regression Method in Statistical Machine Translation

Adaptive Tuning for Statistical Machine Translation (AdapT)

A survey of domain adaptation for statistical machine translation

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Discriminative ridge regression algorithm for adaptation in statistical machine translation

Abstract

Access this article

Similar content being viewed by others

Log-Linear Weight Optimization Using Discriminative Ridge Regression Method in Statistical Machine Translation

Adaptive Tuning for Statistical Machine Translation (AdapT)

A survey of domain adaptation for statistical machine translation

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation