Enhancing the Assessment of (Polish) Translation in PROMIS Using Statistical, Semantic, and Neural Network Metrics

Wołk, Krzysztof; Glinkowski, Wojciech; Żukowska, Agnieszka

doi:10.1007/978-3-319-77712-2_34

Krzysztof Wołk¹⁸,
Wojciech Glinkowski¹⁹ &
Agnieszka Żukowska¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 746))

Included in the following conference series:

World Conference on Information Systems and Technologies

4145 Accesses
2 Citations

Abstract

Differences in culture and language create the need for translators to convert text from one language into another. In order to preserve meaning, context must be analyzed in detail in translation. This study aims to develop accurate evaluation metrics for translations within the PROMIS (Patient-Reported Outcomes Measurement Information System) process, particularly in the reconciliation step, by providing reviews by experts as additional information following backward translation. The result is a semi-automatic semantic evaluation metric for Polish based on the concept of the human-aided translation evaluation metric (HMEANT). We assessed the proposed metrics using a statistics-based support vector machine classifier and applied deep neural networks to replicate the operation of the human brain. We compared the results of the proposed metrics with human judgment and well-known machine translation metrics, such as BLEU (Bilingual Evaluation Understudy), NIST, TER (Translation Error Rate), and METEOR (Metric for Evaluation of Translation with Explicit Ordering). We found that a few of the proposed metrics were highly correlated with human judgment and provided additional semantic information independent of human experience. This showed that the proposed metrics can help assess translations in PROMIS.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
LTSM is a sequence learning technique that uses memory cells to preserve states over a long period of time, which allows distributed representations of sentences through distributed representations of words
2.
www.nihpromis.org

References

Birch, A., Haddow, B., Germann, U., Nadejde, M., Buck, C., Koehn, P.: The feasibility of HMEANT as a human MT evaluation metric. In: Proceedings of the Eighth Workshop on Statistical Machine Translation, pp. 52–61 (2013)
Google Scholar
Furey, T.S., Cristianini, N., Duffy, N., Bednarski, D.W., Schummer, M., Haussler, D.: Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics 16(10), 906–914 (2000)
Article Google Scholar
Cettolo, M., Girardi, C., Federico, M.: Wit3: web inventory of transcribed and translated talks. In: Proceedings of the 16th Conference of the European Association for Machine Translation (EAMT), pp. 261–268 (2012)
Google Scholar
Graves, A., Schmidhuber, J.: Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 18(5), 602–610 (2005)
Article Google Scholar
Tai, K.S., Socher, R., Manning, C.D.: Improved semantic representations from tree-structured long short-term memory networks. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, pp. 1556–1566 (2015)
Google Scholar
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318 (2002)
Google Scholar
Zhang, Y., Vogel, S., Waibel, A.: Interpreting BLEU/NIST scores: how much improvement do we need to have a better system? In: LREC (2004)
Google Scholar
Snover, M., Dorr, B., Schwartz, R., Micciulla, L., Makhoul, J.: A study of translation edit rate with targeted human annotation. In: Proceedings of Association for Machine Translation in the Americas, pp. 223–231 (2006)
Google Scholar
Banerjee, S., Lavie, A.: METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, vol. 29, pp. 65–72 (2005)
Google Scholar
Koehn, P.: Statistical Machine Translation. Cambridge University Press, Cambridge (2009)
Book Google Scholar
US Department of Health and Human Services. PROMIS: Instrument Development and Psychometric Evaluation Scientific Standards. Bethesda: National Institutes of Health (2012). http://www.nihpromis.com/Documents/PROMIS_Standards_050212.pdf. Accessed 8 Mar 2017
Wołk, K., Marasek, K.: Unsupervised comparable corpora preparation and exploration for bi-lingual translation equivalents. In: Proceedings of the 12th IWSLT, pp. 118–125 (2015)
Google Scholar
Bonomi, A.E., Cella, D.F., Hahn, E.A., Bjordal, K., Sperner-Unterweger, B., Gangeri, L., Bergman, B., Willems-Groot, J., Hanquet, P., Zittoun, R.: Multilingual translation of the functional assessment of cancer therapy (FACT) quality of life measurement system. Qual. Life Res. 5(3), 309–320 (1996)
Article Google Scholar
Wild, D., Eremenco, S., Mear, I., Martin, M., Houchin, C., Gawlicki, M., Hareendran, A., Wiklund, I., Chong, L.Y., von Maltzahn, R., Cohen, L., Molsen, E.: Multinational trials-recommendation on the translations required, approaches to using the same language in different countries, and the approaches to support pooling the data: the ISPOR patient reported outcome translation and linguistic validation good practice task force report. Value Health 12(4), 430–440 (2009)
Article Google Scholar
Wołk, K., Marasek, K.: Polish-English speech statistical machine translation systems for the IWSLT 2013. In: Proceedings of the 10th International Workshop on Spoken Language Translation, pp. 113–119 (2015)
Google Scholar
Wołk, K., Marasek, K.: Polish – English speech statistical machine translation systems for the IWSLT 2014. In: Proceedings of the 11th International Workshop on Spoken Language Translation, pp. 143–149 (2014)
Google Scholar
Axelrod, A., He, X., Gao, J.: Domain adaptation via pseudo in-domain data selection. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 355–362 (2011)
Google Scholar
Wang, L., Wong, D.F., Chao, L.S., Lu, Y., Xing, J.: A systematic comparison of data selection criteria for SMT domain adaptation. Sci. World J. 2014, 745485 (2014)
Google Scholar
Berrotarán, G., Carrascosa, R., Vine, A.: Yalign documentation (2013). https://yalign.readthedocs.io/en/latest/. Accessed 01 June 2016
Junczys-Dowmunt, M., Szał, A.: Symgiza ++: symmetrized word alignment models for statistical machine translation. In: Bouvry, P., Kłopotek, M.A., Leprévost, F., Marciniak, M., Mykowiecka, A., Rybiński, H. (eds.) Security and Intelligent Information Systems, pp. 379–390. Springer, Heidelberg (2012)
Chapter Google Scholar
Moses Statistical Machine Translation, “OOVs” (2015). http://www.statmt.org/moses/?n=Advanced.OOVs#ntoc2. Accessed 27 Sept 2015
Durrani, N., Hoang, H., Koehn, P., Sajjad, H.: Integrating an unsupervised transliteration model into statistical machine translation. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 148–153 (2014)
Google Scholar
Heafield, K.: KenLM: faster and smaller language model queries. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 187–197 (2011)
Google Scholar
Costa-Jussà, M.R., Fonollosa, J.R.: Using linear interpolation and weighted reordering hypotheses in the Moses system. In: A: International Conference on Language Resources and Evaluation. Seventh Conference on International Language Resources and Evaluation, pp. 1712–1718 (2010)
Google Scholar
Moses statistical machine translation, “Build reordering model” (2013). http://www.statmt.org/moses/?n=FactoredTraining.BuildReorderingModel. Accessed 10 Oct 2015
Wołk, K., Marasek, K.: A sentence meaning-based alignment method for parallel text corpora preparation. In: Rocha, Á., Correia, A.M., Tan, F.B., Stroetmann, K.A. (eds.) New Perspectives in Information Systems and Technologies, pp. 229–237. Springer, Cham (2014)
Chapter Google Scholar
Moore, R.C., Lewis, W.: Intelligent selection of language model training data. In: Proceedings of the ACL 2010 Conference Short Papers, pp. 220–224 (2010)
Google Scholar
Dieny, R., Thevenon, J., Martinez-Del-Rincon, J., Nebel, J.-C.: Bioinformatics inspired algorithm for stereo correspondence. In: International Conference on Computer Vision Theory and Application, pp. 465–473 (2011)
Google Scholar
Doddington, G.: Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In: Proceedings of the Second International Conference on Human Language Technology Research, pp. 138–145 (2002)
Google Scholar
Koehn, P., Federico, M., Shen, W., Bertoldi, N., Callison-Burch, C., Bojar, O., Cowan, B., Dyer, C., Hoang, H., Zens, R., Constantin, A., Herbst, E., Moran, C.: Open-source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, Factored Translation Models and Confusion Network Decoding, pp. 177–180 (2007)
Google Scholar
Lo, C.-K., Wu, D.: MEANT: an inexpensive, high-accuracy, semi-automatic metric for evaluating translation utility via semantic frames. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 220–229 (2011)
Google Scholar
Robertson, M.J., Kendall, P.C., Ritchie, S., Mcilroy, P.W., Adams, M.J.: The weighted index method: a new technique for analyzing planar optical waveguides. J. Lightwave Technol. 7(12), 2105–2111 (1989)
Article Google Scholar
Derksen, S., Keselman, H.J.: Backward, forward and stepwise automated subset selection algorithms: frequency of obtaining authentic and noise variables. Br. J. Math. Stat. Psychol. 45(2), 265–282 (1992)
Article Google Scholar
Lino, A., Rocha, Á., Sizo, A.: Virtual teaching and learning environments: automatic evaluation with artificial neural networks. Cluster Comput, 1–11 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Polish-Japanese Academy of Information Technology, Warsaw, Poland
Krzysztof Wołk
Medical University of Warsaw, Warsaw, Poland
Wojciech Glinkowski & Agnieszka Żukowska

Authors

Krzysztof Wołk
View author publications
You can also search for this author in PubMed Google Scholar
Wojciech Glinkowski
View author publications
You can also search for this author in PubMed Google Scholar
Agnieszka Żukowska
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Krzysztof Wołk .

Editor information

Editors and Affiliations

Departamento de Engenharia Informática, Universidade de Coimbra, Coimbra, Portugal
Álvaro Rocha
College of Engineering, The Ohio State University, Columbus, Ohio, USA
Hojjat Adeli
DSI/EEUM, Universidade do Minho, Guimarães, Portugal
Luís Paulo Reis
DIMES, Universita della Calabria, Arcavacata di Rende, Italy
Sandra Costanzo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wołk, K., Glinkowski, W., Żukowska, A. (2018). Enhancing the Assessment of (Polish) Translation in PROMIS Using Statistical, Semantic, and Neural Network Metrics. In: Rocha, Á., Adeli, H., Reis, L., Costanzo, S. (eds) Trends and Advances in Information Systems and Technologies. WorldCIST'18 2018. Advances in Intelligent Systems and Computing, vol 746. Springer, Cham. https://doi.org/10.1007/978-3-319-77712-2_34

Download citation

DOI: https://doi.org/10.1007/978-3-319-77712-2_34
Published: 17 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-77711-5
Online ISBN: 978-3-319-77712-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics