Skip to main content

Enhancing the Assessment of (Polish) Translation in PROMIS Using Statistical, Semantic, and Neural Network Metrics

  • Conference paper
  • First Online:
Book cover Trends and Advances in Information Systems and Technologies (WorldCIST'18 2018)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 746))

Included in the following conference series:

Abstract

Differences in culture and language create the need for translators to convert text from one language into another. In order to preserve meaning, context must be analyzed in detail in translation. This study aims to develop accurate evaluation metrics for translations within the PROMIS (Patient-Reported Outcomes Measurement Information System) process, particularly in the reconciliation step, by providing reviews by experts as additional information following backward translation. The result is a semi-automatic semantic evaluation metric for Polish based on the concept of the human-aided translation evaluation metric (HMEANT). We assessed the proposed metrics using a statistics-based support vector machine classifier and applied deep neural networks to replicate the operation of the human brain. We compared the results of the proposed metrics with human judgment and well-known machine translation metrics, such as BLEU (Bilingual Evaluation Understudy), NIST, TER (Translation Error Rate), and METEOR (Metric for Evaluation of Translation with Explicit Ordering). We found that a few of the proposed metrics were highly correlated with human judgment and provided additional semantic information independent of human experience. This showed that the proposed metrics can help assess translations in PROMIS.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    LTSM is a sequence learning technique that uses memory cells to preserve states over a long period of time, which allows distributed representations of sentences through distributed representations of words

  2. 2.

    www.nihpromis.org

References

  1. Birch, A., Haddow, B., Germann, U., Nadejde, M., Buck, C., Koehn, P.: The feasibility of HMEANT as a human MT evaluation metric. In: Proceedings of the Eighth Workshop on Statistical Machine Translation, pp. 52–61 (2013)

    Google Scholar 

  2. Furey, T.S., Cristianini, N., Duffy, N., Bednarski, D.W., Schummer, M., Haussler, D.: Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics 16(10), 906–914 (2000)

    Article  Google Scholar 

  3. Cettolo, M., Girardi, C., Federico, M.: Wit3: web inventory of transcribed and translated talks. In: Proceedings of the 16th Conference of the European Association for Machine Translation (EAMT), pp. 261–268 (2012)

    Google Scholar 

  4. Graves, A., Schmidhuber, J.: Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Netw. 18(5), 602–610 (2005)

    Article  Google Scholar 

  5. Tai, K.S., Socher, R., Manning, C.D.: Improved semantic representations from tree-structured long short-term memory networks. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, pp. 1556–1566 (2015)

    Google Scholar 

  6. Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318 (2002)

    Google Scholar 

  7. Zhang, Y., Vogel, S., Waibel, A.: Interpreting BLEU/NIST scores: how much improvement do we need to have a better system? In: LREC (2004)

    Google Scholar 

  8. Snover, M., Dorr, B., Schwartz, R., Micciulla, L., Makhoul, J.: A study of translation edit rate with targeted human annotation. In: Proceedings of Association for Machine Translation in the Americas, pp. 223–231 (2006)

    Google Scholar 

  9. Banerjee, S., Lavie, A.: METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, vol. 29, pp. 65–72 (2005)

    Google Scholar 

  10. Koehn, P.: Statistical Machine Translation. Cambridge University Press, Cambridge (2009)

    Book  Google Scholar 

  11. US Department of Health and Human Services. PROMIS: Instrument Development and Psychometric Evaluation Scientific Standards. Bethesda: National Institutes of Health (2012). http://www.nihpromis.com/Documents/PROMIS_Standards_050212.pdf. Accessed 8 Mar 2017

  12. Wołk, K., Marasek, K.: Unsupervised comparable corpora preparation and exploration for bi-lingual translation equivalents. In: Proceedings of the 12th IWSLT, pp. 118–125 (2015)

    Google Scholar 

  13. Bonomi, A.E., Cella, D.F., Hahn, E.A., Bjordal, K., Sperner-Unterweger, B., Gangeri, L., Bergman, B., Willems-Groot, J., Hanquet, P., Zittoun, R.: Multilingual translation of the functional assessment of cancer therapy (FACT) quality of life measurement system. Qual. Life Res. 5(3), 309–320 (1996)

    Article  Google Scholar 

  14. Wild, D., Eremenco, S., Mear, I., Martin, M., Houchin, C., Gawlicki, M., Hareendran, A., Wiklund, I., Chong, L.Y., von Maltzahn, R., Cohen, L., Molsen, E.: Multinational trials-recommendation on the translations required, approaches to using the same language in different countries, and the approaches to support pooling the data: the ISPOR patient reported outcome translation and linguistic validation good practice task force report. Value Health 12(4), 430–440 (2009)

    Article  Google Scholar 

  15. Wołk, K., Marasek, K.: Polish-English speech statistical machine translation systems for the IWSLT 2013. In: Proceedings of the 10th International Workshop on Spoken Language Translation, pp. 113–119 (2015)

    Google Scholar 

  16. Wołk, K., Marasek, K.: Polish – English speech statistical machine translation systems for the IWSLT 2014. In: Proceedings of the 11th International Workshop on Spoken Language Translation, pp. 143–149 (2014)

    Google Scholar 

  17. Axelrod, A., He, X., Gao, J.: Domain adaptation via pseudo in-domain data selection. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 355–362 (2011)

    Google Scholar 

  18. Wang, L., Wong, D.F., Chao, L.S., Lu, Y., Xing, J.: A systematic comparison of data selection criteria for SMT domain adaptation. Sci. World J. 2014, 745485 (2014)

    Google Scholar 

  19. Berrotarán, G., Carrascosa, R., Vine, A.: Yalign documentation (2013). https://yalign.readthedocs.io/en/latest/. Accessed 01 June 2016

  20. Junczys-Dowmunt, M., Szał, A.: Symgiza ++: symmetrized word alignment models for statistical machine translation. In: Bouvry, P., Kłopotek, M.A., Leprévost, F., Marciniak, M., Mykowiecka, A., Rybiński, H. (eds.) Security and Intelligent Information Systems, pp. 379–390. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  21. Moses Statistical Machine Translation, “OOVs” (2015). http://www.statmt.org/moses/?n=Advanced.OOVs#ntoc2. Accessed 27 Sept 2015

  22. Durrani, N., Hoang, H., Koehn, P., Sajjad, H.: Integrating an unsupervised transliteration model into statistical machine translation. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 148–153 (2014)

    Google Scholar 

  23. Heafield, K.: KenLM: faster and smaller language model queries. In: Proceedings of the Sixth Workshop on Statistical Machine Translation, pp. 187–197 (2011)

    Google Scholar 

  24. Costa-Jussà, M.R., Fonollosa, J.R.: Using linear interpolation and weighted reordering hypotheses in the Moses system. In: A: International Conference on Language Resources and Evaluation. Seventh Conference on International Language Resources and Evaluation, pp. 1712–1718 (2010)

    Google Scholar 

  25. Moses statistical machine translation, “Build reordering model” (2013). http://www.statmt.org/moses/?n=FactoredTraining.BuildReorderingModel. Accessed 10 Oct 2015

  26. Wołk, K., Marasek, K.: A sentence meaning-based alignment method for parallel text corpora preparation. In: Rocha, Á., Correia, A.M., Tan, F.B., Stroetmann, K.A. (eds.) New Perspectives in Information Systems and Technologies, pp. 229–237. Springer, Cham (2014)

    Chapter  Google Scholar 

  27. Moore, R.C., Lewis, W.: Intelligent selection of language model training data. In: Proceedings of the ACL 2010 Conference Short Papers, pp. 220–224 (2010)

    Google Scholar 

  28. Dieny, R., Thevenon, J., Martinez-Del-Rincon, J., Nebel, J.-C.: Bioinformatics inspired algorithm for stereo correspondence. In: International Conference on Computer Vision Theory and Application, pp. 465–473 (2011)

    Google Scholar 

  29. Doddington, G.: Automatic evaluation of machine translation quality using n-gram co-occurrence statistics. In: Proceedings of the Second International Conference on Human Language Technology Research, pp. 138–145 (2002)

    Google Scholar 

  30. Koehn, P., Federico, M., Shen, W., Bertoldi, N., Callison-Burch, C., Bojar, O., Cowan, B., Dyer, C., Hoang, H., Zens, R., Constantin, A., Herbst, E., Moran, C.: Open-source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, Factored Translation Models and Confusion Network Decoding, pp. 177–180 (2007)

    Google Scholar 

  31. Lo, C.-K., Wu, D.: MEANT: an inexpensive, high-accuracy, semi-automatic metric for evaluating translation utility via semantic frames. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 220–229 (2011)

    Google Scholar 

  32. Robertson, M.J., Kendall, P.C., Ritchie, S., Mcilroy, P.W., Adams, M.J.: The weighted index method: a new technique for analyzing planar optical waveguides. J. Lightwave Technol. 7(12), 2105–2111 (1989)

    Article  Google Scholar 

  33. Derksen, S., Keselman, H.J.: Backward, forward and stepwise automated subset selection algorithms: frequency of obtaining authentic and noise variables. Br. J. Math. Stat. Psychol. 45(2), 265–282 (1992)

    Article  Google Scholar 

  34. Lino, A., Rocha, Á., Sizo, A.: Virtual teaching and learning environments: automatic evaluation with artificial neural networks. Cluster Comput, 1–11 (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Krzysztof Wołk .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wołk, K., Glinkowski, W., Żukowska, A. (2018). Enhancing the Assessment of (Polish) Translation in PROMIS Using Statistical, Semantic, and Neural Network Metrics. In: Rocha, Á., Adeli, H., Reis, L., Costanzo, S. (eds) Trends and Advances in Information Systems and Technologies. WorldCIST'18 2018. Advances in Intelligent Systems and Computing, vol 746. Springer, Cham. https://doi.org/10.1007/978-3-319-77712-2_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-77712-2_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-77711-5

  • Online ISBN: 978-3-319-77712-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics