Abstract
This paper describes an experiment comparing results of machine translation between two pairs of related Slavic languages. Two language pairs on three different translation platforms were observed in the experiment. One pair represents really very close languages (Czech and Slovak), the other pair are slightly less similar languages (Slovenian and Croatian). The comparison is performed by means of three MT systems, one for each pair representing rule-based approach, the other one representing statistical (same system for both language pairs) approach to the task. Both sets of results are manually evaluated by native speakers of the target languages.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Corbi-Bellot, A.M., Forcada, M.L., Ortiz-Rojas, S.: An open-source shallow-transfer machine translation engine for the Romance languages of Spain. In: Proceedings of the EAMT Conference, HITEC e.V., pp. 79–86 (2005)
Hajič, J., Hric, J., Kuboň, V.: Machine translation of very close languages. In: Proceedings of ANLP, pp. 7–12 (2000)
Vičič, J.: A fast implementation of rules based machine translation systems for similar natural language. Informatica 37(4), 455–456 (2013)
Koehn, P., et al.: Moses: open source toolkit for statistical machine translation. In: Proceedings of ACL 2007, pp. 177–180 (2007)
Kuboň, V., Vičič, J.: A comparison of MT methods for closely related languages : a case study on Czech - Slovak language pair. In: EMNLP, pp. 92–98 (2014)
NIST: NIST 2009 Open Machine Translation Evaluation (MT09). Technical report, NIST (2009)
Bojar, O., et al.: Findings of the 2013 workshop on SMT. In: 2013 Workshop on SMT, pp. 1–44 (2013)
Weijnitz, P., Forsbom, E., Gustavii, E., Pettersson, E., Tiedemann, J.: MT goes farming: comparing two machine translation approaches on a new domain. In: LREC, pp. 1–4 (2004)
Kolovratnik, D., Klyueva, N., Bojar, O.: Statistical machine translation between related and unrelated languages. In: Proceedings of TPIT, pp. 31–36 (2009)
Callison-Burch, C., Osborne, M., Koehn, P.: Re-evaluating the role of BLEU in machine translation research. In: Proceedings of EACL, pp. 249–256 (2006)
Apertium: Apertium: machine translation toolbox (2014)
Kruskal, W.H., Wallis, W.A.: Use of Ranks in One-Criterion Variance Analysis. Journal of the American Statistical Association 47(260), 583–621 (1952)
Fleiss, J.L.: Measuring nominal scale agreement among many raters (1971)
Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33(1), 159–174 (1977)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Vičič, J., Kuboň, V. (2015). A Comparison of MT Methods for Closely Related Languages: A Case Study on Czech – Slovak and Croatian – Slovenian Language Pairs. In: Král, P., Matoušek, V. (eds) Text, Speech, and Dialogue. TSD 2015. Lecture Notes in Computer Science(), vol 9302. Springer, Cham. https://doi.org/10.1007/978-3-319-24033-6_25
Download citation
DOI: https://doi.org/10.1007/978-3-319-24033-6_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24032-9
Online ISBN: 978-3-319-24033-6
eBook Packages: Computer ScienceComputer Science (R0)