Abstract
A statistical machine translation (SMT) capability would be very useful in augmented reality (AR) systems. For example, translating and displaying text in a smart phone camera image would be useful to a traveler needing to read signs and restaurant menus, or reading medical documents when a medical problem arises when visiting a foreign country. Such system would also be useful for foreign students to translate lectures in real time on their mobile devices. However, SMT quality has been neglected in AR systems research, which has focused on other aspects, such as image processing, optical character recognition (OCR), distributed architectures, and user interaction. In addition, general-purpose translation services, such as Google Translate, used in some AR systems are not well-tuned to produce high-quality translations in specific domains and are Internet connection dependent. This research devised SMT methods and evaluated their performance for potential use in AR systems. We give particular attention to domain-adapted SMT systems, in which an SMT capability is tuned to a particular domain of text to increase translation quality. We focus on translation between the Polish and English languages, which presents a number of challenges due to fundamental linguistic differences. However, the SMT systems used are readily extensible to other language pairs. SMT techniques are applied to two domains in translation experiments: European Medicines Agency (EMEA) medical leaflets and the Technology, Entertainment, Design (TED) lectures. In addition, field experiments are conducted on random samples of Polish text found in city signs, posters, restaurant menus, lectures on biology and computer science, and medical leaflets. Texts from these domains are translated by a number of SMT system variants, and the systems’ performance is evaluated by standard translation performance metrics and compared. The results appear very promising and encourage future applications of SMT to AR systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Carmigniani, J.: Augmented Reality Methods and Algorithms for Hearing Augmentation. Florida Atlantic University (2011)
Fragoso, V., et al.: Translatar: a mobile augmented reality translator. In: 2011 IEEE Workshop on Applications of Computer Vision (WACV), pp. 497–502. IEEE (2011)
Gschwandtner, M., Kritz, M., Boyer, C.: Requirements of the health professional research. Technical Report (2011)
NeWo LLC.: EnWo—English cam translator. https://play.google.com/store/apps/details?id=com.newo.enwo. Accessed 12 Jan 2014
Cui, L., et al.: Multi-domain adaptation for SMT using multi-task learning. In: EMNLP, pp. 1055–1065 (2013)
Google.: Translate API. https://cloud.google.com/translate/v2/pricing. Accessed 6 April 2015
Wołk, K., Marasek, K.: A sentence meaning based alignment method for parallel text corpora preparation. New Perspectives in Information Systems and Technologies, vol. 1, pp. 229–237. Springer International Publishing (2014)
Martedi, S., Uchiyama, H., Saito, H.: Clickable augmented documents. In: 2010 IEEE International Workshop on Multimedia Signal Processing (MMSP), pp. 162–66. IEEE (2010)
Abbyy.: TextGrabber + Translator. http://abbyy-textgrabber.android.informer.com/. Accessed 16 Feb 2015
Du, J., et al.: Snap and translate using windows phone. In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 809–813. IEEE (2011)
The Economist. Word Lens: This changes everything. http://www.economist.com/blogs/gulliver/2010/12/instant_translation. Accessed 18 Dec 2010
DATAMARK, Inc.: OCR as a real-time language translator. https://www.datamark.net/blog/ocr-as-a-real-time-language-translator. Accessed 30 July 2012
Khan, T., et al.: Augmented reality based word translator. Int. J. Innov. Res. Comput. Sci. Technol. (IJIRCST) 2(2) (2014)
Mahbub-Uz-Zaman, S., Islam, T.: Application of augmented reality: mobile camera based bangla text detection and translation. 2012. PhD Thesis. BRAC University (2012)
Emmanuel, A.S., Nithyanandam, S.: An optimal text recognition and translation system for smart phones using genetic programming and cloud. Int. J. Eng. Sci. Innov. Technol. (IJESIT) 3(2), 437–443 (2014)
Toyama, T., et al.: A mixed reality head-mounted text translation system using eye gaze input. In: Proceedings of the 19th International Conference on Intelligent User Interfaces, pp. 329–334. ACM (2014)
Koehn, P., Schroeder, J.: Experiments in domain adaptation for statistical machine translation. In: Proceedings of the Second Workshop on Statistical Machine Translation. Association for Computational Linguistics, pp. 224–227 (2007)
Koehn, P., et al.: Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions. Association for Computational Linguistics, pp. 177–180 (2007)
Wołk, K., Marasek, K.: Polish–English speech statistical machine translation systems for the IWSLT 2013. In: Proceedings of the 10th International Workshop on Spoken Language Translation, Heidelberg, Germany, pp. 113–119 (2013)
Wołk, K., Marasek, K.: A sentence meaning based alignment method for parallel text corpora preparation. New Perspectives in Information Systems and Technologies, vol. 1, pp. 229–237. Springer International Publishing (2014)
Koehn, P. What is a better translation? Reflections on six years of running evaluation campaigns. Tralogy (2011)
Wołk, K., Marasek, K.: Enhanced bilingual evaluation understudy. Lect. Notes Inf. Theory 2(2) (2014)
Banerjee, S., Lavie, A.: Meteor: an automatic metric for MT evaluation with improved correlation with human judgments. In: Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, pp. 65–72 (2005)
Snover, M., et al.: A study of translation edit rate with targeted human annotation. In: Proceedings of Association for Machine Translation in the Americas, pp. 223–231 (2006)
Smith, R.: An overview of the tesseract OCR engine. ICDAR, pp. 629–633. IEEE (2007)
Heafield, K.: KenLM: faster and smaller language model queries. In: Proceedings of the Sixth Workshop on Statistical Machine Translation. Association for Computational Linguistics, pp. 187–197 (2011)
Information Systems Laboratory, Adam Mickiewicz University.: SyMGIZA++. http://psi.amu.edu.pl/en/index.php?title=SyMGIZA. Accessed 27 April 2011
Gao, Q., Vogel, S.: Parallel implementations of word alignment tool. Software Engineering, Testing, and Quality Assurance for Natural Language Processing. Association for Computational Linguistics, pp. 49–57 (2008)
Durrani, N., et al.: Integrating an unsupervised transliteration model into statistical machine translation. EACL 2014, 148 (2014)
Johnson, J.H., et al.: Improving translation quality by discarding most of the phrase table. Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL) (2007)
Lavie, A.: Evaluating the output of machine translation systems. AMTA Tutorial (2010)
KantanMT—a sophisticated and powerful machine translation solution in an easy-to-use package. http://www.kantanmt.com
Acknowledgments
This research was supported by Polish-Japanese Academy of Information Technology statutory resources (ST/MUL/2016) and resources for young researchers.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing Switzerland
About this paper
Cite this paper
Wołk, K., Wołk, A., Marasek, K. (2017). Implementing Statistical Machine Translation into Mobile Augmented Reality Systems. In: Zgrzywa, A., Choroś, K., Siemiński, A. (eds) Multimedia and Network Information Systems. Advances in Intelligent Systems and Computing, vol 506. Springer, Cham. https://doi.org/10.1007/978-3-319-43982-2_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-43982-2_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-43981-5
Online ISBN: 978-3-319-43982-2
eBook Packages: EngineeringEngineering (R0)