Japanese-to-English translations of tense, aspect, and modality using machine-learning methods and comparison with machine-translation systems on market

Murata, Masaki; Ma, Qing; Uchimoto, Kiyotaka; Kanamaru, Toshiyuki; Isahara, Hitoshi

doi:10.1007/s10579-007-9022-z

Japanese-to-English translations of tense, aspect, and modality using machine-learning methods and comparison with machine-translation systems on market

Published: 19 July 2007

Volume 40, pages 233–242, (2006)
Cite this article

Language Resources and Evaluation Aims and scope Submit manuscript

Masaki Murata¹,
Qing Ma^1,2,
Kiyotaka Uchimoto¹,
Toshiyuki Kanamaru¹ &
…
Hitoshi Isahara¹

371 Accesses
3 Citations
Explore all metrics

Abstract

This paper describes experiments carried out utilizing a variety of machine-learning methods (the k-nearest neighborhood, decision list, maximum entropy, and support vector machine), and using six machine-translation (MT) systems available on the market for translating tense, aspect, and modality. We found that all these, including the simple string-matching-based k-nearest neighborhood used in a previous study, obtained higher accuracy rates than the MT systems currently available on the market. We also found that the support vector machine obtained the best accuracy rates (98.8%) of these methods. Finally, we analyzed errors against the machine-learning methods and commercially available MT systems and obtained error patterns that should be useful for making future improvements.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Slavic languages in phrase-based statistical machine translation: a survey

Article 06 May 2017

Multi-classifier Combination for Translation Error Detection

Learning sign language machine translation based on elastic net regularization and latent semantic analysis

Article 14 January 2016

Notes

The gold standard data were prepared by using system outputs and the gold standard was then biased using these system outputs. However, the bias was small because we used a tag in the corpus and also tense/aspect/modality expressions generated by three translators.
There might be a small number of cases where the tense/aspect/modality categories of a MT system are judged to be incorrect when a tense/aspect/modality selection inside the MT engine is correct but the generation module produces a wrong output string.

References

Cristianini, N., & Shawe-Taylor, J. (2000). An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press.
Daelemans, W., Zavrel, J., van der Sloot, K., & van den Bosch, A. (1995). TiMBL: Tilburg Memory Based Learner version 3.0 Reference Guide. Technical report. ILK Technical Report-ILK 00-01 (http://www.ilk.kub.nl/ilk/papers/ilk0001.ps.gz).
Kudoh, T., & Matsumoto, Y. (2000). Use of support vector learning for chunk identification. In Proceedings of the 4th Conference on Computational Natural Language Learning and of the Second Learning Language in Logic Workshop (CoNLL-2000 and LLL-2000), in Lisbon, Portugal on September 13–14 (pp.142–144).
Murata, M., Ma, Q., Uchimoto, K., & Isahara, H. (1999). An example-based approach to Japanese-to-English translation of tense, aspect, and modality. In Proceedings of the 8th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI-99), in Chester, England on August 23–25 (pp. 66–76).
Murata, M., Utiyama, M., Uchimoto, K., Ma, Q., & Isahara, H. (2005). Correction of errors in a verb modality corpus used for machine translation with a machine-learning method. ACM Transactions on Asian Language Information Processing, 4(1), 18–37.
Google Scholar
Ristad, E. S. (1997). Maximum entropy modeling for natural language. Madrid: ACL/EACL Tutorial Program.
Shirai, S., Yokoo, A., & Bond, F. (1990). Generation of tense in newspaper translation. In Proceedings of 1990 Fall Institute of Electronics, Information and Communication Engineers (IEICE) Meeting, Vol. 6, D-69, in Hiroshima, Japan on October 1–4 (pp. 69) (in Japanese).
Weller, S. C., & Romney, A. K. (1990). Metric scaling: correspondence analysis (quantitative applications in the social sciences). SAGE Publications.
Yarowsky, D. (1994). Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics (ACL), in Las Cruces, New Mexico on June 27–30 (pp. 88–95).

Download references

Author information

Authors and Affiliations

National Institute of Information and Communications Technology, 3-5 Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0289, Japan
Masaki Murata, Qing Ma, Kiyotaka Uchimoto, Toshiyuki Kanamaru & Hitoshi Isahara
Ryukoku University, Otsu, Shiga, 520-2194, Japan
Qing Ma

Authors

Masaki Murata
View author publications
You can also search for this author in PubMed Google Scholar
Qing Ma
View author publications
You can also search for this author in PubMed Google Scholar
Kiyotaka Uchimoto
View author publications
You can also search for this author in PubMed Google Scholar
Toshiyuki Kanamaru
View author publications
You can also search for this author in PubMed Google Scholar
Hitoshi Isahara
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Masaki Murata.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Murata, M., Ma, Q., Uchimoto, K. et al. Japanese-to-English translations of tense, aspect, and modality using machine-learning methods and comparison with machine-translation systems on market. Lang Resources & Evaluation 40, 233–242 (2006). https://doi.org/10.1007/s10579-007-9022-z

Download citation

Received: 21 August 2006
Accepted: 14 May 2007
Published: 19 July 2007
Issue Date: December 2006
DOI: https://doi.org/10.1007/s10579-007-9022-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Japanese-to-English translations of tense, aspect, and modality using machine-learning methods and comparison with machine-translation systems on market

Abstract

Access this article

Similar content being viewed by others

Slavic languages in phrase-based statistical machine translation: a survey

Multi-classifier Combination for Translation Error Detection

Learning sign language machine translation based on elastic net regularization and latent semantic analysis

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Japanese-to-English translations of tense, aspect, and modality using machine-learning methods and comparison with machine-translation systems on market

Abstract

Access this article

Similar content being viewed by others

Slavic languages in phrase-based statistical machine translation: a survey

Multi-classifier Combination for Translation Error Detection

Learning sign language machine translation based on elastic net regularization and latent semantic analysis

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation