skip to main content
note

Improving Chinese-Vietnamese Neural Machine Translation with Linguistic Differences

Authors Info & Claims
Published:25 March 2022Publication History
Skip Abstract Section

Abstract

We present a simple, efficient data augmentation approach for boosting Chinese-Vietnamese neural machine translation performance by leveraging the linguistic difference between the two languages. We first define the formalized representation of modifier symmetry, which is one of the most representative linguistic differences between Chinese and Vietnamese. We then propose and test two data augmentation strategies for leveraging the linguistic difference, which can be integrated naturally with different translation models. Results indicate that both strategies can introduce linguistic rules to boost translation accuracy. Tests on Chinese-Vietnamese benchmarks show significant accuracy improvements. To facilitate studies in this domain, we also release an open-source toolkit1 with flexible implementation for Chinese-Vietnamese linguistic difference tagging.

REFERENCES

  1. [1] Sutskever I., Vinyals O., and Le Q. V.. 2014. Sequence to sequence learning with neural networks. In Proceedings of the 27th International Conference on Neural Information Processing Systems -Volume 2 (NIPS'14). MIT Press, Cambridge, MA, USA, 31043112.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Bahdanau D., Cho K., and Bengio Y.. 2015. Neural machine translation by jointly learning to align and translate. arXiv.2014. 1409.0473.Google ScholarGoogle Scholar
  3. [3] Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A. N., Kaiser Ł., and Polosukhin I.. 2017. Attention is all you need. In Advances in Neural Information Processing Systems 30, 59986008.Google ScholarGoogle Scholar
  4. [4] Koehn P. and Knowles R.. 2017. Six challenges for neural machine translation. In Proceedings of the 1st Workshop on Neural Machine Translation. Vancouver, Canada, 2017, 2839.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Tan Z., Wang S., Yang Z., Chen G., Huang X., Sun M., and Liu Y.. 2021. Neural machine translation: A review of methods, resources, and tools. arXiv:2012.15515.Google ScholarGoogle Scholar
  6. [6] Yu Z., Yu Z., Guo J., Huang Y., and Wen Y.. 2019. Efficient low-resource neural machine translation with reread and feedback mechanism. ACM Transactions on Asian and Low-Resource Language Information Processing 19, 3, Article 34 (2019), 13 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Xia M., Kong X., Anastasopoulos A., and Neubig G.. 2019. Generalized data augmentation for low-resource translation. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy, July 28–August 2, 2019, 57865796.Google ScholarGoogle ScholarCross RefCross Ref
  8. [8] Burlot F. and Yvon F.. 2019. Using monolingual data in neural machine translation: A systematic study. arxiv:cs.CL/1903.11437.Google ScholarGoogle Scholar
  9. [9] Song K., Zhang Y., Yu H., Luo W., Wang K., and Zhang M.. 2019. Code-Switching for Enhancing NMT with Pre-Specified Translation. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, Minnesota. Association for Computational Linguistics, 449459.Google ScholarGoogle Scholar
  10. [10] Georgiana D., Prashant M., Marcello F., and Yaser A.. 2019. Training neural machine translation to apply terminology constraints. arxiv:cs.CL/1906.01105.Google ScholarGoogle Scholar
  11. [11] Gao S., Huang J., Xue M., Yu Z., Wang Z., and Zhang Y.. 2019. Syntax-based chinese-vietnamese tree-to-tree statistical machine translation with bilingual features. ACM Transactions on Asian and Low-Resource Language Information Processing 18, 4, 36 (2019), 20. DOI: Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] Che W., Yu Z., Yu Z., Wen Y., and Guo J.. 2020. Towards integrated classification lexicon for handling unknown words in Chinese-Vietnamese neural machine translation. ACM Transactions on Asian and Low-Resource Language Information Processing 19, 3 (2020), 42 (2020), 17 pages. DOI: Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Huu A. T., Huang H., and Shi S.. 2018. Integrating pronunciation into Chinese-Vietnamese statistical machine translation. Tsinghua Science and Technology 23, 6 (2018), 715723.Google ScholarGoogle Scholar
  14. [14] Huu A. T., Huang H., and Shi S.. 2019. Preordering for Chinese-Vietnam statistical machine translation. IEICE Transactions on Information and Systems E102-D, 2, 375382.Google ScholarGoogle Scholar
  15. [15] Huu A. T., Tran P., Dinh D., Vu V. V., and Le T.. 2018. Dependency-based pre-ordering of preposition phrases in Chinese-Vietnamese machine translation. ICIC Express Letters, Part B: Applications 9 (2018), 265272.Google ScholarGoogle Scholar
  16. [16] He J., Yu Z., Lv C., Lai H., Gao S., and Zhang Y.. 2017. Language post positioned characteristic based Chinese-Vietnamese statistical machine translation method. In Proceedings of the International Conference on Asian Language Processing (IALP’17), Singapore. IEEE, 2017.Google ScholarGoogle Scholar
  17. [17] Song K., Zhang Y., and Yu H.. 2019. Code-switching for enhancing NMT with pre-specified translation[C]. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL’19). Minneapolis, Minnesota, 2019.Google ScholarGoogle Scholar
  18. [18] Dinu G., Mathur P., Federico M., and Al-Onaizan Y.. Training neural machine translation to apply terminology constraints. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy. Association for Computational Linguistics, 30633068.Google ScholarGoogle Scholar
  19. [19] Sennrich R. and Zhang B.. 2019. Revisiting low-resource neural machine translation: A case study[C]. In 57th Annual Meeting of the Association for Computational Linguistics (ACL’19). Florence, Italy. Association for Computational Linguistics, 211--221.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A. N., Kaiser L., and Polosukhin I.. 2017. Attention is all you need. In Advances in Neural Information Processing Systems. Guyon I., Luxburg U. V., Bengio S., Wallach H., Fergus R., Vishwanathan S., and Garnett R. (Eds.). 30. 59986008.Google ScholarGoogle Scholar
  21. [21] Kingma D. P. and Ba J.. 2015. Adam: A method for stochastic optimization. arXiv:1412.6980V5.Google ScholarGoogle Scholar
  22. [22] Papineni K., Roukos S., Ward T., and Zhu W.. 2002. Bleu: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, Pennsylvania, USA. Association for Computational Linguistics, 311--318.Google ScholarGoogle Scholar
  23. [23] Philipp K.. 2004. Statistical significance tests for machine translation evaluation. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing.Google ScholarGoogle Scholar
  24. [24] Isozaki H., Hirao T., and Duh K.. 2010. Automatic evaluation of translation quality for distant language pairs. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing Cambridge, MA. Association for Computational Linguistics, 944--952.Google ScholarGoogle Scholar
  25. [25] Snover M., Madnani N., Dorr B. J., and Schwartz R.. 2009. Fluency, adequacy, or HTER?: Exploring different human judgments with a tunable MT metric. In Proceedings of the 4th Workshop on Statistical Machine Translation (StatMT’09). Association for Computational Linguistics, Stroudsburg, PA, 259268. http://dl.acm.org/citation.cfm?id=1626431.1626480.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Improving Chinese-Vietnamese Neural Machine Translation with Linguistic Differences

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 21, Issue 2
      March 2022
      413 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3494070
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 25 March 2022
      • Revised: 1 July 2021
      • Accepted: 1 July 2021
      • Received: 1 April 2021
      Published in tallip Volume 21, Issue 2

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • note
      • Refereed
    • Article Metrics

      • Downloads (Last 12 months)59
      • Downloads (Last 6 weeks)3

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text

    HTML Format

    View this article in HTML Format .

    View HTML Format