Skip to main content

Chinese Historical Term Translation Pairs Extraction Using Modern Chinese as a Pivot Language

  • Conference paper
  • First Online:
  • 4145 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11856))

Abstract

Term translation of Chinese historical classics is very difficult and time-consuming work, and using term alignment methods to extract term translation pairs is of great help for historical term translation. However, the limited bilingual corpora resources of historical classics and special morphology of the ancient Chinese result in poor performance of term alignment. To this end, this paper proposes a historical term alignment method using modern Chinese as a pivot language. The method first identifies English terms by rules, then aligns them from English to modern Chinese and then from modern Chinese to ancient Chinese. The use of English-modern Chinese corpus and modern-ancient Chinese corpus instead of English-ancient Chinese corpus solves the shortage problem of the parallel corpus. Moreover, using modern Chinese as a pivot language effectively reduces the alignment errors caused by the abbreviations and the interchangeable characters of ancient Chinese. In the term alignment experiment on Shiji, our method outperformed the direct alignment method significantly, which proves the validity of our method.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://github.com/fxsjy/jieba.

  2. 2.

    https://github.com/supercar101/Word-Segmentation-Method-of-Ancient-Chinese/tree/master.

  3. 3.

    https://codeload.github.com/moses-smt/giza-pp/zip/master.

References

  1. Huang, Z.: English translation of cultural classics and postgraduate teaching of translation in Suzhou University. Shanghai J. Translators 1, 56–58 (2007). (In Chinese)

    Google Scholar 

  2. Li, X., Che, C., Liu, X., Lin, H., Wang, R.: Corpus-based extraction of Chinese historical term translation equivalents. Int. J. Asian Lang. Process. 20(2), 63–74 (2010)

    Google Scholar 

  3. Che, C., Zheng, X.: The extraction of term translation pairs for chinese historical classics based on sub-words. J. Chin. Inf. Process. 30(3), 46–51 (2016). (In Chinese)

    Google Scholar 

  4. Yout, G.-W., Hwangt, S.-W., Song, Y.-I., et al.: Mining name translations from entity graph mapping. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 430–439. ACL, Stroudsburg (2010)

    Google Scholar 

  5. Feng, D., Lv, Y., Zhou, M.: A new approach for English-Chinese named entity alignment. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 372–379. ACL, Stroudsburg (2004)

    Google Scholar 

  6. Lee, C.-J., Chang, J.S., Jang, J.-S.R.: Alignment of bilingual named entities in parallel corpora using statistical models and multiple knowledge sources. ACM Trans. Asian Lang. Inf. Process. (TALIP) 5(2), 121–145 (2006)

    Article  Google Scholar 

  7. Zhang, Y., Wang, Y., Cen, L., et al.: Fusion of multiple features and ranking SVM for web-based English-Chinese OOV term translation. In: 23rd International Conference on Computational Linguistics (COLING), pp. 1435–1443. ACM, New York (2010)

    Google Scholar 

  8. Chen, Y., Zong, C., Su, K.-Y.: On jointly recognizing and aligning bilingual named entities. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 631–639. ACL, Stroudsburg (2010)

    Google Scholar 

  9. Chen, Y., Zong, C., Su, K.-Y.: A joint model to identify and align bilingual named entities. Comput. Linguist. 39(2), 229–266 (2013)

    Article  Google Scholar 

  10. Wang, M., Che, W., Manning, C.D.: Joint word alignment and bilin-gual named entity recognition using dual de-composition. In: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp. 1073–1082. ACL, Stroudsburg (2013)

    Google Scholar 

  11. Wu, H., Wang, H.: Pivot language approach for phrase-based statistical machine translation. Mach. Transl. 21(3), 165–181 (2007)

    Article  Google Scholar 

  12. Wu, H., Wang, H.: Revisiting pivot language approach for machine translation. In: Proceedings of the Meeting of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing of the AFNLP, pp. 154–162. ACL, Stroudsburg (2009)

    Google Scholar 

  13. Durrani, N., Koehn, P.: Improving machine translation via triangulation and transliteration. In: Conference of the European Association for Machine Translation, Dubrovnik, Croatia, pp. 71–78 (2014)

    Google Scholar 

  14. Zhu, X., He, Z., Wu, H., Zhu, C., Wang, H., Zhao, T.: Improving pivot-based statistical machine translation by pivoting the co-occurrence count of phrase pairs. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1665–1675. ACL, Stroudsburg (2014)

    Google Scholar 

  15. Brown, P.E., Pietra, S.A.D., Pietra, V.J.D., Mercer, R.L.: The mathematics of statistical machine translation: parameter estimation. Comput. Linguis. 19(2), 263–311 (1993)

    Google Scholar 

  16. Watson, B.: Records of the Grand Historian of China. Columbia University Press, New York (1961)

    Google Scholar 

  17. Watson, B.: Records of the Grand Historian: Qin Dynasty. Chinese University of Hong Kong/Columbia University Press, Hong Kong/New York (1993)

    Google Scholar 

  18. Och, F.J., Ney, H.: A systematic comparison of various statistical alignment models. Comput. Linguis. 29(1), 19–51 (2003)

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (No. 61402068).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chao Che .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wu, X., Zhao, H., Jing, L., Che, C. (2019). Chinese Historical Term Translation Pairs Extraction Using Modern Chinese as a Pivot Language. In: Sun, M., Huang, X., Ji, H., Liu, Z., Liu, Y. (eds) Chinese Computational Linguistics. CCL 2019. Lecture Notes in Computer Science(), vol 11856. Springer, Cham. https://doi.org/10.1007/978-3-030-32381-3_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-32381-3_29

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-32380-6

  • Online ISBN: 978-3-030-32381-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics