Skip to main content

Comparing and Integrating Alignment Template and Standard Phrase-Based Statistical Machine Translation

  • Conference paper
Computational Linguistics and Intelligent Text Processing (CICLing 2007)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4394))

  • 1508 Accesses

Abstract

In statistical machine translation (SMT) research, phrase-based methods have been receiving more interest in recent years. In this paper, we first give a brief survey of phrase-based SMT framework, and then make detailed comparisons of two typical implementations: alignment template approach and standard phrase-based approach. At last, we propose an improved model to integrate alignment template into standard phrase-based SMT as a new feature in a log-linear model. Experimental results show that our method outperforms the baseline method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Tomás, J., Casacuberta, F.: Combining Phrase-based and Template-Based alignment models in Statistical Translation. In: IbPRIA (2003)

    Google Scholar 

  2. Brown, P.F., et al.: The methematics of statistical machine translation:Parameter estimation. Computational Linguistics (1999)

    Google Scholar 

  3. Och, F.J.: Improved Alignment Models for Statistical Machine Translation. In: Proc. of the Joint Conf. of Empirical Methods in Natural Language Processing and Very Large Corpora, June (1999)

    Google Scholar 

  4. Yamada, K., Knight, K.: A syntax-based statistical translation model. In: Proc. Of the 39th Annual Meeting of ACL, Toulouse, France, July (2001)

    Google Scholar 

  5. Marcu, D., Wong, W.: A phrase-based, joint probability model for statistical machine translation. In: Proceeding of EMNLP (2002)

    Google Scholar 

  6. Zens, R., Och, F.J., Ney, H.: Phrase-based Statistical Machine Translation. In: Jarke, M., Koehler, J., Lakemeyer, G. (eds.) 25th German Conf. on Artificial Intelligence, KI2002 (2002)

    Google Scholar 

  7. Koehn, P., et al.: Statistical Phrase-Based Translation. In: HLT/NAACL (2003)

    Google Scholar 

  8. Och, F.J., Ney, H.: Discriminative Training and Maximum Entropy Models for Statistical Machine Translation. In: ACL (2002)

    Google Scholar 

  9. Och, F.J., Ney, H.: The alignment template approach to statistical machine translation. Accepted for publication in Computational Linguistics (2004)

    Google Scholar 

  10. Koehn, P., et al.: Edinburgh System Description for 2005 IWSLT Speech Translation Evaluation (2005)

    Google Scholar 

  11. Kumar, S., et al.: A weighted finite state transducer translation template model for statistical machine translation. Natural Language Engineering 1(1), 1–41 (2004)

    Google Scholar 

  12. Birch Mayne, A.C.: Scalable Phrase-Based, Joint Probability Model for Statistical Machine Translation. Master of Science Cognitive Science and Natural Language Processing School of Informatics University of Edinburgh (2005)

    Google Scholar 

  13. Chiang, D.: A hierarchical Phrase-Based Model for Statistical Machine Translation. In: Proc. Of the 43rd Annual Meeting of the ACL, June (2005)

    Google Scholar 

  14. Papineni, K.: BLEU: a Method for Automatic Evaluation of Machine Translation. In: Proc. Of the 40th ACL, July (2002)

    Google Scholar 

  15. Och, F.J.: GIZA++: Training of statistical translation models (2000), http://www-i6.informatik.rwth-aachen.de/~och/softeware/GIZA++.html

  16. Koehn, P.: Pharaoh:Training Manual (2004)

    Google Scholar 

  17. Och, F.J.: Minimum Error Rate Training in Statistical Machine Translation. In: ACL (2003)

    Google Scholar 

  18. Koehn, P.: Pharaoh: a Beam Search Decoder for Phrased-based Statsistical Machine Translation Models, User Mannual and Description for version 1.2, Technical report, USC Information Science Institute (August 2004)

    Google Scholar 

  19. Och, F.J.: An Efficient Method for Determining Bilingual Word Classes. In: Proceedings of EACL (1999)

    Google Scholar 

  20. Xiong, D., Liu, Q., Lin, S.: Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation. In: ACL (2006)

    Google Scholar 

  21. Al-Onaizan, Y., Papineni, K.: Distortion Models for Statistical Machine Translation. In: ACL (2006)

    Google Scholar 

  22. Arun, A., et al.: Edinburgh System Description for the 2006 TC-STAR Spoken LanguageTranslation Evaluation (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xu, L., Cao, X., Zhang, B., Li, M. (2007). Comparing and Integrating Alignment Template and Standard Phrase-Based Statistical Machine Translation. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2007. Lecture Notes in Computer Science, vol 4394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70939-8_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-70939-8_37

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-70938-1

  • Online ISBN: 978-3-540-70939-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics