Abstract
We present a phrase-based statistical machine translation (SMT) system for Bangla to English that incorporates a novel transliteration module, and a specialized component for handling prepositions and Bangla compound words. We evaluate our components through their impact on the BLEU score for the phrase-based SMT system. According to the experimental results, the transliteration component has the most significant impact on the BLEU score. We also provide a new test set with multiple references between Bangla and English for MT evaluation purposes. Finally we propose a new manual evaluation approach for the MT community and evaluate our components using the new manual evaluation approach.
This research was partially supported by a discovery grant from NSERC, Canada.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
AbdulJaleel, N., Larkey, L.S.: Statistical Transliteration for English-Arabic Cross Language Information Retrieval. In: Proc. of CIKM 2003 (2003)
Koehn, P., Knight, K.: Empirical Methods for Compound Splitting. In: EACL 2003, pp. 187–194 (2003)
Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Herbst, E.: Moses: Open source toolkit for statistical machine translation. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (2007)
Naskar, S.K., Bandyopadhyay, S.: A Phrasal EBMT System for Translating English to Bangla. In MT Summit X (2005)
Rahman, A., Islam, S., Alim, A., Hasan, K.: A Rule Based English-Bangla MT System for Compound Sentences. In: Proceedings of NCCPB (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Roy, M., Popowich, F. (2010). Phrase-Based Statistical Machine Translation for a Low-Density Language Pair. In: Farzindar, A., Kešelj, V. (eds) Advances in Artificial Intelligence. Canadian AI 2010. Lecture Notes in Computer Science(), vol 6085. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13059-5_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-13059-5_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13058-8
Online ISBN: 978-3-642-13059-5
eBook Packages: Computer ScienceComputer Science (R0)