A Corpus for Amharic-English Speech Translation: The Case of Tourism Domain

Woldeyohannis, Michael Melese; Besacier, Laurent; Meshesha, Million

doi:10.1007/978-3-319-95153-9_12

Michael Melese Woldeyohannis²⁰,
Laurent Besacier²¹ &
Million Meshesha²⁰

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 244))

Included in the following conference series:

International Conference on Information and Communication Technology for Develoment for Africa

1214 Accesses
1 Citations

Abstract

Speech translation research for the major languages like English, Japanese and Spanish has been conducted since the 1980’s. But no attempt were made in speech translation to/from the under-resourced language like Amharic. These activities suffered from the lack of Amharic speech and Amharic-English text corpus suited for the development of speech translation between the two languages. In this paper, therefore, an attempt has been made to collect, translate and record speech data from resourced language (English) to under-resourced language (Amharic) taking a Basic Traveler Expression Corpus (BTEC) as domain. Since there is no any Amharic text and speech corpus readily available for speech translation purposes, first, 7.43 h of Amharic read-speech has been prepared from 8,112 sentences, and second, 19,972 parallel Amharic-English corpus has been prepared taking tourism as an application domain. The Amharic speech data is recorded using smart-phone based application tool, LIG-Aikuma under a normal working environment. With the availability of such standard speech and text corpus, researcher will find a ground to further explore speech translation to/from under resourced languages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://data.worldbank.org/indicator/ST.INT.ARVL.
2.
http://www.waltainfo.com/news/detail/32016.
3.
Normalization in Amharic refers to converting the common graphemes of writing to one while in English it represent lowering the case.

References

Gao, Y., Gu, L., Zhou, B.: Speech-to-Speech Translation. World Scientific, Singapore (2007)
Google Scholar
Honda, M.: Human speech production mechanisms, NTT Commun. Sci. Lab. 1(2), 24–29 (2003)
Google Scholar
Karematsu, A., Morimoto, T.: Automatic Speech Translation: Fundamental Technology for Future Cross-Language Communication, vol. 11. Gordon and Breach Publishers, Philadelphia (1996)
Google Scholar
Gao, Y., Gu, L., Zhou, B., Sarikaya, R., Afify, M., Kuo, H.-K., Zhu, W., Deng, Y., Prosser, C., Zhang, W., Besacier, L.: IBM mastor: multilingual automatic speech-to-speech translator. In: Proceedings of 2006 IEEE International Conference Acoustics Speech Signal Processing, vol. 5, p. 5, December 2006
Google Scholar
Jurafsky, D., Martin, J.H.: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, Second edn. Pearson
Google Scholar
Philipp, K.: Statistical Machine Translation. Cambridge University Press, Cambridge (2009)
Google Scholar
Lemmetty, S.: Review of Speech Synthesis Technology. Helsinki University of Technology, Helsinki (1999)
Google Scholar
Woldeyohannis, M.M., Besacier, L., Meshesha, M.: Amharic speech recognition for speech translation. In: JEP-TALN-RECITAL 2016. Paris, France, vol. 11, p. 114 (2016)
Google Scholar
Teshome, M.G., Besacier, L., Taye, G., Teferi, D.: Phoneme-based English-Amharic statistical machine translation. Presented at the AFRICON, 2015, Addis Ababa, Ethiopia, pp. 1–5 (2015)
Google Scholar
Anberbir, T., Takara, T.: Development of an Amharic text-to-speech system using cepstral method. In: Proceedings of the EACL 2009 Workshop on Language Technologies for African Languages (AfLaT 2009), Athens, Greece, pp. 46-52 (2009)
Google Scholar
Ethiopia: A Preferred Location for Foreign Direct Investment in Africa, Ethiopian Investment Commision (2014). http://www.investethiopia.gov.et/images/pdf/Investment_Brochure_to_Ethiopia.pdf
Ethiopian Embassy: Investing in ethiopia: Briefing for tour operators. http://www.ethiopianembassy.org/PDF/InvestingTourism.pdf. Accessed 10 Aug 2017
UNWTO: World Tourism Organization annual report 2015. Technical report, United Nation, Madrid, Spain (2016)
Google Scholar
Simons, G.F., Fennig, C.D.: Ethnologue: Languages of the World. SIL, Dallas (2017)
Google Scholar
Teklehaimanot, T.: Ethiopian Languages. http://www.ethiopiantreasures.co.uk/pages/language.htm. Accessed 10 Aug 2017
Abate, S.T., Menzel, W., Tafila, B.: An Amharic speech corpus for large vocabulary continuous speech recognition. In: Proceedings of the XVth International Conference of Ethiopian Studies, Hamburg, Germany (2005)
Google Scholar
Yimam, B.: Yeamarigna sewasew (Amharic version). Addis Ababa, Ethiopia, EMPDA (1986)
Google Scholar
Fondazione Bruno Kessler: International Workshop on Spoken Language Translation, Paris, France, pp. 2–3 (2010)
Google Scholar
Davel, M.H., Badenhorst, J., Basson, W.D., De Wet, F., Barnard, E., De Waal, A., De Vries, N.J.: A smartphone-based ASR data collection tool for under-resourced languages, vol. 56, pp. 119–131 (2014)
Google Scholar
Blachon, D., Gauthier, E., Besacier, L., Kouarata, G.-N., Adda-Decker, M., Rialland, A.: Parallel speech collection for under-resourced language studies using the Lig-Aikuma mobile device app. In: SLTU (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Addis Ababa University, Addis Ababa, Ethiopia
Michael Melese Woldeyohannis & Million Meshesha
LIG Laboratory, UJF, BP53, 38041, Grenoble Cedex 9, France
Laurent Besacier

Authors

Michael Melese Woldeyohannis
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Besacier
View author publications
You can also search for this author in PubMed Google Scholar
Million Meshesha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michael Melese Woldeyohannis .

Editor information

Editors and Affiliations

Council for Scientific and Industrial Research, Pretoria, South Africa
Fisseha Mekuria
Information Technology, University of Turku, Turku, Finland
Ethiopia Enideg Nigussie
Dresden University of Technology, Dresden, Germany
Waltenegus Dargie
Aalto University, Helsinki, Finland
Mutafugwa Edward
Bahir Dar Institute of Technology, Bahir Dar, Ethiopia
Tesfa Tegegne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Woldeyohannis, M.M., Besacier, L., Meshesha, M. (2018). A Corpus for Amharic-English Speech Translation: The Case of Tourism Domain. In: Mekuria, F., Nigussie, E., Dargie, W., Edward, M., Tegegne, T. (eds) Information and Communication Technology for Development for Africa. ICT4DA 2017. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 244. Springer, Cham. https://doi.org/10.1007/978-3-319-95153-9_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-95153-9_12
Published: 05 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-95152-2
Online ISBN: 978-3-319-95153-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics