UmobiTalk: Ubiquitous Mobile Speech Based Translator for Sesotho Language

Nyetanyane, John; Masinde, Muthoni

doi:10.1007/978-3-030-05198-3_9

UmobiTalk: Ubiquitous Mobile Speech Based Translator for Sesotho Language

Conference paper
First Online: 14 December 2018

467 Accesses

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 260))

Abstract

The need to conserve the under-resourced languages is becoming more urgent as some of them are becoming extinct; natural language processing can be used to redress this. Currently, most initiatives around language processing technologies are focusing on western languages such as English and French, yet resources for such languages are already available. Sesotho language is one of the under-resourced Bantu languages; it is mostly spoken in Free State province of South Africa and in Lesotho. Like other parts of South Africa, Free State has experienced a high number of non-Sesotho speaking migrants from neighboring provinces and countries. Such people are faced with serious language barrier problems especially in the informal settlements where everyone tends to speak only Sesotho. As a solution to this, we developed a parallel corpus that has English as a source and Sesotho as a target language and packaged it in UmobiTalk - Ubiquitous mobile speech based learning translator. UmobiTalk is a mobile-based tool for learning Sesotho for English speakers. The development of this tool was based on the combination of automatic speech recognition, machine translation and speech synthesis. This application will be used as an analysis tool for testing accuracy and speed of the corpus. We present the development, testing and evaluation of UmobiTalk in this paper.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 44.99; Price excludes VAT (USA)

Softcover Book: USD 60.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Alumäe, T., Kalijurnd, K.: Open and extendable speech recognition application architecture for mobile environments. In: Spoken Language Technologies for Under-Resourced Languages (2012)
Google Scholar
Anthony, L.: AntConc: design and development of a freeware corpus analysis toolkit for the technical writing classroom. In: Proceedings International Professional Communication Conference. IEEE (2005)
Google Scholar
Anodo, T.: Open source spelling checker for Kimiiru language. Doctoral dissertation, University of Nairobi (2013)
Google Scholar
Anusuya, M., Katti, S.: Speech recognition by machine, a review. arXiv preprint arXiv:1001.2267 (2010)
Barnard, E., Davel, M., Van Heerden, C.: ASR corpus design for resource-scarce languages. In: ISCA (2009)
Google Scholar
Besacier, L., et al.: Automatic speech recognition for under-resourced languages: a survey. Speech Commun. 56, 85–100 (2014)
Article Google Scholar
Chen, L., et al.: VACE multimodal meeting corpus. In: Renals, S., Bengio, S. (eds.) MLMI 2005. LNCS, vol. 3869, pp. 40–51. Springer, Heidelberg (2006). https://doi.org/10.1007/11677482_4
Chapter Google Scholar
Collins, N.: The UbuWeb electronic music corpus: an MIR investigation of a historical database. Organised Sound 20(1), 122–134 (2015)
Article Google Scholar
Demuth, K.: Accessing functional categories in Sesotho: interactions at the morpho-syntax interface. In: Meisel, J.M. (ed.) The Acquisition of Verb Placement. SITP, vol. 16, pp. 83–107. Springer, Dordrecht (1992). https://doi.org/10.1007/978-94-011-2803-2_4
Chapter Google Scholar
Devlin, J., et al.: Fast and robust neural network joint models for statistical machine translation. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1 (2014)
Google Scholar
Duarte, T., et al.: Speech recognition for voice-based machine translation. IEEE Softw. 31(1), 26–31 (2014)
Article Google Scholar
Engelbrecht, H., Schultz, T.: Rapid development of an Afrikaans-English speech-to-speech translator. In: International Workshop on Spoken Language Translation (IWSLT) (2005)
Google Scholar
Arshavskaya, E.: The Routledge Handbook of Language Learning and Technology, pp. 103–106 (2018)
Article Google Scholar
Graën, J., Batinic, D., Volk, M.: Cleaning the Europarl corpus for linguistic applications (2014)
Google Scholar
Green, S., Heer, J., Manning, C.: The efficacy of human post-editing for language translation. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM (2013)
Google Scholar
Gries, S.T., Berez, A.L.: Linguistic annotation in/for corpus linguistics. In: Ide, N., Pustejovsky, J. (eds.) Handbook of Linguistic Annotation, pp. 379–409. Springer, Dordrecht (2017). https://doi.org/10.1007/978-94-024-0881-2_15
Chapter Google Scholar
Guma, S.: An Outline Structure of Southern Sotho. Shuter and Shooter, South Africa (1971)
Google Scholar
Hammo, B., et al.: Exploring and exploiting a historical corpus for Arabic. Lang. Resour. Eval. 50(4), 839–861 (2016)
Article Google Scholar
Hutchins, J.: From first conception to first demonstration: the nascent years of machine translation: 1947–1954. A chronology. Mach. Transl. 12(3), 195–252 (1997)
Google Scholar
Hyman, P.: Speech-to-speech translations stutter, but researchers see mellifluous future. Commun. ACM 57(4), 16–19 (2014)
Article Google Scholar
Imseng, D., et al.: Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition. In: 2013 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). IEEE (2013)
Google Scholar
Jakubíček, M., et al.: The ten ten corpus family. In: 7th International Corpus Linguistics Conference CL (2013)
Google Scholar
Kennedy, G.: An Introduction to Corpus Linguistics. Routledge, Abingdon (2014)
Book Google Scholar
Kumar, A., et al.: Rethinking speech recognition on mobile devices (2011)
Google Scholar
Latham, B.: Sampling: What is it: Quantitative research methods 5377 (2007)
Google Scholar
Nakazawa, T., Kurohashi, S., Kobayashi, H., Ishikawa, H., Sassano, M.: 3-step parallel corpus cleaning using monolingual crowd workers. In: Hasida, K., Purwarianti, A. (eds.) Computational Linguistics. CCIS, vol. 593, pp. 79–93. Springer, Singapore (2016). https://doi.org/10.1007/978-981-10-0515-2_6
Chapter Google Scholar
Niño, A.: Machine translation in foreign language learning: language learners’ and tutors’ perceptions of its advantages and disadvantages. ReCALL 21(2), 241–258 (2009)
Article Google Scholar
Paulussen, H., et al.: Dutch parallel corpus: a balanced parallel corpus for Dutch-English and Dutch-French. In: Spyns, P., Odijk, J. (eds.) Essential Speech and Language Technology for Dutch. NLP, pp. 185–199. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-30910-6_11
Chapter Google Scholar
Reddy, R., Mahender, E.: Speech to text conversion using android platform. Int. J. Eng. Res. Appl. (IJERA) 3(1), 253–258 (2013)
Google Scholar
Sibanda, O.: Social ties and the dynamics of integration in the city of Johannesburg among Zimbabwe migrants. J. Sociol. Soc. Anthropol. 1(1-2), 47–57 (2010)
Article Google Scholar
Statistics South Africa. Population Census (2011). [dataset]. http://www.statssa.gov.za/census2011/Products/Census_2011_Census_in_brief.pdf
Sundermeyer, M., et al.: Translation modeling with bidirectional recurrent neural networks. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2014)
Google Scholar
Thenthiruperai, B., Kates, J., Miller, K.: Use of speech recognition engine to track and manage live call center calls: U.S. Patent No. 8,130,937, 6 March (2012)
Google Scholar
Tian, L., et al.: UM-corpus: a large English-Chinese parallel corpus for statistical machine translation. In: LREC (2014)
Google Scholar
Yang, L., Zhang, D., Tang, Y.: The realization of food corpus based on database technology. J. Simul. 2(6), 327–330 (2014)
Google Scholar
Vaughan, E., O’Keefe, A.: Corpus analysis. In: The International Encyclopedia of Language and Social Interaction (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

Unit for Research on Informatics for Drought, Central University of Technology, Free State, Bloemfontein, South Africa
John Nyetanyane & Muthoni Masinde

Authors

John Nyetanyane
View author publications
You can also search for this author in PubMed Google Scholar
Muthoni Masinde
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Muthoni Masinde .

Editor information

Editors and Affiliations

ECE Paris, VEDECOM, Paris, France
Rafik Zitouni
ECE Paris, Paris, France
Max Agueh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nyetanyane, J., Masinde, M. (2019). UmobiTalk: Ubiquitous Mobile Speech Based Translator for Sesotho Language. In: Zitouni, R., Agueh, M. (eds) Emerging Technologies for Developing Countries. AFRICATEK 2018. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 260. Springer, Cham. https://doi.org/10.1007/978-3-030-05198-3_9

Download citation

DOI: https://doi.org/10.1007/978-3-030-05198-3_9
Published: 14 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05197-6
Online ISBN: 978-3-030-05198-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics