Abstract
The need to conserve the under-resourced languages is becoming more urgent as some of them are becoming extinct; natural language processing can be used to redress this. Currently, most initiatives around language processing technologies are focusing on western languages such as English and French, yet resources for such languages are already available. Sesotho language is one of the under-resourced Bantu languages; it is mostly spoken in Free State province of South Africa and in Lesotho. Like other parts of South Africa, Free State has experienced a high number of non-Sesotho speaking migrants from neighboring provinces and countries. Such people are faced with serious language barrier problems especially in the informal settlements where everyone tends to speak only Sesotho. As a solution to this, we developed a parallel corpus that has English as a source and Sesotho as a target language and packaged it in UmobiTalk - Ubiquitous mobile speech based learning translator. UmobiTalk is a mobile-based tool for learning Sesotho for English speakers. The development of this tool was based on the combination of automatic speech recognition, machine translation and speech synthesis. This application will be used as an analysis tool for testing accuracy and speed of the corpus. We present the development, testing and evaluation of UmobiTalk in this paper.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Alumäe, T., Kalijurnd, K.: Open and extendable speech recognition application architecture for mobile environments. In: Spoken Language Technologies for Under-Resourced Languages (2012)
Anthony, L.: AntConc: design and development of a freeware corpus analysis toolkit for the technical writing classroom. In: Proceedings International Professional Communication Conference. IEEE (2005)
Anodo, T.: Open source spelling checker for Kimiiru language. Doctoral dissertation, University of Nairobi (2013)
Anusuya, M., Katti, S.: Speech recognition by machine, a review. arXiv preprint arXiv:1001.2267 (2010)
Barnard, E., Davel, M., Van Heerden, C.: ASR corpus design for resource-scarce languages. In: ISCA (2009)
Besacier, L., et al.: Automatic speech recognition for under-resourced languages: a survey. Speech Commun. 56, 85–100 (2014)
Chen, L., et al.: VACE multimodal meeting corpus. In: Renals, S., Bengio, S. (eds.) MLMI 2005. LNCS, vol. 3869, pp. 40–51. Springer, Heidelberg (2006). https://doi.org/10.1007/11677482_4
Collins, N.: The UbuWeb electronic music corpus: an MIR investigation of a historical database. Organised Sound 20(1), 122–134 (2015)
Demuth, K.: Accessing functional categories in Sesotho: interactions at the morpho-syntax interface. In: Meisel, J.M. (ed.) The Acquisition of Verb Placement. SITP, vol. 16, pp. 83–107. Springer, Dordrecht (1992). https://doi.org/10.1007/978-94-011-2803-2_4
Devlin, J., et al.: Fast and robust neural network joint models for statistical machine translation. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1 (2014)
Duarte, T., et al.: Speech recognition for voice-based machine translation. IEEE Softw. 31(1), 26–31 (2014)
Engelbrecht, H., Schultz, T.: Rapid development of an Afrikaans-English speech-to-speech translator. In: International Workshop on Spoken Language Translation (IWSLT) (2005)
Arshavskaya, E.: The Routledge Handbook of Language Learning and Technology, pp. 103–106 (2018)
Graën, J., Batinic, D., Volk, M.: Cleaning the Europarl corpus for linguistic applications (2014)
Green, S., Heer, J., Manning, C.: The efficacy of human post-editing for language translation. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM (2013)
Gries, S.T., Berez, A.L.: Linguistic annotation in/for corpus linguistics. In: Ide, N., Pustejovsky, J. (eds.) Handbook of Linguistic Annotation, pp. 379–409. Springer, Dordrecht (2017). https://doi.org/10.1007/978-94-024-0881-2_15
Guma, S.: An Outline Structure of Southern Sotho. Shuter and Shooter, South Africa (1971)
Hammo, B., et al.: Exploring and exploiting a historical corpus for Arabic. Lang. Resour. Eval. 50(4), 839–861 (2016)
Hutchins, J.: From first conception to first demonstration: the nascent years of machine translation: 1947–1954. A chronology. Mach. Transl. 12(3), 195–252 (1997)
Hyman, P.: Speech-to-speech translations stutter, but researchers see mellifluous future. Commun. ACM 57(4), 16–19 (2014)
Imseng, D., et al.: Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition. In: 2013 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). IEEE (2013)
Jakubíček, M., et al.: The ten ten corpus family. In: 7th International Corpus Linguistics Conference CL (2013)
Kennedy, G.: An Introduction to Corpus Linguistics. Routledge, Abingdon (2014)
Kumar, A., et al.: Rethinking speech recognition on mobile devices (2011)
Latham, B.: Sampling: What is it: Quantitative research methods 5377 (2007)
Nakazawa, T., Kurohashi, S., Kobayashi, H., Ishikawa, H., Sassano, M.: 3-step parallel corpus cleaning using monolingual crowd workers. In: Hasida, K., Purwarianti, A. (eds.) Computational Linguistics. CCIS, vol. 593, pp. 79–93. Springer, Singapore (2016). https://doi.org/10.1007/978-981-10-0515-2_6
Niño, A.: Machine translation in foreign language learning: language learners’ and tutors’ perceptions of its advantages and disadvantages. ReCALL 21(2), 241–258 (2009)
Paulussen, H., et al.: Dutch parallel corpus: a balanced parallel corpus for Dutch-English and Dutch-French. In: Spyns, P., Odijk, J. (eds.) Essential Speech and Language Technology for Dutch. NLP, pp. 185–199. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-30910-6_11
Reddy, R., Mahender, E.: Speech to text conversion using android platform. Int. J. Eng. Res. Appl. (IJERA) 3(1), 253–258 (2013)
Sibanda, O.: Social ties and the dynamics of integration in the city of Johannesburg among Zimbabwe migrants. J. Sociol. Soc. Anthropol. 1(1-2), 47–57 (2010)
Statistics South Africa. Population Census (2011). [dataset]. http://www.statssa.gov.za/census2011/Products/Census_2011_Census_in_brief.pdf
Sundermeyer, M., et al.: Translation modeling with bidirectional recurrent neural networks. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2014)
Thenthiruperai, B., Kates, J., Miller, K.: Use of speech recognition engine to track and manage live call center calls: U.S. Patent No. 8,130,937, 6 March (2012)
Tian, L., et al.: UM-corpus: a large English-Chinese parallel corpus for statistical machine translation. In: LREC (2014)
Yang, L., Zhang, D., Tang, Y.: The realization of food corpus based on database technology. J. Simul. 2(6), 327–330 (2014)
Vaughan, E., O’Keefe, A.: Corpus analysis. In: The International Encyclopedia of Language and Social Interaction (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Nyetanyane, J., Masinde, M. (2019). UmobiTalk: Ubiquitous Mobile Speech Based Translator for Sesotho Language. In: Zitouni, R., Agueh, M. (eds) Emerging Technologies for Developing Countries. AFRICATEK 2018. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 260. Springer, Cham. https://doi.org/10.1007/978-3-030-05198-3_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-05198-3_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05197-6
Online ISBN: 978-3-030-05198-3
eBook Packages: Computer ScienceComputer Science (R0)