Skip to main content

UmobiTalk: Ubiquitous Mobile Speech Based Translator for Sesotho Language

  • Conference paper
  • First Online:
  • 467 Accesses

Abstract

The need to conserve the under-resourced languages is becoming more urgent as some of them are becoming extinct; natural language processing can be used to redress this. Currently, most initiatives around language processing technologies are focusing on western languages such as English and French, yet resources for such languages are already available. Sesotho language is one of the under-resourced Bantu languages; it is mostly spoken in Free State province of South Africa and in Lesotho. Like other parts of South Africa, Free State has experienced a high number of non-Sesotho speaking migrants from neighboring provinces and countries. Such people are faced with serious language barrier problems especially in the informal settlements where everyone tends to speak only Sesotho. As a solution to this, we developed a parallel corpus that has English as a source and Sesotho as a target language and packaged it in UmobiTalk - Ubiquitous mobile speech based learning translator. UmobiTalk is a mobile-based tool for learning Sesotho for English speakers. The development of this tool was based on the combination of automatic speech recognition, machine translation and speech synthesis. This application will be used as an analysis tool for testing accuracy and speed of the corpus. We present the development, testing and evaluation of UmobiTalk in this paper.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   44.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   60.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Alumäe, T., Kalijurnd, K.: Open and extendable speech recognition application architecture for mobile environments. In: Spoken Language Technologies for Under-Resourced Languages (2012)

    Google Scholar 

  2. Anthony, L.: AntConc: design and development of a freeware corpus analysis toolkit for the technical writing classroom. In: Proceedings International Professional Communication Conference. IEEE (2005)

    Google Scholar 

  3. Anodo, T.: Open source spelling checker for Kimiiru language. Doctoral dissertation, University of Nairobi (2013)

    Google Scholar 

  4. Anusuya, M., Katti, S.: Speech recognition by machine, a review. arXiv preprint arXiv:1001.2267 (2010)

  5. Barnard, E., Davel, M., Van Heerden, C.: ASR corpus design for resource-scarce languages. In: ISCA (2009)

    Google Scholar 

  6. Besacier, L., et al.: Automatic speech recognition for under-resourced languages: a survey. Speech Commun. 56, 85–100 (2014)

    Article  Google Scholar 

  7. Chen, L., et al.: VACE multimodal meeting corpus. In: Renals, S., Bengio, S. (eds.) MLMI 2005. LNCS, vol. 3869, pp. 40–51. Springer, Heidelberg (2006). https://doi.org/10.1007/11677482_4

    Chapter  Google Scholar 

  8. Collins, N.: The UbuWeb electronic music corpus: an MIR investigation of a historical database. Organised Sound 20(1), 122–134 (2015)

    Article  Google Scholar 

  9. Demuth, K.: Accessing functional categories in Sesotho: interactions at the morpho-syntax interface. In: Meisel, J.M. (ed.) The Acquisition of Verb Placement. SITP, vol. 16, pp. 83–107. Springer, Dordrecht (1992). https://doi.org/10.1007/978-94-011-2803-2_4

    Chapter  Google Scholar 

  10. Devlin, J., et al.: Fast and robust neural network joint models for statistical machine translation. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1 (2014)

    Google Scholar 

  11. Duarte, T., et al.: Speech recognition for voice-based machine translation. IEEE Softw. 31(1), 26–31 (2014)

    Article  Google Scholar 

  12. Engelbrecht, H., Schultz, T.: Rapid development of an Afrikaans-English speech-to-speech translator. In: International Workshop on Spoken Language Translation (IWSLT) (2005)

    Google Scholar 

  13. Arshavskaya, E.: The Routledge Handbook of Language Learning and Technology, pp. 103–106 (2018)

    Article  Google Scholar 

  14. Graën, J., Batinic, D., Volk, M.: Cleaning the Europarl corpus for linguistic applications (2014)

    Google Scholar 

  15. Green, S., Heer, J., Manning, C.: The efficacy of human post-editing for language translation. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM (2013)

    Google Scholar 

  16. Gries, S.T., Berez, A.L.: Linguistic annotation in/for corpus linguistics. In: Ide, N., Pustejovsky, J. (eds.) Handbook of Linguistic Annotation, pp. 379–409. Springer, Dordrecht (2017). https://doi.org/10.1007/978-94-024-0881-2_15

    Chapter  Google Scholar 

  17. Guma, S.: An Outline Structure of Southern Sotho. Shuter and Shooter, South Africa (1971)

    Google Scholar 

  18. Hammo, B., et al.: Exploring and exploiting a historical corpus for Arabic. Lang. Resour. Eval. 50(4), 839–861 (2016)

    Article  Google Scholar 

  19. Hutchins, J.: From first conception to first demonstration: the nascent years of machine translation: 1947–1954. A chronology. Mach. Transl. 12(3), 195–252 (1997)

    Google Scholar 

  20. Hyman, P.: Speech-to-speech translations stutter, but researchers see mellifluous future. Commun. ACM 57(4), 16–19 (2014)

    Article  Google Scholar 

  21. Imseng, D., et al.: Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition. In: 2013 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). IEEE (2013)

    Google Scholar 

  22. Jakubíček, M., et al.: The ten ten corpus family. In: 7th International Corpus Linguistics Conference CL (2013)

    Google Scholar 

  23. Kennedy, G.: An Introduction to Corpus Linguistics. Routledge, Abingdon (2014)

    Book  Google Scholar 

  24. Kumar, A., et al.: Rethinking speech recognition on mobile devices (2011)

    Google Scholar 

  25. Latham, B.: Sampling: What is it: Quantitative research methods 5377 (2007)

    Google Scholar 

  26. Nakazawa, T., Kurohashi, S., Kobayashi, H., Ishikawa, H., Sassano, M.: 3-step parallel corpus cleaning using monolingual crowd workers. In: Hasida, K., Purwarianti, A. (eds.) Computational Linguistics. CCIS, vol. 593, pp. 79–93. Springer, Singapore (2016). https://doi.org/10.1007/978-981-10-0515-2_6

    Chapter  Google Scholar 

  27. Niño, A.: Machine translation in foreign language learning: language learners’ and tutors’ perceptions of its advantages and disadvantages. ReCALL 21(2), 241–258 (2009)

    Article  Google Scholar 

  28. Paulussen, H., et al.: Dutch parallel corpus: a balanced parallel corpus for Dutch-English and Dutch-French. In: Spyns, P., Odijk, J. (eds.) Essential Speech and Language Technology for Dutch. NLP, pp. 185–199. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-30910-6_11

    Chapter  Google Scholar 

  29. Reddy, R., Mahender, E.: Speech to text conversion using android platform. Int. J. Eng. Res. Appl. (IJERA) 3(1), 253–258 (2013)

    Google Scholar 

  30. Sibanda, O.: Social ties and the dynamics of integration in the city of Johannesburg among Zimbabwe migrants. J. Sociol. Soc. Anthropol. 1(1-2), 47–57 (2010)

    Article  Google Scholar 

  31. Statistics South Africa. Population Census (2011). [dataset]. http://www.statssa.gov.za/census2011/Products/Census_2011_Census_in_brief.pdf

  32. Sundermeyer, M., et al.: Translation modeling with bidirectional recurrent neural networks. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2014)

    Google Scholar 

  33. Thenthiruperai, B., Kates, J., Miller, K.: Use of speech recognition engine to track and manage live call center calls: U.S. Patent No. 8,130,937, 6 March (2012)

    Google Scholar 

  34. Tian, L., et al.: UM-corpus: a large English-Chinese parallel corpus for statistical machine translation. In: LREC (2014)

    Google Scholar 

  35. Yang, L., Zhang, D., Tang, Y.: The realization of food corpus based on database technology. J. Simul. 2(6), 327–330 (2014)

    Google Scholar 

  36. Vaughan, E., O’Keefe, A.: Corpus analysis. In: The International Encyclopedia of Language and Social Interaction (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Muthoni Masinde .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nyetanyane, J., Masinde, M. (2019). UmobiTalk: Ubiquitous Mobile Speech Based Translator for Sesotho Language. In: Zitouni, R., Agueh, M. (eds) Emerging Technologies for Developing Countries. AFRICATEK 2018. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 260. Springer, Cham. https://doi.org/10.1007/978-3-030-05198-3_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-05198-3_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-05197-6

  • Online ISBN: 978-3-030-05198-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics