ABSTRACT
In this paper, a rule-based automatic syllabification Algorithm for Amharic language using linguistic implementation notions is designed following the Maximal Onset and Sonority Hierarchy principles. Amharic is a syllabic language in which every grapheme represents consonant-vowel assimilation. However, while reading a text in Amharic, all the CV syllables are not uttered as expected and hence the syllables in the text are not the CV sequence seen in the grapheme sequence. Epenthesis and gemination are also major challenges in Amharic grapheme-to-phoneme conversion because of the failure of Amharic orthography to show epenthetic vowel and geminated consonants. This limits the performance of many Amharic speech systems (such as Text-To-Speech and Automatic Speech Recognition) and other natural language applications. After a thorough study of the syllable structure, identification of linguistic syllabification rules and a survey of the relevant literature, a set of rules were identified and used to design a syllabification algorithm. The system was implemented and tested. The experiment was conducted using carefully selected Amharic words. The system exhibited a 98.1% word accuracy rate with very high sensitivity to epenthesis.
- Tian, J. 2004. Data-Driven Approaches for Automatic Detection of Syllable Boundaries. Nokia Research Center, Tampere, Finland.Google Scholar
- Bartlett, S., Kondrak, G., and Cherry, C. 2009. On the syllabication of phonemes. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Colorado, USA. Google ScholarDigital Library
- Jurafsky, D., and Martin, J. H. 2006. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. New Jersey: Prentice Hall. Google ScholarDigital Library
- Marchand, Y., Adsett, C. R., and Damper, R. I. 2009. Automatic Syllabification in English: A Comparison of Different Algorithms. Language and Speech, 52, (1), 1--27.Google ScholarCross Ref
- Sebsbie, H., Kishore, S. P., Black, A. W., Kumar, R., and Sangal, R. 2004. Unit Selection, IPA Equivalence Voice for Amharic Using Festvox, In 5th Speech Synthesis Workshop. Pittsburgh, USA.Google Scholar
- Mulugeta, S. 2001. The syllable Structure and Syllablification in Amharic. Masters of philosophy in general linguistic thesis. Department of Linguistics, Trondheim, Norway.Google Scholar
- Aster, T. 1981. The syllable structure of Amharic and syllabification of Medial Consonant Clusters and Geminates. B. A thesis in Linguistics, Addis Ababa University, Addis Ababa, Ethiopia.Google Scholar
- Amsalu, A. 2004. Amharic-English Dictionary, Second edition. Mega Publishing Enterprise, Addis Ababa, Ethiopia.Google Scholar
- Jeppe, B., Daniela, B., João, N., Miguel Sales, D., and Luis, C. 2009. Automatic syllabification for Danish Text-to-speech Systems. In proceedings of ISCA. Brighton, UK.Google Scholar
- Bigi, B., Meunier, C., Nesterenko, I., and Bertrand, R. 2009. Automatic detection of syllable boundaries in spontaneous speech. In proceedings of the International Conference on Language Resources. Valletta, Malta.Google Scholar
- Bartlett, S., Kondrak, G., and Cherry, C. 2008. Automatic syllabification with structured SVMs for letter-to-phoneme conversion. In Proceedings of Human Language Technologies: The 2008 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Columbus, Ohio, USA.Google Scholar
- Bartlett, S., Kondrak, G., and Cherry, C. 2009. On the syllabification of phonemes. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Colorado, USA. Google ScholarDigital Library
Index Terms
- Modeling improved syllabification algorithm for Amharic
Recommendations
Concatenative speech synthesis for Amharic using unit selection method
MEDES '12: Proceedings of the International Conference on Management of Emergent Digital EcoSystemsIn this paper we propose algorithms and methods that address critical issues in developing a general Amharic text-to-speech synthesizer. Converting grapheme to phoneme in Amharic is a very challenging task because of the two necessary and yet ...
Syllabification algorithm based on syllable rules matching for Malay language
ACACOS'11: Proceedings of the 10th WSEAS international conference on Applied computer and applied computational scienceIn this paper, we present a new syllabification algorithm for Malay language. Syllabification is the process to extract or divide syllable from words. Syllabification process is language dependent where each language can have its own set of syllable ...
Is the syllabification of Irish a typological exception? An experimental study
We examined whether Irish speakers syllabify intervocalic consonants as codas (e.g., poca 'pocket' /po:k.@?/ CVC.V), as claimed by many authors, but contrary to claims in phonological theory of a universal preference for syllables with onsets. We ...
Comments