skip to main content
10.1145/2457276.2457280acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmedesConference Proceedingsconference-collections
research-article

Modeling improved syllabification algorithm for Amharic

Published:28 October 2012Publication History

ABSTRACT

In this paper, a rule-based automatic syllabification Algorithm for Amharic language using linguistic implementation notions is designed following the Maximal Onset and Sonority Hierarchy principles. Amharic is a syllabic language in which every grapheme represents consonant-vowel assimilation. However, while reading a text in Amharic, all the CV syllables are not uttered as expected and hence the syllables in the text are not the CV sequence seen in the grapheme sequence. Epenthesis and gemination are also major challenges in Amharic grapheme-to-phoneme conversion because of the failure of Amharic orthography to show epenthetic vowel and geminated consonants. This limits the performance of many Amharic speech systems (such as Text-To-Speech and Automatic Speech Recognition) and other natural language applications. After a thorough study of the syllable structure, identification of linguistic syllabification rules and a survey of the relevant literature, a set of rules were identified and used to design a syllabification algorithm. The system was implemented and tested. The experiment was conducted using carefully selected Amharic words. The system exhibited a 98.1% word accuracy rate with very high sensitivity to epenthesis.

References

  1. Tian, J. 2004. Data-Driven Approaches for Automatic Detection of Syllable Boundaries. Nokia Research Center, Tampere, Finland.Google ScholarGoogle Scholar
  2. Bartlett, S., Kondrak, G., and Cherry, C. 2009. On the syllabication of phonemes. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Colorado, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Jurafsky, D., and Martin, J. H. 2006. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. New Jersey: Prentice Hall. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Marchand, Y., Adsett, C. R., and Damper, R. I. 2009. Automatic Syllabification in English: A Comparison of Different Algorithms. Language and Speech, 52, (1), 1--27.Google ScholarGoogle ScholarCross RefCross Ref
  5. Sebsbie, H., Kishore, S. P., Black, A. W., Kumar, R., and Sangal, R. 2004. Unit Selection, IPA Equivalence Voice for Amharic Using Festvox, In 5th Speech Synthesis Workshop. Pittsburgh, USA.Google ScholarGoogle Scholar
  6. Mulugeta, S. 2001. The syllable Structure and Syllablification in Amharic. Masters of philosophy in general linguistic thesis. Department of Linguistics, Trondheim, Norway.Google ScholarGoogle Scholar
  7. Aster, T. 1981. The syllable structure of Amharic and syllabification of Medial Consonant Clusters and Geminates. B. A thesis in Linguistics, Addis Ababa University, Addis Ababa, Ethiopia.Google ScholarGoogle Scholar
  8. Amsalu, A. 2004. Amharic-English Dictionary, Second edition. Mega Publishing Enterprise, Addis Ababa, Ethiopia.Google ScholarGoogle Scholar
  9. Jeppe, B., Daniela, B., João, N., Miguel Sales, D., and Luis, C. 2009. Automatic syllabification for Danish Text-to-speech Systems. In proceedings of ISCA. Brighton, UK.Google ScholarGoogle Scholar
  10. Bigi, B., Meunier, C., Nesterenko, I., and Bertrand, R. 2009. Automatic detection of syllable boundaries in spontaneous speech. In proceedings of the International Conference on Language Resources. Valletta, Malta.Google ScholarGoogle Scholar
  11. Bartlett, S., Kondrak, G., and Cherry, C. 2008. Automatic syllabification with structured SVMs for letter-to-phoneme conversion. In Proceedings of Human Language Technologies: The 2008 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Columbus, Ohio, USA.Google ScholarGoogle Scholar
  12. Bartlett, S., Kondrak, G., and Cherry, C. 2009. On the syllabification of phonemes. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Colorado, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Modeling improved syllabification algorithm for Amharic

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        MEDES '12: Proceedings of the International Conference on Management of Emergent Digital EcoSystems
        October 2012
        199 pages
        ISBN:9781450317559
        DOI:10.1145/2457276
        • General Chair:
        • Janusz Kacprzyk,
        • Program Chair:
        • Dominique Laurent,
        • Publications Chair:
        • Richard Chbeir

        Copyright © 2012 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 28 October 2012

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        MEDES '12 Paper Acceptance Rate16of50submissions,32%Overall Acceptance Rate267of682submissions,39%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader