skip to main content
10.1145/3193063.3193071acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiciitConference Proceedingsconference-collections
research-article

Myanmar Continuous Speech Recognition System Using Fuzzy Logic Classification in Speech Segmentation

Authors Info & Claims
Published:26 February 2018Publication History

ABSTRACT

Speech recognition is one of the next generation technologies for human-computer interaction. Automatic Speech Recognition (ASR) is a technology that allows a computer to recognize the words spoken by a person through telephone, microphone or other devices. The various stages of the speech recognition system are pre-processing, segmentation of speech signal, feature extraction of speech and recognition of word. Among many speech recognition systems, continuous speech recognition system is very important and most popular system. This paper proposes the time-domain features and frequency-domain features based on fuzzy knowledge for continuous speech segmentation task via a nonlinear speech analysis. Short-time Energy and Zero-crossing Rate are time-domain features, and Spectral Centroid is frequency-domain feature that the system will calculate in each point of speech signal in order to exploit relevant information for generating the significant segments. Fuzzy Logic technique will be used not only to fuzzify the calculated features into three complementary sets namely: low, middle, high but also to perform a matching phase using a set of fuzzy rules. The outputs of the Fuzzy Logic are phonemes, syllables and disyllables of Myanmar Language. The result of the system will recognize the continuous words of input speech.

References

  1. T. T. Thet, J. Na and W. K. Ko, "Word Segmentation for the Myanmar Language", Journal of Information Science, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. T. M. Tun and K. T. Lynn, "Myanmar Continuous Speech to Isolated Word Segmentation", Engineering and Technology, IJSRSET, Issue 2, Volume 1, 2015.Google ScholarGoogle Scholar
  3. Haykin, S (2001), Minimum mean square error adaptive filter. In Adaptive Filter Theory, 4th ed. Prentice Hall, Upper Saddle River, 183--228.Google ScholarGoogle Scholar
  4. M. M. Rahman and M. A. Bhuiyan, "Continuous Bangla Speech Segmentation using short time Speech Features Extraction Approaches., IJACSA, Volume 3, No. 11, 2012.Google ScholarGoogle Scholar
  5. T. Zhang and J. C. C. Kuo, "Hierarchical classification of audio data for archiving and retrieving", In International Conference on Acoustics, Speech and Signal Processing, volume VI, pages 3001--3004. IEEE, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. L R Rabiner and M R Sambur, "An Algorithm for determining the endpoints of Isolated Utterances", The Bell System Technical Journal, February 1975, pp 298--315.Google ScholarGoogle Scholar
  7. T Giannakopoulos, "Study and application of acoustic information for the detection of harmful content and fusion with visual information" Ph.D. dissertation, Dept. of Informatics and Telecommunications, University of Athens, Greece, 2009.Google ScholarGoogle Scholar
  8. Vimala, C., Radha, V., A review on speech recognition challenges and approaches", World Computer. Sci. Inf. Technol., 2012, 2, (1), pp. 1--7.Google ScholarGoogle Scholar
  9. Mr. Sridhar Chandramohan Iyer, Speaker Recognition System using Coefficients and Correlation Approaches in MATLAB, IJERT, Vol. 3 Issue 5, May 2014.Google ScholarGoogle Scholar

Index Terms

  1. Myanmar Continuous Speech Recognition System Using Fuzzy Logic Classification in Speech Segmentation

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICIIT '18: Proceedings of the 2018 International Conference on Intelligent Information Technology
      February 2018
      76 pages
      ISBN:9781450363785
      DOI:10.1145/3193063

      Copyright © 2018 ACM

      © 2018 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the United States Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 February 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader