An accurate HSMM-based system for Arabic phonemes recognition | IEEE Conference Publication | IEEE Xplore

An accurate HSMM-based system for Arabic phonemes recognition


Abstract:

The majority of successful automatic speech recognition (ASR) systems utilize a probabilistic modeling of the speech signal via hidden Markov models (HMMs). In a standard...Show More

Abstract:

The majority of successful automatic speech recognition (ASR) systems utilize a probabilistic modeling of the speech signal via hidden Markov models (HMMs). In a standard HMM model, state duration probabilities decrease exponentially with time, which fails to satisfactorily describe the temporal structure of speech. Incorporating explicit state durational probability distribution functions (pdf) into the HMM is a famous solution to overcome this feebleness. This way is well-known as a hidden semi-Markov model (HSMM). Previous papers have confirmed that using HSMM models instead of the standard HMMs have enhanced the recognition accuracy in many targeted languages. This paper addresses an important stage of our on-going work which aims to construct an accurate Arabic recognizer for teaching and learning purposes. It presents an implementation of an HSMM model whose principal goal is improving the classical HMM's durational behavior. In this implementation, the Gaussian distribution is used for modeling state durations. Experiments have been carried out on a particular Arabic speech corpus collected from recitations of the Holy Quran. Results show an increase in recognition accuracy by around 1% We confirmed via these results that such a system outperforms the baseline HTK when the Gaussian distribution is integrated into the HTK's recognizer back-end.
Date of Conference: 04-06 February 2017
Date Added to IEEE Xplore: 13 July 2017
ISBN Information:
Conference Location: Doha, Qatar

Contact IEEE to Subscribe

References

References is not available for this document.