Conferences >2017 Ninth International Conf...

An accurate HSMM-based system for Arabic phonemes recognition

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The majority of successful automatic speech recognition (ASR) systems utilize a probabilistic modeling of the speech signal via hidden Markov models (HMMs). In a standard...Show More

Metadata

Abstract:

The majority of successful automatic speech recognition (ASR) systems utilize a probabilistic modeling of the speech signal via hidden Markov models (HMMs). In a standard HMM model, state duration probabilities decrease exponentially with time, which fails to satisfactorily describe the temporal structure of speech. Incorporating explicit state durational probability distribution functions (pdf) into the HMM is a famous solution to overcome this feebleness. This way is well-known as a hidden semi-Markov model (HSMM). Previous papers have confirmed that using HSMM models instead of the standard HMMs have enhanced the recognition accuracy in many targeted languages. This paper addresses an important stage of our on-going work which aims to construct an accurate Arabic recognizer for teaching and learning purposes. It presents an implementation of an HSMM model whose principal goal is improving the classical HMM's durational behavior. In this implementation, the Gaussian distribution is used for modeling state durations. Experiments have been carried out on a particular Arabic speech corpus collected from recitations of the Holy Quran. Results show an increase in recognition accuracy by around 1% We confirmed via these results that such a system outperforms the baseline HTK when the Gaussian distribution is integrated into the HTK's recognizer back-end.

Published in: 2017 Ninth International Conference on Advanced Computational Intelligence (ICACI)

Date of Conference: 04-06 February 2017

Date Added to IEEE Xplore: 13 July 2017

ISBN Information:

DOI: 10.1109/ICACI.2017.7974511

Conference Location: Doha, Qatar