Phoneme-Based Recognizer to Assist Reading the Holy Quran

Elhadj, Yahya Ould Mohamed; Alghamdi, Mansour; Alkanhal, Mohammad

doi:10.1007/978-3-319-01778-5_15

Yahya Ould Mohamed Elhadj⁶,
Mansour Alghamdi⁷ &
Mohammad Alkanhal⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 235))

1707 Accesses
7 Citations

Abstract

This paper presents a new phase of our ongoing efforts for building a high performance speaker independent recognizer for Quran recitation. An in-house developed and annotated sound database of about eight hours is used for this purpose. Since this sound database is segmented and annotated on both allophone and phoneme levels, we are developing two separate baseline recognizers for respectively allophones and phonemes. We employed the same approach for developing both phoneme and allophone recognizers to be able to make some kind of comparison between them. The Cambridge HTK tools are used for the development of these recognizers. We present in this paper the development of the phoneme-based recognizer to measure its appropriateness for the sake of our ultimate goal of building a high performance speaker independent recognizer to assist reading and memorizing the Holy Quran; the details of the allophonic recognizer is being published separately. Each Quarnic phoneme is modeled by an acoustic Hidden Markov Model (HMM) with 3-emitting states. A continues probability distribution using 16 Gaussian mixture distributions is used for each emitting state. Results give 92% of average recognition rate, which is very promising, compared to 88% for the allophonic recognizer.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Rabiner, L.: A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2) (1989)
Google Scholar
Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall (1993)
Google Scholar
Jelinek, F.: Statistical Methods for Speech Recognition. MIT Press, Cambridge (1998)
Google Scholar
Huang, X., Acero, A., Hon, H.: Spoken Language Processing. Prentice Hall (2001)
Google Scholar
Jurafsky, D., Martin, J.H.: Speech and Language Processing: An introduction to natural language processing, computational linguistics, and speech recognition, 2nd edn. Prentice Hall (2008)
Google Scholar
Young, S.: The HTK Hidden Markov model toolkit: design and philosophy (Tech. Rep. CUED/FINFENG/TR152). Cambridge University Engineering Dept, UK (1994)
Google Scholar
Young, S., et al.: HTK Book (V.3.4). Cambridge University Engineering Dept, UK (2009)
Google Scholar
Huang, X., Alleva, F., Hon, H.W., Hwang, M.Y., Rosenfeld, R.: The SPHINX-II speech recognition system: an overview. Computer Speech and Language 7(2), 137–148 (1993)
Article Google Scholar
Elhadj, Y.O.M., Alsughayeir, I.A., Alghamdi, M., Alkanhal, M., Ohali, Y.M., Alansari, A.M.: Computerized teaching of the Holy Quran. Final Technical Report, King Abdulaziz City for Sciences and Technology (KACST), Riyadh, KSA (2012) (in Arabic)
Google Scholar
Elhadj, Y.O.M., AlGhamdi, M., AlKanhal, M., Alansari, A.M.: Sound Corpus of a part of the noble Quran. In: Proc. of the International Conference on the Glorious Quran and Contemporary Technologies, King Fahd Complex for the Printing of the Holy Quran, Almadinah, Saudi Arabia, October 13-15 (2009) (in Arabic)
Google Scholar
Elhadj, Y.O.M.: Preparation of speech database with perfect reading of the last part of the Holly Quran. In: Proc. of the 3rd IEEE International Conference on Arabic Language Processing (CITAL 2009), Rabat, Morocco, May 4-5, pp. 5–8 (2009) (in Arabic)
Google Scholar
AlGhamdi, M., Elhadj, Y.O.M., AlKanhal, M.: A manual system to segment and transcribe Arabic Speech. In: Proceedings of IEEE ICSPC 2007, Dubai, UAE, pp. 233–236 (2007) ISBN 1-4244-1236-6
Google Scholar
Alghamdi, M.: KACST Arabic Phonetics Database. In: The Fifteenth International Congress of Phonetics Science, Barcelona, pp. 3109–3112 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Center for Islamic and Arabic Computing, Al-Imam Mohammad Ibn Saud Islamic University, Riyadh, Kingdom of Saudi Arabia, P.O. Box 5701, Riyadh, 11432, Kingdom of Saudi Arabia
Yahya Ould Mohamed Elhadj
Computers and Electronics Research Institute, KACST, Riyadh, Saudi Arabia
Mansour Alghamdi & Mohammad Alkanhal

Authors

Yahya Ould Mohamed Elhadj
View author publications
You can also search for this author in PubMed Google Scholar
Mansour Alghamdi
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Alkanhal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yahya Ould Mohamed Elhadj .

Editor information

Editors and Affiliations

Technopark Campus Trivandrum, Indian Inst. of Information Technology and Management – Kerala (IIITM-K), Kerala, India
Sabu M. Thampi
Machine Intelligence Research Labs (MIR Labs), Auburn, USA
Ajith Abraham
Indian Statistical Institute, Kolkata, India
Sankar Kumar Pal
Department of Computer Science School of Science, University of Salamanca, Salamanca, Spain
Juan Manuel Corchado Rodriguez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Elhadj, Y.O.M., Alghamdi, M., Alkanhal, M. (2014). Phoneme-Based Recognizer to Assist Reading the Holy Quran. In: Thampi, S., Abraham, A., Pal, S., Rodriguez, J. (eds) Recent Advances in Intelligent Informatics. Advances in Intelligent Systems and Computing, vol 235. Springer, Cham. https://doi.org/10.1007/978-3-319-01778-5_15

Download citation

DOI: https://doi.org/10.1007/978-3-319-01778-5_15
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01777-8
Online ISBN: 978-3-319-01778-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics