Abstract:
Most of information retrieval (IR) systems for Qur'an use text as their input query, whether they use the Alphabetic script or the Arabic script to represent the query. T...Show MoreMetadata
Abstract:
Most of information retrieval (IR) systems for Qur'an use text as their input query, whether they use the Alphabetic script or the Arabic script to represent the query. Thus, required IR user to know how to write the query. For searching the Qur'an verses, it is possible that IR user knows how to pronounce the query, but does not have enough knowledge about how to write Arabic letters to represent the query when search for a Qur'an verse. In this case, speech can be an alternative as the input to the IR system. In this work, we develop a spoken query IR based on the Hidden Markov Model acoustic models and the n- gram language model for its automatic speech recognition system. Both models are trained by using all verses of the Qur'an. The Inference Network Model and the well-known Vector Space Model are employed for its IR system. For the speech recognition system, average of word error rate are 7.41% for closed speakers, and 18.53% for open speakers. For the IR system, the best query formulation for the Inference Network is achieved by using input queries consisting of phrase of 2 words with the average value of Mean Reciprocal Rank is 0,922475, while for the Vector Space Model is achieved by using input query consisting of one word with the average value of Mean Reciprocal Rank is 0,9308.
Date of Conference: 01-03 November 2017
Date Added to IEEE Xplore: 14 June 2018
ISBN Information:
Electronic ISSN: 2472-7695