Abstract
This research is part of the automatic speech synthesis (ASS) field; it addresses a study on the voice production based on a text written in the Arabic language. Our principal purpose is the design of a new hybrid approach that integrates the advantages of artificial intelligence in the field of ASS using expert systems (ES). We describe the methodology tackled for the approach design, and we present its principal realization steps, which are summarized as follows; (1) the sound base creation based on the elaborated corpus; (2) the linguistic processing, which is responsible for the conversion of the written form of the text to its spoken form; and (3) the acoustic generation corresponding to the pre-acquired Text. The adopted approach is based on a conceptual analysis of the principal steps needed for the design of our speech synthesis ES. Finally, we present the system evaluation report and we explain the obtained results.
Similar content being viewed by others
References
Biletskiy, Y., & Girish, R. (2010). Identification and resolution of conflicts during ontological integration using rules. Expert Systems, 27, 75–89.
Boersma, P., & Weenink, D. (2013) PRAAT: Doing phonetics by computer. Amsterdam: Phonetic Sciences, University of Amsterdam.
Cadic, D. (2011). Optimization of the process of creation of voices in synthesis by selection. Doctoral Thesis, Physical Specialty—Doctoral School, Sciences and Technologies of Information of Telecommunications and Systems, University of Paris SUD11.
Demeke, Y. (2011). Duration modeling of phonemes for Amharic text to speech system. M.Sc. thesis, Faculty of Informatics, Addis Ababa University, Ethiopia.
Demner-Fushman, D., Chapman, W., & Mcdonald, C. (2009). What can natural language processing do for clinical decision support? Journal of Biomedical Informatics, 42, 760–772.
Dragicevic, P. (2004). A model of interaction in input for interactive systems multi-devices highly configurable. A Ph.D. thesis from the University of Nantes, the National College of Industrial Technology and Mines of Nantes, France.
Eide, E., Aaron, A., Bakis, R., Hamza, W., Picheny, M., & Pitrelli, J. (2004). A corpus-based approach to expressive speech synthesis. In Proceedings of ISCA SSW5.
Elfaki, A. O. (2016). A rule-based approach to detect and prevent inconsistency in the domain-engineering process. Expert Systems, 33, 3–13. https://doi.org/10.1111/exsy.12116.
Ferrat, K., & Guerti, M. (2017). An experimental study of the gemination in the Arabic language. Archives of Acoustics, 42(4), 571–578. https://doi.org/10.1515/aoa-2017-0061.
George, L. (2008). Artificial intelligence: Structures and strategies for complex problem solving. Harlow: Addison Wesley Longman.
Hazem, M., El-Bakry, M. Z., Rashad, & Islam, R., & Isma’il. (2011). Diphone-based concatenative speech synthesis systems for Arabic language. In Proceeding CSECS’11/MECHANICS’11. Proceedings of the 10th WSEAS International Conference on Circuits, Systems, Electronics, Control & Signal Processing, and Proceedings of the 7th WSEAS International Conference on Applied and Theoretical Mechanics, Montreux, Switzerland (pp. 81–86).
Hussein, O., Monzer, Q., & Hazim, F. (2009). Framework model for shell expert system. International Journal of Computer Science and Network Security, 9(11), 56 68.
Jafri, A., Sobh, I. & Alkhairy, A. (2015). Statistical formant speech synthesis for Arabic. Arabian Journal for Science and Engineering, 40(11), 3151–3159.
Jalabneh, A. (2009). Multiple functions of the nunations in Arabic syntax: A minimalist perspective. Dirasat, Human and Social Sciences, 36(3), 687–707.
Karima, A., Zakaria, E., & Yamina, T. G. (2005). Arabic text categorization: A Comparative study of different representation modes. Journal of Theoretical and Applied Information Technology, 38, 1–5.
Khalil, K. M., & Adnan, C. (2013) Arabic HMM-based speech synthesis. In International Conference on Electrical Engineering and Software Applications (ICEESA), Hammamet, Tunisia (pp 1–5).
Kiflu, A., & Beshah, T. (2012). Unit selection based text-to-speech synthesizer for Tigrinya language. HiLCoE Journal of Computer Science and Technology, 1(1), 13–21.
Kong, G., Xu, D. L., Liu, X., & Yang, J. B. (2009). Applying a belief rule-based inference methodology to a guideline-based clinical decision support system. Expert System, 26(5), 391–408.
Liao, S. H. (2005). Expert system methodologies and applications—a decade review from 1995 to 2004. Expert Systems with Applications, 28(1), 93–103.
Monedero, I., Leo´, C., Denda, N. R., & Luque, J. (2009). Datacab: A geographical-information-system-based expert system for the design of cable networks. Expert Systems, 25, 335–348.
Norlela, I., & Amiruddin, I., & Riza, A. (2009). An overview of expert systems in pavement management. European Journal of Scientific Research, 30(1), 99–111.
Rebai, I. & Benayed, Y. (2015). Text-to-speech synthesis system with Arabic diacritic recognition system. Computer Speech & Language, 34(1), 43–60.
Sebsibe, H., Mariam, S. P., Kishore, A. W., Black, R., Kumar, & Sangal, R. (2005). Unit selection voice for amharic using festivox. In 5th ISCA Speech Synthesis Workshop, Pittsburgh (pp. 103–107).
Sproat, R. (1996) Multilingual text analysis for text-to-speech synthesis. In Proceedings of ICSLP ’96 (Vol. 3).
Tatham, M., & Morton, K.(2005). Developments in speech synthesis. Chichester: Wiley.
Wielemaker, J. (2013) SWI-Prolog (Version 6.6.1), free software, Amsterdam.
Yang, D., Miao, R., Wu, H., & Zhou, Y. (2009). Product configuration knowledge modeling using ontology web language. Expert Systems with Applications, 36, 4399–4411.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Tebbi, H., Hamadouche, M. & Azzoune, H. A new hybrid approach for speech synthesis: application to the Arabic language. Int J Speech Technol 22, 629–637 (2019). https://doi.org/10.1007/s10772-018-9499-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10772-018-9499-4