Skip to main content
Log in

A new hybrid approach for speech synthesis: application to the Arabic language

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

This research is part of the automatic speech synthesis (ASS) field; it addresses a study on the voice production based on a text written in the Arabic language. Our principal purpose is the design of a new hybrid approach that integrates the advantages of artificial intelligence in the field of ASS using expert systems (ES). We describe the methodology tackled for the approach design, and we present its principal realization steps, which are summarized as follows; (1) the sound base creation based on the elaborated corpus; (2) the linguistic processing, which is responsible for the conversion of the written form of the text to its spoken form; and (3) the acoustic generation corresponding to the pre-acquired Text. The adopted approach is based on a conceptual analysis of the principal steps needed for the design of our speech synthesis ES. Finally, we present the system evaluation report and we explain the obtained results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  • Biletskiy, Y., & Girish, R. (2010). Identification and resolution of conflicts during ontological integration using rules. Expert Systems, 27, 75–89.

    Article  Google Scholar 

  • Boersma, P., & Weenink, D. (2013) PRAAT: Doing phonetics by computer. Amsterdam: Phonetic Sciences, University of Amsterdam.

    Google Scholar 

  • Cadic, D. (2011). Optimization of the process of creation of voices in synthesis by selection. Doctoral Thesis, Physical Specialty—Doctoral School, Sciences and Technologies of Information of Telecommunications and Systems, University of Paris SUD11.

  • Demeke, Y. (2011). Duration modeling of phonemes for Amharic text to speech system. M.Sc. thesis, Faculty of Informatics, Addis Ababa University, Ethiopia.

  • Demner-Fushman, D., Chapman, W., & Mcdonald, C. (2009). What can natural language processing do for clinical decision support? Journal of Biomedical Informatics, 42, 760–772.

    Article  Google Scholar 

  • Dragicevic, P. (2004). A model of interaction in input for interactive systems multi-devices highly configurable. A Ph.D. thesis from the University of Nantes, the National College of Industrial Technology and Mines of Nantes, France.

  • Eide, E., Aaron, A., Bakis, R., Hamza, W., Picheny, M., & Pitrelli, J. (2004). A corpus-based approach to expressive speech synthesis. In Proceedings of ISCA SSW5.

  • Elfaki, A. O. (2016). A rule-based approach to detect and prevent inconsistency in the domain-engineering process. Expert Systems, 33, 3–13. https://doi.org/10.1111/exsy.12116.

    Article  Google Scholar 

  • Ferrat, K., & Guerti, M. (2017). An experimental study of the gemination in the Arabic language. Archives of Acoustics, 42(4), 571–578. https://doi.org/10.1515/aoa-2017-0061.

    Article  Google Scholar 

  • George, L. (2008). Artificial intelligence: Structures and strategies for complex problem solving. Harlow: Addison Wesley Longman.

    Google Scholar 

  • Hazem, M., El-Bakry, M. Z., Rashad, & Islam, R., & Isma’il. (2011). Diphone-based concatenative speech synthesis systems for Arabic language. In Proceeding CSECS’11/MECHANICS’11. Proceedings of the 10th WSEAS International Conference on Circuits, Systems, Electronics, Control & Signal Processing, and Proceedings of the 7th WSEAS International Conference on Applied and Theoretical Mechanics, Montreux, Switzerland (pp. 81–86).

  • Hussein, O., Monzer, Q., & Hazim, F. (2009). Framework model for shell expert system. International Journal of Computer Science and Network Security, 9(11), 56 68.

    Google Scholar 

  • Jafri, A., Sobh, I. & Alkhairy, A. (2015). Statistical formant speech synthesis for Arabic. Arabian Journal for Science and Engineering, 40(11), 3151–3159.

    Article  Google Scholar 

  • Jalabneh, A. (2009). Multiple functions of the nunations in Arabic syntax: A minimalist perspective. Dirasat, Human and Social Sciences, 36(3), 687–707.

    Google Scholar 

  • Karima, A., Zakaria, E., & Yamina, T. G. (2005). Arabic text categorization: A Comparative study of different representation modes. Journal of Theoretical and Applied Information Technology, 38, 1–5.

    Google Scholar 

  • Khalil, K. M., & Adnan, C. (2013) Arabic HMM-based speech synthesis. In International Conference on Electrical Engineering and Software Applications (ICEESA), Hammamet, Tunisia (pp 1–5).

  • Kiflu, A., & Beshah, T. (2012). Unit selection based text-to-speech synthesizer for Tigrinya language. HiLCoE Journal of Computer Science and Technology, 1(1), 13–21.

    Google Scholar 

  • Kong, G., Xu, D. L., Liu, X., & Yang, J. B. (2009). Applying a belief rule-based inference methodology to a guideline-based clinical decision support system. Expert System, 26(5), 391–408.

    Article  Google Scholar 

  • Liao, S. H. (2005). Expert system methodologies and applications—a decade review from 1995 to 2004. Expert Systems with Applications, 28(1), 93–103.

    Article  Google Scholar 

  • Monedero, I., Leo´, C., Denda, N. R., & Luque, J. (2009). Datacab: A geographical-information-system-based expert system for the design of cable networks. Expert Systems, 25, 335–348.

    Article  Google Scholar 

  • Norlela, I., & Amiruddin, I., & Riza, A. (2009). An overview of expert systems in pavement management. European Journal of Scientific Research, 30(1), 99–111.

    Google Scholar 

  • Rebai, I. & Benayed, Y. (2015). Text-to-speech synthesis system with Arabic diacritic recognition system. Computer Speech & Language, 34(1), 43–60.

    Article  Google Scholar 

  • Sebsibe, H., Mariam, S. P., Kishore, A. W., Black, R., Kumar, & Sangal, R. (2005). Unit selection voice for amharic using festivox. In 5th ISCA Speech Synthesis Workshop, Pittsburgh (pp. 103–107).

  • Sproat, R. (1996) Multilingual text analysis for text-to-speech synthesis. In Proceedings of ICSLP ’96 (Vol. 3).

  • Tatham, M., & Morton, K.(2005). Developments in speech synthesis. Chichester: Wiley.

    Book  Google Scholar 

  • Wielemaker, J. (2013) SWI-Prolog (Version 6.6.1), free software, Amsterdam.

  • Yang, D., Miao, R., Wu, H., & Zhou, Y. (2009). Product configuration knowledge modeling using ontology web language. Expert Systems with Applications, 36, 4399–4411.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hanane Tebbi.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tebbi, H., Hamadouche, M. & Azzoune, H. A new hybrid approach for speech synthesis: application to the Arabic language. Int J Speech Technol 22, 629–637 (2019). https://doi.org/10.1007/s10772-018-9499-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-018-9499-4

Keywords

Navigation