Skip to main content
Log in

Energy bands and spectral cues for Arabic vowels recognition

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

The present study examines the short and long Arabic vowels (/a/, /a:/, /i/, /i:/, /u/ and /u:/) with a new approach based on three methods: formant frequencies extraction, spectral moments and energy bands. Among Arabic language characteristics compared to other languages are long vowels which can be pronounced with different duration length. The formant frequencies are the most exploited in characterizing vowels in different languages nevertheless using only formants was not very significant for vowels identification especially when production duration augments. Therefore, our approach is to broaden previous studies and present new tools in order to characterize long vowels compared to short ones.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  • Alghamdi, M. M. (1998). A spectrographic analysis of Arabic vowels: A cross-dialectal study. Journal of King Saud University, 10, 3–24.

    Google Scholar 

  • Alotaibi, Y. & Hussain, A. (2009). Speech recognition system and formant based analysis of spoken Arabic vowels. In Proceedings of First International Conference, December, FGIT, Jeju Island, Korea (pp. 10–12).

  • Anani, M. (1999). Arabic vowel formant frequencies. In Proceedings of the 14th International Congress of Phonetic Sciences (Vol. 9), San Francisco, CA.

  • Cooper, F. S., Liberman, A. M., & Borst, J. M. (1951). The interconversion of audible and visible patterns as a basis for research in the perception of speech. Proceedings of the National Academy of Sciences Washington, 37, 318325.

    Article  Google Scholar 

  • Fant, G. (1960). Accoustic theory of speech production: with calculations based on X-ray studies of Russian articulations. Gravenhage: Mouton Co.

    Google Scholar 

  • Feng, Y., Grace, J. H., Steve, A. X., & Max, L. (2011). Detecting anticipatory effects in speech articulation by means of spectral coefficient analyses. Speech Communication, 53, 842–854.

    Article  Google Scholar 

  • Hafiz, R., Awais, M., Masud, S., & Shamail, S. (2008). On vowels segmentation and identification using formant transitions in continuous recitation of Quranic Arabic. Study of Computer Intelligence, 134, 155–162.

    Google Scholar 

  • Hillenbrand, J., Getty, L. A., Clark, M. J., & Wheeler, K. (1995). Acoustic characteristics of American English vowels. Journal of the Acoustic Society of America, 97, 3099–3111.

    Article  Google Scholar 

  • Ladefoged, P. (1962). Elements of acoustic phonetics. Chicago - London: The University of Chicago Press.

    Google Scholar 

  • Ladefoged, P. (1985). The phonetic basis for computer speech processing. In F. Fall & W. A. Woods (Eds.), computer speech processing (pp. 3–27). London: Prentice Hall.

    Google Scholar 

  • Linville, S. E., & Rens, J. (2001). Vocal tract resonance analysis of aging voice using long-term average spectra. Journal of Voice, 15(3), 323–330.

    Article  Google Scholar 

  • Milenkovic, P., & Forrest, K. (1988). Classification of vowels using spectrum moments. Journal of the Acoustical Society of America, 83(s1), s67.

    Article  Google Scholar 

  • Natour, Y. S., Basem, Y., Mawada, A. S., & Tadros, Y. K. (2010). Formant frequency characteristics in normal Arabic speaking Jordanians. Journal of Voice, 25, 75–84. doi:10.1016/j.jvoice.

    Article  Google Scholar 

  • Pentti K. (2015). Spectral moments analysis of/s/coarticulation development in Finnish-speaking children. In ICPhS Proceedings of the 18th International Congress of Phonetic Sciences, ISBN 978-0-85261-941-4.

  • Sakayori, S., Kitama, T., Chimoto, S., Qin, L., & Sato, Y. (2002). Critical spectral regions for vowel identification”. Neuroscience Research, 43, 155–162.

    Article  Google Scholar 

  • Savela, J. (2009). Role of selected spectral attributes in the perception of synthetic vowels. Ph.D. thesis, University of Turku.

  • Savela, J., Ojala, S., Aaltonen, O., & Salakoski, T. (2007). Role of different spectral attributes in vowel categorization: the case of Udmurt. In NODALIDA Conference Proceedings (pp. 384–388).

  • Stevens, K. N., & House, A. S. (1955). Development of a quantitative description of vowel articulation. Journal of the Acoustical Society of America, 27, 484–493.

    Article  Google Scholar 

  • Thyer, N., Hickson, L., & Dodd, B. (2000). The perceptual magnet effect in Australian English vowels. Perception and Psychophysics, 62(1), 1–20.

    Article  Google Scholar 

  • Tsukada, K. (2009). An Acoustic Comparison of Vowel Length Contrasts in Standard Arabic, Japanese and Thai. IEEE International Conference on Asian Languages Processing. doi:10.1109/IALP.2009.25.

    Google Scholar 

  • Xue, S., Hao, G., & Mayo, R. (2006). Volumetric measurements of vocal tracts for male speakers from different races. Clinical Linguistics & Phonetics, 20, 691–702.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Karim Tahiry.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tahiry, K., Mounir, B., Mounir, I. et al. Energy bands and spectral cues for Arabic vowels recognition. Int J Speech Technol 19, 707–716 (2016). https://doi.org/10.1007/s10772-016-9363-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-016-9363-3

Keywords

Navigation