Embedded Learning Segmentation Approach for Arabic Speech Recognition

Frihia, Hamza; Bahi, Halima

doi:10.1007/978-3-319-45510-5_44

Hamza Frihia¹⁷ &
Halima Bahi¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9924))

Included in the following conference series:

International Conference on Text, Speech, and Dialogue

1725 Accesses
4 Citations

Abstract

Building an Automatic Speech Recognition (ASR) system requires a well segmented and labeled speech corpus (often transcription is made by an expert). These resources are not always available for languages such as Arabic. This paper presents a system for automatic Arabic speech segmentation for speech recognition purpose. State-of-the-art models in ASR systems are the Hidden Markov Models (HMM), so that for the segmentation, we expect the use of embedded learning approach where an alignment between speech segments and HMMs is done iteratively to refine the segmentation. This approach needs the use of transcribed and labelled data, for this purpose, we built a dedicated corpus. Finally, the obtained results are close to those described in the literature and could be improved by handling more Arabic speech specificities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bahi, H., Sellami, M.: Combination of vector quantization and hidden Markov models for Arabic speech recognition. In: Proceedings ACS/IEEE International Conference on Computer Systems and Applications, Beirut, Liban, pp. 96–100 (2001)
Google Scholar
Bahi, H., Sellami, M.: Neural expert model applied to phonemes recognition. In: Perner, P., Imiya, A. (eds.) MLDM 2005. LNCS (LNAI), vol. 3587, pp. 507–515. Springer, Heidelberg (2005)
Chapter Google Scholar
Sangeetha, J., Jothilakshmi, S.: Robust automatic continuous speech segmentation for indian languages to improve speech to speech translation. Int. J. Comput. Appl. 53, 13–16 (2012)
Google Scholar
Khawaja, M.A., Haider, N.G.: Segmentation of Sindhi speech using formants. In: IEEE International Conference on Signal Processing and Communications (ICSPC 2007), Dubai, United Arab Emirates, pp. 24–27 (2017)
Google Scholar
Sharma, M., Mammone, R.J.: Blind speech segmentation: automatic segmentation of speech without linguistic knowledge. In: ICSLP (1996)
Google Scholar
Wang, H., Lee, T.: Acoustic segment modeling with spectral clustering methods. IEEE/ACM Trans. Audio Speech Lang. Process. (TASLP), 264–277 (2015)
Google Scholar
Brognaux, S., Drugman, T.: HMM-based speech segmentation: improvements of fully automatic approaches. IEEE/ACM Trans. Audio Speech Lang., 5–15 (2016)
Google Scholar
Young, S., et al.: The HTK Book. Cambridge University Engineering Department, Cambridge (2002)
Google Scholar
Galka, J., Ziolko, B.: Study of performance evaluation methods for non-uniform speech segmentation. Int. J. Circ. Syst. Sig. Process. (2007)
Google Scholar
Nofal, M., Abdel-Raheem, E., Henawy, H.E., Kader, N.S.A.: Arabic automatic segmentation system and its application for Arabic speech recognition system. In: IEEE 46th Midwest Symposium on Circuits and Systems, vol. 2, pp. 697–700 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Labged Laboratory, Universite de Badji Mokhtar Annaba, 23000, Annaba, Algeria
Hamza Frihia & Halima Bahi

Authors

Hamza Frihia
View author publications
You can also search for this author in PubMed Google Scholar
Halima Bahi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hamza Frihia .

Editor information

Editors and Affiliations

Masaryk University , Brno, Czech Republic
Petr Sojka
Masaryk University , Brno, Czech Republic
Aleš Horák
Masaryk University , Brno, Czech Republic
Ivan Kopeček
Masaryk University , Brno, Czech Republic
Karel Pala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Frihia, H., Bahi, H. (2016). Embedded Learning Segmentation Approach for Arabic Speech Recognition. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech, and Dialogue. TSD 2016. Lecture Notes in Computer Science(), vol 9924. Springer, Cham. https://doi.org/10.1007/978-3-319-45510-5_44

Download citation

DOI: https://doi.org/10.1007/978-3-319-45510-5_44
Published: 03 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45509-9
Online ISBN: 978-3-319-45510-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics