Abstract
There are different methods for vocal pathology detection. These methods usually have three steps which are feature extraction, feature reduction and speech classification. The first and second steps present obstacles to attain high performance and accuracy of the classification system [20]. Indeed, feature reduction can create a loss of data. In this paper, we present an initial study of Arabic speech classification based on probabilistic approach and distance between reference speeches and speech to classify. The first step in our approach is dedicated to generate a standard distance (phonetic distance) between different healthy speech bases. In the second stage we will determine the distance between speech to classify and reference speeches (phonetic model proper to speaker and a reference phonetic model). Comparing these two distances (distance between speech to classify and reference speeches & standard distance), in the third step, we can classify the input speech to healthy or pathological. The proposed method is able to classify Arabic speeches with an accuracy of 96.25%, and we attain 100% by concatenation falsely classified sequences. Results of our method provide insights that can guide biologists and computer scientists to design high performance systems of vocal pathology detection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Majidnezhad, V., Kheidorov, I.: An ANN-based Method for Detecting Vocal Fold Pathology. International Journal of Computer Applications 62(7) (January 2013)
Little, M., McSharry, P., Moroz, I., Roberts, S.: Nonlinear, Biophysically-Informed Speech Pathology Detection. In: ICASSP (2006)
Majidnezhad, V., Kheidorov, I.: A HMM-Based Method for V ocal Fold Pathology Diagnosis. IJCSI International Journal of Computer Science Issues 9(6(2)) (November 2012)
Bréhilin, L., Gascuel, O.: Modèles de Markov caches et apprentissage de sequences
Patane, G., Russo, M.: The enhanced LBG Algorithm. Neural Networks 14(9) (November 2001)
Makhoul, J., Roucos, S., Gish, H.: Vector Quantization in Speech Coding
Räsänen, O.J., Laine, U.K., Altosaar, T.: Self-learning Vector Quantization for Pattern Discovery from Speech. In: InterSpeech 2009, September 6-10, Brighton, United Kingdom (2009)
Ittichaichareon, C., Suksri, S., Yingthawornsuk, T.: Speech Recognition using MFCC. In: International Conference on Computer Graphics, Simulation and Modeling, ICGSM 2012, July 28-29, Pattaya, Thailand (2012)
Jolliffe, I.T.: Principal Component Analysis. 2nd edn. (2008)
Paquet, P.: L’utilisation des réseaux de neurones artificiels en finance. Document de Recherche n° 1997-1 (1997)
Kukharchik, P., Martynov, D., Kheidorov, I., Kotov, O.: Vocal Fold Pathology Detection using Modified Wavelet-like Features and Support Vector Machines. In: 15th European Signal Processing Conference, EUSIPCO 2007, September 3-7, Poznan, Poland, (2007); copyright by EURASIP
Archaux, C., Laanaya, H., Martin, A., Khenchaf, A.: An SVM based Churn Detector in Prepaid Mobile Telephony (2004)
Aguira-Conraria, L., Soares, M.J.: The Contnuous Wavelet Transform: A Primer (July 12, 2011)
Damerval, C.: Ondelettes pour la détection de caractéristiques en traitement d’images. Doctoral thesis (Mai 2008)
Hamila, R., Astola, J., Cheikh, F.A., Gabbouj, M., Renfors, M.: Teager energy and the ambiguity function. IEEE Transactions on Signal Processing 47(1) (1999)
Martinez César, E., Rufiner Hugo, L.: Acoustic analysis of speech for detection of laryngeal pathologies. In: 22nd Annual EMBS International Conference, July 23-28, Chicago Il (2000)
Fabrice, P., Christian, B.-V.: Détection acoustique des pathologies phonatoires chez l’enfant. Doctoral thesis (1993)
Terbeh, N., Labidi, M., Zrigui, M.: Automatic speech correction: A step to speech recognition for people with disabilities. In: ICTA 2013, October 23-26, Hammamet, Tunisia (2013)
Terbeh, N., Zrigui, M.: Vers la Correction Automatique de la Parole Arabe. In: Citala 2014, November 26-27, Oujda, Morocco (2014)
Zrigui, M., Ayadi, R., Mars, M., Maraoui, M.: Arabic Text Classification Framework Based on Latent Dirichlet Allocation. CIT 20(2), 125–140 (2012)
Zouaghi, A., Zrigui, M., Antoniadis, G.: Automatic Understanding of Spontaneous Arabic Speech - A Numerical Model. TAL 49(1), 141–166 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Terbeh, N., Maraoui, M., Zrigui, M. (2015). Probabilistic Approach for Detection of Vocal Pathologies in the Arabic Speech. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2015. Lecture Notes in Computer Science(), vol 9042. Springer, Cham. https://doi.org/10.1007/978-3-319-18117-2_45
Download citation
DOI: https://doi.org/10.1007/978-3-319-18117-2_45
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18116-5
Online ISBN: 978-3-319-18117-2
eBook Packages: Computer ScienceComputer Science (R0)