Abstract
A novel text-independent verification system based on the fractional Brownian motion (M_dim_fBm) for automatic speaker recognition (ASR) is presented in this paper. The performance of the proposed M_dim_fBm was compared to those achieved with the GMM (Gaussian Mixture Models) classifier using the mel-cepstral coefficients. We have used a speech database – obtained from fixed and cellular phones – uttered by 75 different speakers. The results have shown the superior performance of the M_dim_fBm classifier in terms of recognition accuracy. In addition, the proposed classifier employs a much simpler modeling structure as compared to the GMM.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barnsley, M., et al.: The Science of Fractal Images. Springer, USA (1988)
Hurst, E.: Long-term storage capacity of reservoirs. Transactions of American Society of Civil Engineers 116, 770–799 (1951)
Reynolds, D., Rose, R.: Robust text-independent speaker identification using gaussian mixture speaker models. IEEE Transactions on Speech, and Audio Processing 3, 72–83 (1995)
Esteller, R., Vachtsevandos, G., Henry, T.: Fractal dimensions characterizes seizure onset in epileptic patients. In: IEEE Proceedings, ICASSP 1999, vol. 4, pp. 2343–2346 (1999)
Morimoto, T., et al.: Pattern recognition of fruit shape based on the concept of chaos and neural networks. Computers and Electronics in Agriculture 26, 171–186 (2000)
Fernández, S., Feijóo, S., Balsa, R.: Fractal characterization of Spanish fricatives. Proceedings of the ICPhS 3, 2145–2148 (1999)
Petry, A., Barone, D.: Fractal dimension applied to speaker identification. In: Proceedings of the ICASSP (2001)
Beran, J.: Statistics for Long-Memory Processes. Chapman & Hall, Boca Raton (1994)
Veith, D., Abry, P.: A wavelet-based joint estimator of the parameters of long-range dependence. IEEE Trans. on Information Theory 45, 878–897 (1998)
Daubechies, I.: Ten Lectures on Wavelets. SIAM, Philadelphia (1992)
Martin, A., et al.: The det curve in assessment of detection task performance. In: Proceedings of EuroSpeech 1997, pp. 1895–1898 (1997)
Reynolds, D., Rose, R., Hosftetter, E.: Integrated models of signal and background with application to speaker identification in noise. IEEE Transactions on Speech, and Audio Processing 2, 245–267 (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ana, R.S., Coelho, R., Alcaim, A. (2004). A New Classifier for Speaker Verification Based on the Fractional Brownian Motion Process. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2004. Lecture Notes in Computer Science(), vol 3206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30120-2_32
Download citation
DOI: https://doi.org/10.1007/978-3-540-30120-2_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23049-6
Online ISBN: 978-3-540-30120-2
eBook Packages: Springer Book Archive