Skip to main content

A New Classifier for Speaker Verification Based on the Fractional Brownian Motion Process

  • Conference paper
Text, Speech and Dialogue (TSD 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3206))

Included in the following conference series:

  • 863 Accesses

Abstract

A novel text-independent verification system based on the fractional Brownian motion (M_dim_fBm) for automatic speaker recognition (ASR) is presented in this paper. The performance of the proposed M_dim_fBm was compared to those achieved with the GMM (Gaussian Mixture Models) classifier using the mel-cepstral coefficients. We have used a speech database – obtained from fixed and cellular phones – uttered by 75 different speakers. The results have shown the superior performance of the M_dim_fBm classifier in terms of recognition accuracy. In addition, the proposed classifier employs a much simpler modeling structure as compared to the GMM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Barnsley, M., et al.: The Science of Fractal Images. Springer, USA (1988)

    MATH  Google Scholar 

  2. Hurst, E.: Long-term storage capacity of reservoirs. Transactions of American Society of Civil Engineers 116, 770–799 (1951)

    Google Scholar 

  3. Reynolds, D., Rose, R.: Robust text-independent speaker identification using gaussian mixture speaker models. IEEE Transactions on Speech, and Audio Processing 3, 72–83 (1995)

    Article  Google Scholar 

  4. Esteller, R., Vachtsevandos, G., Henry, T.: Fractal dimensions characterizes seizure onset in epileptic patients. In: IEEE Proceedings, ICASSP 1999, vol. 4, pp. 2343–2346 (1999)

    Google Scholar 

  5. Morimoto, T., et al.: Pattern recognition of fruit shape based on the concept of chaos and neural networks. Computers and Electronics in Agriculture 26, 171–186 (2000)

    Article  Google Scholar 

  6. Fernández, S., Feijóo, S., Balsa, R.: Fractal characterization of Spanish fricatives. Proceedings of the ICPhS 3, 2145–2148 (1999)

    Google Scholar 

  7. Petry, A., Barone, D.: Fractal dimension applied to speaker identification. In: Proceedings of the ICASSP (2001)

    Google Scholar 

  8. Beran, J.: Statistics for Long-Memory Processes. Chapman & Hall, Boca Raton (1994)

    MATH  Google Scholar 

  9. Veith, D., Abry, P.: A wavelet-based joint estimator of the parameters of long-range dependence. IEEE Trans. on Information Theory 45, 878–897 (1998)

    Article  Google Scholar 

  10. Daubechies, I.: Ten Lectures on Wavelets. SIAM, Philadelphia (1992)

    MATH  Google Scholar 

  11. Martin, A., et al.: The det curve in assessment of detection task performance. In: Proceedings of EuroSpeech 1997, pp. 1895–1898 (1997)

    Google Scholar 

  12. Reynolds, D., Rose, R., Hosftetter, E.: Integrated models of signal and background with application to speaker identification in noise. IEEE Transactions on Speech, and Audio Processing 2, 245–267 (1994)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ana, R.S., Coelho, R., Alcaim, A. (2004). A New Classifier for Speaker Verification Based on the Fractional Brownian Motion Process. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2004. Lecture Notes in Computer Science(), vol 3206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30120-2_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30120-2_32

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23049-6

  • Online ISBN: 978-3-540-30120-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics