VQ score normalisation for text-dependent and text-independent speaker recognition

Finan, R. A.; Sapeluk, A. T.; Damper, R. I.

doi:10.1007/BFb0015998

VQ score normalisation for text-dependent and text-independent speaker recognition

R. A. Finan¹,
A. T. Sapeluk¹ &
R. I. Damper²

Text-dependent Speaker Authentication
Conference paper
First Online: 01 January 2005

2423 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1206))

Abstract

Individual weighting of speaker models in VQ-based recognition has some advantages but means that scores from different models may not be directly comparable, so making identification difficult. It is also problematic for verification as decision thresholds cannot easily be set without first testing models with genuine and imposter utterances. We present a novel normalisation method for VQ speaker recognition which applies an offset to each model, based on the average score between it and the imposter models, to bring particularly high- or low-scoring models into line with the general score range. It may be calculated a priori, before running any actual tests. The method works for both text-dependent and text-independent tasks and improves both the identification and verification error rates.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

References

Booth, I., Barlow, M., Watson, B.: Enhancements to DTW and VQ decision algorithms for speaker recognition. Speech Comm. 13 (1993) 427–433.
Google Scholar
Furui, S.: An analysis of long-term variation of feature parameters of speech and its application to talker recognition. Electronic Comms. 57-A (1974) 34–42.
Google Scholar
Green, D.M., Swets, J.A.: Signal Detection Theory and Psychophysics. John Wiley, New York (1966).
Google Scholar
Hannah, M.I., Sapeluk, A.T., Damper, R.I., Roger, I.M.: Using genetic algorithms to improve speaker-verifier performance. Proc. IEE/IEEE Workshop on Natural Algorithms in Signal Processing, Chelmsford, UK (1993) 24/1–24/9.
Google Scholar
Li, K-P., Porter, J.E.: Normalizations and selection of speech segments for speaker recognition scoring. Proc. ICASSP '88 (1988) 595–598.
Google Scholar
Linde, J., Buzo, A., Gray, R.M.: An algorithm for vector quantizer design. IEEE Trans. Comms. 28 (1980) 84–95.
Google Scholar
Picone, J.: Signal modeling techniques in speech recognition. Proc. IEEE 81 (1990) 1215–1247.
Google Scholar
Rosenberg, A.E., Soong, F.K.: Evaluation of a vector quantization talker recognition system in text dependent and text independent modes. Comp. Speech Lang. 22 (1987) 143–157.
Google Scholar
Yu, K., Mason, J., Oglesby, J.: Speaker recognition using hidden Markov models, dynamic time warping and vector quantisation. IEE Proc. Vision, Image and Sig. Process. 142 (1995) 313–318.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Engineering, University of Abertay Dundee, DD1 1HG, Scotland
R. A. Finan & A. T. Sapeluk
Department of Electronics and Computer Science, University of Southampton, SO17 1BJ, England
R. I. Damper

Authors

R. A. Finan
View author publications
You can also search for this author in PubMed Google Scholar
A. T. Sapeluk
View author publications
You can also search for this author in PubMed Google Scholar
R. I. Damper
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Josef Bigün Gérard Chollet Gunilla Borgefors

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Finan, R.A., Sapeluk, A.T., Damper, R.I. (1997). VQ score normalisation for text-dependent and text-independent speaker recognition. In: Bigün, J., Chollet, G., Borgefors, G. (eds) Audio- and Video-based Biometric Person Authentication. AVBPA 1997. Lecture Notes in Computer Science, vol 1206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0015998

Download citation

DOI: https://doi.org/10.1007/BFb0015998
Published: 10 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62660-2
Online ISBN: 978-3-540-68425-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics