Automatic Estimation of a Priori Speaker Dependent Thresholds in Speaker Verification

Saeta, Javier R.; Hernando, Javier

doi:10.1007/3-540-44887-X_9

Automatic Estimation of a Priori Speaker Dependent Thresholds in Speaker Verification

Javier R. Saeta⁶ &
Javier Hernando⁷

Conference paper
First Online: 01 January 2003

1222 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2688))

Abstract

The selection of a suitable threshold is considered essential for the correct performance of automatic enrollment in speaker verification. Conventional methods have faced with the scarcity of data and the problem of an a priori decision, using biased client scores, impostor data, variances, a speaker independent threshold or some combination of them. Because of this lack of data, means and variances are estimated in most cases with very few scores. Noise or simply poor quality utterances, when comparing to the client model, can lead to some scores which produce a high variance in estimations. These scores are outliers and have an effect on the right estimation of mean and specially standard deviation. We propose here an algorithm to discard outliers. The method consists of iteratively selecting the most distant score with respect to mean. If this score goes beyond a certain threshold, the score is removed and mean and standard deviation estimations are recalculated. When there are only a few utterances to estimate mean and variance, this method leads to a great improvement. Text dependent and text independent experiments have been carried out by using a telephonic multisession database in Spanish with 184 speakers, that has been recently recorded by the authors.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

S. Furui, “Cepstral Analysis for Automatic Speaker Verification”, IEEE Trans. on Acoustics, Speech and Signal Processing, vol. 29, no. 2, pp. 254–272, 1981.
Article Google Scholar
F. Bimbot, D. Genoud, “Likelihood Ratio Adjustment for the Compensation of Model Mismatch in Speaker Verification”, Proc. Eurospeech’97, pp. 1387–1390.
Google Scholar
Q. Li, B.H. Juang, Q. Zhou, C.H. Lee, “Verbal Information Verification”, Proc. Eurospeech’97, pp. 839–842.
Google Scholar
D.A. Reynolds, “Comparison of Background Normalization Methods for Text-Independent Speaker Verification”, Proc. Eurospeech’97, pp. 963–966.
Google Scholar
G. Gravier, G. Chollet, “Comparison of Normalization Techniques for Speaker Verification”, Proc. RLA2C, Avignon, 1998, pp. 97–100.
Google Scholar
J.B. Pierrot, J. Lindberg, J. Koolwaaij, H.P. Hutter, D. Genoud, M. Blomberg, F. Bimbot, “A Comparison of A Priori Threshold Setting Procedures for Speaker Verification in the CAVE Project”, Proc. ICASSP’98, pp. 125–128.
Google Scholar
J. Lindberg, J. Koolwaaij, H.P. Hutter, D. Genoud, J.B. Pierrot, M. Blomberg, F. Bimbot, “Techniques for A Priori Decision Threshold Estimation in Speaker Verification”, Proc. RLA2C, Avignon 1998, pp. 89–92.
Google Scholar
W.D. Zhang, K.K. Yiu, M.W. Mak, C.K. Li, M.X. He, “A Priori Threshold Determination for Phrase-Prompted Speaker Verification”, Proc. Eurospeech’99, pp. 1203–1206.
Google Scholar
A.C. Surendran, C.H. Lee, “A Priori Threshold Selection for Fixed Vocabulary Speaker Verification Systems”, Proc. ICSLP’00, pp.246–249, vol. II.
Google Scholar
N. Mirghafori, L. Heck, “An Adaptive Speaker Verification System with Speaker Dependent A Priori Decision Thresholds”, Proc. ICSLP’02, pp. 589–592.
Google Scholar

Download references

Author information

Authors and Affiliations

Biometric Technologies, S.L. Barcelona, Spain
Javier R. Saeta
TALP Research Center, Universitat Politecnica de Catalunya, Barcelona, Spain
Javier Hernando

Authors

Javier R. Saeta
View author publications
You can also search for this author in PubMed Google Scholar
Javier Hernando
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Vision, Speech and Signal Proc., University of Surrey, GU2 7XH, Guildford, Surrey, UK
Josef Kittler
Department of Electronics and Computer Science, University of Southampton, SO17 1BJ, Southampton, UK
Mark S. Nixon

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Saeta, J.R., Hernando, J. (2003). Automatic Estimation of a Priori Speaker Dependent Thresholds in Speaker Verification. In: Kittler, J., Nixon, M.S. (eds) Audio- and Video-Based Biometric Person Authentication. AVBPA 2003. Lecture Notes in Computer Science, vol 2688. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44887-X_9

Download citation

DOI: https://doi.org/10.1007/3-540-44887-X_9
Published: 24 June 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40302-9
Online ISBN: 978-3-540-44887-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics