Score Normalization Technique for Text-Prompted Speaker Verification with Chinese Digits

Li, Jing; Dong, Yuan; Dong, Chengyu; Wang, Haila

doi:10.1007/978-3-540-74205-0_112

Score Normalization Technique for Text-Prompted Speaker Verification with Chinese Digits

Jing Li¹,
Yuan Dong^1,2,
Chengyu Dong² &
…
Haila Wang²

Conference paper

1744 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4682))

Abstract

A text prompted speaker verification system is presented in this paper. This system is based on ten Chinese digits. Basic acoustic models are speaker dependent and content dependent phoneme HMMs which were generated by adapting speaker independent models to the utterances of specific speakers. An obvious constraint for normalization techniques used in TDSV is that the phrases with the same content should be used for competitive cohort models. So many of the score normalization techniques are either difficult to implement because of lack of data or not good for performance improvement because of poor estimation of the normalization parameters. We propose a method which combines the traditional T-Norm and Cohort Norm together to find a good tradeoff of testing utterance normalization and target speaker model normalization. The proposed method improved the system performance from the baseline equal error rate 3.42% for T-Norm and 2.72% for Cohort Norm to 2.50%.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Che, C.W., Lin, Q., Yuk, D.S.: An HMM Approach to Text-prompted Speaker Verification. In: Proc. ICASSP vol. 2, pp. 673–676 (1996)
Google Scholar
Kato, T., Shimizu, T.: Improved Speaker Verification Over the Cellular Phone Network Using Phoneme-Balanced and Digit-Sequence-Preserving Connected Digit Patterns. In: Proc. ICASSP vol. 2, II pp. 57–60 (2003)
Google Scholar
Melin, H., Lindberg, J.: Prompting of Passwords in Speaker Verification System. KTH, Dept. of Speech, Music and Hearing
Google Scholar
Li, K.P., Porter, J.E.: Normalizations and Selection of Speech Segments for Speaker Recognition Scoring. In: Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, New York, NY, USA, vol. 1, pp. 595–598 (April 1988)
Google Scholar
Reynolds, D.A.: The Effect of Handset Variability on Speaker Recognition Performance: Experiments on Switchboard Corpus. In: Proc. ICASSP vol. 1, pp. 113–116 (1996)
Google Scholar
Hebert, M., Boies, D., Communication, N.: T-Norm for Text-Dependent Commercial Speaker Verification Applications: Effect of Lexical Mismatch. In: Proc. ICASSP, vol. 1, pp. 729–732, 113-116
Google Scholar
Matsui, T., Furui, S.: Concatenated Phoneme Models for Text Variable Speaker Recognition. In: Proc. ICASSP, vol. 2, pp. 391–394 (1993)
Google Scholar
Auckenthaler, R., Carey, M., Lloyd-Thomas, H.: Score Normalization for Text-Independent Speaker Verification System. On Digital Signal Processing 10, 42–54 (2000)
Article Google Scholar
Sturim, D.E., Reynolds, D.A.: Speaker Adaptive Cohort Selection for Tnorm in Text-Independent Speaker Verification. In: Proc. ICASSP, vol. 1, pp. 741–744 (2005)
Google Scholar
Colombi, J.M., Ruck, D.W., Anderson, T.R., Rogers, S.K., Oxley, M.: Cohort Selection and Word Grammar Effects for Speaker Recognition. In: Proc. ICASSP, vol. 1, pp. 85–88 (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Beijing University of Posts and Telecommunications, 100876, P.R. China
Jing Li & Yuan Dong
France Telecom R&D Beijing Co, Ltd., Beijing, P.R. China
Yuan Dong, Chengyu Dong & Haila Wang

Authors

Jing Li
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Dong
View author publications
You can also search for this author in PubMed Google Scholar
Chengyu Dong
View author publications
You can also search for this author in PubMed Google Scholar
Haila Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

De-Shuang Huang Laurent Heutte Marco Loog

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, J., Dong, Y., Dong, C., Wang, H. (2007). Score Normalization Technique for Text-Prompted Speaker Verification with Chinese Digits. In: Huang, DS., Heutte, L., Loog, M. (eds) Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence. ICIC 2007. Lecture Notes in Computer Science(), vol 4682. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74205-0_112

Download citation

DOI: https://doi.org/10.1007/978-3-540-74205-0_112
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74201-2
Online ISBN: 978-3-540-74205-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics