Abstract
It is well known that device, language and environmental mismatch adversely affect speaker recognition performance. Much less attention is paid to effect of age-related voice changes on speaker recognition performance. In this paper we attempted to answer if speaker recognition algorithms have the re-sistance to age-related changes, and how often we have to update the voice bi-ometric templates. We have investigated such effects basing on the speech da-tabase collected during the period 2006-2010 and have found a clear trend of degradation of the performance of automatic speaker recognition systems in a time interval of up to 4 years.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Schötz, S., Müller, C.: A Study of Acoustic Correlates of Speaker Age. In: Müller, C. (ed.) Speaker Classifcation II. LNCS (LNAI), vol. 4441, pp. 1–9. Springer, Heidelberg (2007)
Beigi, H.: Effects of time lapse on speaker recognition results. In: Proc. 16th Int. Conf. on Digital Signal Processing (DSP 2009), pp. 1260–1265. IEEE Press, Piscataway (2009)
Czajka, A.: Call for cooperation: biometric template ageing. In: Proc. of IBPC 2010, NIST (2010)
Linville, S.E.: Vocal Aging. Singular Publishing Group, San Diego (2001)
Mishra, A.: Multimodal Biometrics it is: Need for Future Systems. International Journal of Computer Applications 3(4), 28–33 (2010)
Carls, J.W.: A framework for analyzing biometric template aging and renewal prediction. Ph.D. Thesis, Air Force Institute of Technology (2009)
Ajmera, J., Burkhardt, F.: Age and gender classification using modulation cepstrum. In: Proc. Speaker Odyssey (2008)
Metze, F., Ajmera, J., Englert, R., Bub, U., Burkhardt, F., Stegmann, J., Müller, C., Huber, R., Andrassy, B., Bauer, J., Littel, B.: Comparison of four approaches to age and gender recognition for telephone applications. In: Proc. ICASSP, vol. 4, pp. 1089–1092 (2007)
Muller, C., Burkhardt, F.: Combining short-term cepstral and long-term prosodic features for automatic recognition of speaker age. In: Proc. of Interspeech (2007)
Simonchik, K.: Identification system of the speaker’s age group by spontaneous record. Scientific and Technical Journal of Information Technologies, Mechanics and Optics 82(6), 89–93 (2012)
Wolters, M., Vipperla, R., Renals, S.: Age Recognition for Spoken Dialogue Systems: Do We Need It? In: Proc. of Interspeech, pp. 1435–1438 (2009)
Przybocki, M.A., Martin, A.F., Le, A.N.: NIST speaker recognition evaluations utilizing the mixer corpora - 2004, 2005, 2006. IEEE Trans. Audio Speech Lang. Process. 15(7), 1951–1959 (2007)
Kohler, T.: The 2010 NIST Speaker Recognition Evaluation. IEEE Speech and Language Processing Technical Committee’s Newsletter, SLTC Newsletter (2010)
Kelly, F., Drygajlo, A., Harte, N.: Speaker Verification with Long-Term Ageing Data. In: International Biometric Conference (2012)
Campbell, J.P., Shen, W., Campbell, W.M., Schwartz, R., Bonastre, J.-F., Matrouf, D.: Forensic speaker recognition. IEEE Signal Processing Magazine 26(2), 95–103 (2009)
STC Ltd., The database for speaker identification RUASTEN, registration certificate – 2010620533, RU (2010)
Labutin, P., Koval, S., Raev, A.: Speaker identification based on the statistical analysis of f0. In: Proc. IAFPA, Plymouth, UK (2007)
Labutin, P., Koval, S., Raev, A., Smirnova, N., Stolbov, M., Tampel, I., Khitrov, M.: Speaker recognition system for standard telephone network. In: Proc. SPECOM 2005, Patras, Greece, pp. 563–566 (2005)
Matveev, Y.N., Simonchik, K.K.: The speaker identification system for the NIST SRE 2010. In: GraphiCon 2010, St. Petersburg, Russia, pp. 315–319 (2010)
Furui, S.: Comparison of speaker recognition methods using statistical features and dynamic features. IEEE Transactions on Acoustics, Speech and Signal Processing 29(3), 342–350 (1981)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Matveev, Y. (2013). The Problem of Voice Template Aging in Speaker Recognition Systems. In: Železný, M., Habernal, I., Ronzhin, A. (eds) Speech and Computer. SPECOM 2013. Lecture Notes in Computer Science(), vol 8113. Springer, Cham. https://doi.org/10.1007/978-3-319-01931-4_46
Download citation
DOI: https://doi.org/10.1007/978-3-319-01931-4_46
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01930-7
Online ISBN: 978-3-319-01931-4
eBook Packages: Computer ScienceComputer Science (R0)