Abstract
This paper presents a method of multistep speaker identification using Gibbs-distribution-based extended Bayesian inference (GEBI) for rejecting unregistered speaker. The method is developed for our speaker recognition system which utilizes competitive associative nets (CAN2s) for learning piecewise linear approximation of nonlinear speech signal to extract feature vectors of pole distribution from piecewise linear coefficients reflecting nonlinear and time-varying vocal tract of the speaker. In this paper, we focus on the problem of Bayesian inference (BI) in multistep identification for rejecting unregistered speaker and introduce GEBI to solve the problem. The effectiveness of the present method is shown by means of experiments using real speech signals.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ahalt, A.C., Krishnamurthy, A.K., Chen, P., Melton, D.E.: Competitive learning algorithms for vector quantization. Neural Networks 3, 277–290 (1990)
Kohonen, T.: Associative Memory. Springer (1977)
Kurogi, S., Ueno, T., Sawa, M.: A batch learning method for competitive associative net and its application to function approximation. In: Proc. SCI 2004, vol. 5, pp. 24–28 (2004)
Kurogi, S., Mineishi, S., Sato, S.: An Analysis of Speaker Recognition Using Bagging CAN2 and Pole Distribution of Speech Signals. In: Wong, K.W., Mendis, B.S.U., Bouzerdoum, A. (eds.) ICONIP 2010, Part I. LNCS, vol. 6443, pp. 363–370. Springer, Heidelberg (2010)
Kurogi, S., Mineishi, S., Tsukazaki, T., Nishida, T.: Naive Bayesian Multistep Speaker Recognition Using Competitive Associative Nets. In: Lu, B.-L., Zhang, L., Kwok, J. (eds.) ICONIP 2011, Part I. LNCS, vol. 7062, pp. 70–78. Springer, Heidelberg (2011)
Campbell, J.P.: Speaker Recognition: A Tutorial. Proc. the IEEE 85(9), 1437–1462 (1997)
Furui, S.: Speaker Recognition. In: Cole, R., Mariani, J., et al. (eds.) Survey of the State of the Art in Human Language Technology, pp. 36–42. Cambridge University Press (1998)
Hasan, M.R., Jamil, M., Rabbani, M.G., Rahman, M.S.: Speaker identification using Mel frequency cepstral coefficients. In: Proc. ICECE 2004, pp. 565–568 (2004)
Bocklet, T., Shriberg, E.: Speaker recognition using syllable-based constraints for cepstral frame selection. In: Proc. ICASSP (2009)
Beigi, H.: Fundamentals of speaker recognition. Springer-Verlag New York Inc. (2011)
Zhang, H.: The optimality of naive Bayes. In: Proc. FLAIRS 2004 Conference (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mizobe, Y., Kurogi, S., Tsukazaki, T., Nishida, T. (2012). Multistep Speaker Identification Using Gibbs-Distribution-Based Extended Bayesian Inference for Rejecting Unregistered Speaker. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds) Neural Information Processing. ICONIP 2012. Lecture Notes in Computer Science, vol 7667. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34500-5_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-34500-5_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34499-2
Online ISBN: 978-3-642-34500-5
eBook Packages: Computer ScienceComputer Science (R0)