Multistep Speaker Identification Using Gibbs-Distribution-Based Extended Bayesian Inference for Rejecting Unregistered Speaker

Mizobe, Yuta; Kurogi, Shuichi; Tsukazaki, Tomohiro; Nishida, Takeshi

doi:10.1007/978-3-642-34500-5_30

Yuta Mizobe²⁰,
Shuichi Kurogi²⁰,
Tomohiro Tsukazaki²⁰ &
…
Takeshi Nishida²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7667))

Included in the following conference series:

International Conference on Neural Information Processing

3876 Accesses
2 Citations

Abstract

This paper presents a method of multistep speaker identification using Gibbs-distribution-based extended Bayesian inference (GEBI) for rejecting unregistered speaker. The method is developed for our speaker recognition system which utilizes competitive associative nets (CAN2s) for learning piecewise linear approximation of nonlinear speech signal to extract feature vectors of pole distribution from piecewise linear coefficients reflecting nonlinear and time-varying vocal tract of the speaker. In this paper, we focus on the problem of Bayesian inference (BI) in multistep identification for rejecting unregistered speaker and introduce GEBI to solve the problem. The effectiveness of the present method is shown by means of experiments using real speech signals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ahalt, A.C., Krishnamurthy, A.K., Chen, P., Melton, D.E.: Competitive learning algorithms for vector quantization. Neural Networks 3, 277–290 (1990)
Article Google Scholar
Kohonen, T.: Associative Memory. Springer (1977)
Google Scholar
Kurogi, S., Ueno, T., Sawa, M.: A batch learning method for competitive associative net and its application to function approximation. In: Proc. SCI 2004, vol. 5, pp. 24–28 (2004)
Google Scholar
Kurogi, S., Mineishi, S., Sato, S.: An Analysis of Speaker Recognition Using Bagging CAN2 and Pole Distribution of Speech Signals. In: Wong, K.W., Mendis, B.S.U., Bouzerdoum, A. (eds.) ICONIP 2010, Part I. LNCS, vol. 6443, pp. 363–370. Springer, Heidelberg (2010)
Chapter Google Scholar
Kurogi, S., Mineishi, S., Tsukazaki, T., Nishida, T.: Naive Bayesian Multistep Speaker Recognition Using Competitive Associative Nets. In: Lu, B.-L., Zhang, L., Kwok, J. (eds.) ICONIP 2011, Part I. LNCS, vol. 7062, pp. 70–78. Springer, Heidelberg (2011)
Chapter Google Scholar
Campbell, J.P.: Speaker Recognition: A Tutorial. Proc. the IEEE 85(9), 1437–1462 (1997)
Article Google Scholar
Furui, S.: Speaker Recognition. In: Cole, R., Mariani, J., et al. (eds.) Survey of the State of the Art in Human Language Technology, pp. 36–42. Cambridge University Press (1998)
Google Scholar
Hasan, M.R., Jamil, M., Rabbani, M.G., Rahman, M.S.: Speaker identification using Mel frequency cepstral coefficients. In: Proc. ICECE 2004, pp. 565–568 (2004)
Google Scholar
Bocklet, T., Shriberg, E.: Speaker recognition using syllable-based constraints for cepstral frame selection. In: Proc. ICASSP (2009)
Google Scholar
Beigi, H.: Fundamentals of speaker recognition. Springer-Verlag New York Inc. (2011)
Google Scholar
Zhang, H.: The optimality of naive Bayes. In: Proc. FLAIRS 2004 Conference (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Kyushu Institute of Technology, Tobata, Kitakyushu, Fukuoka, 804-8550, Japan
Yuta Mizobe, Shuichi Kurogi, Tomohiro Tsukazaki & Takeshi Nishida

Authors

Yuta Mizobe
View author publications
You can also search for this author in PubMed Google Scholar
Shuichi Kurogi
View author publications
You can also search for this author in PubMed Google Scholar
Tomohiro Tsukazaki
View author publications
You can also search for this author in PubMed Google Scholar
Takeshi Nishida
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Texas A&M University at Qatar, Education City, P.O. Box 23874, Doha, Qatar
Tingwen Huang
Department of Control Science and Engineering, Huazhong University of Science and Technology, 1037 Luoyu Road, 430074, Wuhan, Hubei, China
Zhigang Zeng
College of Computer Science, Chongqing University, 174 Shazhengjie Street, 400044, Chongqing, China
Chuandong Li
Department of Electronic Engineering, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong, China
Chi Sing Leung

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mizobe, Y., Kurogi, S., Tsukazaki, T., Nishida, T. (2012). Multistep Speaker Identification Using Gibbs-Distribution-Based Extended Bayesian Inference for Rejecting Unregistered Speaker. In: Huang, T., Zeng, Z., Li, C., Leung, C.S. (eds) Neural Information Processing. ICONIP 2012. Lecture Notes in Computer Science, vol 7667. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34500-5_30

Download citation

DOI: https://doi.org/10.1007/978-3-642-34500-5_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34499-2
Online ISBN: 978-3-642-34500-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics