Abstract
This paper presents an extension of the naive Bayesian classifier, called “homologous naive Bayes (HNB),” which is applied to the problem of text-independent, close-set speaker recognition. Unlike the standard naive Bayes, HNB can take advantage of the prior information that a sequence of input feature vectors belongs to the same unknown class. We refer to such a sequence a homologous set, which is naturally available in speaker recognition. We empirically compare HNB with the Gaussian mixture model (GMM), the most widely used approach to speaker recognition. Results show that, in spite of its simplisity, HNB can achieve comparable classification accuracies for up to a hundred speakers while taking much less resources in terms of time and code size for both training and classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hsu, C.N., Huang, H.J., Wong, T.T.: Why discretization works for naive bayesian classifiers. In: Machine Learning: Proceedings of the 17th International Conference (ML 2000), San Francisco, CA (2000)
John, G., Langley, P.: Estimating continuous distributions in Bayesian classifiers. In: In Proceedings of the Eleventh Annual Conference on Uncertainty in Artificial Intelligence (UAI’ 95). (1995) 338–345
Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous valued attributes for classification learning. In: Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence (IJCAI’ 93), Chambery, France (1993) 1022–1027
Ross, S.: A First Course in Probability. Prentice Hall (1998)
Blake, C., Merz, C.: UCI repository of machine learning databases (1998)
Przybocki, M.A., Martin, A.F.: NIST speaker recognition evaluation. In: Workshop on Speaker Recognition and its Commercial and Forensic Applications (RLA2C), Avignon, France (1998)
Reynolds, D.A., Rose, R.C.: Robust text-independent speaker identification using gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing 3 (1995) 72–83
de Veth, J., Bourlard, H.: Comparison of hidden markov model techniques for automatic speaker verification in real-world conditions. Speech Communication 17 (1995) 81–90
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series, B(39) (1977) 1–38
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Huang, HJ., Hsu, CN. (2002). Recognizing 100 Speakers Using Homologous Naive Bayes. In: Ishizuka, M., Sattar, A. (eds) PRICAI 2002: Trends in Artificial Intelligence. PRICAI 2002. Lecture Notes in Computer Science(), vol 2417. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45683-X_43
Download citation
DOI: https://doi.org/10.1007/3-540-45683-X_43
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44038-3
Online ISBN: 978-3-540-45683-4
eBook Packages: Springer Book Archive