Recognizing 100 Speakers Using Homologous Naive Bayes

Huang, Hung-Ju; Hsu, Chun-Nan

doi:10.1007/3-540-45683-X_43

Hung-Ju Huang³ &
Chun-Nan Hsu⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2417))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

846 Accesses
1 Citations

Abstract

This paper presents an extension of the naive Bayesian classifier, called “homologous naive Bayes (HNB),” which is applied to the problem of text-independent, close-set speaker recognition. Unlike the standard naive Bayes, HNB can take advantage of the prior information that a sequence of input feature vectors belongs to the same unknown class. We refer to such a sequence a homologous set, which is naturally available in speaker recognition. We empirically compare HNB with the Gaussian mixture model (GMM), the most widely used approach to speaker recognition. Results show that, in spite of its simplisity, HNB can achieve comparable classification accuracies for up to a hundred speakers while taking much less resources in terms of time and code size for both training and classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hsu, C.N., Huang, H.J., Wong, T.T.: Why discretization works for naive bayesian classifiers. In: Machine Learning: Proceedings of the 17th International Conference (ML 2000), San Francisco, CA (2000)
Google Scholar
John, G., Langley, P.: Estimating continuous distributions in Bayesian classifiers. In: In Proceedings of the Eleventh Annual Conference on Uncertainty in Artificial Intelligence (UAI’ 95). (1995) 338–345
Google Scholar
Fayyad, U.M., Irani, K.B.: Multi-interval discretization of continuous valued attributes for classification learning. In: Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence (IJCAI’ 93), Chambery, France (1993) 1022–1027
Google Scholar
Ross, S.: A First Course in Probability. Prentice Hall (1998)
Google Scholar
Blake, C., Merz, C.: UCI repository of machine learning databases (1998)
Google Scholar
Przybocki, M.A., Martin, A.F.: NIST speaker recognition evaluation. In: Workshop on Speaker Recognition and its Commercial and Forensic Applications (RLA2C), Avignon, France (1998)
Google Scholar
Reynolds, D.A., Rose, R.C.: Robust text-independent speaker identification using gaussian mixture speaker models. IEEE Transactions on Speech and Audio Processing 3 (1995) 72–83
Article Google Scholar
de Veth, J., Bourlard, H.: Comparison of hidden markov model techniques for automatic speaker verification in real-world conditions. Speech Communication 17 (1995) 81–90
Article Google Scholar
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series, B(39) (1977) 1–38
MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer and Information Science, National Chiao-Tung University, Hsinchu City, 300, Taiwan
Hung-Ju Huang
Institute of Information Science, Academia Sinica, Nankang 115, Taipei City, Taiwan
Chun-Nan Hsu

Authors

Hung-Ju Huang
View author publications
You can also search for this author in PubMed Google Scholar
Chun-Nan Hsu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Science and Technology Department of Information and Communication Engineering, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan
Mitsuru Ishizuka
School of Information Technology Knowledge Representation and Reasoning Unit (KRRU) Faculty of Engineering and Information Technology, Griffith University, PMB 50 Gold Coast Mail Centre, Queensland, 9726, Australia
Abdul Sattar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huang, HJ., Hsu, CN. (2002). Recognizing 100 Speakers Using Homologous Naive Bayes. In: Ishizuka, M., Sattar, A. (eds) PRICAI 2002: Trends in Artificial Intelligence. PRICAI 2002. Lecture Notes in Computer Science(), vol 2417. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45683-X_43

Download citation

DOI: https://doi.org/10.1007/3-540-45683-X_43
Published: 21 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44038-3
Online ISBN: 978-3-540-45683-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics