A Study on the Relation Between the Frame Pruning and the Robust Speaker Identification with Multivariate t-Distribution

Lee, Younjeong; Lee, Joohun; Hahn, Hernsoo

doi:10.1007/11581772_70

A Study on the Relation Between the Frame Pruning and the Robust Speaker Identification with Multivariate t-Distribution

Younjeong Lee¹⁸,
Joohun Lee¹⁹ &
Hernsoo Hahn¹⁸

Conference paper

1186 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3767))

Abstract

In this paper, we performed the robust speaker identification based on the frame pruning and multivariate t-distribution respectively, and then studied on a theoretical basis for the frame pruning using the other methods. Based on the results from two methods, we showed that the robust algorithms based on the weight of frames become the theoretical basis of the frame pruning method by considering the correspondence between the weight of frame pruning and the conditional expectation of t-distribution. Both methods showed good performance when coping with the outliers occurring in a given time period, while the frame pruning method removing less reliable frames is recommended as one of good methods and, also, the multivariate t-distributions are generally used instead of Gaussian mixture models (GMM) as a robust approach for the speaker identification. In experiments, we found that the robust speaker identification has higher performance than the typical GMM algorithm. Moreover, we showed that the trend of frame likelihood using the frame pruning is similar to one of robust algorithms.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Reynolds, D.A., Rose, R.: Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans. on SAP 3(1), 72–82 (1995)
Google Scholar
Bessacier, L., Bonastre, J.F.: Frame Pruning for speaker recognition. In: ICASSP 1998, pp. 765–768 (1998)
Google Scholar
Dav’e, R.N., Krishnapuram, R.: Robust Clustering Methods: A Unified View. IEEE Trans. On Fuzzy Systems 5(2), 270–293 (1997)
Article Google Scholar
Lee, J., Rheem, J., Lee, K.Y.: Robust Speaker Recognition Against Utterance Variations. In: Kumar, V., Gavrilova, M.L., Tan, C.J.K., L’Ecuyer, P. (eds.) ICCSA 2003. LNCS, vol. 2668, pp. 624–630. Springer, Heidelberg (2003)
Chapter Google Scholar
Ohashi, Y.: Fuzzy clustering and robust estimation, in 9th Meet. SAS Users Grp. Int., Hollywood Beach, FL (1984)
Google Scholar
Rajesh, N.D.: Characterization and detection of noise in clustering. Pattern Recognition Letter 12(11), 657–664 (1991)
Article Google Scholar
Goodall, C.: M-estimator of location: An outline of the theory. In: Hoaglin, D.C., Mosteller, F., Tukey, J.W. (eds.) Understanding Robust and Exploratory Data Analysis, New York, pp. 339–403 (1983)
Google Scholar
Hampel, F.R., Ponchotti, E.M., Rousseeuw, P.J., Stahel, W.A.: Robust Statistics: The Approach Based on Influence Functions. Wiley, New York (1986)
MATH Google Scholar
Huber, P.J.: Robust Statistics. Wiley, New York (1981)
Book MATH Google Scholar
Peel, D., McLachlan, G.J.: Robust mixture modeling using the t-distribution. Statistics and computing 10, 339–348 (2000)
Article Google Scholar
Wang, H., Zhang, Q., Luo, B., Wei, S.: Robust mixture modeling using multivariate t-distribution with missing information. Pattern Recognition Letters 25, 701–710 (2004)
Article Google Scholar
Markov, K., Nakagawa, S.: Frame level likelihood normalization for text-independent speaker identification using GMMs. In: Proc. ICSLP, pp. 1764–1767 (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronic Engineering, Soongsil University, Dongjak-gu, Seoul, Korea
Younjeong Lee & Hernsoo Hahn
Dept. of Internet Broadcasting, Dong-Ah Broadcasting College, Anseong, Korea
Joohun Lee

Authors

Younjeong Lee
View author publications
You can also search for this author in PubMed Google Scholar
Joohun Lee
View author publications
You can also search for this author in PubMed Google Scholar
Hernsoo Hahn
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Gwangju Institute of Science and Technology (GIST), 1 Oryong-dong Buk-gu, 500-712, Gwangju, Korea
Yo-Sung Ho
Multimedia Security Lab, Korea University, Science Campus, 136-701, Seoul, Korea
Hyoung Joong Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lee, Y., Lee, J., Hahn, H. (2005). A Study on the Relation Between the Frame Pruning and the Robust Speaker Identification with Multivariate t-Distribution. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_70

Download citation

DOI: https://doi.org/10.1007/11581772_70
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30027-4
Online ISBN: 978-3-540-32130-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics