Abstract
The paper proposes the sub-band main peak frequencies( SMPF) for speaker identification (SI). The SMPF could be derived from the sub-band first formant frequencies by all-pole model of speech signal. Compared with MFCC features for SI based on a Gaussian mixture model (GMM), only SMPF features for SI is better than only the MFCC, with one of improved relative rate up to 15%. Experimental utterances are Chinese mandarin under clean background recording circumstances.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Reynolds, D.A., Rose, R.C.: Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions Speech Audio Processing 3(1), 72–83 (1995)
Parham, A., Guangji, S., Maryam, M.S., Seyed, A.B.: Phase-based Speech Processing. World Scientific, USA (2006)
Kuldip, K.P., Leigh, D.A.: On the Usefulness of STFT phase spectrum in human listening test. Speech Communication 45, 153–170 (2005)
Leigh, D.A., Kuldip, K.P.: Iterative reconstruction of speech from short-time Fourier transform phase and magnitude spectra. Computer Speech and Language 21(1), 174–186 (2007)
Leigh, D.A., Kuldip, K.P.: Short-time phase spectrum in speech processing: A review and some experimental results. Digital Signal Processing: A Review Journal 17(3), 578–616 (2007)
Vibha, T., Jyoti, S.: AM-FM Features and Their Application to Noise Robust Speech Recognition: A Review. The IUP Journal of Telecommunications 2(1), 7–19 (2010)
Limin, H., Juanmin, X.: A New Approach to Extract Formant Instantaneous Characteristics for Speaker Identification. International Journal of Computer Information System and Management Applications 1, 295–302 (2009)
Limin, H., Juanmin, X.: Compensating function of Formant Instantaneous Characteristics in Speaker Identification. In: Fifth International Conference on Information Assurance and Security, IAS 2009, pp. 744–750 (2009)
Limin, H., Xiaoning, H., Juanmin, X.: Formant Instantaneous Characteristics application to Speech Recognition and Speaker Identification. Journal of Shanghai University 15(2), 123–127 (2011)
Marco, G., Fred, C.: Speaker Identification Using Instantaneous Frequencies. IEEE Trans. Speech and Language Processing 16(6), 1097–1111 (2008)
Thiruvaran, T., Ambikairajah, E., Epps, J.: Extraction of FM components from speech signals using all-pole model. Electronics Letters 44(6), 449–450 (2008)
Thiruvaran, T., Nosratighods, M., Ambikairajah, E., Epps, J.: Computationally efficient frame-averaged FM feature extraction for speaker recognition. Electronics Letters 45(6), 335–337 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hou, L., Xie, J., Xie, S. (2011). Sub-band Main Peak Frequency Application for Speaker Identification. In: Sun, Z., Lai, J., Chen, X., Tan, T. (eds) Biometric Recognition. CCBR 2011. Lecture Notes in Computer Science, vol 7098. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25449-9_23
Download citation
DOI: https://doi.org/10.1007/978-3-642-25449-9_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25448-2
Online ISBN: 978-3-642-25449-9
eBook Packages: Computer ScienceComputer Science (R0)