Sub-band Main Peak Frequency Application for Speaker Identification

Hou, Limin; Xie, Juanmin; Xie, Su

doi:10.1007/978-3-642-25449-9_23

Limin Hou¹⁹,
Juanmin Xie¹⁹ &
Su Xie¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7098))

Included in the following conference series:

Chinese Conference on Biometric Recognition

1588 Accesses

Abstract

The paper proposes the sub-band main peak frequencies( SMPF) for speaker identification (SI). The SMPF could be derived from the sub-band first formant frequencies by all-pole model of speech signal. Compared with MFCC features for SI based on a Gaussian mixture model (GMM), only SMPF features for SI is better than only the MFCC, with one of improved relative rate up to 15%. Experimental utterances are Chinese mandarin under clean background recording circumstances.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Multitaper MFCC and normalized multitaper phase-based features for speaker verification

Article 02 March 2019

A Comparative Study on Effect of Temporal Phase for Speaker Verification

Performance comparison of multitaper techniques for speaker verification with expressive speech

Article 27 November 2017

References

Reynolds, D.A., Rose, R.C.: Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions Speech Audio Processing 3(1), 72–83 (1995)
Article Google Scholar
Parham, A., Guangji, S., Maryam, M.S., Seyed, A.B.: Phase-based Speech Processing. World Scientific, USA (2006)
Google Scholar
Kuldip, K.P., Leigh, D.A.: On the Usefulness of STFT phase spectrum in human listening test. Speech Communication 45, 153–170 (2005)
Article Google Scholar
Leigh, D.A., Kuldip, K.P.: Iterative reconstruction of speech from short-time Fourier transform phase and magnitude spectra. Computer Speech and Language 21(1), 174–186 (2007)
Article Google Scholar
Leigh, D.A., Kuldip, K.P.: Short-time phase spectrum in speech processing: A review and some experimental results. Digital Signal Processing: A Review Journal 17(3), 578–616 (2007)
Article Google Scholar
Vibha, T., Jyoti, S.: AM-FM Features and Their Application to Noise Robust Speech Recognition: A Review. The IUP Journal of Telecommunications 2(1), 7–19 (2010)
Google Scholar
Limin, H., Juanmin, X.: A New Approach to Extract Formant Instantaneous Characteristics for Speaker Identification. International Journal of Computer Information System and Management Applications 1, 295–302 (2009)
Google Scholar
Limin, H., Juanmin, X.: Compensating function of Formant Instantaneous Characteristics in Speaker Identification. In: Fifth International Conference on Information Assurance and Security, IAS 2009, pp. 744–750 (2009)
Google Scholar
Limin, H., Xiaoning, H., Juanmin, X.: Formant Instantaneous Characteristics application to Speech Recognition and Speaker Identification. Journal of Shanghai University 15(2), 123–127 (2011)
Article Google Scholar
Marco, G., Fred, C.: Speaker Identification Using Instantaneous Frequencies. IEEE Trans. Speech and Language Processing 16(6), 1097–1111 (2008)
Article Google Scholar
Thiruvaran, T., Ambikairajah, E., Epps, J.: Extraction of FM components from speech signals using all-pole model. Electronics Letters 44(6), 449–450 (2008)
Article Google Scholar
Thiruvaran, T., Nosratighods, M., Ambikairajah, E., Epps, J.: Computationally efficient frame-averaged FM feature extraction for speaker recognition. Electronics Letters 45(6), 335–337 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Communication and Information Engineering, Shanghai University, 149 Yanchang Road, Shanghai, China
Limin Hou, Juanmin Xie & Su Xie

Authors

Limin Hou
View author publications
You can also search for this author in PubMed Google Scholar
Juanmin Xie
View author publications
You can also search for this author in PubMed Google Scholar
Su Xie
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National Laboratory of Pattern Recognition, Center for Biometrics and Security Research, Chinese Academy of Sciences, Institute of Automation, P.O. Box 2728, 100190, Beijing, China
Zhenan Sun & Tieniu Tan &
School of Information Science and Technolog, Sun Yat-Sen University, 510275, Guangzhou, China
Jianhuang Lai
Institute of Computing Technology, Chinese Academy of Sciences, 100190, Beijing, China
Xilin Chen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hou, L., Xie, J., Xie, S. (2011). Sub-band Main Peak Frequency Application for Speaker Identification. In: Sun, Z., Lai, J., Chen, X., Tan, T. (eds) Biometric Recognition. CCBR 2011. Lecture Notes in Computer Science, vol 7098. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25449-9_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-25449-9_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25448-2
Online ISBN: 978-3-642-25449-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics