Features Extracted Using Frequency-Time Analysis Approach from Nyquist Filter Bank and Gaussian Filter Bank for Text-Independent Speaker Identification

Sen, Nirmalya; Basu, T. K

doi:10.1007/978-3-642-19530-3_12

Features Extracted Using Frequency-Time Analysis Approach from Nyquist Filter Bank and Gaussian Filter Bank for Text-Independent Speaker Identification

Nirmalya Sen²¹ &
T. K Basu²²

Conference paper

834 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6583))

Abstract

This paper compares the feature sets extracted using frequency-time analysis approach and time-frequency analysis approach for text-independent speaker identification. The impetus for the frequency-time analysis approach comes from the band pass filtering view of STFT. Nyquist filter bank and Gaussian filter bank both have been used for extracting features using frequency-time analysis approach. Experimental evaluation was conducted on the POLYCOST database with 130 speakers using Gaussian mixture speaker model. Results reveal that, the feature sets extracted using frequency-time analysis approach performs significantly better compared to the feature set extracted using time-frequency analysis approach.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Davis, S.B., Mermelsteine, P.: Comparison of parametric representation for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust., Speech, Signal Processing ASSP-28(4), 357–365 (1980)
Article Google Scholar
Reynolds, D.A., Rose, R.C.: Robust Text-Independent speaker identification using Gaussian mixture speaker models. IEEE Trans. Speech and Audio Processing 3(1), 72–83 (1995)
Article Google Scholar
Hayakawa, S., Itakura, F.: Text-dependent speaker recognition using the information in the higher frequency band. In: ICASSP 1994, pp. 137–140 (1994)
Google Scholar
Besacier, L., Bonastre, J.-F.: Subband architecture for automatic speaker recognition. Signal Processing 80(7), 1245–1259 (2000)
Article MATH Google Scholar
Lu, X., Dang, J.: An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification. Speech Communication 50, 312–322 (2008)
Article Google Scholar
Chakroborty, S., Roy, A., Saha, G.: Improved closed set text-independent speaker identification by combining MFCC with evidence from flipped filter banks. International Journal of Signal Processing 4(2), 1304–4478 (2007) ISSN 1304-4478
Google Scholar
Sen, N., Basu, T.K.: A New Nyquist window with near optimal time-bandwidth product. In: IEEE Conference INDICON (2009)
Google Scholar
Sen, N., Patil, H.A., Basu, T.K.: A New transform for robust Text-Independent speaker identification. In: IEEE Conference INDICON (2009)
Google Scholar
Sen, N., Basu, T.K., Patil, H.A.: Significant improvement in the closed set text-independent speaker identification using features extracted from Nyquist filterbank. In: 5th International Conference on Industrial and Information Systems, ICIIS 2010, pp. 61–66 (2010)
Google Scholar
Quatieri, T.F.: Discrete-Time Speech Signal Processing Principles and Practice. Pearson Education, London
Google Scholar
Haykin, S., Veen, B.V.: Signals and Systems. John Wiley & Sons, Inc., Chichester (2001)
MATH Google Scholar
Petrovska, D., et al.: POLYCOST: A Telephonic speech database for speaker recognition. In: RLA2C, Avignon, France, April 20-23, pp. 211–214 (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Signal Processing Research Group, C.E.T, IIT Kharagpur, India
Nirmalya Sen
Electrical Engineering Department, IIT Kharagpur, India
T. K Basu

Authors

Nirmalya Sen
View author publications
You can also search for this author in PubMed Google Scholar
T. K Basu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Bradenburg University of Applied Sciences, 14737, Brandenburg an der Havel, Germany
Claus Vielhauer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, 39016, Magdeburg, Germany
Jana Dittmann
Swiss Federal Institute of Technology Lausanne (EPFL), 1015, Lausanne, Switzerland
Andrzej Drygajlo
Roskilde University, 4000, Roskilde, Denmark
Niels Christian Juul
Department of Electronics, University of Kent, CT2 7NT, Canterbury, Kent, UK
Michael C. Fairhurst

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sen, N., Basu, T.K. (2011). Features Extracted Using Frequency-Time Analysis Approach from Nyquist Filter Bank and Gaussian Filter Bank for Text-Independent Speaker Identification. In: Vielhauer, C., Dittmann, J., Drygajlo, A., Juul, N.C., Fairhurst, M.C. (eds) Biometrics and ID Management. BioID 2011. Lecture Notes in Computer Science, vol 6583. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19530-3_12

Download citation

DOI: https://doi.org/10.1007/978-3-642-19530-3_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19529-7
Online ISBN: 978-3-642-19530-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics