Multilingual Speaker Identification with the Constraint of Limited Data Using Multitaper MFCC

Nagaraja, B. G.; Jayanna, H. S.

doi:10.1007/978-3-642-34135-9_13

B. G. Nagaraja⁵ &
H. S. Jayanna⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 335))

Included in the following conference series:

International Conference on Security in Computer Networks and Distributed Systems

1333 Accesses

Abstract

Feature extraction has the ability to improve the performance of speaker identification systems. This paper studies the significance of low-variance multitaper Mel-frequency cepstral coefficient (multitaper MFCC) features for Multilingual speaker identification with the constraint of limited data. The speaker identification study is conducted using 30 speakers of our own database. Sine-weighted cepstrum estimator (SWCE) taper MFCC features are extracted and modeled using Gaussian Mixture Model (GMM)-Universal Background Model (UBM). The results show that the multitaper MFCC approach performs better than the conventional Hamming window MFCC technique in all the speaker identification experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Significance of Frequency Band Selection of MFCC for Text-Independent Speaker Identification

Improving short utterance speaker verification by combining MFCC and Entrocy in Noisy conditions

Article 25 March 2021

Language and Text-Independent Speaker Recognition System Using Energy Spectrum and MFCCs

References

Salman, A., Muhammad, E., Khurshid, K.: Speaker Verification using Boosted Cepstral Features with Gaussian Distributions. In: Proc. IEEE, INMIC 2007, pp. 1–5 (2007)
Google Scholar
Reynolds, D.A., Rose, R.C.: Robust Text-Independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Trans. Speech and Audio Processing 3, 72–83 (1995)
Article Google Scholar
Jayanna, H.S., Mahadeva Prasanna, S.R.: Analysis, Feature Extraction, Modeling and Testing techniques for Speaker Recognition. IETE Technical Review 26, 181–190 (2009)
Article Google Scholar
Arjun, P.H.: Speaker Recognition in Indian Languages: A Feature Based Approach. Ph.D. dissertation, Indian Institute of Technology Kharagpur, INDIA (July 2005)
Google Scholar
Jayanna, H.S.: Limited data Speaker Recognition. Ph.D. dissertation, Indian Institute of Technology, Guwahati, INDIA (November 2009)
Google Scholar
Kinnunen, T., Saeidi, R., Sandberg, J., Hansson-Sandsten, M.: What Else is New Than the HammingWindow? Robust MFCCs for Speaker Recognition via Multitapering. In: Proc. Interspeech 2010, pp. 2734–2737 (September 2010)
Google Scholar
Durou, G.: Multilingual text-independent speaker identification. In: Proc. MIST 1999 Workshop, Leusden, Netherlands, pp. 115–118 (1999)
Google Scholar
Pandey, B., Ranjan, A., Kumar, R., Shukla, A.: Multilingual Speaker Recognition Using ANFIS. In: Proc. IEEE, ICSPS, vol. 3, pp. 714–718 (2010)
Google Scholar
Nagaraja, B.G., Jayanna, H.S.: Multi-lingual Speaker Identification with the constraint of Limited data. Accepted for publication in Proc. ICAdC 2012, MSRIT, Bengaluru. Springer (July 2012)
Google Scholar
Sandberg, J., Hansson-Sandsten, M., Kinnunen, T., Saeidi, R., Flandrin, P., Borgnat, P.: Multitaper Estimation of Frequency-Warped Cepstra With Application to Speaker Verification. IEEE Signal Processing Letters 17, 343–346 (2010)
Article Google Scholar
Alam, M.J., Kinnunen, T., Kenny, P., Ouellet, P., O’Shaughnessy, D.: Multi-taper MFCC Features for Speaker Verification using I-vectors. In: Proc. IEEE, ASRU 2011, pp. 547–552 (December 2011)
Google Scholar
Kinnunen, T., Saeidi, R., Sedlák, F., Lee, K.A., Sandberg, J., Hansson-Sandsten, M., Li, H.: Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification. IEEE Transaction on Audio, Speech and Language Processing 20, 1990–2001 (2012)
Article Google Scholar
Percival, D.B., Walden, A.T.: Spectral Analysis for Physical Applications. Cambridge Univ. Press, Cambridge (1993)
Book MATH Google Scholar
Thomson, D.J.: Spectrum estimation and harmonic analysis. Proc. IEEE 70, 1055–1096 (1982)
Article Google Scholar
Riedel, K.S., Sidorenko, A.: Minimum bias multiple taper spectral estimation. IEEE Trans. Signal Process. 43, 188–195 (1995)
Article Google Scholar
Ku, J.M.K., Ambikairajan, E., Epps, J., Togneri, R.: Speaker Verification Using Sparse Representation Classification. In: Proc. IEEE, ICASSP, pp. 4548–4551 (2011)
Google Scholar
Hosseinzadeh, D., Krishnan, S.: Combining Vocal Source and MFCC Features for Enhanced Speaker Recognition Performance Using GMMs. In: Proc. IEEE, MMSP 2007, pp. 365–368 (October 2007)
Google Scholar
Reynolds, D.: Universal Background Models. Encyclopedia of Biometric Recognition, Journal Article (February 2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Science and Engineering, Siddaganga Institute of Technology, Tumkur, 572103, India
B. G. Nagaraja & H. S. Jayanna

Authors

B. G. Nagaraja
View author publications
You can also search for this author in PubMed Google Scholar
H. S. Jayanna
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Indian Institute of Information Technology and Management, Technopark Campus, 695581, Trivandrum, Kerala, India
Sabu M. Thampi & Tony Thomas &
School of Information Technologies, The University of Sydney, Building J12, 2006, Sydney, NSW, Australia
Albert Y. Zomaya
FG Peer-to-Peer-Netzwerke, TU Darmstadt - FB 20, Hochschulstr. 10, 64289, Darmstadt, Germany
Thorsten Strufe
Hewlett-Packard Laboratories, Stoke Gifford, BS34 8QZ, Bristol, UK
Jose M. Alcaraz Calero

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nagaraja, B.G., Jayanna, H.S. (2012). Multilingual Speaker Identification with the Constraint of Limited Data Using Multitaper MFCC. In: Thampi, S.M., Zomaya, A.Y., Strufe, T., Alcaraz Calero, J.M., Thomas, T. (eds) Recent Trends in Computer Networks and Distributed Systems Security. SNDS 2012. Communications in Computer and Information Science, vol 335. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34135-9_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-34135-9_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34134-2
Online ISBN: 978-3-642-34135-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics