A novel whispered speaker identification system based on extreme learning machine

Sangeetha, J.; Jayasankar, T.

doi:10.1007/s10772-017-9488-z

A novel whispered speaker identification system based on extreme learning machine

Published: 01 February 2018

Volume 21, pages 157–165, (2018)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

J. Sangeetha¹ &
T. Jayasankar²

262 Accesses
Explore all metrics

Abstract

Whispered speech speaker identification system is one of the most demanding efforts in automatic speaker recognition applications. Due to the profound variations between neutral and whispered speech in acoustic characteristics, the performance of conventional speaker identification systems applied on neutral speech degrades drastically when compared to whisper speech. This work presents a novel speaker identification system using whispered speech based on an innovative learning algorithm which is named as extreme learning machine (ELM). The features used in this proposed system are Instantaneous frequency with probability density models. Parametric and nonparametric probability density estimation with ELM was compared with the hybrid parametric and nonparametric probability density estimation with Extreme Learning Machine (HPNP-ELM) for instantaneous frequency modeling. The experimental result shows the significant performance improvement of the proposed whisper speech speaker identification system.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Speaker Identification Through Natural and Whisper Speech Signal

Combined i-Vector and Extreme Learning Machine Approach for Robust Speaker Identification and Evaluation with SITW 2016, NIST 2008, TIMIT Databases

Article 25 March 2021

Combining Evidences from Mel Cepstral and Cochlear Cepstral Features for Speaker Recognition Using Whispered Speech

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Campbell, W. M., Campbell, J. P., Reynolds, D. A., Singer, E., & Torres-Carrasquillo, P. A. (2006). Support vector machines for speaker and language recognition. Computer Speech & Language, 20(2), 210–229.
Article Google Scholar
Campbell, W. M., Sturim, D. E., & Reynolds, D. A. (2006). Support vector machines using GMM supervectors for speaker verification. IEEE Signal Processing Letters, 13(5), 308–311.
Article Google Scholar
Fan, X., & Hansen, J. H. (2011). Speaker identification within whispered speech audio streams. IEEE transactions on Audio, Speech, and Language Processing, 19(5), 1408–1421.
Article Google Scholar
Gu, X., & Zhao, H. (2010). Whispered speech speaker identification based on SVM and FA. In Audio Language and Image Processing (ICALIP), 2010 International Conference on (pp. 757–760). IEEE.
Haim, P., Joseph, F., & Ian, J. (2006). A study of Gaussian mixture models ofcolor and texture features for image classification and segmentation. Pattern Recognition, 39(4), 695–706, 2006.
Huang, G. B., Wang, D., Lan, Y. (2011). Extreme learning machine: A survey. International Journal of Machine Learning and Cybernetics, 2, 107–122.
Article Google Scholar
Huang, G. B., Zhu, Q. Y., & Siew, C. K. (2006). Extreme learning machine: Theory and applications. Neurocomputing, 70(1), 489–501.
Article Google Scholar
Ito, T., Takeda, K., & Itakura, F. (2005). Analysis and recognition of whispered speech. Speech Communication, 45(2), 139–152.
Article Google Scholar
Jain, K., Ross, A., & Prabhakar, S. (2004). An introduction to biometric recognition. IEEE Transactions on Circuits and Systems for Video Technology, 14(1), 4–20.
Article Google Scholar
Jin, Q., Jou, S.-C. S., & Schultz, T. (2007). Whispering speaker identification. In Multimedia and Expo, 2007 IEEE International Conference on, pp. 1027–1030.
John, H. L. (2007). Analysis and classification of speech Mode: Whispered through shouted. 8th Annual Conference of the International Speech Communication Association, Interspeech.
Jovičić, S. T. (1998). Formant feature differences between whispered and voiced sustained vowels. Acta Acustica United with Acustica, 84(4), 739–743.
Google Scholar
Jovičić, S. T., & Šarić, Z. (2008). Acoustic analysis of consonants in whispered speech. Journal of Voice, 22(3), 263–274.
Article Google Scholar
Li, Q. (2001). A detection approach to search-space reduction for HMM state alignment in speaker verification. IEEE Transactions on Speech and Audio Processing, 9(5), 569–578.
Article Google Scholar
Mak, M. W., & Kung, S. Y. (2000). Estimation of elliptical basis function parameters by the EM algorithm with application to speaker verification. IEEE Transactions on Neural Networks, 11(4), 961–969.
Article Google Scholar
Morris, R. W., & Clements, M. A. (2002). Reconstruction of speech from whispers. Medical Engineering & Physics, 24(7), 515–520.
Article Google Scholar
Oyang, Y. J., Ou, Y. Y., Hwang, S. C., Chenl, C. Y., & Chang, D. T. H. (2005). Data classification with a relaxed model of variable kernel density estimation. In Proc. IEEE Int. Joint Conf. Neural Netw, vol. 5, pp. 2831–2836.
Pellom, L., & Hansen, J. H. L. (1998). An efficient scoring algorithm for Gaussian mixture model based speaker identification. IEEE Signal Processing Letters, 5(11) 281–284.
Article Google Scholar
Poignant, J., Besacier, L., & Quenot, G. (2014). Unsupervised speaker identification in TV broadcast based on written names. IEEE/ACM Transactions on Audio, Speech and Language Processing, 23, 57–68.
Google Scholar
Sadjadi, S. O., & Hansen, J. H. L. (2014). Blind spectral weighting for robust speaker identification under reverberation mismatch. IEEE/ACM Transactions on Audio, Speech and Language Processing, 22(5), 937–945.
Article Google Scholar
Wang, J. C., Chin, Y. H., Hsieh, W. C., Lin, C. H., Chen, Y. R., & Siahaan, E. (2015). Speaker identification with whispered speech for the access control system. IEEE Transactions on Automation Science and Engineering, 12(4), 1191–1199.
Article Google Scholar
Wang, J. C., Yang, C. H., Wang, J. F., & Lee, H. P. (2007). Robust speaker identification and verification.” IEEE Computational Intelligence Magazine, 2(2), 52–59.
Article Google Scholar
Xu, J., & Zhao, H. (2012). Speaker identification with whispered speech using unvoiced-consonant phonemes. In Proc. Int. Conf. Image Anal. Signal Process, pp. 9–11.
Zhang, C., & Hansen, J. H. (2007). Analysis and classification of speech mode: Whispered through shouted. In Interspeech (Vol. 7, pp. 2289–2292).
Zhao, Y., Wang, & Wang, D. (2014). Robust speaker identification in noisy and reverberant conditions. IEEE/ACM Transactions on Audio, Speech and Language Processing, 22(4), 836–845.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of IT/SOC, SASTRA Deemed University, Thanjavur, 613409, India
J. Sangeetha
Department of ECE, Anna University, BIT Campus, Trichirappalli, 620024, India
T. Jayasankar

Authors

J. Sangeetha
View author publications
You can also search for this author in PubMed Google Scholar
T. Jayasankar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. Sangeetha.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sangeetha, J., Jayasankar, T. A novel whispered speaker identification system based on extreme learning machine. Int J Speech Technol 21, 157–165 (2018). https://doi.org/10.1007/s10772-017-9488-z

Download citation

Received: 01 September 2017
Accepted: 26 December 2017
Published: 01 February 2018
Issue Date: March 2018
DOI: https://doi.org/10.1007/s10772-017-9488-z

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A novel whispered speaker identification system based on extreme learning machine

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Speaker Identification Through Natural and Whisper Speech Signal

Combined i-Vector and Extreme Learning Machine Approach for Robust Speaker Identification and Evaluation with SITW 2016, NIST 2008, TIMIT Databases

Combining Evidences from Mel Cepstral and Cochlear Cepstral Features for Speaker Recognition Using Whispered Speech

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

A novel whispered speaker identification system based on extreme learning machine

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Speaker Identification Through Natural and Whisper Speech Signal

Combined i-Vector and Extreme Learning Machine Approach for Robust Speaker Identification and Evaluation with SITW 2016, NIST 2008, TIMIT Databases

Combining Evidences from Mel Cepstral and Cochlear Cepstral Features for Speaker Recognition Using Whispered Speech

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation