Abstract
This work evaluates the performance of speaker verification system based on Wavelet based Fuzzy Learning Vector Quantization (WLVQ) algorithm. The parameters of Gaussian mixture model (GMM) are designed using this proposed algorithm. Mel Frequency Cepstral Coefficients (MFCC) are extracted from the speech data and vector quantized through Wavelet based FLVQ algorithm. This algorithm develops a multi resolution codebook by updating both winning and nonwinning prototypes through an unsupervised learning process. This codebook is used as mean vector of GMM. The other two parameters, weight and covariance are determined from the clusters formed by the WLVQ algorithm. The multi resolution property of wavelet transform and ability of FLVQ in regulating the competition between prototypes during learning are combined in this algorithm to develop an efficient codebook for GMM. Because of iterative nature of Expectation Maximization (EM) algorithm, the applicability of alternative training algorithms is worth investigation. In this work, the performance of speaker verification system using GMM trained by LVQ, FLVQ and WLVQ algorithms are evaluated and compared with EM algorithm. FLVQ and WLVQ based training algorithms for modeling speakers using GMM yields better performance than EM based GMM.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10772-013-9191-7/MediaObjects/10772_2013_9191_Fig1_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10772-013-9191-7/MediaObjects/10772_2013_9191_Fig2_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10772-013-9191-7/MediaObjects/10772_2013_9191_Fig3_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10772-013-9191-7/MediaObjects/10772_2013_9191_Fig4_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10772-013-9191-7/MediaObjects/10772_2013_9191_Fig5_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10772-013-9191-7/MediaObjects/10772_2013_9191_Fig6_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10772-013-9191-7/MediaObjects/10772_2013_9191_Fig7_HTML.gif)
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Bezdek, J. C. (1981). Pattern recognition with fuzzy objective function algorithms. Norwell: Kluwer Academic.
Chen, S.-H., & Luo, Y.-R. (2009). Speaker verification using MFCC and support vector machine. In Proceedings of the international multiconference of engineers and computer scientists, IMECS, Hong Kong, March 18–20 (Vol. 1).
Daniel, J., & Ho, W. (2001). Fuzzy wavelet networks for function learning. IEEE Transactions on Fuzzy Systems, 9(1), 200–211.
Delyon, B., Juditsky, A., & Benvensite, A. (1995). Accuracy analysis for wavelet approximations. IEEE Transactions on Neural Networks, 6, 332–348.
Engin, A. (2007). An Automatic System For Turkish Word Recognition Using Discreet Wavelet Neural Network Based on Adoptive Entropy. The Arabian Journal for Science and Engineering, 32(2B).
He, J., Liu, L., & Palm, G. (1997). A new codebook training algorithm for VQ-based speaker recognition. In IEEE international conference on acoustics, speech, and signal processing (ICASSP’97) (Vol. 2, p. 1091).
Jayanna, H. S., & Mahadeva Prasanna, S. R. (2009). An experimental comparison of modelling techniques for speaker recognition under limited data condition. Sâdhana, 34(3), 717–728.
Jun, W., Jian, X., Hong, P., & Xiumei, G. (2005). Constructing Fuzzy Wavelet Network Modeling. International Journal of Information Technology, 11(6).
Karayiannis, N. B., & Bezdek, J. C. (1997). An Integrated Approach to Fuzzy Learning Vector Quantization and Fuzzy-Means Clustering. IEEE Transactions on Fuzzy Systems, 5(4).
Karayiannis, N. B., & Pai, P.-I. (1996). Fuzzy algorithms for learning vector quantization. IEEE Transactions on Neural Networks, 7, 1196–1211.
Karayiannis, N. B., Pai, P.-I., & Zervos, N. (1998). Image compression based on fuzzy algorithms for learning vector quantization and wavelet image decomposition. IEEE Transactions on Image Processing, 7(8).
Kohonen, T. (1990). The self-organizing map. Proceedings of the IEEE, 78, 1464–1480.
Linde, Y., Buzo, A., & Gray, R. M. (1980). An algorithm for vector quantizer design. IEEE Transactions on Communications, 28, 84–95.
Pati, Y. C., & Krishnaprasad, P. S. (1993). Analysis and synthesis of feed forward neural networks using affine wavelet. IEEE Transactions on Neural Networks, 4(1), 73–75.
Raj Apsingekar, V., & De Leon, P. L. (2009). Speaker model clustering for efficient speaker identification in large population applications. IEEE Transactions on Audio, Speech, and Language Processing, 17(4).
Shaban Al-Ani, M., Sultan Mohammed, T., & Aljebory, K. M. (2007). Speaker identification: a hybrid approach using neural networks and wavelet transform. Journal of Computer Science, 3(5), 304–309.
Shashidhara, H. L., Lohani, S., & Gadre, V. M. (2000). Function learning using wavelet neural networks. In Proceeding of IEEE international conference on industrial technology (Vol. 2, pp. 335–340).
Soleymani Baghshah, M., Bagheri Souraki, S., & Kasaei, S. (2005). A novel fuzzy approach to recognition of OnlinePersian handwriting. In ISDA’05, Wroclaw, Poland, September 2005.
Tsao, E. C.-K., Bezdek, J. C., & Pal, N. R. (1994). Fuzzy Kohonen clustering networks. Pattern Recognition, 27(5), 757–764.
Wu, X., Fu, H., Wu, B., & Zhao, J. (2010). Possibilistic fuzzy learning vector quantization. Journal of Information & Computational Science, 7(3), 777–783.
Zhang, Q., & Benveniste, A. (1992). Wavelet networks. IEEE Transactions on Neural Networks, 3, 889–898.
Zhang, J., Walter, G. G., Miao, Y., & Lee, W. N. W. (1995). Wavelet neural networks for function learning. IEEE Transactions on Signal Processing, 43, 1485–1497.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Shanmugapriya, P., Venkataramani, Y. Wavelet fuzzy LVQ based speaker verification system. Int J Speech Technol 16, 403–412 (2013). https://doi.org/10.1007/s10772-013-9191-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10772-013-9191-7