Parallel Implementation of a VQ-Based Text-Independent Speaker Identification

Soğanci, Ruhsar; Gürgen, Fikret; Topcuoğlu, Haluk

doi:10.1007/978-3-540-30198-1_30

Parallel Implementation of a VQ-Based Text-Independent Speaker Identification

Ruhsar Soğanci¹⁷,
Fikret Gürgen¹⁷ &
Haluk Topcuoğlu¹⁸

Conference paper

1413 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3261))

Abstract

This study presents parallel implementation of a vector quantization (VQ) based text-independent speaker identification system that uses Melfrequency cepstrum coefficients (MFCC) for feature extraction, Linde-Buzo-Gray (LBG) VQ algorithm for pattern matching and Euclidean distance for match score calculation. Comparing meaningful characteristics of voice samples and matching them with similar ones requires large amount of transformations and comparisons, which result in large memory usage and disk access. When the cost of computations is considered, it states the main motivation for a parallel speaker identification implementation, where the parallelism is achieved using domain decomposition. In this paper, we present a set of experiments using the YOHO speaker corpus and observe the effects of several parameters as VQ size, number of MFCC filter banks and threshold value. First we focus on the serial algorithm and improve the algorithm to give the best success rates and provide a strong base for parallel implementation, where a clear performance improvement on speedup is obtained.

Download to read the full chapter text

Chapter PDF

References

Campbell, J.P.: Speaker Recognition: A Tutorial. Proceedings of the IEEE 85(9) (September 1997)
Google Scholar
Quatieri, T.F.: Discrete-Time Speech Signal Processing: Principles and Practice. Prentice Hall, Englewood Cliffs (2001)
Google Scholar
Furui, S.: Digital Speech Processing, Synthesis and Recognition (February 2001) ISBN: 0824704525
Google Scholar
James, D., Hutter, H.P., Bimbot, F.: The CAVE Speaker Verification Project-Experiments on the YOHO and SESP corpora. In: 1st Inernational Conf. On AVBPA, Crans-Montana, Switzerland (1997)
Google Scholar
Pellom, B., Hansen, J.: An Efficient Scoring Algorithm for GMM based Speaker Identification. IEEE Signal Processing Letters 5(11), 281–284
Google Scholar
Park, A., Hazen, J.: ASR Dependent Techniques For Speaker Identification. In: Proceedings of the 7th Internatonal Conference on Spoken Kanguage Processing, Denver, Colorado, September 16-20, pp. 1337–1340 (2002)
Google Scholar
Zilca, R.D.: Text-independent speaker verification using covariance modeling. IEEE Signal Processing Letters 8(4) (April 2001)
Google Scholar
MPITB (2004), http://atc.ugr.es/javier-bin/mpitb_eng
MultiMatlab (2004), http://www.cs.cornell.edu/Info/People/lnt/multimatlab.html

Download references

Author information

Authors and Affiliations

Department of Computer Engineering, Boğaziçi University Bebek, Istanbul, Turkey
Ruhsar Soğanci & Fikret Gürgen
Department of Computer Engineering, Marmara University, Göztepe, İstanbul, Turkey
Haluk Topcuoğlu

Authors

Ruhsar Soğanci
View author publications
You can also search for this author in PubMed Google Scholar
Fikret Gürgen
View author publications
You can also search for this author in PubMed Google Scholar
Haluk Topcuoğlu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dokuz Eylül University, lzmir, Turkey
Tatyana Yakhno

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Soğanci, R., Gürgen, F., Topcuoğlu, H. (2004). Parallel Implementation of a VQ-Based Text-Independent Speaker Identification. In: Yakhno, T. (eds) Advances in Information Systems. ADVIS 2004. Lecture Notes in Computer Science, vol 3261. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30198-1_30

Download citation

DOI: https://doi.org/10.1007/978-3-540-30198-1_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23478-4
Online ISBN: 978-3-540-30198-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics