Speaker Verification Performance Evaluation Based on Open Source Speech Processing Software and TIMIT Speech Corpus

Kłosowski, Piotr; Dustor, Adam; Izydorczyk, Jacek

doi:10.1007/978-3-319-19419-6_38

Piotr Kłosowski⁴,
Adam Dustor⁴ &
Jacek Izydorczyk⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 522))

Included in the following conference series:

International Conference on Computer Networks

1697 Accesses
5 Citations

Abstract

Creating of speaker recognition application requires advanced speech processing techniques realized by specialized speech processing software. It is very possible to improve the speaker recognition research by using speech processing platform based on open source software. The article presents the example of using open source speech processing software to perform speaker verification experiments designed to test various speaker recognition models based on different scenarios. Speaker verification efficiency was evaluated for each scenario using TIMIT speech corpus distributed by Linguistic Data Consortium. The experiment results allowed to compare and select the best scenario to build speaker model for speaker verification application.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dustor, A., Kłosowski, P., Izydorczyk, J.: Influence of feature dimensionality and model complexity on speaker verification performance. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2014. CCIS, vol. 431, pp. 177–186. Springer, Heidelberg (2014)
Chapter Google Scholar
Dustor, A., Kłosowski, P., Izydorczyk, J.: Speaker recognition system with good generalization properties. In: Proceedings of International Conference on Multimedia Computing and Systems 2014 p. 73, Marrakech, Morocco, IEEE (2014)
Google Scholar
Rabiner, L.R., Schafer, R.W.: Introduction to digital speech processing. Found. Trends Sig. Process. 1(1–2), 1–194 (2007)
Article Google Scholar
Kłosowski, P., Dustor, A., Izydorczyk, J., Kotas, J., Ślimok, J.: Speech recognition based on open source speech processing software. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) CN 2014. CCIS, vol. 431, pp. 308–317. Springer, Heidelberg (2014)
Chapter Google Scholar
Beigi, H.: Fundamentals of speaker recognition. Springer, New York (2011)
Book MATH Google Scholar
Togneri, R., Pullella, D.: An overview of speaker identification: accuracy and robustness issues. IEEE Circ. Sys. Mag. 11(2), 23–61 (2011)
Article Google Scholar
Tsontzos, G., Orglmeister, R.: CMU Sphinx4 speech recognizer in a Service-oriented Computing style. In: IEEE International Conference on Service-Oriented Computing and Applications (SOCA), pp. 1–4 (2011)
Google Scholar
Bilmes, J., Bartels, C.: Graphical model architectures for speech recognition. IEEE Sig Process. Mag. 22(5), 89–100 (2005)
Article Google Scholar
Pellom, B., Hacioglu, K.: Recent improvements in the CU SONIC ASR system for noisy speech: the SPINE task. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Hong Kong (Apr 2003)
Google Scholar
Young, S., Evermann, G., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book. Cambridge University Engineering Department, Cambridge, UK (2002)
Google Scholar
Bonastre, J.F., Wils, F., Meignier, S.: ALIZE, a free toolkit for speaker recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’05), vol. 1, pp. 737–740 (2005)
Google Scholar
Speech Processing, Transmission and Quality Aspects (STQ); Distributed speech recognition; Front-end feature extraction algorithm; Compression algorithms. Technical standard ES 201 108, v1.1.3. European Telecommunications Standards Institute (2003)
Google Scholar
Fauve, B.G.B., Matrouf, D., Scheffer, N., Bonastre, J.F., Mason, J.S.D.: State-of-the-art performance in text-independent speaker verification through open-source software. IEEE Trans. Audio, Speech, Lang. Process. 15(7), 1960–1968 (2007)
Article Google Scholar
Garofolo, J.S., Lamel, L.F., Fisher, W.M., Fiscus, J.G., Pallett, D.S., Dahlgren, N.L., Zue, V.: TIMIT Acoustic-Phonetic Continuous Speech Corpus. Linguistic Data Consortium, Philadelphia (1993)
Google Scholar
Fisher, W.M., Doddington, G.R., Goudie-Marshall, K.M.: The DARPA speech recognition research database: specifications and status. In: Proceedings of DARPA Workshop on Speech Recognition, pp. 93–99 (1986)
Google Scholar
Fernandez, S., Graves, A., Schmidhuber, J.: Phoneme recognition in TIMIT with BLSTM-CTC (2008)
Google Scholar
Lopes, C., Perdigao, F.: Phoneme Recognition on the TIMIT Database (2011)
Google Scholar

Download references

Acknowledgements

This work was supported by The National Centre for Research and Development (www.ncbir.gov.pl) under Grant number POIG.01.03.01-24-107/12 (Innovative speaker recognition methodology for communications network safety).

Author information

Authors and Affiliations

Silesian University of Technology, Institute of Electronics, Akademicka Str. 16, 44-100, Gliwice, Poland
Piotr Kłosowski, Adam Dustor & Jacek Izydorczyk

Authors

Piotr Kłosowski
View author publications
You can also search for this author in PubMed Google Scholar
Adam Dustor
View author publications
You can also search for this author in PubMed Google Scholar
Jacek Izydorczyk
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Piotr Kłosowski .

Editor information

Editors and Affiliations

Silesian University of Technology, Gliwice, Poland
Piotr Gaj
Silesian University of Technology, Gliwice, Poland
Andrzej Kwiecień
Silesian University of Technology, Gliwice, Poland
Piotr Stera

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kłosowski, P., Dustor, A., Izydorczyk, J. (2015). Speaker Verification Performance Evaluation Based on Open Source Speech Processing Software and TIMIT Speech Corpus. In: Gaj, P., Kwiecień, A., Stera, P. (eds) Computer Networks. CN 2015. Communications in Computer and Information Science, vol 522. Springer, Cham. https://doi.org/10.1007/978-3-319-19419-6_38

Download citation

DOI: https://doi.org/10.1007/978-3-319-19419-6_38
Published: 28 May 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19418-9
Online ISBN: 978-3-319-19419-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics