Multivariability speaker recognition database in Indian scenario

Haris B C; Pradhan, G.; Misra, A.; Prasanna, S. R. M.; Das, R. K.; Sinha, R.

doi:10.1007/s10772-012-9140-x

Multivariability speaker recognition database in Indian scenario

Published: 28 March 2012

Volume 15, pages 441–453, (2012)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

Haris B C¹,
G. Pradhan¹,
A. Misra¹,
S. R. M. Prasanna¹,
R. K. Das¹ &
…
R. Sinha¹

503 Accesses
28 Citations
Explore all metrics

Abstract

In this paper we describe the collection and organization of the speaker recognition database in Indian scenario named as IITG Multivariability Speaker Recognition Database. The database contains speech from 451 speakers speaking English and other Indian languages both in conversational and read speech styles recorded using various sensors in parallel under different environmental conditions. The database is organized into four phases on the basis of different conditions employed for the recording. The results of the initial studies conducted on a speaker verification system exploring the impact of mismatch in training and test conditions using the collected data are also included. A copy of this database can be obtained from the authors by contacting them.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A review on face recognition systems: recent approaches and challenges

Article 30 July 2020

Survey on Virtual Assistant: Google Assistant, Siri, Cortana, Alexa

Analyzing Multilingual Automatic Speech Recognition Systems Performance

References

Campbell, J. P., & Reynolds, D. A. (1999). Corpora for the evaluation of speaker recognition systems. In Proceedings of international conference on acoustics, speech and signal processing 1999 (ICASSP ’99).
Google Scholar
Dehak, N., Kenny, P., Dehak, R., Dumouchel, P., & Ouellet, P. (2011). Front-end factor analysis for speaker verification. IEEE Transactions on Audio, Speech, and Language Processing, 19(4), 788–798.
Article Google Scholar
Doddington, G. (1985). Speaker recognition-identifying people by their voices. Proceedings of the IEEE, 73(11), 1651–1664.
Article Google Scholar
Ganchev, T., Fakotakis, N., & Kokkinakis, G. (2005). Comparative evaluation of various mfcc implementations on the speaker verification task. In Proc. SPECOM (pp. 191–194).
Google Scholar
Haris B C, Pradhan, G., Misra, A., Shukla, S., Sinha, R., & Prasanna, S. R. M. (2011). Multi-variability speech database for robust speaker recognition. In Proceedings of national conference on communications (pp. 1–5).
Google Scholar
KTH Royal Institute of Technology. (2005). wavesurfer. http://www.speech.kth.se/wavesurfer/index2.html.
Martin, A. (2003). NIST 2003 speaker recognition evaluation plan, http://www.itl.nist.gov/iad/mig/tests/sre/2003/2003-spkrec-evalplan-v2.2.pdf.
Martin, A., Doddington, G., Kamm, T., Ordowski, M., & Przybocki, M. (1997). The DET curve in assessment of detection task performance. In Proceedings of Eurospeech ’97, Rhodes, Greece (pp. 1895–1898).
Google Scholar
Patil, H., & Basu, T. (2008). Development of speech corpora for speaker recognition research and evaluation in Indian languages. International Journal of Speech Technology, 11, 17–32.
Article Google Scholar
Patil, H., Prakash, D., Kar, B., Bhatta, B., & Basu, T. (2006). Corpora for speaker recognition research and evaluation in oriya. In Proceedings of IEEE international conference on industrial technology (pp. 2217–2222).
Chapter Google Scholar
Reynolds, D. (1996). The effects of handset variability on speaker recognition performance: experiments on the switchboard corpus. In Proceedings of IEEE international conference on acoustics, speech, and signal processing 1996 (ICASSP ’96) (Vol. 1, pp. 113–116).
Chapter Google Scholar
Reynolds, D., Zissman, M., Quatieri, T., O’Leary, G., & Carlson, B. (1995). The effects of telephone transmission degradations on speaker recognition performance. In Proceedings of IEEE international conference on acoustics, speech, and signal processing 1995 (ICASSP ’95) (Vol. 1, pp. 329–332).
Google Scholar
Reynolds, D. A. (2002). An overview of automatic speaker recognition technology. In Proceedings of IEEE international conference on acoustics, speech, and signal processing 2002 (ICASSP ’02) (Vol. 4, pp. IV–4072–IV–4075)
Google Scholar
Reynolds, D. A., Quatieri, T. F., & Dunn, R. B. (2000). Speaker verification using adapted Gaussian mixture models. Digital Signal Processing, 10(1–3), 19–41.
Article Google Scholar
Yin, S.-C., Rose, R., & Kenny, P. (2007). A joint factor analysis approach to progressive model adaptation in text-independent speaker verification. IEEE Transactions on Audio, Speech, and Language Processing, 15(7), 1999–2010.
Article Google Scholar
Young, S., Evermann, G., Gales, M., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., & Woodland, P. (2006). The HTK book version 3.4. Cambridge: Cambridge University Engineering Department.
Google Scholar

Download references

Acknowledgement

This work has been supported by the project grant No. 12(4)/2009-ESD sponsored by the Department of Information Technology, Government of India. The authors sincerely thank the efforts of Mr. Akhilesh Shukla and Mr. Sumit Shukla for their effort towards the collection and processing of database.

Author information

Authors and Affiliations

Department of Electronics and Electrical Engineering, Indian Institute of Technology Guwahati, Guwahati, 781039, India
Haris B C, G. Pradhan, A. Misra, S. R. M. Prasanna, R. K. Das & R. Sinha

Authors

Haris B C
View author publications
You can also search for this author in PubMed Google Scholar
G. Pradhan
View author publications
You can also search for this author in PubMed Google Scholar
A. Misra
View author publications
You can also search for this author in PubMed Google Scholar
S. R. M. Prasanna
View author publications
You can also search for this author in PubMed Google Scholar
R. K. Das
View author publications
You can also search for this author in PubMed Google Scholar
R. Sinha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. R. M. Prasanna.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Haris B C, Pradhan, G., Misra, A. et al. Multivariability speaker recognition database in Indian scenario. Int J Speech Technol 15, 441–453 (2012). https://doi.org/10.1007/s10772-012-9140-x

Download citation

Received: 13 November 2011
Accepted: 13 March 2012
Published: 28 March 2012
Issue Date: December 2012
DOI: https://doi.org/10.1007/s10772-012-9140-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multivariability speaker recognition database in Indian scenario

Abstract

Access this article

Similar content being viewed by others

A review on face recognition systems: recent approaches and challenges

Survey on Virtual Assistant: Google Assistant, Siri, Cortana, Alexa

Analyzing Multilingual Automatic Speech Recognition Systems Performance

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Multivariability speaker recognition database in Indian scenario

Abstract

Access this article

Similar content being viewed by others

A review on face recognition systems: recent approaches and challenges

Survey on Virtual Assistant: Google Assistant, Siri, Cortana, Alexa

Analyzing Multilingual Automatic Speech Recognition Systems Performance

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation