Journals & Magazines >IEEE Signal Processing Letters >Volume: 23 Issue: 4

A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The state-of-the-art speaker-recognition systems suffer from significant performance loss on degraded speech conditions and acoustic mismatch between enrolment and test p...Show More

Metadata

Abstract:

The state-of-the-art speaker-recognition systems suffer from significant performance loss on degraded speech conditions and acoustic mismatch between enrolment and test phases. Past international evaluation campaigns, such as the NIST speaker recognition evaluation (SRE), have partly addressed these challenges in some evaluation conditions. This work aims at further assessing and compensating for the effect of a wide variety of speech-degradation processes on speaker-recognition performance. We present an open-source simulator generating degraded telephone, VoIP, and interview-speech recordings using a comprehensive list of narrow-band, wide-band, and audio codecs, together with a database of over 60 h of environmental noise recordings and over 100 impulse responses collected from publicly available data. We provide speaker-verification results obtained with an

$i$ -vector-based system using either a clean or degraded PLDA back-end on a NIST SRE subset of data corrupted by the proposed simulator. While error rates increase considerably under degraded speech conditions, large relative equal error rate (EER) reductions were observed when using a PLDA model trained with a large number of degraded sessions per speaker.

Published in: IEEE Signal Processing Letters ( Volume: 23, Issue: 4, April 2016)

Referenced in:IEEE Biometrics Compendium

Page(s): 527 - 531

Date of Publication: 03 March 2016

ISSN Information:

DOI: 10.1109/LSP.2016.2537844

Funding Agency:

Contents

References is not available for this document.

A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

A Large-Scale Open-Source Acoustic Simulator for Speaker Recognition

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?