Conferences >2008 IEEE International Confe...

GMM/SVM N-best speaker identification under mismatch channel conditions

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Under severe channel mismatch conditions, such as training with far-field speech and testing with telephone data, performance of speaker identification (SID) degrades sig...Show More

Metadata

Abstract:

Under severe channel mismatch conditions, such as training with far-field speech and testing with telephone data, performance of speaker identification (SID) degrades significantly, often below practical use. But for many SID tasks, it is sufficient to recognize an N-best list of speakers for further human analysis. We investigate N-best SID accuracy for matched (telephone/telephone) and mismatched (far-field/telephone) train/test channel conditions. Using an SVM-GMM supervector (GSV), pitch and formant frequency histograms (PFH) and cross-channel adaptation using cohorts, we reduced matched channel error rate by over 25% relative to the baseline (GMM-UBM), for top-1, and achieved mismatched N-best accuracy comparable to the baseline.

Published in: 2008 IEEE International Conference on Acoustics, Speech and Signal Processing

Date of Conference: 31 March 2008 - 04 April 2008

Date Added to IEEE Xplore: 12 May 2008

ISBN Information:

ISSN Information:

DOI: 10.1109/ICASSP.2008.4518563

Conference Location: Las Vegas, NV, USA

Contents

References is not available for this document.

GMM/SVM N-best speaker identification under mismatch channel conditions

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

GMM/SVM N-best speaker identification under mismatch channel conditions

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?