Conferences >2011 IEEE International Confe...

HNM-based MFCC+F0 extractor applied to statistical speech synthesis

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Currently, the statistical framework based on Hidden Markov Models (HMMs) plays a relevant role in speech synthesis, while voice conversion systems based on Gaussian Mixt...Show More

Metadata

Abstract:

Currently, the statistical framework based on Hidden Markov Models (HMMs) plays a relevant role in speech synthesis, while voice conversion systems based on Gaussian Mixture Models (GMMs) are almost standard. In both cases, statistical modeling is applied to learn distributions of acoustic vectors extracted from speech signals, each vector containing a suitable parametric representation of one speech frame. The overall performance of the systems is often limited by the accuracy of the underlying speech parameterization and reconstruction method. The method presented in this paper allows accurate MFCC extraction and high-quality reconstruction of speech signals assuming a Harmonics plus Noise Model (HNM). Its suitability for high-quality HMM-based speech synthesis is shown through subjective tests.

Published in: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 22-27 May 2011

Date Added to IEEE Xplore: 11 July 2011

ISBN Information:

ISSN Information:

DOI: 10.1109/ICASSP.2011.5947411

Conference Location: Prague, Czech Republic

Contents

References is not available for this document.

HNM-based MFCC+F0 extractor applied to statistical speech synthesis

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

HNM-based MFCC+F0 extractor applied to statistical speech synthesis

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?