Conferences >ICASSP 2019 - 2019 IEEE Inter...

Voice Conversion with Cyclic Recurrent Neural Network and Fine-tuned Wavenet Vocoder

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

This paper presents a novel framework for providing high-quality parallel voice conversion (VC) using a cyclic recurrent neural network (RNN) and a finely tuned WaveNet v...Show More

Metadata

Abstract:

This paper presents a novel framework for providing high-quality parallel voice conversion (VC) using a cyclic recurrent neural network (RNN) and a finely tuned WaveNet vocoder. Using the proposed system, we are tackling the quality degradation issue faced by WaveNet when it is fed with estimated (oversmoothed) speech features, such as mel-cepstrum parameters predicted from a statistical model. In VC, providing predicted features to fine-tune a pretrained WaveNet model is not straightforward owing to the difference in time-sequence alignment. To overcome this problem, we propose the use of a cyclic spectral conversion network that is capable of performing a conversion flow, i.e., source-to-target, and a cyclic flow, i.e., generate self-predicted target speaker features, and is trained by using both the conversion and cyclic losses. The experimental results demonstrate that, overall, the proposed system significantly improves the converted speech, resulting in a mean opinion score of 3.79 and a speaker similarity score of 73.86%.

Published in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 12-17 May 2019

Date Added to IEEE Xplore: 17 April 2019

ISBN Information:

ISSN Information:

DOI: 10.1109/ICASSP.2019.8682156

Conference Location: Brighton, UK

Contents

References is not available for this document.

Voice Conversion with Cyclic Recurrent Neural Network and Fine-tuned Wavenet Vocoder

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Voice Conversion with Cyclic Recurrent Neural Network and Fine-tuned Wavenet Vocoder

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?