Abstract:
Multipulse Linear Predictive Coding [1] has been shown to produce natural sounding speech at relatively low bit rates. So far, this technique has mostly been used for spe...Show MoreMetadata
Abstract:
Multipulse Linear Predictive Coding [1] has been shown to produce natural sounding speech at relatively low bit rates. So far, this technique has mostly been used for speech transmission or storage. In this paper, we show that a multipulse LPC synthesizer can also be used in a text-to-speech system based on diphone concatenation. The main problem is how to manipulate the prosodic parameters required for speech synthesis, and it is addressed here by a two-step procedure. First, a speech signal with relatively flat pitch contour is obtained by multipulse synthesis of concatenated diphones. Then the prosodic parameters of this signal are corrected using a special purpose phase vocoder. This method produces French synthetic speech of fairly good naturalness.
Date of Conference: 26-29 April 1985
Date Added to IEEE Xplore: 29 January 2003