Conferences >2014 International Conference...

Speech re-synthesis from spectrogram image through sinusoidal modelling

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

A novel method to extract parameters i.e. frequencies and their bandwidth for intelligible speech synthesis is presented in the paper. The parameters are extracted from t...Show More

Metadata

Abstract:

A novel method to extract parameters i.e. frequencies and their bandwidth for intelligible speech synthesis is presented in the paper. The parameters are extracted from the spectrogram image of the pre-recorded male and female voice samples and used to re-synthesize speech by employing sinusoidal signals. The phase continuity is preserved by quantifying time-scale and identifying phase at temporal boundaries for a given frequency. The amplitude distribution of the sinusoidals follow Gaussian distribution and use frequency overlap to extend the bandwidth from 4 kHz to 6 kHz for the improvement in clarity of synthesized speech. The synthesized speech is further passed through a weighting filter to improve the envelope of re-synthesized time-domain signal. The synthesized speech is synthetic but noticeably intelligible.

Published in: 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Date of Conference: 24-27 September 2014

Date Added to IEEE Xplore: 01 December 2014

ISBN Information:

DOI: 10.1109/ICACCI.2014.6968501

Conference Location: Delhi, India