Improving Quality of Voice Conversion Systems

Farhid, M.; Tinati, M. A.

doi:10.1007/978-3-540-89985-3_124

M. Farhid⁵ &
M. A. Tinati⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 6))

Included in the following conference series:

Computer Society of Iran Computer Conference

920 Accesses

Abstract

New improvement scheme for voice conversion are proposed in this paper. We take Human factor cepstral coefficients (HFCC), a modification of MFCC that uses the known relationship between center frequency and critical bandwidth from human psychoacoustics to decouple filter bandwidth from filter spacing, as the basic feature. We propose U/V (Unvoiced/Voiced) decision rule such that two sets of codebooks are used to capture the difference between unvoiced and voiced segments of the source speaker. Moreover, we apply three schemes to refine the synthesized voice, including pitch refinement, energy equalization, and frame concatenation. The acceptable performance of the voice conversion system can be verified through ABX listening test and MOS grad.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Continuous vocoder applied in deep neural network based voice conversion

Article Open access 16 September 2019

Performance Analysis of LPC and MFCC Features in Voice Conversion Using Artificial Neural Networks

Voice Conversion Using Spectral Mapping and TD-PSOLA

References

Abe, M., Nakamura, S., Shikana, K., Kuwabara, H.: Voice Conversion though Vector Quantization. In: Proc. ICASSP, New York, USA, vol. 1, pp. 65–658 (1988)
Google Scholar
Stylianou, Y., Cappe, O., Moulines, E.: Continuous probabilistic transform for voice conversion. IEEE Trans. Speech Audio Proc. 6, 131–142 (1998)
Article Google Scholar
Skowronski, M.D.: Human Factor cepstral coefficient. Computational NeuroEngineering Laboratory University of Florida (2004)
Google Scholar
Lin, C.-Y., Roger Jang, J.-S.: New Refinement Schemes For Voice Convertion. IEEE, Los Alamitos (2003)
Google Scholar
Hassan, M.M.: A statistical mapping approach to voice conversion. Acoustics Soc. (December 2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Tabriz University, Tabriz, Iran
M. Farhid & M. A. Tinati

Authors

M. Farhid
View author publications
You can also search for this author in PubMed Google Scholar
M. A. Tinati
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

IPM School of Computer Science, and Department of Computer Engineering, Sharif Unversity of Technology, Azadi Street, Tehran, Iran
Hamid Sarbazi-Azad
Department of Electrical and computer Engineering, UC Irvine, California, USA
Behrooz Parhami
Department of Computer Engineering, Sharif University of Technology,, Azadi Street, Tehran, Iran
Seyed-Ghassem Miremadi
Department of Computer Engineering, Sharif University of Technology, Azadi Street, Tehran, Iran
Shaahin Hessabi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Farhid, M., Tinati, M.A. (2008). Improving Quality of Voice Conversion Systems. In: Sarbazi-Azad, H., Parhami, B., Miremadi, SG., Hessabi, S. (eds) Advances in Computer Science and Engineering. CSICC 2008. Communications in Computer and Information Science, vol 6. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89985-3_124

Download citation

DOI: https://doi.org/10.1007/978-3-540-89985-3_124
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89984-6
Online ISBN: 978-3-540-89985-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics