Loading [MathJax]/extensions/MathMenu.js
Gender-dependent and speaker-dependent speech enhancement | IEEE Conference Publication | IEEE Xplore

Gender-dependent and speaker-dependent speech enhancement


Abstract:

Our work introduces a speech enhancement technique that can explicitly incorporate prior information about the gender or speaker time-frequency characteristics in its for...Show More

Abstract:

Our work introduces a speech enhancement technique that can explicitly incorporate prior information about the gender or speaker time-frequency characteristics in its formalism. We approximate the multimodal, clean speech linear spectrum magnitude with a mixture of Gaussians pdfs using the Expectation-Maximization algorithm (EM). Subsequently. we apply the Bayesian inference framework to the degraded spectral coefficients and by employing Minimum Mean Square Error Estimation (MMSE) we derive a closed fonn solution for the spectral magnitude estimation task adapted to the spectral characteristics and noise variance of each band. We suggest that 2–3 minutes of phonetically balanced non-degraded gender or speaker dependent speech is adequate to tune our algorithm. We demonstrate the benefit of using an enhancement technique tailored to a specific gender or speaker and propose its use in cases where message ambiguity is of critical importance. We evaluate of our algorithm using Lynx helicopter and White Gaussian noise on the task of improving the quality of speech and in combination with a speech coder and demonstrate its robustness at very low SNRs. Implementation code is available at: http://slt.wcl.ee.upatras.gr/potamitis/index.html
Date of Conference: 13-17 May 2002
Date Added to IEEE Xplore: 07 April 2011
Print ISBN:0-7803-7402-9
Print ISSN: 1520-6149
Conference Location: Orlando, FL, USA

Contact IEEE to Subscribe

References

References is not available for this document.