Critical Band Subspace-Based Speech Enhancement Using SNR and Auditory Masking Aware Technique

Jia-Ching WANG
Hsiao-Ping LEE
Jhing-Fa WANG
Chung-Hsien YANG

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E90-D    No.7    pp.1055-1062
Publication Date: 2007/07/01
Online ISSN: 1745-1361
DOI: 10.1093/ietisy/e90-d.7.1055
Print ISSN: 0916-8532
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
human auditory system,  Karhunen-Loeve transform (KLT),  in-car noise,  signal subspace,  speech enhancement,  perceptual filterbank,  wavelet transform,  

Full Text: PDF(851.4KB)>>
Buy this Article



Summary: 
In this paper, a new subspace-based speech enhancement algorithm is presented. First, we construct a perceptual filterbank from psycho-acoustic model and incorporate it in the subspace-based enhancement approach. This filterbank is created through a five-level wavelet packet decomposition. The masking properties of the human auditory system are then derived based on the perceptual filterbank. Finally, the prior SNR and the masking threshold of each critical band are taken to decide the attenuation factor of the optimal linear estimator. Five different types of in-car noises in TAICAR database were used in our evaluation. The experimental results demonstrated that our approach outperformed conventional subspace and spectral subtraction methods.


open access publishing via