Abstract:
This paper presents a novel algorithm for learning a hierarchical dictionary in the short-time Fourier (STFT) domain, which can improve the performance of dictionary lear...Show MoreMetadata
Abstract:
This paper presents a novel algorithm for learning a hierarchical dictionary in the short-time Fourier (STFT) domain, which can improve the performance of dictionary learning (DL) based single-channel speech separation (SCSS). The goal of SCSS is to separate the underlying clean speeches from a signal mixture, which was often achieved by learning a pair of discriminative sub-dictionaries and sparsely coding the mixture speech signal over the dictionary pair. The case of 2 source speech signals is considered in this paper. Unfortunately, the existing DL approaches cannot avoid the source confusion drastically, i.e., when we sparsely represent the mixture signal over the dictionary pair, parts of the object speech component are explained by interferer speech dictionary atoms and vice-versa. In order to suppress more source confusion, we divide the training sets into two layers of components and learn hierarchical sub-dictionaries using different layers. Experimental testing is shown to verify the superior performance compared with other existing approaches.
Date of Conference: 29 June 2014 - 02 July 2014
Date Added to IEEE Xplore: 28 August 2014
Electronic ISBN:978-1-4799-4975-5
Print ISSN: 2373-0803