Loading [a11y]/accessibility-menu.js
Linear demixed domain multichannel nonnegative matrix factorization for speech enhancement | IEEE Conference Publication | IEEE Xplore

Linear demixed domain multichannel nonnegative matrix factorization for speech enhancement


Abstract:

In this paper, we investigate blind source separation for audio signals based on multichannel nonnegative matrix factorization (MNMF) of magnitude spectrograms in a linea...Show More

Abstract:

In this paper, we investigate blind source separation for audio signals based on multichannel nonnegative matrix factorization (MNMF) of magnitude spectrograms in a linear demixed domain. The original magnitude MNMF by itself is less effective in general acoustic situations because it discards mutual information between input channels, which is represented by non-diagonal complex elements of the spatial covariance matrices of them. To deal with this problem, several linear transformations of the multichannel input have been proposed in order to diagonalize the covariance matrices without loss of the mutual information. However, when the number of microphones is small, it is difficult for static transformations to work well for various combinations of source positions. For this problem, we first prove that general linear transformations (linear demixing) can be applied as preprocessing of the magnitude MNMF, and then confirm that a transformation adaptive to source positions, such as using frequency domain independent component analysis, is better than the conventional static transformation by experimental comparison of 2- and 4-channel noisy speech enhancement tasks.
Date of Conference: 05-09 March 2017
Date Added to IEEE Xplore: 19 June 2017
ISBN Information:
Electronic ISSN: 2379-190X
Conference Location: New Orleans, LA, USA

Contact IEEE to Subscribe

References

References is not available for this document.