ABSTRACT
Convolutive blind source separation (BSS) refers the scenario that sources are recorded by multiple sensors in a reverberant environment, which can be depicted as a convolutive mixing model. A big task in convolutive BSS is to identify the number of the source before the sources are separated from mixtures. In this paper, it shows that this problem can be categorized as the detection of columns (hidden hyperlines) of the mixing matrix in frequency domain. Motivated by the observation that only one source is active, or locally dominant in the covariance domain, the hyperlines are first estimated by searching the optimal projection direction based on the locally Second Order of Statistics (SOSs) of mixtures. After the estimated hyperlines are estimated, a density-based clustering method is then proposed to evaluate the true number of hyperlines in terms of sorted scores, which is calculated from a product of the local density and the intracluster distance of hyperlines. Such scores are further utilized to automatically search the optimal estimate sources number by a gap-based detection method. Finally, the number with the highest proportion from a selected frequency bins is guaranteed as the estimated number of sources. The experiment results show that the proposed method achieves a stable performance in various of under-determined cases.
- K. Torkkola, "Blind separation of convolved sources based on information maximization," in IEEE Workshop Neural Netw. Signal Process., 1996, pp. 423--432. Google ScholarCross Ref
- M. S. Pedersen, J. Larsen, U. Kjems, and L. C. Parra, A Survey of Convolutive Blind Source Separation Methods. New York: Springer, 2007.Google Scholar
- S. Xie, L. Yang, J.-M. Yang, G. Zhou, and Y. Xiang, "Time-frequency approach to underdetermined blind source separation," IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 2, pp. 306--316, Feb. 2012. Google ScholarCross Ref
- Z. Yang, G. Zhou, S. Xie, and S. Ding, "Blind spectral unmixing based on sparse nonnegative matrix factorization," IEEE Trans. Image Processing, vol. 20, no. 4, pp. 1112--1125, Apr. 2011. Google ScholarDigital Library
- Z. Yang, Y. Xiang, K. Xie, and Y. Lai, "Adaptive method for non-smooth nonnegative matrix factorization," IEEE Trans. Neural Networks and Learning Systems, to be appeared.Google Scholar
- L. Parra and C. Spence, "Convolutive blind separation of non-stationary sources," IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 320--327, May 2000. Google ScholarCross Ref
- K. Rahbar and J. P. Reilly, "A frequency domain method for blind source separation of convolutive audio mixtures," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 832--844, Sept. 2005. Google ScholarCross Ref
- D. Nion, K. N. Mokios, N. D. Sidiropoulos, and A. Potamianos, "Batch and adaptive PARAFAC-based blind separation of convolutive speech mixtures," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1193--1207, Aug. 2010. Google ScholarDigital Library
- P. Tichavsky and Z. Koldovsky, "Weight-adjusted tensor method for blind separation of underdetermined mixtures of nonstationary sources," IEEE Trans. Signal Process., vol. 59, no. 3, pp. 1037--1047, Mar. 2011. Google ScholarDigital Library
- P. Comon, "Blind identification and source separation in 2 x 3 underdetermined mixtures," IEEE Trans. Signal Process., vol. 52, no. 1, pp. 11--22, Jan. 2004. Google ScholarDigital Library
- X. Fu, W. K. Ma, K. Huang, and N. D. Sidiropoulos, "Blind separation of quasi-stationary sources: exploiting convex geometry in covariance domain," IEEE Trans. Signal Process., vol. 63, no. 9, pp. 2306--2320, May. 2015.Google ScholarDigital Library
- V. G. Reju, S. N. Koh, and I. Y. Soon, "Underdetermined convolutive blind source separation via time-frequency masking," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 1, pp. 101--116, Jan. 2010. Google ScholarDigital Library
- H. Sawada, S. Winter, R. Mukai, S. Araki, and S. Makino, "Estimating the number of sources for frequency-domain blind source separation," in ICA 2004, 2004, pp. 610--617.Google Scholar
- O. Yilmaz and S. Rickard, "Blind separation of speech mixtures via time-frequency masking," IEEE Trans. Signal Process., vol. 52, no. 7, pp. 1830--1847, Jul. 2004. Google ScholarDigital Library
- Z. Yang, Y. Xiang, S. Xie, S. Ding' and Y. Rong, "Nonnegative blind source separation by sparse component analysis based on determinant measure," IEEE Trans. Neural Networks and Learning Systems, vol. 23, no. 10, pp. 1601--1610, Oct. 2012. Google ScholarCross Ref
- Z. Yang, Y. Zhang, W. Yan, Y. Xiang, and S. Xie, "A fast non-smooth nonnegative matrix factorization for learning sparse representation," IEEE Access, vol. 4, pp. 5161--5168, 2016. Google ScholarCross Ref
- Rodriguez, Alex, and L. Alessandro, "Clustering by fast search and find of density peaks," Science, vol. 344, no. 6191, pp. 1492--1496, 2014. Google ScholarCross Ref
- Z. He, A. Cichocki, S. Xie, and K. Choi:, "Detecting the number of clusters in n-way probabilistic clustering," IEEE Trans. Pattern Anal. Mach. Intell., vol. 32, no. 11, pp. 2006--2021, 2010. Google ScholarDigital Library
- D. Nion, "Blind source separation (BSS) of convolutive speech mixtures," http://dimitri.nion.free.fr/bss/BSS.html, 2010, [Online]Google Scholar
Recommendations
MIMO-AR system identification and blind source separation for GMM-distributed sources
The problem of blind source separation (BSS) and system identification for multiple-input multiple-output (MIMO) auto-regressive (AR) mixtures is addressed in this paper. Two new time-domain algorithms for system identification and BSS are proposed ...
Blind Source Separation Based on Cumulants With Time and Frequency Non-Properties
This paper presents new results on blind separation of instantaneously mixed independent sources based on high-order statistics together with their time and frequency non-properties (i.e., the non-stationarity and non-whiteness of sources). Separation ...
Blind Source Separation of Postnonlinear Convolutive Mixture
In this paper, a novel solution is developed to solve blind source separation of postnonlinear convolutive mixtures. The proposed model extends the conventional linear instantaneous mixture model to include both convolutive mixing and postnonlinear ...
Comments