Elastic nonnegative matrix factorization

doi:10.1016/j.patcog.2018.07.007

Pattern Recognition

Volume 90, June 2019, Pages 464-475

https://doi.org/10.1016/j.patcog.2018.07.007 Get rights and content

Highlights

•
ENMF introduces an elastic loss that takes advantage of both Frobenius norm and ℓ_{2, 1} norm when noise distribution is uncertain, therefore ENMF is far more insensitive to noise and outliers.
•
ENMF takes the geometric information of the projected data points in the low dimensional manifold as feedback to construct the affinity graph, hence ENMF can handle the situation that a few exceptional data pairs are close in the original space but far away from each other in the manifold.
•
ENMF utilizes the exclusive LASSO to enhance the intra-cluster competition and therefore the “winner” is more likely to stand out while the “loser” tends to be out in a sparse manner.
•
ENMF provides consistently better clustering results on several well-known data sets as compared to standard NMF and several other variants of the NMF algorithm.

Abstract

Nonnegative matrix factorization (NMF) plays a vital role in data mining and machine learning fields. Standard NMF utilizes the Frobenius norm while robust NMF uses the robust ℓ_2,1-norm to measure the quality of factorization, given the assumption of i.i.d Gaussian noise model and i.i.d Laplacian noise model, respectively. In this paper, we propose a novel elastic loss which is intercalated and adapted between Frobenius norm and ℓ_2,1-norm. Inspired by this, we derive an elastic NMF model guided by the elastic loss with incorporating geometry manifold information while enforcing sparsity of coefficients at intra-cluster level via ℓ_1,2-norm. The new formulation is more robust to noises while preserving the stronger capability of clustering. We propose an EM-like algorithm (using an auxiliary function) to solve the resultant optimization problem, whose convergence can be rigorously proved. The extensive experiments demonstrate the effectiveness of the novel elastic NMF model on benchmarks.

Introduction

In the fields of data mining and machine learning, the actual input data matrix in many applications is often of very high dimension, hence dimensionality reduction is a crucial process before data mining process. The most widely used dimensionality reduction methods include Principle Component Analysis (PCA), Singular Value Decomposition (SVD), Non-negative Matrix Factorization (NMF), etc. Different from PCA and SVD, NMF tends to discover two nonnegative matrices whose product is approximately close to the original matrix. More specifically, given a nonnegative matrix $X \in R^{p \times n}$ and r ≪ min (p, n), X is approximately factorized into two nonnegative matrices $F \in R^{p \times r}$ and $G \in R^{r \times n}$ . Thus, the original data point x_i in the p-dimensional space is projected to g_i in a lower k-dimensional subspace defined by columns of F.

Since the initial work by Lee and Seung [1], recent years have seen the expanding research on NMF. NMF has demonstrated its advantages in a variety of areas, such as document clustering [2], [3], multimedia data analysis [4], microarray data analysis [5], social network analysis [6], [7], single channel source separation [8], [9], [10], [11], visual tracking [12], audio source separation [13], [14], detecting topic hierarchies [15], graph matching [16], etc. Algorithmic extensions of NMF also have been extended in order to accommodate a number of data analysis problems, e.g., consensus clustering [17], balanced clustering [18], [19], semi-supervised clustering [20], [21], classification [22], [23], multiple-domain learning [24], multi-kernel learning [25] and collaborative filtering [26].

Standard NMF which utilizes the least square error function to measure the quality of factorization is ideal for zero mean, Gaussian noise. However, in real world cases, many data in various applications may not align with the above situation. It has been proved that the standard NMF is sensitive to outliers [27] which could have significant impact on the objective function with squared residue error. Robust matrix factorization with ℓ₁ norm in [28] reduces the negative impact posed by the noise, but it is unable to preserve the feature rotation invariance which is, however, required by many applications. Robust nonnegative matrix factorization using ℓ_2,1-norm (ℓ_2,1-NMF) in [27] is ideal for the assumption of the i.i.d Laplacian noise model. [29] utilizes a robust capped norm to handle the extreme outliers. Several robust versions of NMF are proposed in [30], including NMF based on the Correntropy Induced Metric (CIM-NMF), row-based CIM-NMF (rCIM-NMF), and NMF based on the Huber function (Huber-NMF).

For many data sets in the area of data mining and machine learning, there is a low dimensional manifold embedded in the high dimensional original space. Manifold learning algorithms including Locally Linear Embedding (LLE) [31], Isometric Mapping (ISOMAP) [32], Laplacian Eigenmap (LE) [33] and Locality Preserving Projections (LPP) [34] aim to detect the underlying manifold structure to improve the learning performance. Several graph based clustering methods [35], [36], [37] have shown the effectiveness by exploiting the locally geometric structure of data. By learning a similarity graph with exactly r connected components (where r equals to the cluster number), the Constrained Laplacian Rank (CLR) [38] and Clustering with Adaptive Neighbors (CAN) [39] methods exhibit excellent performance. Graph Regularized NMF (GNMF) in [40] seeks a matrix factorization which respects the intrinsic geometry of data. Rather than using a fixed graph as GNMF, AdapGrNMF in [41] regularizes NMF with an adaptive graph constructed based on the feature selection results. MultiGrNMF in [42] approximates the intrinsic manifold by a linear combination of several graphs with different models and parameters. The graph dual regularization nonnegative matrix factorization (DNMF) in [43] considers the geometric structures of the data manifold and the feature manifold together. The global discriminative-based nonnegative spectral clustering methods in [44] integrate the geometrical structure and discriminative structure in a joint framework. RMNMF in [45] integrates ℓ_2,1-NMF and spectral clustering with an additional orthogonal constraint. In these algorithms, the geometrical information is usually encoded by an affinity graph, whose vertexes represent the data points and edge weights indicate the affinity between data pairs. Meanwhile, locally invariant idea [46] is considered, i.e., the nearby points in the high dimensional original space are likely close to each other in the low dimensional manifold.

Due to nonnegative constraint, NMF results in sparse basis and coefficient matrices which provide parts-based representation. Such parts-based representation is accordant to the psychological and physiological evidence in the human brain [47], [48], [49]. However, in some cases, the sparsity obtained by NMF is not enough. NMF with sparseness constraints (NMFsc) in [50] enhances the sparsity of the basis and coefficient matrices by explicitly setting both ℓ₁ and ℓ₂ norms. Sparse NMF (SNMF) in [51] imposes the sparsity at intra-data-point level. The global Nonnegative Matrix Underapproximation (G-NMU) and recursive Nonnegative Matrix Underapproximation (R-NMU) methods in [52] utilize an underapproximation technique based on Lagrangian relaxation and provide sparse parts-based representations with low reconstruction error. Logdet divergence based sparse NMF method (LDS-NMF) in [53] deals with the rank-deficiency problem and enhances the sparsity of coefficients using the standard LASSO regularization term.

In this paper, we propose a novel elastic loss which is intercalated and adapted between Frobenius norm and ℓ_2,1-norm. Then we build an elastic NMF model (ENMF) guided by the novel elastic loss. To exploit the locally geometric structure of data, ENMF incorporates a manifold regularization term. However, different from other graph-based algorithms, the affinity graph is constructed based on not only the high dimensional original space but also the low dimensional manifold. Noticing that most data points may conform to the locally invariant idea, there are a few exceptions, i.e., two data points are close to each other in the high dimensional original space, but the distance between them in the low dimensional manifold may be very large. From this perspective, we take the geometric information of the projected data points in the manifold as feedback in order to reduce the edge weights between exceptional data pairs in the affinity graph. What’s more, ENMF incorporates an exclusive LASSO regularization term to enhance the sparsity of coefficients. Noticing that coefficients of each single cluster rather than coefficients of different clusters should compete to survive. As a result, the exclusive LASSO term in the formulation is naturally used to encourage the intra-cluster competition but discourage the inter-cluster competition. Given all the above considerations, we derive the corresponding multiplicative updating algorithm (via an auxiliary function approach) and provide rigorous analysis on its convergence and correctness.

The contributions of our ENMF are summarized as follows.

•
ENMF introduces an elastic loss that leverages the advantage of both Frobenius norm and ℓ_2,1 norm when noise distribution is uncertain, therefore ENMF is far more insensitive to noise and outliers.
•
ENMF takes the geometric information of the projected data points in the low dimensional manifold as feedback to construct the affinity graph, hence ENMF can handle the situation where a few exceptional data pairs are close in the original space but far away from each other in the manifold.
•
ENMF utilizes the exclusive LASSO to enhance the intra-cluster competition and therefore the “winner” is more likely to stand out while the “loser” tends to recede in a sparse manner.
•
ENMF provides consistently better clustering results on several well-known data sets as compared to standard NMF and several other variants of the NMF algorithm.

The rest of the paper is organized as follows. In Section 2, we present the formulation of our ENMF and its computational algorithm. In Section 3, we provide a rigorous analysis of the convergence of the algorithm. In Section 4, we prove that the converged solution satisfies the Karush-Kuhn-Tucker (KKT) condition, which further validates the correctness of the algorithm. In Section 5, we show experimental results on several well-known data sets by making comparisons with k-means algorithm, PCA k-means algorithm, standard NMF algorithm and several other variants of the NMF algorithm. Finally, we conclude this paper.

Section snippets

Elastic loss

Given the observation $x_{i} \in R^{p}$ (ith data point), NMF [1] decomposes it into basis $F \in R^{p \times r}$ and the representation $g_{i} \in R^{r}$ given the basis F, i.e., $\min_{F \geq 0, g \geq 0} z (x_{i}, F g_{i}) .$

There are several typical losses used for NMF, for example, $\begin{matrix} least square loss : \\ z_{2} (x_{i}, F g_{i}) = \sum_{i} {∥ x_{i} - F g_{i} ∥}^{2} . \end{matrix}$ $\begin{matrix} ℓ_{2, 1} loss : \\ z_{21} (x_{i}, F g_{i}) = \sum_{i} ∥ x_{i} - F g_{i} ∥ . \end{matrix}$ $\begin{matrix} KL divergence : \\ z_{kl} (x_{i}, F g_{i}) = \sum_{i j} (X_{i j} \log \frac{X_{i j}}{{(F G)}_{i j}} - X_{i j} + {(F G)}_{i j}) . \end{matrix}$ The different loss functions have motivated the different formulations of NMF, such as ℓ_2,1-NMF [27], KL-divergence NMF [1], etc.

Inspired by the idea of

Convergence of the algorithm

In this section, we provide the proof of the convergence of the algorithm described in Theorem 3.

Theorem 3

(A) Updating G using the rule of Eq. (17) while fixing F, the objective function of Eq. (11) is nonincreasing. (B) Updating F using the rule of Eq. (16) while fixing G, the objective function of Eq. (11) is nonincreasing.

(A) and (B) of Theorem 3 will be proved respectively in the next two subsections.

Correctness of the algorithm

In this section, we prove that the converged solution satisfies the Karush-Kuhn-Tucker condition of the constrained optimization theory, which has shown the correctness of the algorithm. Since the proof of the correctness of the algorithm w.r.t. F is similar to that of the algorithm w.r.t. G, therefore the former is omitted due to space limitations.

Theorem 9

At convergence, the converged solution G* of the updating rule of Eq. (17) satisfies the KKT condition of the optimization theory.

Proof

The KTT condition

Experiment

In this section, we empirically evaluate the proposed ENMF algorithm by comparing several other clustering algorithms on 9 data sets.

Conclusion and future work

In this paper, we propose an elastic NMF model. Our ENMF utilizes an elastic loss function intercalated between Frobenius norm and ℓ_2,1-norm to measure the quality of factorization, therefore it is significantly more insensitive to noise and outliers. Also, our ENMF takes the geometric information of the data points in the manifold as feedback to construct the affinity graph. In addition, our ENMF achieves sparsity at intra-cluster level by the exclusive Lasso regularization term. We also

He Xiong received the BS degree in automation from HeFei University of Technology, China, in 2009 and the master’s degree in automation from the University of Science and Technology of China, in 2013. He is currently a lecturer in the Department of Computer Science at BengBu University. His research interests include machine learning, data mining and computer vision.

References (65)

D. Tu et al.
Hierarchical online NMF for detecting and tracking topic hierarchies in a text stream
Pattern Recognit.
(2018)
B. Jiang et al.
A sparse nonnegative matrix factorization technique for graph matching problems
Pattern Recognit.
(2014)
J.J.-Y. Wang et al.
Max–min distance nonnegative matrix factorization
Neural Netw.
(2015)
J.J.-Y. Wang et al.
Beyond cross-domain learning: multiple-domain nonnegative matrix factorization
Eng. Appl. AI
(2014)
J.J.-Y. Wang et al.
Feature selection and multi-kernel learning for adaptive graph regularized nonnegative matrix factorization
Expert Syst. Appl.
(2015)
J.J.-Y. Wang et al.
Multiple graph regularized nonnegative matrix factorization
Pattern Recognit.
(2013)
F. Shang et al.
Graph dual regularization non-negative matrix factorization for co-clustering
Pattern Recognit.
(2012)
R. Shang et al.
Global discriminative-based nonnegative spectral clustering
Pattern Recognit.
(2016)
J. Huang et al.
Robust manifold nonnegative matrix factorization
ACM Trans. Knowl. Discovery Data (TKDD)
(2014)
S.E. Palmer
Hierarchical structure in perceptual representation
Cogn. Psychol.
(1977)

N. Gillis et al.

Using underapproximations for sparse nonnegative matrix factorization

Pattern Recognit.

(2010)

D.D. Lee et al.

Algorithms for non-negative matrix factorization

Advances in Neural Information Processing Systems

(2001)

W. Xu et al.

Document clustering based on non-negative matrix factorization

Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval

(2003)

V.P. Pauca et al.

Text mining using non-negative matrix factorizations

Proceedings of the 2004 SIAM International Conference on Data Mining

(2004)

M. Cooper et al.

Summarizing video using non-negative similarity matrix factorization

Multimedia Signal Processing, 2002 IEEE Workshop on

(2002)

H. Kim et al.

Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis

Bioinformatics

(2007)

S. Zhang et al.

Learning from incomplete ratings using non-negative matrix factorization

Proceedings of the 2006 SIAM International Conference on Data Mining

(2006)

Y. Chi et al.

Probabilistic polyadic factorization and its application to personalized recommendation

Proceedings of the 17th ACM Conference on Information and Knowledge Management

(2008)

B. Gao et al.

Machine learning source separation using maximum a posteriori nonnegative matrix factorization

IEEE Trans. Cybern.

(2014)

B. Gao et al.

Variational regularized 2-d nonnegative matrix factorization

IEEE Trans. Neural Netw. Learn. Syst.

(2012)

P. Parathai et al.

Single-channel blind separation using l 1-sparse complex non-negative matrix factorization for acoustic signals

J. Acoust. Soc. Am.

(2015)

B. Gao et al.

Adaptive sparsity non-negative matrix factorization for single-channel source separation

IEEE J. Sel. Top Signal Process.

(2011)

Y. Wu et al.

Visual tracking via online nonnegative matrix factorization

IEEE Trans. Circuits Syst. Video Technol.

(2014)

A. Al-Tmeme et al.

Underdetermined convolutive source separation using GEM-MU with variational approximated optimum model order NMF2d

IEEE/ACM Trans. Audio Speech Lang. Process.

(2017)

A. Al-Theme et al.

Underdetermined reverberant acoustic source separation using weighted full-rank nonnegative tensor models

Acoust. Soc. Am. J.

(2015)

T. Li et al.

Solving consensus and semi-supervised clustering problems using nonnegative matrix factorization

Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on

(2007)

H. Liu et al.

Balanced clustering with least square regression

AAAI

(2017)

Z. Li et al.

Balanced clustering via exclusive lasso: a pragmatic approach

AAAI

(2018)

F. Wang et al.

Semi-supervised clustering via matrix factorization

Proceedings of the 2008 SIAM International Conference on Data Mining

(2008)

H. Liu et al.

Constrained nonnegative matrix factorization for image representation

IEEE Trans. Pattern Anal. Mach. Intell.

(2012)

F. Sha et al.

Multiplicative updates for nonnegative quadratic programming in support vector machines

Advances in neural information processing systems

(2003)

N. Srebro et al.

Maximum-margin matrix factorization

Advances in neural information processing systems

(2005)

Cited by (15)

Matrix factorization with a sigmoid-like loss control
2024, Neurocomputing
Matrix factorization is one of the fundamental approaches of recommender systems. With the popular $L_{2}$ loss, learning models tend to overfit significantly deviated predictions. However, predicting the actual rating of 5 as 1 or 2 makes no essential difference in the application. In this paper, we design a sigmoid-like function to control the loss of each individual prediction, which has two advantages. First, it reduces the loss corresponding to significantly deviated predictions. Therefore, the impact of these predictions, some of which may be caused by outliers, is also reduced. Second, it is independent of two classical over-fitting control techniques using regular terms and validation data, respectively. Hence, it can be combined with them to form a more powerful method. Experiments are undertaken on six benchmark datasets in comparison with different losses. Results show that the proposed loss function has good performance in terms of MAE, RMSE, and NDCG, however not so good in terms of HR and MAP.
Elastic adversarial deep nonnegative matrix factorization for matrix completion
2023, Information Sciences
In recent years, matrix completion has attracted a lot of attention. Conventional matrix completions are shallow and ineffective when dealing with complex data structures. Researchers have recently tried to incorporate deep structures into matrix completion; however, considerable challenges still exist. Most matrix completion methods may fail to work effectively in the presence of limited observations. To enhance the generalization of the reconstruction, adversarial methods are proposed that attempt to fool models by providing deceptive input. The aim is to develop an adversarial training algorithm that resists attacks in a deep model, thus at the same time leading to enhancing the generalization. Therefore, in this paper, we propose an elastic adversarial training to design a high-capacity Deep Nonnegative Matrix Factorization (DNMF) model with proper discovery latent structure of the data and enhanced generalization abilities. In other words, the challenges mentioned above are addressed by perturbing the inputs in DNMF with an elastic loss which is intercalated and adapted between Frobenius and $ℓ_{2, 1}$ norms. This model not only dispenses with adversarial DNMF generation but also is robust towards a mixture of multiple attacks to attain improved accuracy. Extensive simulations show that the proposed approach outperforms state-of-the-art methods.
Semi-supervised adaptive kernel concept factorization
2023, Pattern Recognition
Kernelized concept factorization (KCF) has shown its advantage on handling data with nonlinear structures; however, the kernels involved in the existing KCF-based methods are empirically predefined, which may compromise the performance. In this paper, we propose semi-supervised adaptive kernel concept factorization (SAKCF), which integrates the data representation and kernel learning into a unified model to make the two learning processes adapt to each other. SAKCF extends traditional KCF in a semi-supervised manner, which encourages the high-dimensional representation to be consistent with both the limited supervisory and local geometric information. Besides, an alternating iterative algorithm is proposed to solve the resulting constrained optimization problem. Experimental results on six real-world data sets verify the effectiveness and advantages of our SAKCF over state-of-the-art methods when applied on the clustering task.
Image feature selection embedded distribution differences between classes for convolutional neural network
2022, Applied Soft Computing
Citation Excerpt :
Yang et al. proposed a feature selection with local structure learning, in which generating similarity matrix and feature selection are carried out alternately [19]. In addition, feature selection based on matrix decomposition [20,21], such as singular value decomposition, LU decomposition, is also successfully applied in image classification. Start with the perspective of subspace learning, Wang et al. proposed a matrix factorization criterion for unsupervised feature selection and proved its convergence [22].
Convolutional neural networks have achieved a great success in feature extraction and classification of images. However, some of the features extracted by convolutional neural networks are with insignificant difference between classes, which not only contribute little to image classification, but also increase the complexity of the classifier. It is important to select features that are helpful for image classification when using convolutional neural network. In view of the existence of class labels of image samples when training classifier, and motivated by the intention that these labels may also play a certain role in feature selection for image classification, we propose a feature selection approach by taking the distribution differences between classes into consideration on the basis of the features extracted by convolutional neural network. To be specific, we use the Gaussian mixture model to approximate the distribution of each feature on each subclass, and select the features significantly contribute to classification by designing a measure of distribution difference according to the numerical characteristics described by Gaussian mixture models. Further, an image classifier can be presented by redesigning the fully connected layers of the convolutional neural network based on the selected features. The proposed feature selection is adopted to image classification, and the experimental results show the effectiveness of the method.
Computational inverse imaging method by machine learning-informed physical model for electrical capacitance tomography
2022, Journal of Computational Science
The solution of the imaging inversion problem is an important step in the electrical capacitance tomography developed for process parameter measurements. Many studies have been carried out to improve the reconstruction quality, but the discrepancy between ground-truth imaging prototypes and recovered tomograms is still significant. To address the challenge, the regularization by denoising (RED) is introduced in this work, turning the denoising algorithm into a regularizer. Measurement physics, RED and sparsity prior are coupled into a new imaging model. A new numerical method is developed to solve the established imaging model by integrating the split Bregman algorithm and the forward backward splitting technique. To improve the performance of RED, the multiple output least squares support vector machine is combined with the low-dimensional representation method, and the training problem is solved by a new distributed computational method. The nonnegative matrix factorization method is extended into a new low-dimensional representation method, and a powerful optimizer is developed to solve the model. The performance evaluations clearly imply that the new method achieves more significant reconstruction performance gain and better robustness than popular imaging algorithms. This study improves the measurement physics based imaging method by machine learning techniques, and provides new perspectives and insights into the development of the image reconstruction paradigm.
Robust distribution-based nonnegative matrix factorizations for dimensionality reduction
2021, Information Sciences
As a popular dimensionality-reduction technique, nonnegative matrix factorization (NMF) has been widely researched since it is consistent with human cognitive processes in the psychology and physiology. This paper presents a novel NMF framework, called robust distribution-based NMF (RDNMF), to learn the robustly discriminative representations for data. In this RDNMF, a Kullback–Leibler divergence to measure the similarity between the data and representations is introduced, which fully preserves the geometrical structure of data. Meanwhile, this RDNMF employs the $l_{2, 1}$ -norm loss to reduce the influence of noise and outliers. This paper further proposes a semi-supervised RDNMF (SRDNMF) by enforcing the representations of labeled points in the same class to be aligned on the same axis. The proposed RDNMF and SRDNMF are solved by the modified multiplicative update rules. Clustering experiments on seven benchmark datasets demonstrate the effectiveness of our methods in comparison to other state-of-the-art methods.

View all citing articles on Scopus

Deguang Kong received his Ph.D degree in Computer Science from University of Texas Arlington at 2013. He is currently a senior research scientist (principal data scientist) at Yahoo Research (Sunnyvale), and ever worked in Los Alamos national Lab, NEC research lab, Penn State University and Samsung Research America as a researcher. His research interests focus on feature learning and compressive sensing, user engagement understanding and recommendation, etc. He has published over 30 referred articles in top conferences, including ICML, NIPS, AAAI, CVPR, KDD, ICDM, SDM, WSDM, CIKM, ECML/PKDD, etc. He has served as a program committee member in NIPS, AAAI, IJCAI, KDD, SDM and a reviewer for TPAMI, TKDE, DMKD, TIFS, TNNLS, TDSC, etc.

View full text

Elastic nonnegative matrix factorization

Highlights

Abstract

Introduction

Section snippets

Elastic loss

Convergence of the algorithm

Correctness of the algorithm

Experiment

Conclusion and future work

Pattern Recognit.

Pattern Recognit.

Neural Netw.

Eng. Appl. AI

Expert Syst. Appl.

Pattern Recognit.

Pattern Recognit.

Pattern Recognit.

ACM Trans. Knowl. Discovery Data (TKDD)

Cogn. Psychol.

Pattern Recognit.

Algorithms for non-negative matrix factorization

Advances in Neural Information Processing Systems

Document clustering based on non-negative matrix factorization

Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval

Text mining using non-negative matrix factorizations

Proceedings of the 2004 SIAM International Conference on Data Mining

Summarizing video using non-negative similarity matrix factorization

Multimedia Signal Processing, 2002 IEEE Workshop on

Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis

Bioinformatics

Learning from incomplete ratings using non-negative matrix factorization

Proceedings of the 2006 SIAM International Conference on Data Mining

Probabilistic polyadic factorization and its application to personalized recommendation

Proceedings of the 17th ACM Conference on Information and Knowledge Management

Machine learning source separation using maximum a posteriori nonnegative matrix factorization

IEEE Trans. Cybern.

Variational regularized 2-d nonnegative matrix factorization

IEEE Trans. Neural Netw. Learn. Syst.

Single-channel blind separation using l 1-sparse complex non-negative matrix factorization for acoustic signals

J. Acoust. Soc. Am.

Adaptive sparsity non-negative matrix factorization for single-channel source separation

IEEE J. Sel. Top Signal Process.

Visual tracking via online nonnegative matrix factorization

IEEE Trans. Circuits Syst. Video Technol.

Underdetermined convolutive source separation using GEM-MU with variational approximated optimum model order NMF2d

IEEE/ACM Trans. Audio Speech Lang. Process.

Underdetermined reverberant acoustic source separation using weighted full-rank nonnegative tensor models

Acoust. Soc. Am. J.

Solving consensus and semi-supervised clustering problems using nonnegative matrix factorization

Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on

Balanced clustering with least square regression

AAAI

Balanced clustering via exclusive lasso: a pragmatic approach

AAAI

Semi-supervised clustering via matrix factorization

Proceedings of the 2008 SIAM International Conference on Data Mining

Constrained nonnegative matrix factorization for image representation

IEEE Trans. Pattern Anal. Mach. Intell.

Multiplicative updates for nonnegative quadratic programming in support vector machines

Advances in neural information processing systems

Maximum-margin matrix factorization

Advances in neural information processing systems

Proceedings of the 2004 SIAM International Conference on Data Mining

Multimedia Signal Processing, 2002 IEEE Workshop on

Proceedings of the 2006 SIAM International Conference on Data Mining

Proceedings of the 2008 SIAM International Conference on Data Mining