Manifold NMF with L21 norm for clustering

doi:10.1016/j.neucom.2017.08.025

Neurocomputing

Volume 273, 17 January 2018, Pages 78-88

https://doi.org/10.1016/j.neucom.2017.08.025 Get rights and content

Abstract

Nonnegative matrix factorization has been widely used in data mining and machine learning fields as a clustering algorithm. The standard nonnegative matrix factorization algorithm utilizes the sum of squared error to measure the quality of factorization, however, the noise and outliers in the dataset will reduce the performance of algorithm significantly. This paper proposes a robust manifold nonnegative matrix factorization algorithm based on L₂₁ norm, and the projected gradient method is utilized to obtain the updating rules. The proposed algorithm utilizes the L₂₁ norm to measure the quality of factorization, which is insensitive to the noise and outliers, also it utilizes the geometrical structure of the dataset and considers the local invariance. The experimental results on several data sets and the comparison with other clustering algorithms demonstrate the effectiveness of the proposed algorithm.

Introduction

The “semantic gap” between image visual features and high-level semantic is the bottleneck of the development of image retrieval. The core problem of improving the utilization efficiency of image content and reducing the semantic gap is how to represent the visual features of the image effectively, that is, how to find a reasonable method to show the data [1], [2]. In order to deal with the data information in the matrix, the matrix is usually decomposed [3]. After the decomposition of the matrix, not only the dimension of the original matrix can be greatly reduced, but also the data stored in the original matrix can be summarized and compressed. The matrix decomposition technology has attracted more and more attention. The typical matrix decomposition methods mainly includes: principal component analysis (PCA) [4], linear discriminant analysis (LDA) [5], independent component analysis (ICA) [6], singular value decomposition (SVD) [7], etc. These methods usually make decomposition or linear transformation on the original data matrix under certain restrictions. They are common in allowing negative values in the decomposition results, which is correct from the view of calculation, but in practice, negative values are meaningless [8].

Recently, non-negative matrix factorization (NMF) as an effective matrix decomposition method has attracted wide attention. In 1999, Lee and Seung [9] first propose the concept of non-negative matrix factorization on ”Nature” journal. Different from the above-mentioned typical matrix decomposition methods, NMF method is unique in that all elements of the matrix decomposition process are non-negative. NMF method decomposes the original matrix into two non-negative matrix: the left matrix is called basis matrix, the right matrix is called coefficient matrix. Therefore, the column vector in the original matrix can be interpreted as the weighted sum of all the columns in the basis matrix, and the weight coefficients are contained in the coefficient matrix. Such a vector combination has an intuitive semantic interpretation, in line with the concept of ”local constitute a whole” in human thinking [10], [11]. The solution methods of NMF include multiplicative update rule, least square substitution method, and projected gradient method [12]. By adding non-negative constraints, NMF can not only ensure the interpretation of the decomposition results, but also has the advantages of easy implementation, small occupation of storage space, and so on. Therefore, it has a certain practical significance to explore the non-negative matrix factorization.

NMF has received considerable attention since it was presented. In order to improve the efficiency and recognition rate of NMF, many scholars have made a deep research on the NMF algorithm, and obtained a lot of research results. Akashi and Okatani [13] introduce sparse encoding theory into NMF. They apply sparseness constraints on both the basis matrix and coefficient matrix, and propose the sparse non-negative matrix factorization algorithm (SNMF). Liu et al. [14] design the constrained non-negative matrix factorization algorithm (CNMF), which uses the known label information to guide the matrix factorization process. Later, Shu and Zhao [15] improve the CNMF by adding sparse constraints, and propose the constrained non-negative matrix factorization with sparseness (CNMFS). Feng et al. [16] propose the weighted non-negative matrix factorization (WNMF), in which each training sample is accompanied by a non-negative weight. Then we may control the influence of the samples on the decomposition results by adjusting the sample weights. Bucak and Gunsel [17] combine incremental learning with non-negative matrix factorization, and propose the incremental non-negative matrix factorization algorithm (INMF), which reduces the computation scale using the idea of block matrix. Kong et al. [18] propose a robust non-negative matrix factorization algorithm based on L₂₁ norm (L₂₁NMF). It overcomes the drawbacks of standard NMF algorithm which is sensitive to the noise and outliers, and improves the robustness of the algorithm. In addition, NMF has been successfully applied to various fields, including text clustering [19], face recognition [20], visual tracking [21], image denoising [22], image retrieval [23], social network analysis [24], etc.

The manifold is a curled space with the properties of Euclidean space in local area. Manifold learning can achieve the dimension reduction by finding the low dimensional manifold embedded in the high-dimensional space. Manifold learning algorithms includes isometric mapping (Isomap), locally linear embedding (LLE), Laplacian eigenmap (LE). They use the local invariant features that two close sample points in high-dimensional space also have short distance in the low dimensional manifold [25]. Cai et al. [26] propose the graph regularized non-negative matrix factorization algorithm (GNMF), which combines manifold learning and NMF. GNMF algorithm considers the geometric structure contained in the original data and will preserve the local geometric information of data sets during the decomposition process. Jiang et al. [27] propose the graph regularized non-negative matrix factorization algorithm with sparse constraints (GNMFSC), which not only takes the geometrical information of data into account, but also impose sparse constraints on the coefficient matrix, so the face images after decomposition have higher recognition rate. But in GNMF algorithm, the quality of matrix decomposition is measured by the Frobenius norm, which is easily affected by the noise and abnormal values in the data. In this case, large errors of the objective function may exist and the performance of the algorithm therefore decrease. In order to get more robust clustering results, this paper presents a manifold non-negative matrix factorization algorithm based on L₂₁ norm (MNMFL₂₁), in which the update rules are calculated by projection gradient method. MNMFL₂₁ algorithm uses the L₂₁ norm to measure the quality of matrix decomposition, so it is not sensitive to the data noise. Also it combines the manifold learning with the robust NMF, and uses the local invariance property to detect the geometric structure of data. The main contribution of this paper are: (1) we use L₂₁ norm to improve the GNMF algorithm and propose MNMFL₂₁ algorithm; (3) we give the objective function of MNMFL₂₁ algorithm and deduce its equivalent form to facilitate the calculation; (4) we present the detailed update rules of the non-negative matrix in MNMFL₂₁ algorithm with the projected gradient method.

The rest parts of the article are as follows: Section 2 introduces the standard NMF algorithm, L₂₁NMF algorithm and GNMF algorithm; Section 3 gives the definition of MNMFL₂₁ algorithm and its equivalence objective function; Section 4 uses the gradient projection method to solve the update rules of MNMFL₂₁ algorithm; Section 5 shows the experimental results of the MNMFL₂₁ algorithm on several data sets and illustrates the validity of MNMFL₂₁ algorithm by comparing it with other clustering algorithms; finally, we summarize the work in this paper.

Section snippets

Related works

In this section, we briefly review the standard NMF algorithm, L₂₁NMF algorithm and GNMF algorithm. The input matrix is defined as $X = {x_{1}, x_{2}, \dots, x_{n}},$ where each p dimensional column vector x_i represents a non-negative data sample.

The proposed MNMFL₂₁ algorithm

In this section, we will give the definition of the proposed MNMFL₂₁ algorithm. For convenience of the calculation of update rules, we derive the equivalent objective function of MNMFL₂₁ algorithm.

Update rules for MNMFL₂₁ algorithm

In this section, we first use the projection gradient method to solve the general constrained optimization problem, and then use the projected gradient method to calculate the update rules of F and G in MNMFL₂₁ algorithm.

Contrast algorithms

We compare the MNMFL₂₁ algorithm with the following algorithms:

(1)
k-means algorithm [28]: the classical partition clustering algorithm that obtains the best clustering results by minimizing the distance between the data points and their cluster centers.
(2)
PCA algorithm [29]: using PCA to reduce the dimension of the original data set, and then clustering on the low dimensional data with k-means algorithm.
(3)
SNMF algorithm [13]: representing the data set by sparse coding method, and using F norm to

Conclusion

This paper proposes the MNMFL₂₁ algorithm, which is a robust manifold NMF clustering algorithm based on L₂₁ norm. This algorithm inherits the advantages of L₂₁NMF and GNMF algorithms. It uses the L₂₁ norm to measure the quality of matrix decomposition, and considers the manifold structure and local invariance of the data. Through introducing the Laplacian matrix, the low dimensional space G obtained by the MNMFL₂₁ algorithm can accurately reflect the geometric structure of the data in high

Acknowledgments

This project is supported by the Fundamental Research Funds for the Central Universities (2014ZDPY23) and the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD).

Baolei Wu received the M.S. degree in Computer Application Technology from China University of Mining and Technology, Xuzhou, China, in 2010. His current research interests include saliency detection and image processing.

References (31)

E. Ergul
Relative attribute based incremental learning for image recognition
CAAI Trans. Intell. Technol.
(2017)
JiaH. et al.
Approximate normalized cuts without eigen-decomposition
Inf. Sci.
(2016)
M.S. Treder et al.
The LDA beamformer: optimal estimation of ERP source time series using linear discriminant analysis
NeuroImage
(2016)
LiuM. et al.
Salient pairwise spatio-temporal interest points for real-time activity recognition
CAAI Trans. Intell. Technol.
(2016)
ZhangQ. et al.
Multisensor video fusion based on higher order singular value decomposition
Inf. Fus.
(2015)
A. Maiti et al.
Development of a mass model in estimating weight-wise particle size distribution using digital image processing
Int. J. Min. Sci. Technol.
(2017)
A. Sotiras et al.
Finding imaging patterns of structural covariance via non-negative matrix factorization
NeuroImage
(2015)
D. Behnia et al.
Modeling of shear wave velocity in limestone by soft computing methods
Int. J. Min. Sci. Technol
(2017)
Y. Akashi et al.
Separation of reflection components by sparse non-negative matrix factorization
Comput. Vis. Image Underst.
(2016)
FengY. et al.
A locally weighted sparse graph regularized non-negative matrix factorization method
Neurocomputing
(2015)

S.S. Bucak et al.

Incremental subspace learning via non-negative matrix factorization

Pattern Recognit.

(2009)

LiuQ. et al.

Projective nonnegative matrix factorization for social image retrieval

Neurocomputing

(2016)

R. Hettiarachchi et al.

Multi-manifold LLE learning in pattern recognition

Pattern Recognit.

(2015)

ZhaoJ. et al.

Recovering seabed topography from sonar image with constraint of sounding data

J. China Univ. Min. Technol.

(2017)

HuZ. et al.

Sparse principal component analysis via rotation and truncation

IEEE Trans. Neural Netw. Learn. Syst.

(2016)

Cited by (30)

Centric graph regularized log-norm sparse non-negative matrix factorization for multi-view clustering
2024, Signal Processing
Multi-view non-negative matrix factorization (NMF) provides a reliable method to analyze multiple views of data for low-dimensional representation. A variety of multi-view learning methods have been developed in recent years, demonstrating successful applications in clustering. However, existing methods in multi-view learning often tend to overlook the non-linear relationships among data and the significance of the similarity of internal views, both of which are essential in multi-view tasks. Meanwhile, the mapping between the obtained representation and the original data typically contains complex hidden information that deserves to be thoroughly explored. In this paper, a novel multi-view NMF is proposed that explores the local geometric structure among multi-dimensional data and learns the hidden representation of different attributes through centric graph regularization and pairwise co-regularization of the coefficient matrix. In addition, the proposed model is further sparsified with $l_{2, l o g}$ -(pseudo) norm to efficiently generate sparse solutions. As a result, the model obtains a better part-based representation, enhancing its robustness and applicability in complex noisy scenarios. An effective iterative update algorithm is designed to solve the proposed model, and the convergence of the algorithm is proven to be theoretically guaranteed. The effectiveness of the proposed method is verified by comparing it with nine state-of-the-art methods in clustering tasks of eight public datasets.
Self-filling evidential clustering for partial multi-view data
2024, Expert Systems with Applications
Partial multi-view clustering (PMVC) is a hot topic in data mining, where each view suffers from the absence of some data. Although many PMVC algorithms have been presented with appropriate performance, to the best of our knowledge, almost all of them need to be fed with the number of clusters as hyperparameters. Moreover, these existing algorithms only create hard partitions for objects with multiple views, which are often in highly overlapping areas. This ignores ambiguity and uncertainty in object assignment, which likely leads to performance degradation. To address these issues, we propose a novel self-filling evidential clustering algorithm (SFPMEC) for the PMVC problem. Based on the shared coefficient matrix learned from a self-filling process, SFPMEC provides a human-readable chart through which the number of clusters can be easily detected. SFPMEC then derives a credal partition under the framework of evidence theory, improving the fault tolerance of clustering and gaining deeper insight into the data structure. Extensive experiments are conducted to demonstrate the benefits of adopting the self-filling process and evidence theory. Besides, our algorithm shows better performance than other state-of-the-art methods on benchmark datasets¹.
WSNMF: Weighted Symmetric Nonnegative Matrix Factorization for attributed graph clustering
2024, Neurocomputing
In recent times, Symmetric Nonnegative Matrix Factorization (SNMF), a derivative of Nonnegative Matrix Factorization (NMF), has surfaced as a promising technique for graph clustering. Nevertheless, when applied to attributed graph clustering, it confronts notable challenges. These include the disregard for attributed information, the oversight of geometric data point structures, and the inability to discriminate irrelevant features and data outliers. In response, we introduce an innovative extension of SNMF termed Weighted Symmetric Nonnegative Matrix Factorization (WSNMF). This method introduces node attribute similarity to compute a weight matrix, effectively bridging the gap for attributed graph clustering. Our approach incorporates graph regularization and sparsity constraints to uphold the geometric structure of data points and discern irrelevant features and data outliers. Additionally, we present an updating rule to address optimization complexities and validate algorithmic convergence. Rigorous experimentation on real-world and synthetic networks, employing well-established metrics including F-measure, RI, Modularity, Density, and entropy, substantiates the performance enhancement offered by WSNMF.
Robust dual-graph discriminative NMF for data classification
2023, Knowledge-Based Systems
In this paper, we propose a new supervised non-negative matrix factorization algorithm, named Robust Dual-graph Discriminative Non-negative Matrix Factorization (RDGDNMF). This model not only considers the differentiability between data samples, but also explores the intrinsic geometric structure information of data and feature space simultaneously, so it can obtain a more effective low-dimensional representation. Considering that the real data usually contains noise and outliers, we adopt $ℓ_{2, \frac{1}{2}}$ norm to minimize the loss function of Non-negative Matrix Factorization (NMF) to improve the robustness of the algorithm. In addition, we also give the multiplication update rules and prove the convergence of the algorithm. Then, we use this model and K-Nearest Neighbor (KNN) to reduce dimension and classify data. By comparing the classification results with other advanced methods on multiple datasets, the effectiveness and robustness of the proposed method are demonstrated.
Multi-view clustering via matrix factorization assisted k-means
2023, Neurocomputing
Matrix factorization based multi-view clustering algorithms has attracted much attention in recent years due to the strong interpretability and efficient implementation. In general, these approaches firstly compute the coefficient matrices of each data views, and learn a consensus matrix simultaneously. By applying the classical clustering techniques, such as k-means, on the consensus matrix, the final partition can be easily obtained. However, most of previous models work in a “step-by-step” manner, which cannot perform multi-view matrix factorization and clustering label generation simultaneously, leading to degenerated performance. In this paper, we propose a novel “one-pass” method, which integrates matrix factorization and k-means into a unified framework, named multi-view clustering via matrix factorization assisted k-means (MFK). In MFK, the generation of cluster indicator matrix and coefficient matrix learning can boost each other, leading to final improved clustering performance. Furthermore, we adopt a graph Laplacian regularization on the indicator matrix in order to capture the intrinsic geometric structure of original data. An alternating optimization strategy is designed to solve the resultant optimization problem and extensive experiments on six publicly datasets are conducted to demonstrate the superiority and effectiveness of the proposed MFK model.
Robust multi-view subspace enhanced representation based on collaborative constraints and HSIC induction
2023, Engineering Applications of Artificial Intelligence
Citation Excerpt :
Furthermore, it not only extracts useful identification information in the data but also effectively identifies the optimal solution. Existing studies have shown that the manifold information in data plays an important role in the performance improvement of clustering methods (Cai et al., 2011; Lu et al., 2020; Wu et al., 2018; Wen et al., 2018; Zheng et al., 2010). Further, useful information from real-world data often remains hidden in the low ranks of the data (Li et al., 2017; Vidal and Favaro, 2014).
The design of effective multi-view subspace clustering (MSC) algorithms has recently garnered significant research attention. Herein, to effectively improve the recognition performance and anti-noise interference ability of an MSC model, we propose a novel MSC algorithm, termed as robust multi-view subspace enhancement representation, based on collaborative constraints and a Hilbert–Schmidt independence criterion (HSIC) induction method. To mine the complementary information between different views, we apply the HSIC as a diversity regularization term. Specifically, to enhance the diagonal block structure of a subspace representation, a new sparse constraint is introduced on the product of itself and the transpose of the subspace representation matrix in a multi-view subspace learning model. Furthermore, hypergraph regularization and a low-rank idea are considered to capture the local geometric structure and clean data. In addition, to optimize our model, we adopt an augmented Lagrangian multiplier method and discuss the convergence of the model. Extensive experiments on six challenging datasets reveal that the proposed method achieves a highly competent objective performance with and without noisy views, as compared with several state-of-the-art multi-view clustering methods.

View all citing articles on Scopus

Enyuan Wang is a professor in the College of Safety Engineering at China University of Mining and Technoloty, Xuzhou, China. He received his Ph.D. degree from the same university in 1997. And he worked as a postdoctor in college of electrical engineering at China University of Mining and Technology from December 1997 to November 1999. His current research interests are rock mechanics and coal mining induced disasters monitoring and forcasting.

Zhen Zhen is a graduate student in School of Safety Engineering at China University of Mining and Technology, Xuzhou, China. His current research are detection technology and automatic equipment.

Wei Chen received the Ph.D degree in communications and information systems from China University of Mining and Technology, Beijing, China, in 2008. In 2008, he joined the School of Computer Science and Technology, China University of Mining and Technology at Xuzhou, where he is currently a Professor. His research interests include machine learning, image processing, and wireless communications.

Pengcheng Xiao is an assistant professor in the Department of Mathematics at University of Evansville. He received his Ph.D. degree from the University of Texas at Arlington in 2015. His current research interests are computational neuroscience and biomathematics.

View full text

Manifold NMF with L21 norm for clustering

Abstract

Introduction

Section snippets

Related works

The proposed MNMFL21 algorithm

Update rules for MNMFL21 algorithm

Contrast algorithms

Conclusion

Acknowledgments

CAAI Trans. Intell. Technol.

Inf. Sci.

NeuroImage

CAAI Trans. Intell. Technol.

Inf. Fus.

Int. J. Min. Sci. Technol.

NeuroImage

Int. J. Min. Sci. Technol

Comput. Vis. Image Underst.

Neurocomputing

Pattern Recognit.

Neurocomputing

Pattern Recognit.

Recovering seabed topography from sonar image with constraint of sounding data

J. China Univ. Min. Technol.

Sparse principal component analysis via rotation and truncation

IEEE Trans. Neural Netw. Learn. Syst.

Manifold NMF with L₂₁ norm for clustering

The proposed MNMFL₂₁ algorithm

Update rules for MNMFL₂₁ algorithm