Learning a consensus affinity matrix for multi-view clustering via subspaces merging on Grassmann manifold

doi:10.1016/j.ins.2020.07.059

Information Sciences

Volume 547, 8 February 2021, Pages 68-87

https://doi.org/10.1016/j.ins.2020.07.059 Get rights and content

Abstract

Integrative multi-view subspace clustering aims to partition observed samples into underlying clusters through fusing representative subspace information from different views into a latent space. The clustering performance relies on the accuracy of sample affinity measurement. However, existing approaches leverage the subspace representation of each view and overlook the learning of appropriate sample affinities. This paper proposes to learn a consensus affinity directly by merging subspace representations of different views on a Grassmann manifold while maintaining their geometric structures across these views. The proposed method not only preserves the structure of the most informative individual view, but also discovers a latent common structure across all views. The associated constrained optimization problem is solved using the alternating direction method of multipliers. Extensive experiments on synthetic and real-world datasets show that the proposed method outperforms several state-of-the-art multi-view subspace clustering methods. The affinity matrix obtained by our method can extract highly representative and latent common information to enhance the clustering performance.

Introduction

Clustering aims to partition data into different groups such that data in the same groups are similar [1], [2], [3]. In such a task, similarity measurement among samples, denoted by an affinity matrix, obtained by various metrics, plays a significant role in affecting the clustering performance. For example, heat kernel-based similarity measurements are widely used in spectral clustering. The performance relies on the heuristic metric and is likely to be corrupted by noise or outliers. To address this drawback, subspace clustering provides a simple yet effective way to find a latent affinity matrix [4].

Subspace clustering uses the self-expressiveness property of data samples. It assumes that each sample can be represented as a linear combination of other samples. Under this assumption, a representation matrix with different regularization can be constructed from samples and is then used to construct the affinity matrix. Low-rank representation (LRR) subspace clustering [5], [6] seeks the lowest rank representation among all the candidates. The low-rank regularization implies that the original data space is spanned by a small number of vectors, which describes the global property of the data subspaces. Sparse subspace clustering (SSC) [7] takes the data matrix as the dictionary and performs the representation matrix calculation as a sparse coding task. The sparse regularization implies that the data sample can be represented by a few other samples. To take advantage of the LRR subspace clustering and SSC, subspace clustering methods considering both low-rank and sparse regularizations [8], [9] are developed to capture the global and local structures of the data space.

Although these subspace clustering approaches are effective, the information provided by single-source data is limited, especially when the observations are insufficient and/or grossly corrupted. In real-world applications, datasets are often with multiple modalities or composed of multiple representations (i.e., views). Multi-view clustering problems aim to partition data into different groups by making use of complementary information from these heterogeneous views [10]. Multi-view learning methods learn a latent representation of data from multiple views assuming that data in all views share a common structure [11], [12]. Feature concatenation is a simple way to combine all data views, however, this method cannot discover common structure across different views. To tackle such a problem, co-regularized multi-view spectral clustering minimizes the eigenvectors of the graph Laplacian across different views to learn a common representation of the multi-view data [13]. Most previous works tend to construct affinity matrices on each individual view before learning the consensus affinity matrix [14], [3]. These methods divide the learning problem into two independent processes without adaptive interaction. In contrast, methods such as multi-view low-rank sparse subspace clustering (MLRSSC) construct the multi-view affinity matrix in one step by learning a common representation matrix from the subspace representations of each view [15]. In addition, latent multi-view subspace clustering (LMSC) assumes that views originate from one underlying latent representation, and then subspace clustering is performed on the latent representation [16]. To deal with incomplete data, multi-view co-clustering with incomplete data [17] finds a consistent cluster partition based on rank-one matrix approximation. All of these methods focus on regularizations on individual representations and consistency preservation between the common representation and individual representations.

However, data from different views usually have different structures. Therefore, direct affinity fusion among different views in Euclidean space is too rigid to align the learned subspaces. Such approaches easily break the local structure of each individual subspace and yield poor performance. To address this issue, this paper proposes to align the subspaces on a Grassmann manifold [18], [19] to learn the consensus affinity matrix. To preserve the local structure within each subspace, the consensus affinity matrix is also regularized to have the same structure properties as those of individual views, which helps learn the common structures instead of mixing or breaking them. The convex optimization problem is solved using the alternating direction method of multipliers (ADMM) [20]. We conduct extensive experiments on synthetic and real-world datasets to demonstrate the performance of the proposed method. Specifically, the main contributions of this paper can be summarized as follows:

•
We merge the self-representative subspaces of individual views on a Grassmann manifold to obtain a robust integrative subspace. The obtained integrative subspace preserves the geometric uniformity of the subspaces from each view.
•
The affinity matrix is directly learned on the integrative subspace and is further regularized with low-rank and sparse constraints. The regularization over view-specific space and Grassmann manifold ensures the favorable subsequent clustering performance.
•
We conduct extensive experiments on synthetic and real-world datasets and demonstrate that our method outperforms several state-of-the-art multi-view clustering methods. The superior performance of the proposed method validates its effectiveness.

The remainder of this paper is as follows. Section 2 briefly reviews the background and related works for multi-view subspace clustering. Section 3 introduces the proposed model and provides the optimization algorithms. Section 4 demonstrates the experimental results on one synthetic dataset and eight real-world datasets. Section 5 presents the conclusion of this paper.

Section snippets

Notation

Throughout the paper, scalars are represented with lower-case symbols, vectors with bold lower-case symbols and matrices with bold upper-case symbols. The i-th column of a matrix $X$ is denoted by $X_{i}$ , whereas $X_{j}^{T}$ denotes its j-th row. The widely used $l_{1}$ norm, Frobenius norm, $l_{2, 1}$ norm, and nuclear norm are denoted by $‖ \cdot ‖_{1}, ‖ \cdot ‖_{F}, ‖ \cdot ‖_{2, 1}$ and $‖ \cdot ‖_{*}$ , respectively. The Schatten p-norm is denoted by $‖ \cdot ‖_{S_{p}}$ and defined as $‖ A ‖_{S_{p}} = ‖ diag (Σ) ‖_{p}$ where $A = U Σ V^{T}$ is the singular value decomposition of matrix $A$ . We use

Integrative affinity learning for multi-view subspace clustering through grassmann alignment

In this section, we present an integrative learning model for multi-view subspace clustering. There are two major merits that distinguish this approach from other popular models. The first one is that the learned subspace from each individual view is aligned to an integrative subspace on Grassmann manifolds to ensure geometric uniformity. The second one is that a latent integrative affinity matrix is directly estimated, and thus facilitates and enhances the clustering performance. The two terms

Experiments

In this section, we conduct experiments on synthetic data and real-world datasets to evaluate the clustering performance of our method compared with several state-of-the-art algorithms. We also propose an efficient parameter tuning strategy based on the parameter sensitivity.

Conclusion

In this paper, we propose an integrative affinity learning model for multi-view subspace clustering to enhance clustering performance. The method directly learns a consensus affinity matrix by merging subspace representations of multiple individual views on a Grassmann manifold, rather than concentrating on subspace learning or alignment on Euclidean space. We also penalize the consensus affinity matrix to extract the latent common information among multiple views. In addition, we provide a

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

This work was partially supported by the Key-Area Research and Development of Guangdong Province (2020B010166002, 2020B111119001), National Natural Science Foundation of China (61771007), Science and Technology Planning Project of Guangdong Province (2017B020226004), and the Health & Medical Collaborative Innovation Project of Guangzhou City (202002020049).

References (36)

S. Kanaan-Izquierdo et al.
Multiview and multifeature spectral clustering using common eigenvectors
Pattern Recogn. Lett.
(2018)
L. Houthuys et al.
Multi-view kernel spectral clustering
Inform. Fusion
(2018)
Y. Liu et al.
Structure-constrained low-rank and partial sparse representation with sample selection for image classification
Pattern Recogn.
(2016)
J. Zhao et al.
Multi-view learning overview: recent progress and new challenges
Inform. Fusion
(2017)
M. Brbić et al.
Multi-view low-rank sparse subspace clustering
Pattern Recogn.
(2018)
G. Chao et al.
Multi-view cluster analysis with incomplete data to understand treatment effects
Inf. Sci.
(2019)
X. Zhang et al.
Robust low-rank kernel multi-view subspace clustering based on the schatten p-norm and correntropy
Inf. Sci.
(2019)
X. Wang et al.
Multi-view subspace clustering with intactness-aware similarity
Pattern Recogn.
(2019)
M. Abavisani et al.
Multimodal sparse and low-rank subspace clustering
Inform. Fusion
(2018)
C. Lu et al.
Convex sparse spectral clustering: Single-view to multi-view
IEEE Trans. Image Process.
(2016)

R. Vidal

Subspace clustering

IEEE Signal Process. Mag.

(2011)

G. Liu, Z. Lin, Y. Yu, Robust subspace segmentation by low-rank representation, in: Proceedings of the 27th...

G. Liu et al.

Robust recovery of subspace structures by low-rank representation

IEEE Trans. Pattern Anal. Mach. Intell.

(2013)

E. Elhamifar et al.

Sparse subspace clustering: algorithm, theory, and applications

IEEE Trans. Pattern Anal. Mach. Intell.

(2013)

Y.X. Wang et al.

Provable subspace clustering: when lrr meets ssc

Advances in Neural Information Processing Systems

(2013)

G. Chao, S. Sun, J. Bi, A survey on multi-view clustering. arXiv preprint arXiv:171206246...

S. Sun

A survey of multi-view machine learning

Neural Comput. Appl.

(2013)

A. Kumar et al.

Co-regularized multi-view spectral clustering

Advances in neural information processing systems

(2011)

Cited by (28)

Accurate multi-view clustering to seek the cross-viewed yet uniform sample assignment via tensor feature matching
2024, Information Sciences
Multi-view clustering has become one of the popular clustering branches with data accumulation from multiple domains. Unfortunately, a large heterogeneity gap exists due to cross-view discrepancy, resulting in inaccurate sample-wise similarity estimation. Furthermore, the popular multi-view clustering methods heavily rely on sample-wise similarity measurements, which often lead to suboptimal clustering performance due to inaccurate similarity estimation across views. To tackle these challenges, this paper presents an accurate multi-view clustering method from a standpoint of inter-view feature-wise matching to bypass the inaccurate sample-wise similarity that leverages tensor feature matching for cross-viewed yet uniform sample assignment, named by multi-view clustering via tensor feature matching (MC-TFM). Specifically, our approach begins with exploring a comprehensive feature matching tensor by exploiting both intra-view and inter-view correlations among multi-view features. Subsequently, a feature matching matrix, which preserves the correlations and importance of multi-view features for guiding the cross-viewed yet uniform sample assignment, is estimated based on this tensor. Furthermore, these correlations and importance are maintained into two coded feature bases by decomposing the feature matching matrix. Finally, a sample assignment matrix is learned via jointly reconstructing the samples using the two bases further cooperating with spectral clustering. In this way, the heterogeneity gap is bridged by tensor feature matching, and the inaccurate sample-wise similarity estimation is omitted by using feature-wise matching to guide sample-wise assignment. Extensive experiments conducted on seven real-world datasets highlight the effectiveness of the cross-viewed yet uniform sample assignment, demonstrating the potential of our approach in accurate multi-view clustering tasks.
Incomplete multiview subspace clustering based on multiple kernel low-redundant representation learning
2024, Information Fusion
Subspace clustering is a widely used technique for clustering high-dimensional data. However, its effectiveness is limited in the context of incomplete multiview clustering, where intact subspaces cannot be obtained due to missing instances. To address this issue, we present a novel approach for incomplete multiview subspace clustering based on multiple kernel completion, low-redundant representation learning, and weighted tensor low-rank constraint. First, a carefully designed kernel completion scheme is employed to obtain intact kernels, from which the complete low-redundant representations are learned to obtain intact and compact subspaces. Second, unlike the traditional pairwise subspace fusion, we propose to fuse the multiview subspaces with a weighted tensor low-rank constraint, which not only explores higher-order relationships among views but also assigns appropriate weights to each view. Finally, we propose a unified model that jointly learns low-redundant representations, view-specific subspaces, and their low-rank tensor structure. Extensive experiments conducted on four publicly available datasets demonstrate the effectiveness of the proposed method.
Double-level View-correlation Multi-view Subspace Clustering
2024, Knowledge-Based Systems
In recent years, significant progress has been made in Multi-view Subspace Clustering (MSC). Most existing MSC methods attempt to explore and exploit the view correlations of multi-view data to boost the clustering performance. They achieve the subspace matrices of different views from the original feature space directly. However, the diversity view-correlation and consistency-view correlation of multi-view data are two antagonistic properties, which are improper and challenging to be captured in such a straightforward process. To simultaneously and properly investigate the two antagonistic properties of multi-view data, a novel Double-level View-correlation Multi-view Subspace Clustering method, named DV-MSC, is introduced in this paper. To be specific, DV-MSC adopts a strategy that deals with the diversity view-correlation and consistency view-correlation in different levels: (1) low-level, which excavates the diversity view-correlation in the feature space, and (2) high-level, which explores the consistency view-correlation in subspace representations. The underlying assumption is that different views should be diverse in the feature space while having the same clustering results, in other words, the proposed method explores the Diversity in Low-level Feature Content (DLFC) and the Consistency in High-level Clustering Structure (CHCS). Experimental results show the promising and competitive clustering performance of DV-MSC, compared to several existing state-of-the-arts.
Adaptive multi-granularity sparse subspace clustering
2023, Information Sciences
Sparse subspace clustering (SSC) focuses on revealing data distribution from algebraic perspectives and has been widely applied to high-dimensional data. The key to SSC is to learn the sparsest representation and derive an adjacency graph. Theoretically, the adjacency matrix with proper block diagonal structure leads to a desired clustering result. Various generalizations have been made through imposing Laplacian regularization or locally linear embedding to describe the manifold structure based on the nearest neighborhoods of samples. However, a single set of nearest neighborhoods cannot effectively characterize local information. From the perspective of granular computing, the notion of scored nearest neighborhoods is introduced to develop multi-granularity neighborhoods of samples. The multi-granularity representation of samples is integrated with SSC to collaboratively learn the sparse representation, and an adaptive multi-granularity sparse subspace clustering model (AMGSSC) is proposed. The learned adjacency matrix has a consistent block diagonal structure at all granularity levels. Furthermore, the locally linear relationship between samples is embedded in AMGSSC, and an enhanced AMGLSSC is developed to eliminate the over-sparsity of the learned adjacency graph. Experimental results show the superior performance of both models on several clustering criteria compared with state-of-the-art subspace clustering methods.
Projection-based coupled tensor learning for robust multi-view clustering
2023, Information Sciences
Multi-view clustering methods based on tensor learning have received extensive attention due to their ability to effectively mine high-order correlation information between views. However, the presence of noise and redundant information in multi-view data can seriously interfere with the performance of clustering tasks. To this end, we propose a projection-based coupled tensor learning method (PCTL). In particular, we first construct an orthogonal projection matrix to obtain the main characteristic information of the raw data of each view and learn the representation matrix in a clean embedding space. Then, we use tensor learning to couple the projection matrix and the representation matrix to mine the high-order information between views and construct a more suitable and optimal representation of the embedding space. A large number of experiments prove that PCTL can effectively suppress the interference of noise and redundant information, and the clustering performance is better than some existing excellent algorithms.
Robust anchor-based multi-view clustering via spectral embedded concept factorization
2023, Neurocomputing
Multi-view clustering (MVC) often provides superior effectiveness to single-view clustering due to the integration of information from diverse views. Nonetheless, existing MVC methods are limited to large-scale real-world data by the drawbacks of low efficiency and poor robustness. To address these issues, we propose a novel robust anchor-based MVC model via spectral embedded concept factorization (RAMCSF). RAMCSF builds anchor graphs to approximate full-sample graphs and decomposes these anchor graphs by concept factorization (CF). To improve the clustering effectiveness, factor matrices of CF are constrained as orthogonal matrices to reduce the freedom of decomposition, and a novel small-scale anchor-based spectral embedding is designed to explore the high-order neighbor relationships. To restrain complex noises distributed in real-world data, we employ correntropy to measure the error between the original data and the learned representation. Moreover, RAMCSF can get a clustering indicator matrix directly, avoiding additional post-processing and ensuring that changes in data dimensions have a limited impact on efficiency. The model is then optimized by a novel fast half-quadratic-based optimization strategy that combines the orthogonal properties and the traces of matrices. Extensive experiments indicate that RAMCSF can achieve higher efficiency and robustness while maintaining comparable effectiveness to other state-of-the-art methods.

View all citing articles on Scopus

¹: Contribute equally to the article.

View full text

Learning a consensus affinity matrix for multi-view clustering via subspaces merging on Grassmann manifold

Abstract

Introduction

Section snippets

Notation

Integrative affinity learning for multi-view subspace clustering through grassmann alignment

Experiments

Conclusion

Declaration of Competing Interest

Acknowledgement

Pattern Recogn. Lett.

Inform. Fusion

Pattern Recogn.

Inform. Fusion

Pattern Recogn.

Inf. Sci.

Inf. Sci.

Pattern Recogn.

Inform. Fusion

Convex sparse spectral clustering: Single-view to multi-view

IEEE Trans. Image Process.

Subspace clustering

IEEE Signal Process. Mag.

Robust recovery of subspace structures by low-rank representation

IEEE Trans. Pattern Anal. Mach. Intell.

Sparse subspace clustering: algorithm, theory, and applications

IEEE Trans. Pattern Anal. Mach. Intell.

Provable subspace clustering: when lrr meets ssc

Advances in Neural Information Processing Systems

A survey of multi-view machine learning

Neural Comput. Appl.

Co-regularized multi-view spectral clustering

Advances in neural information processing systems