Incomplete multiview nonnegative representation learning with multiple graphs

doi:10.1016/j.patcog.2021.108412

Pattern Recognition

Volume 123, March 2022, 108412

https://doi.org/10.1016/j.patcog.2021.108412 Get rights and content

Highlights

•
We build a novel incomplete multiview nonnegative representation learning framework, referred to as IMNRL.
•
IMNRL learns a consensus nonnegative representation and view-specific representations simultaneously.
•
The nonnegative representation retains the graph information on all views, and it reveals the clustering results.
•
IMNRL achieves state-of-the-art incomplete multiview clustering results on different incomplete cases.

Abstract

Multiview clustering has become an important research topic during the past decade. However, partial views of many data instances are missing in some realistic multiview learning scenarios. To handle this problem, we develop an effective incomplete multiview nonnegative representation learning (IMNRL) framework, which is suitable for incomplete multiview clustering in various situations. The IMNRL framework performs matrix factorization on multiple incomplete graphs and decomposes these incomplete graphs into a consensus nonnegative representation and view-specific spectral representations, which integrates the advantages of multiview nonnegative representation learning and graph learning. The proposed framework has the following merits: (1) it learns a consensus nonnegative embedding and view-specific embeddings simultaneously; (2) the nonnegative embedding satisfies the neighbor constraint on each incomplete view, which directly reveals the multiview clustering results. Experimental results show that the proposed framework outperforms other state-of-the-art incomplete multiview clustering algorithms.

Introduction

Recently, multiview clustering has become an important problem in machine learning and artificial intelligence fields [1]. In multiview clustering, each instance is associated with multiple features from diverse views which often contain complementary information to each other, and the objective is to solve the problem of the complex correlation among multiple views [2], [3]. Most multiview clustering methods usually learn the unified representation of multiview data [4], [5], [6] or the common graph structure among instances [7], [8] for data grouping. For example, the multiview spectral clustering via integrating nonnegative embedding and spectral embedding (NESE) [4] method performed matrix factorization on multiple graphs and obtained the consistent nonnegative embedding for clustering. Multiview learning with adaptive neighbors (MLAN) [7] learned a consistent similarity matrix with $k$ connected components, where each view shares the consistent similarity matrix. However, an important assumption for traditional multiview clustering methods is that all views of instances should be complete [9], [10]. In many real-world multiview tasks, many instances suffer from the absence of partial views [11], [12], which leads to difficulties in modeling the correlation among instances.

This view-missing problem in multiview clustering is commonly referred to as the incomplete multiview clustering (IMC) problem. Some efforts have been made in recent years to handle this problem [13], [14]. The graph-based method is an important technique for solving the IMC problem, which aims to learn the consensus embedding and preserve the graph structure information among multiple incomplete views [15], [16]. Trivedi et al. [17] made use of the Laplacian regularization of a complete view to establish the kernel representation of incomplete views. This is the primary work on IMC, with the limitation that at least one view must be complete. Gao et al. [18] utilized the mean of instances to fill incomplete views and performed the latent consensus representation learning, where the filled incomplete views may affect the subsequent multiview learning. Wang et al. [19] proposed a perturbation-oriented IMC method, which obtained the consensus representation from multiple similarity graphs. Incomplete multimodality grouping (IMG) [20] transformed the complete and incomplete instances into a complete representation and then learned a common graph structure, while incomplete multiview spectral clustering with adaptive graph learning (IMSC_AGL) [21] performed subspace learning and consensus representation learning simultaneously. Moreover, IMG and IMSC_AGL have performed well without filling incomplete views.

The matrix factorization method is another research hotspot for solving the IMC problem, such as partial multiview clustering (PVC) [22], multiple incomplete view clustering (MIC) [23], online multiview clustering (OMVC) [24], doubly aligned IMC (DAIMC) [25], and one-pass IMC (OPIMC) [26]. DAIMC [25] introduced a regression constraint into the weighted semi-nonnegative matrix factorization, which utilized the given instance alignment information to learn a common latent feature matrix for all the views. OPIMC [26] was an efficient and effective IMC method by adequately considering the instance missing information with the help of regularized matrix factorization and weighted matrix factorization. Matrix factorization-based IMC methods usually introduced weighted matrices containing missing view information, so that they can intuitively deal with the IMC problem. However, they have obvious shortcomings in the nonlinear structural learning among instances compared with graph-based IMC methods. Currently, there are some IMC works based on matrix factorization and graph learning, such as graph regularized partial multi-view clustering (GPMVC) [27] and generalized IMC with flexible locality structure diffusion (GIMC_FLSD) [28]. For example, GIMC_FLSD [28] flexibly performed local structural learning and individual representation learning flexibly, where all individual representations can be easily converted to a common representation. Compared with graph-based IMC methods, GIMC_FLSD can make fuller use of the local geometric information among instances by performing matrix factorization on the neighbors of the instances. Besides, GIMC_FLSD adaptively learned the importance of different views which was usually ignored in graph-based or matrix factorization-based IMC methods. Therefore, this paper focuses on the IMC method integrating the graph information and the nonnegative matrix factorization.

In this paper, we develop a novel incomplete multiview nonnegative representation learning (IMNRL) framework for IMC, which inherits the advantages of both graph-based and matrix factorization-based IMC methods. As shown in Fig. 1, IMNRL takes advantage of the neighbor structure of each individual incomplete view to construct multiple similarity graphs and decomposes these graphs into the consensus nonnegative embedding and view-specific graph embeddings. In this way, the consensus nonnegative embedding can contain nonlinear structural information on different views. Moreover, we employ an additional graph regularization term to constrain the consensus embedding, so that the learned consensus embedding can retain more graph structure information. In IMNRL, the final cluster labels are determined by the column index of the largest value in each row of the consensus embedding. To summarize, this papers main contributions are:

•
We build a novel incomplete multiview nonnegative representation learning framework, namely IMNRL. It can handle various incomplete cases.
•
IMNRL performs the consensus nonnegative representation learning and the view-specific representation learning simultaneously. The consensus nonnegative embedding retains local structural information on different incomplete views, and it directly reveals the clustering results.
•
We perform experiments to verify the proposed IMNRL, and the results on different incomplete scenarios demonstrate that IMNRL achieves state-of-the-art incomplete multiview clustering results.

The remainder of the paper is organized as follows. Section 2 introduces the related works. Section 3 explains the proposed IMNRL model, and the optimization of IMNRL is given in Section 4. In Section 5, experimental results show the feasibilities of the proposed method. Finally, some conclusions are given in the last section.

Section snippets

Related work

This section briefly reviews related work, including traditional single-view clustering methods and multiview clustering methods.

The proposed method

This section mainly introduces the IMNRL framework. IMNRL learns the consensus nonnegative embedding and the view-specific graph embeddings simultaneously.

Optimization

In this section, we first provide the inference and learning procedures of IMNRL, and then give the convergence analysis and the computational complexity.

Experiments

In this section, we compare the proposed IMNRL with state-of-the-art IMC methods on incomplete real-world multiview datasets.

Conclusion

In this paper, we have presented an effective incomplete multiview nonnegative representation learning (IMNRL) framework, which can handle the incomplete multiview clustering problem well without filling incomplete views. IMNRL uses the nonnegative term and the graph regularization term to constrain the consensus representation, and thus the consensus representation can retain local structural information on multiple incomplete views and final data partition information. Besides, the final

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Grant nos. 62076096 and 62006076), the Shanghai Municipal Project (No. 20511100900), Shanghai Knowledge Service Platform Project ZF1213, the Open Research Fund of KLATASDS-MOE, and the Fundamental Research Funds for the Central Universities.

Nan Zhang received the Ph.D. degree in artificial intelligence and pattern recognition from China University of Mining and Technology in 2019. Now, he studies as postdoctoral fellow with the School of Computer Science and Technology and the Head of the Pattern Recognition and Machine Learning Research Group, East China Normal University. His research results have expounded in over 20 publications at peer-reviewed journals. His current research interests include multiview learning and deep

References (47)

Y. Chen et al.
Multi-view subspace clustering via simultaneously learning the representation tensor and affinity matrix
Pattern Recognit.
(2020)
J. Ma et al.
Discriminative subspace matrix factorization for multiview data clustering
Pattern Recognit.
(2021)
D. Wang et al.
Pseudo-label guided collective matrix factorization for multiview clustering
IEEE Trans. Cybern.
(2021)
S. Yu et al.
Bayesian co-training
J. Mach. Learn. Res.
(2011)
J. Guo et al.
Partial multi-view outlier detection based on collective learning
Proceedings of AAAI Conference on Artificial Intelligence
(2018)
W. Zhu et al.
Structured general and specific multi-view subspace clustering
Pattern Recognit.
(2019)
D. Greene et al.
A matrix factorization approach for integrating multiple data views
Proceedings of Joint European Conference on Machine Learning and Knowledge Discovery in Databases
(2009)
S. Sun et al.
Multiview Machine Learning
(2019)
N. Zhang et al.
Multiview graph restricted Boltzmann machines
IEEE Trans. Cybern.
(2021)
Z. Hu et al.
Multi-view spectral clustering via integrating nonnegative embedding and spectral embedding
Inf. Fusion
(2020)

F. Nie et al.

Multi-view clustering and semi-supervised classification with adaptive neighbours

Proceedings of AAAI Conference on Artificial Intelligence, San Francisco, California, USA

(2017)

J. Chen et al.

Dual distance adaptive multiview clustering

Neurocomputing

(2021)

J. Zhao et al.

Multi-view learning overview: recent progress and new challenges

Inf. Fusion

(2017)

X. Liu et al.

Multiple kernel $k$ -means with incomplete kernels

IEEE Trans. Pattern Anal. Mach. Intell.

(2020)

X. Liu et al.

Efficient and effective regularized incomplete multi-view clustering

IEEE Trans. Pattern Anal. Mach. Intell.

(2021)

J. Wen et al.

Adaptive graph completion based incomplete multi-view clustering

IEEE Trans. Multimed.

(2020)

H. Wang et al.

Incomplete multi-view clustering via structured graph learning

Proceedings of Pacific Rim International Conference on Artificial Intelligence

(2018)

H. Lian et al.

Partial multiview clustering with locality graph regularization

Int. J. Intell. Syst.

(2021)

A. Trivedi et al.

Multiview clustering with incomplete views

Proceedings of Advances in Neural Information Processing Systems Workshop

(2010)

H. Gao et al.

Incomplete multi-view clustering

Proceedings of International Joint Conference on Intelligent Information Processing

(2016)

H. Wang et al.

Spectral perturbation meets incomplete multi-view data

Proceedings of International Joint Conference on Artificial Intelligence

(2019)

H. Zhao et al.

Incomplete multi-modal visual data grouping

In Proceedings of. International Joint Conference on Artificial Intelligence, New York, NY, USA

(2016)

J. Wen et al.

Incomplete multiview spectral clustering with adaptive graph learning

IEEE Trans. Cybern.

(2020)

Cited by (10)

Discovering common information in multi-view data
2024, Information Fusion
We introduce an innovative and mathematically rigorous definition for computing common information from multi-view data, drawing inspiration from Gács-Körner common information in information theory. Leveraging this definition, we develop a novel supervised multi-view learning framework to capture both common and unique information. By explicitly minimizing a total correlation term, the extracted common information and the unique information from each view are forced to be independent of each other, which, in turn, theoretically guarantees the effectiveness of our framework. To estimate information-theoretic quantities, our framework employs matrix-based Rényi’s $α$ -order entropy functional, which forgoes the need for variational approximation and distributional estimation in high-dimensional space. Theoretical proof is provided that our framework can faithfully discover both common and unique information from multi-view data. Experiments on synthetic and seven benchmark real-world datasets demonstrate the superior performance of our proposed framework over state-of-the-art approaches.
Relaxed multi-view discriminant analysis
2024, Engineering Applications of Artificial Intelligence
Consistency and complementarity are two important principles in multiview feature extraction. However, most current multiview feature extraction methods only explore the former but neglect the latter. To alleviate this limitation, in this article we propose a relaxed multiview discriminant analysis (RMDA) model. Firstly, a relaxed loss function is formulated to make the projection matrices have more degrees of freedom. Then two scatter matrices are utilized to preserve cross-view between-class and within-class discriminative information. The proposed RMDA explores the complementarity of multiple views while maintaining the consistency across different views. To solve the RMDA problem efficiently, an iteration strategy is proposed. Theoretical analysis demonstrates the effectiveness and quadratic convergence rate of the RMDA algorithm. To further deal with nonlinearities present in the data, a relaxed kernel multiview discriminant analysis (RKMDA) is put forward too. Several corroborating numerical tests using artificial dataset and real datasets are provided to showcase the merits of the RMDA and RKMDA relative to several competing methods.
Incomplete multi-view learning: Review, analysis, and prospects
2024, Applied Soft Computing
Multi-view data, stemming from diverse information sources, often suffer from incompleteness due to various factors such as equipment failure and data transmission issues. This challenge has given rise to the emerging field of incomplete multi-view learning (IML). To provide guidance for newcomers and researchers in this field, this survey systematically presents an in-depth analysis of IML from generative and discriminative perspectives, focusing on all missing scenarios and various learning tasks. Within these categories, discriminative methods are further classified into matrix learning-based IML and graph learning-based IML, while generative methods encompass generative adversarial networks-based IML, auto-encoder-based IML, and contrastive learning-based IML. Meanwhile, practical applications across various domains are summarized, with extensions of IML to multiple labels as well as unaligned views. To advance this field, we conclude that adapting multi-view learning for incomplete data, addressing complex and arbitrary missing scenarios, tackling high missing ratios, exploring regularization techniques, reducing noise impact, and integrating IML with other learning paradigms are valuable research directions in the future.
Joint group and pairwise localities embedding for feature extraction
2024, Information Sciences
Many practical applications generate high-dimensional data, which poses challenges in terms of computational time and storage. To address this issue, feature extraction has become a popular research topic. Learning by way of graph embedding is useful for discovering potential intrinsic low-dimensional structures and has thus gained wide attention. However, traditional embedding models typically utilize only one graph or one type of multiple graphs to capture local relationships, which may be insufficient for high-dimensional data that may exhibit different kinds of local relationships. To overcome this drawback, we developed a joint embedding framework that incorporates multiple types of graphs. Under this framework, a novel joint group and pairwise locality embedding model (GPE) is proposed. The GPE model has the following distinctive merits: (1) it simultaneously incorporates simple graphs, hypergraphs, and probabilistic hypergraphs, enabling the use of not only one type of multiple graphs but also multiple types of multiple graphs; (2) it can leverage both group relationships and pairwise graphs to discover local information during feature extraction; and (3) its objective function can be solved using an alternating optimization strategy, which involves solving eigenvalue problems and quadratic programming problems, resulting in very fast convergence. Finally, we arranged classification tasks and clustering tasks on several high-dimensional real-world datasets, and the experimental results prove that the identification capability of GPE is encouraging.
Incremental unsupervised feature selection for dynamic incomplete multi-view data
2023, Information Fusion
Multi-view unsupervised feature selection has been proven to be efficient in reducing the dimensionality of multi-view unlabeled data with high dimensions. The previous methods assume that all views are complete. However, in real applications, the multi-view data are often incomplete, i.e., some views of instances are missing, which will result in the failure of these methods. Besides, while the data arrive in form of streams, these existing methods will suffer the issues of high storage cost and expensive computation time. To address these issues, we propose an Incremental Incomplete Multi-view Unsupervised Feature Selection method (I $^{2}$ MUFS) on incomplete multi-view streaming data. By jointly considering the consistent and complementary information across different views, I $^{2}$ MUFS embeds the unsupervised feature selection into an extended weighted non-negative matrix factorization model, which can learn a consensus clustering indicator matrix and fuse different latent feature matrices with adaptive view weights. Furthermore, we introduce the incremental learning mechanisms to develop an alternative iterative algorithm, where the feature selection matrix is incrementally updated, rather than recomputing on the entire updated data from scratch. A series of experiments are conducted to verify the effectiveness of the proposed method by comparing with several state-of-the-art methods. The experimental results demonstrate the effectiveness and efficiency of the proposed method in terms of the clustering metrics and the computational cost.
Multiview Jointly Sparse Discriminant Common Subspace Learning
2023, Pattern Recognition
Multiview data leads to the demand for classifying samples from various views, and the large gap between different views makes the classification task challenging. Recently, researchers have extended linear discriminant analysis (LDA) to multi-view scenarios. However, the extended methods are generally associated with the small-class problem, that is, the projection size is limited by the number of classes. In addition, they are sensitive to variations in images or outliers. To solve these problems, this study proposes a generalized robust multiview discriminant analysis (GRMDA) to obtain a linear transform for each view and for learning multiview jointly sparse discriminant common subspace. GRMDA aims to achieve both maximal between-class and minimal within-class variation for data from multiple views in a common space. Instead of formulating the ratio trace problem, we reformulate GRMDA inspired by maximum margin criterion (MMC) to address the small-class problem. Moreover, the proposed method achieves stronger robustness by reconstructing the within-class and between-class scatter terms from the definition of $L_{2, 1}$ norm. Furthermore, GRMDA ensures joint sparsity using the $L_{2, 1}$ norm-based regularization term. Additionally, we present an iterative algorithm, convergence proof, and complexity analysis. Experiments on six popular databases, that is, COIL100, USPS/MNIST, Extended Yale Face B, AR, BBCSport, and multiple feature datasets, were conducted to evaluate the performance of GRMDA against the state-of-the-art multiview methods. The experimental results demonstrate that the proposed method can achieve a significant performance with strong robustness and fast convergence.

View all citing articles on Scopus

Shiliang Sun received the Ph.D. degree in pattern recognition and intelligent systems from Tsinghua University, Beijing, China, in 2007. He is a Professor with the School of Computer Science and Technology and the Head of the Pattern Recognition and Machine Learning Research Group, East China Normal University, Shanghai, China. From 2009 to 2010, he was a Visiting Researcher with the School of Computer Science, Centre for Computational Statistics and Machine Learning, University College London, London, U.K. In 2014, he was a Visiting Researcher with the Department of Electrical Engineering, Columbia University, New York, NY, USA. His current research interests include kernel methods, multiview learning, learning theory, approximate inference, sequential modeling, deep learning and their applications. His research results have expounded in over 100 publications at peer-reviewed journals and conferences, such as IEEE T-PAMI, JMLR, IEEE T-NNLS, IEEE T-Cybernetics, PR, NIPS, ICML, IJCAI and ECML. Prof. Sun is on the Editorial Board of multiple international journals, including Pattern Recognition and IEEE Transactions on Neural Networks and Learning Systems.

View full text

Incomplete multiview nonnegative representation learning with multiple graphs

Highlights

Abstract

Introduction

Section snippets

Related work

The proposed method

Optimization

Experiments

Conclusion

Declaration of Competing Interest

Acknowledgments

Pattern Recognit.

Pattern Recognit.

IEEE Trans. Cybern.

J. Mach. Learn. Res.

Pattern Recognit.

Multiview Machine Learning

Multiview graph restricted Boltzmann machines

IEEE Trans. Cybern.

Multi-view spectral clustering via integrating nonnegative embedding and spectral embedding

Inf. Fusion

Multi-view clustering and semi-supervised classification with adaptive neighbours

Proceedings of AAAI Conference on Artificial Intelligence, San Francisco, California, USA

Dual distance adaptive multiview clustering

Neurocomputing

Multi-view learning overview: recent progress and new challenges

Inf. Fusion

Multiple kernel k-means with incomplete kernels

IEEE Trans. Pattern Anal. Mach. Intell.

Efficient and effective regularized incomplete multi-view clustering

IEEE Trans. Pattern Anal. Mach. Intell.

Adaptive graph completion based incomplete multi-view clustering

IEEE Trans. Multimed.

Incomplete multi-view clustering via structured graph learning

Proceedings of Pacific Rim International Conference on Artificial Intelligence

Partial multiview clustering with locality graph regularization

Int. J. Intell. Syst.

Multiview clustering with incomplete views

Proceedings of Advances in Neural Information Processing Systems Workshop

Incomplete multi-view clustering

Proceedings of International Joint Conference on Intelligent Information Processing

Spectral perturbation meets incomplete multi-view data

Proceedings of International Joint Conference on Artificial Intelligence

Incomplete multi-modal visual data grouping

In Proceedings of. International Joint Conference on Artificial Intelligence, New York, NY, USA

Incomplete multiview spectral clustering with adaptive graph learning

IEEE Trans. Cybern.

Multiple kernel $k$ -means with incomplete kernels