Semi-supervised learning via sparse model

doi:10.1016/j.neucom.2013.10.033

Neurocomputing

Volume 131, 5 May 2014, Pages 124-131

https://doi.org/10.1016/j.neucom.2013.10.033 Get rights and content

Abstract

Graph-based Semi-Supervised Learning (SSL) methods are the widely used SSL methods due to their high accuracy. They can well meet the manifold assumption with high computational cost, but don't meet the cluster assumption. In this paper, we propose a Semi-supervised learning via SPArse (SSPA) model. Since SSPA uses sparse matrix multiplication to depict the adjacency relations among samples, SSPA can approximate low dimensional manifold structure of samples with lower computational complexity than these graph-based SSL methods. Each column of this sparse matrix corresponds to one sparse representation of a sample. The rational is that the inner product of sparse representations can also be sparse under certain constraint. Since the dictionary in the SSPA model can depict the distribution of the entire samples, the sparse representation of a sample encodes its spatial location information. Therefore, in the SSPA model the manifold structure of samples is computed via their locations in the intrinsic geometry of the distribution instead of their feature vectors. In order to meet the cluster assumption, we propose an structured dictionary learning algorithm to explicitly reveal the cluster structure of the dictionary. We develop the SSPA algorithms with the structured dictionary besides non-structured one, and experiments show that our methods are efficient and outperform state-of-the-art graph-based SSL methods.

Introduction

Semi-supervised learning (SSL) uses a large number of unlabeled samples together with a small number of labeled samples to build a better learner. It has attracted great research interest in both theory and practice in the past decade [1]. There exist various approaches in SSL to exploit unlabeled samples, showing incremental performance improvements [2]. Among them, graph-based methods, also known as manifold methods [3], [4], are widely used due to their good performances [3], [5], [6].

Through establishing the graph structure to represent adjacency relations among samples, the graph-based SSL methods can approximate the intrinsically low-dimensional manifold structure well [7]. Existing ways for graph construction include K-nearest-neighbor(KNN) method and ε-ball based method [5], [6]. In general, they construct the graph structure in two steps: (1) choosing adjacency samples needed to be connected, and (2) determining the edge weights [6]. $l_{1}$ graph, recently proposed by [4], is constructed at one run by solving each sample's sparse representation based on the rest of the entire sample set. It can surpass KNN method and ε-ball based method due to its characteristics of the high discriminative power, sparsity, and adaptive neighborhood [8]. However, these graph-based SSL methods have common drawbacks: high computational cost always comes together with the graph construction. When the adjacency relations among samples are computed to construct the graph structure, the complexity is $O ({mn}^{2})$ [3], where n is the number of data samples and m is the dimensionality of the sample feature.

The graph-based SSL methods do not meet the cluster assumption. The cluster assumption states that if points are located in the same cluster, they are likely to belong to the same class. The success of SSL relies highly on certain semi-supervised assumptions (SSA) on the samples' distribution, which includes manifold and cluster assumptions [1]. The manifold assumption indicates that the high dimensional data lie on a low-dimensional manifold. Graph-based methods have not explored the cluster information of input distribution, so they may connect data points of different classes in cluster boundaries by the Euclidean distances of their feature vectors.

In order to overcome the above shortcomings of the graph-based SSL methods, we propose a new Semi-supervised learning method based on SPArse model, named SSPA. With the SSPA model, following advantages are achieved: Firstly adjacency relations among samples can be achieved by sparse matrix multiplication with a complexity of $O ({sn}^{2})$ [9], where s is the average number of nonzero elements in each sparse representation and far less than the feature dimensionality m, so $O ({sn}^{2}) ⪯¡ O ({mn}^{2})$ . Secondly, we explore the fact that the dictionary in SSPA model has the same cluster information as the training data samples, and thus we propose a structured dictionary learning algorithm to learn a structured dictionary which has explicit cluster information. With the structured dictionary the structured sparse model can be used in SSPA to gain better accuracy.

The main contribution of the work is that we propose a new SSPA model, with two kinds of dictionaries: non-structured [10] and structured. The SSPA model has a lower computational complexity than the graph-based SSL methods. With the structured dictionary the SSPA model can utilize the cluster information to improve the classification accuracy. The experimental results demonstrate that the proposed methods are efficient and outperform state-of-the-art graph-based SSL methods.

Section snippets

Related work

This work is closely related to the graph-based SSL methods and sparse model.

SSPA model

This section elaborates on the formulation of the SSPA model, and theoretically proves that the proposed SSPA model is capable of depicting the cluster and manifold structures of data samples.

Experiments

In this section, we conduct experiments to evaluate the accuracy and efficiency of the SSPA model at first, and then study the parameter sensitivity.

Conclusion and future work

In this paper, we propose a new model, SSPA, for semi-supervised learning based on sparse model. Since we compute adjacency matrix by a sparse matrix multiplication, the proposed method has lower computational cost than graph-based SSL methods. In order to meet the cluster assumption we implement SSPA algorithms with structured dictionary, besides non-structured one. With cluster information adjacency links among different clusters can be removed, and that can improve the accuracy of SSL.

Acknowledgments

This work was supported in part by the National Nature Science Foundation of China (61173054,61273247), and the National Key Technology Research and Development Program of China (2012BAH39B02).

Yu Wang received her Master's degree from Harbin Institute of Technology, China, in 2009. She is a Ph.D. candidate in the Institute of Computing Technology, Chinese Academy of Sciences (ICT-CAS), China.

Her current research interests include pattern recognition and content-based multimedia retrieval and indexing.

References (34)

Z. Lai et al.
Sparse two dimensional local discriminant projections for feature extraction
Neurocomputing
(2011)
K. Chen et al.
Semi-supervised learning via regularized boosting working on multiple semi-supervised assumptions
IEEE Trans. Pattern Anal. Mach. Intell.
(2011)
C. Leistner, A. Saffari, J. Santner, H. Bischof, Semi-supervised random forests, in: Proceedings of the 12th...
G.S. Mann, A. McCallum, Simple, robust, scalable semi-supervised learning via expectation regularization, in:...
B. Cheng et al.
Learning with l₁-graph for image analysis
IEEE Trans. Image Proc.
(2010)
X. Zhu, Z. Ghahramani, J. Lafferty, Semi-supervised learning using gaussian fields and harmonic functions, in:...
M. Belkin et al.
Manifold regularizationa geometric framework for learning from labeled and unlabeled examples
J. Mach. Learn. Res.
(2006)
J.B. Tenenbaum et al.
A global geometric framework for nonlinear dimensionality reduction
Science
(2000)
R. He, W.-S. Zheng, B.-G. Hu, X.-W. Kong, Nonnegative sparse coding for discriminative semi-supervised learning, in:...
S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, J. Demmel, Optimization of sparse matrix-vector multiplication...

Y. Wang, S. Tang, F. Liang, Y. Zhang, J. Li, Beyonds kmedoids: sparse model based medoids algorithm for representative...

M. Szummer, T. Jaakkola, Partially labeled classification with Markov random walks, in: Proceedings of the Advances in...

D. Cai, X. He, J. Han, Semi-supervised discriminant analysis, in: Proceedings of the 10th International Conference on...

B. Ni et al.

Learning a propagable graph for semisupervised learningclassification and regression

IEEE Trans. Knowl. Data Eng.

(2012)

W. Liu, J. He, S.-F. Chang, Large graph construction for scalable semi-supervised learning, in: Proceedings of the 27th...

J. Mairal et al.

Supervised dictionary learning

Adv. Neural Inf. Proc. Syst.

(2009)

Z. Lai et al.

Sparse approximation to the eigensubspace for discrimination

IEEE Trans. Neural Netw. Learn. Syst.

(2012)

Cited by (5)

Graph-based semi-supervised learning: A review
2020, Neurocomputing
Considering the labeled samples may be difficult to obtain because they require human annotators, special devices, or expensive and slow experiments. Semi-supervised learning (SSL) has tremendous practical value. Moreover, graph-based SSL methods have received more attention since their convexity, scalability and effectiveness in practice. The convexity of graph-based SSL guarantees that the optimization problems become easier to obtain local solution than the general case. The scalable graph-based SSL methods are convenient to deal with large-scale dataset for big data. Graph-based SSL methods aim to learn the predicted function for the labels of those unlabeled samples by exploiting the label dependency information reflected by available label information. The main purpose of this paper is to provide a comprehensive study of graph-based SSL. Specifically, the concept of the graph is first given before introducing graph-based semi-supervised learning. Then, we build a framework that divides the corresponding works into transductive graph-based SSL, inductive graph-based SSL, and scalable graph-based SSL. The core idea of these models is to impose graph constraints to the optimal function, which guarantees the smoothness over the graph. Next, several representative graph-based SSL methods are conducted on the three data sets, including two face data sets and a natural image data set. Finally, we outlook several directions for future work of graph-based SSL, and hope our review on graph-based SSL will offer insights for further research.
Expert list-wise ranking method based on sparse learning
2016, Neurocomputing
Expert ranking is the core issue of expert retrieval. Taking into consideration the complexity of feature redundancy in traditional dense listwise Learning to Rank method and local optimum in parameter learning, the article proposed the expert listwise Learning to Rank method based on sparse learning. The objective function was defined through the optimization process of experts listwise ranking performance index. Then the Learning to Rank loss function was solved by the objective function. Thus feature dimension reduction was achieved by the feature threshold from the loss-control function of sparse learning algorithm and the steps above. In order to verify whether the feature threshold is optimal, the article made cross validation with the feature threshold and the objective function of model parameter vector to get the optimal model parameters vector and to verify the feature threshold. Meanwhile the article realized expert ranking via the expert listwise ranking model based on sparse learning, which depends on feature dimension reduction and parameter tuning. At last, the contrast experiments of expert ranking proved the effectiveness of the proposed method, which supported expert listwise ranking strongly.
Semi-supervised image segmentation with globalized probability of boundary and simple linear iterative clustering
2018, ICNC-FSKD 2017 - 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery
An image segmentation method based on Simple Linear Iterative Clustering and graph-based semi-supervised learning
2016, Proceedings of 2015 International Conference on Orange Technologies, ICOT 2015
Semi-supervised two-dimensional manifold learning based on pair-wise constraints
2014, Proceedings of the 33rd Chinese Control Conference, CCC 2014

Her current research interests include pattern recognition and content-based multimedia retrieval and indexing.

Sheng Tang received his Ph.D. degree in computer application technology at the Institute of Computing Technology, Chinese Academy of Sciences (ICT-CAS), China, in 2006. He is now an associate professor in the ICT-CAS. He visited National University of Singapore (NUS) for participating in TREC Video Retrieval Evaluation (TRECVid) tasks, in 2006. From 2006 to 2008, he was the TRECVid team (MCG-ICT-CAS) leader of the Multimedia Computing Groupat ICT-CAS. From February 2009 to February 2010, he worked as a visiting research fellow in NUS under the instruction of Prof. Chua Tat-Seng. His current research interests are in the fields of pattern recognition and machine learning, multimedia information processing, in particular, on indexing, retrieval and extraction of information in image and video.

Dr. Tang is a member of ACM, IEEE, and senior member of China Computer Federation (CCF). He serves as the reviewer for a number of famous international conferences and journals, such as IEEE Transactions on Information Forensics & Security, Neurocomputing, Elsevier Data & Knowledge Engineering (DKE), Journal of Visual Communication and Image Representation, Multimedia Tools and Applications, Journal of Computer Science and Technology, Chinese Journal of Computers, Chinese Journal of Computer Research and Development, etc. His homepage: http://mcg.ict.ac.cn/people/shengtang.htm.

Yan-Tao Zheng received his Ph.D. from National University of Singapore (with Best Ph.D. Thesis Award) and B.E. (with 1st class Hons) from Nanyang Technological University, Singapore. He is a research scientist at Institute for Infocomm Research, Singapore. His research interests include geo-mining in multimedia, image annotation and video search. During his attachment at Google Inc., in 2008, he developed a world-scale landmark recognition engine together with Google engineers, which has been highly praised and well publicized.

Dr. Zheng is the recipient of a number of international awards, including Champion of Star Challenge, Microsoft Research Fellowship, IBM Waston Emerging Multimedia Leaders, and so on. He has served as program committee member and reviewer of a number of prestigious international conferences and journals.

Yong-Dong Zhang received his Ph.D. degree in Electronic Engineering from Tianjin University, Tianjin, China, in 2002. Dr. Zhang is currently a Professor in the ICT-CAS, China. His research interests are in the field of video coding and transcoding, video analysis and retrieval, and universal media access.

Jin-Tao Li received his Ph.D. degree in computer science at the Institute of Computing Technology, Chinese Academy of Sciences (ICT-CAS), China, in 1989. He spent the years of 1989, 1990 at Academy of Sciences of the Czech Republic as a visiting scholar.

Dr. Li is currently a professor and Ph.D. supervisor in the ICT-CAS, China. His current research interests focus on the areas of digital image processing, video coding, multimedia content analysis and retrieval, digital watermarking, and network computing technology, etc.

View full text

Semi-supervised learning via sparse model

Abstract

Introduction

Section snippets

Related work

SSPA model

Experiments

Conclusion and future work

Acknowledgments

Neurocomputing

Semi-supervised learning via regularized boosting working on multiple semi-supervised assumptions

IEEE Trans. Pattern Anal. Mach. Intell.

Learning with l1-graph for image analysis

IEEE Trans. Image Proc.

Manifold regularizationa geometric framework for learning from labeled and unlabeled examples

J. Mach. Learn. Res.

A global geometric framework for nonlinear dimensionality reduction

Science