Abstract
In many practical data mining applications, such as web categorization, key gene selection, etc., generally, unlabeled training examples are readily available, but labeled ones are fairly expensive to obtain. Therefore, in recent years, semi-supervised learning algorithms such as graph-based methods have attracted much attention. However, most of these traditional methods adopted a Gaussian function to calculate the edge weights of the graph. In this paper, a novel weight for semi-supervised graph-based methods is proposed. The new method adds the label information from problem into the target function, and adopts the geodesic distance rather than Euclidean distance as the measure of the difference between two data points when conducting the calculation. In addition, we also add class prior knowledge from problem into semi-supervised learning algorithm. Here we address the problem of learning with local and global consistency (LGC). It was found that the effect of class prior knowledge was probably different between under high label rate and low label rate. Furthermore, we integrate sparse representation (SR) in LGC algorithm. Experiments on a UCI data set show that our proposed method outperforms the original algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Chapelle, O., Schölkopf, B., Zien, A., et al.: Semi-supervised learning, vol. 2. MIT Press, Cambridge (2006)
Seeger, M.: Learning with labeled and unlabeled data, Tech. Rep., University of Edinburgh (2001)
Zhu, X.: Semi-supervised learning literature survey (2005), http://www.cs.wisc.edu/~jerryzhu/pub/ssl_survey.pdf
Zhou, D., Bousquet, O., Lal, T., Weston, J., Schölkopf, B.: Learning with local and global consistency. In: Advances in Neural Information Processing Systems, vol. 16, pp. 321–328 (2004)
Belkin, M., Niyogi, P.: Semi-supervised learning on riemannian manifolds. Machine Learning 56(1), 209–239 (2004)
Zhu, X., Ghahramani, Z., Lafferty, J.: Semi-supervised learning using gaussian fields and harmonic functions. In: Proceedings of the Twentieth International Conference on Machine Learning, vol. 20, p. 912 (2003)
Wright, J., Yang, A., Ganesh, A., Sastry, S., Ma, Y.: Robust face recognition via sparse representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(2), 210–227 (2009)
Cheng, B., Yang, J., Yan, S., Fu, Y., Huang, T.: Learning with l 1-graph for image analysis. IEEE Transactions on Image Processing 19(4), 858–866 (2010)
Kim, K., Kwon, Y.: Single-image super-resolution using sparse regression and natural image prior. IEEE Transactions on Pattern Analysis and Machine Intelligence 32(6), 1127–1133 (2010)
Rish, I., Grabarnik, G.: Sparse signal recovery with exponential-family noise. In: Proceedings of the 47th Annual Allerton Conference on Communication, Control, and Computing, pp. 60–66. IEEE (2009)
Yan, S., Wang, H.: Semi-supervised learning by sparse representation. In: Proceedings of SIAM International Conference on Data Mining, pp. 792–801 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gui, J., Zhao, Z., Hu, R., Jia, W. (2013). Semi-supervised Learning with Local and Global Consistency by Geodesic Distance and Sparse Representation. In: Yang, J., Fang, F., Sun, C. (eds) Intelligent Science and Intelligent Data Engineering. IScIDE 2012. Lecture Notes in Computer Science, vol 7751. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36669-7_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-36669-7_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36668-0
Online ISBN: 978-3-642-36669-7
eBook Packages: Computer ScienceComputer Science (R0)