Abstract
Graph-based semi-supervised learning (GSSL) is one of the most important semi-supervised learning (SSL) paradigms. Though GSSL methods are helpful in many situations, they may hurt performance when using unlabeled data. In this paper, we propose a new GSSL method GsslIs based on instance selection in order to reduce the chances of performance degeneration. Our basic idea is that given a set of unlabeled instances, it is not the best to exploit all the unlabeled instances; instead, we should exploit the unlabeled instances which are highly possible to help improve the performance, while do not take the ones with high risk into account. Experiments on a board range of data sets show that the chance of performance degeneration of our proposal is much smaller than that of many state-of-the-art GSSL methods.
This research was supported by NSFC (61403186), JiangsuSF (BK20140613), 863 Program (2015AA015406).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Belkin, M., Niyogi, P.: Semi-supervised learning on Riemannian manifolds. Mach. Learn. 56, 209–239 (2004)
Belkin, M., Niyogi, P.: Laplacian eigenmaps and spectral techniques for embedding and clustering. In: Advances in Neural Information Processing Systems, vol. 14, pp. 585–591. MIT Press (2002)
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the 7th Annual Conference on Computational Learning Theory, Madison, WI (1998)
Blum, A., Chawla, S.: Learning from labeled and unlabeled data using graph mincuts. In: Proceedings of the 18th International Conference on Machine Learning, Williamstown, MA, pp. 19–26 (2001)
Camps-Valls, G., Marsheva, T.V.B., Zhou, D.: Semi-supervised graph-based hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 45(10), 3044–3054 (2007)
Chapelle, O., Schölkopf, B., Zien, A., et al.: Semi-supervised learning (2006)
Joachims, T.: Transductive inference for text classification using support vector machines. In: Proceedings of the 16th International Conference on Machine Learning, Bled, Slovenia, pp. 200–209 (1999)
Joachims, T.: Transductive learning via spectral graph partitioning. In: Proceedings of the 20th International Conference on Machine Learning, Washington, DC, pp. 290–297 (2003)
Karlen, M., Weston, J., Erkan, A., Collobert, R.: Large scale manifold transduction. In: Proceedings of the 25th International Conference on Machine Learning, Helsinki, Finland, pp. 775–782 (2008)
Kuncheva, L.I., Whitaker, C.J., Shipp, C.A., Duin, R.P.: Limits on the majority vote accuracy in classifier fusion. Pattern Anal. Appl. 6(1), 22–31 (2003)
Li, Y.-F., Wang, S.-B., Zhou, Z.-H.: Graph quality judgement: a large margin expedition. In: Proceedings of the 25th International Joint Confernece on Artificial Intelligence, New York, NY (2016)
Wang, F., Zhang, C.: Label propagation through linear neighborhoods. IEEE Trans. Knowl. Data Eng. 20(1), 55–67 (2008)
Zhou, D., Bousquet, O., Lal, T.N., Weston, J., Schölkopf, B.: Learning with local and global consistency. In: Advances in Neural Information Processing Systems, vol. 16, pp. 595–602. MIT Press, Cambridge (2004)
Zhu, X.: Semi-supervised learning literature survey. Technical report. University of Wisconsin-Madison (2007)
Zhu, X., Ghahramani, Z., Lafferty, J.: Semi-supervised learning using Gaussian fields and harmonic functions. In: Proceedings of the 20th International Conference on Machine Learning, Washington, DC, pp. 912–919 (2003)
Zhu, X., Lafferty, J., Rosenfeld, R.: Semi-supervised learning with graphs. Ph.D. thesis. Carnegie Mellon University (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Wang, H., Wang, SB., Li, YF. (2016). Instance Selection Method for Improving Graph-Based Semi-supervised Learning. In: Booth, R., Zhang, ML. (eds) PRICAI 2016: Trends in Artificial Intelligence. PRICAI 2016. Lecture Notes in Computer Science(), vol 9810. Springer, Cham. https://doi.org/10.1007/978-3-319-42911-3_47
Download citation
DOI: https://doi.org/10.1007/978-3-319-42911-3_47
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42910-6
Online ISBN: 978-3-319-42911-3
eBook Packages: Computer ScienceComputer Science (R0)