Skip to main content

Instance Selection Method for Improving Graph-Based Semi-supervised Learning

  • Conference paper
  • First Online:
PRICAI 2016: Trends in Artificial Intelligence (PRICAI 2016)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9810))

Included in the following conference series:

Abstract

Graph-based semi-supervised learning (GSSL) is one of the most important semi-supervised learning (SSL) paradigms. Though GSSL methods are helpful in many situations, they may hurt performance when using unlabeled data. In this paper, we propose a new GSSL method GsslIs based on instance selection in order to reduce the chances of performance degeneration. Our basic idea is that given a set of unlabeled instances, it is not the best to exploit all the unlabeled instances; instead, we should exploit the unlabeled instances which are highly possible to help improve the performance, while do not take the ones with high risk into account. Experiments on a board range of data sets show that the chance of performance degeneration of our proposal is much smaller than that of many state-of-the-art GSSL methods.

This research was supported by NSFC (61403186), JiangsuSF (BK20140613), 863 Program (2015AA015406).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Downloaded from http://archive.ics.uci.edu/ml/datasets.html.

  2. 2.

    http://pages.cs.wisc.edu/~jerryzhu/pub/harmonic_function.m.

References

  • Belkin, M., Niyogi, P.: Semi-supervised learning on Riemannian manifolds. Mach. Learn. 56, 209–239 (2004)

    Article  MATH  Google Scholar 

  • Belkin, M., Niyogi, P.: Laplacian eigenmaps and spectral techniques for embedding and clustering. In: Advances in Neural Information Processing Systems, vol. 14, pp. 585–591. MIT Press (2002)

    Google Scholar 

  • Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proceedings of the 7th Annual Conference on Computational Learning Theory, Madison, WI (1998)

    Google Scholar 

  • Blum, A., Chawla, S.: Learning from labeled and unlabeled data using graph mincuts. In: Proceedings of the 18th International Conference on Machine Learning, Williamstown, MA, pp. 19–26 (2001)

    Google Scholar 

  • Camps-Valls, G., Marsheva, T.V.B., Zhou, D.: Semi-supervised graph-based hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 45(10), 3044–3054 (2007)

    Article  Google Scholar 

  • Chapelle, O., Schölkopf, B., Zien, A., et al.: Semi-supervised learning (2006)

    Google Scholar 

  • Joachims, T.: Transductive inference for text classification using support vector machines. In: Proceedings of the 16th International Conference on Machine Learning, Bled, Slovenia, pp. 200–209 (1999)

    Google Scholar 

  • Joachims, T.: Transductive learning via spectral graph partitioning. In: Proceedings of the 20th International Conference on Machine Learning, Washington, DC, pp. 290–297 (2003)

    Google Scholar 

  • Karlen, M., Weston, J., Erkan, A., Collobert, R.: Large scale manifold transduction. In: Proceedings of the 25th International Conference on Machine Learning, Helsinki, Finland, pp. 775–782 (2008)

    Google Scholar 

  • Kuncheva, L.I., Whitaker, C.J., Shipp, C.A., Duin, R.P.: Limits on the majority vote accuracy in classifier fusion. Pattern Anal. Appl. 6(1), 22–31 (2003)

    Article  MathSciNet  MATH  Google Scholar 

  • Li, Y.-F., Wang, S.-B., Zhou, Z.-H.: Graph quality judgement: a large margin expedition. In: Proceedings of the 25th International Joint Confernece on Artificial Intelligence, New York, NY (2016)

    Google Scholar 

  • Wang, F., Zhang, C.: Label propagation through linear neighborhoods. IEEE Trans. Knowl. Data Eng. 20(1), 55–67 (2008)

    Article  Google Scholar 

  • Zhou, D., Bousquet, O., Lal, T.N., Weston, J., Schölkopf, B.: Learning with local and global consistency. In: Advances in Neural Information Processing Systems, vol. 16, pp. 595–602. MIT Press, Cambridge (2004)

    Google Scholar 

  • Zhu, X.: Semi-supervised learning literature survey. Technical report. University of Wisconsin-Madison (2007)

    Google Scholar 

  • Zhu, X., Ghahramani, Z., Lafferty, J.: Semi-supervised learning using Gaussian fields and harmonic functions. In: Proceedings of the 20th International Conference on Machine Learning, Washington, DC, pp. 912–919 (2003)

    Google Scholar 

  • Zhu, X., Lafferty, J., Rosenfeld, R.: Semi-supervised learning with graphs. Ph.D. thesis. Carnegie Mellon University (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yu-Feng Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Wang, H., Wang, SB., Li, YF. (2016). Instance Selection Method for Improving Graph-Based Semi-supervised Learning. In: Booth, R., Zhang, ML. (eds) PRICAI 2016: Trends in Artificial Intelligence. PRICAI 2016. Lecture Notes in Computer Science(), vol 9810. Springer, Cham. https://doi.org/10.1007/978-3-319-42911-3_47

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42911-3_47

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42910-6

  • Online ISBN: 978-3-319-42911-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics