Combining Committee-Based Semi-supervised and Active Learning and Its Application to Handwritten Digits Recognition

Abdel Hady, Mohamed Farouk; Schwenker, Friedhelm

doi:10.1007/978-3-642-12127-2_23

Mohamed Farouk Abdel Hady¹⁹ &
Friedhelm Schwenker¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5997))

Included in the following conference series:

International Workshop on Multiple Classifier Systems

1348 Accesses
4 Citations

Abstract

Semi-supervised learning reduces the cost of labeling the training data of a supervised learning algorithm through using unlabeled data together with labeled data to improve the performance. Co-Training is a popular semi-supervised learning algorithm, that requires multiple redundant and independent sets of features (views). In many real-world application domains, this requirement can not be satisfied. In this paper, a single-view variant of Co-Training, CoBC (Co-Training by Committee), is proposed, which requires an ensemble of diverse classifiers instead of the redundant and independent views. Then we introduce two new learning algorithms, QBC-then-CoBC and QBC-with-CoBC, which combines the merits of committee-based semi-supervised learning and committee-based active learning. An empirical study on handwritten digit recognition is conducted where the random subspace method (RSM) is used to create ensembles of diverse C4.5 decision trees. Experiments show that these two combinations outperform the other non committee-based ones.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Semi-supervised Learning Based on Improved Co-training by Committee

A multiple classifiers system with roulette-based feature subspace selection for one-vs-one scheme

Article 27 July 2022

Ensemble constrained Laplacian score for efficient and robust semi-supervised feature selection

Article 23 November 2015

References

Nigam, K., Ghani, R.: Analyzing the effectiveness and applicability of co-training. In: Proc. of the 9th Int. Conf. on Information and knowledge management, New York, NY, USA, pp. 86–93 (2000)
Google Scholar
Blum, A., Mitchell, T.: Combining labeled and unlabeled data with co-training. In: Proc. of the 11th Annual Conf. on Computational Learning Theory (COLT 1998), pp. 92–100. Morgan Kaufmann Publishers, San Francisco (1998)
Chapter Google Scholar
Muslea, I., Minton, S., Knoblock, C.A.: Selective sampling with redundant views. In: Proc. of the 17th National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence, pp. 621–626 (2000)
Google Scholar
Freund, Y., Seung, H., Shamir, E., Tishby, N.: Selective sampling using the Query by Committee algorithm. Machine Learning 28(2-3), 133–168 (1997)
Article MATH Google Scholar
McCallum, A.K., Nigam, K.: Employing EM and pool-based active learning for text classification. In: Proc. of the 15th Int. Conf. on Machine Learning (ICML 1998), pp. 350–358. Morgan Kaufmann Publishers Inc., San Francisco (1998)
Google Scholar
Muslea, I., Minton, S., Knoblock, C.A.: Active + Semi-Supervised learning = robust multi-view learning. In: Proc. of the 19th Int. Conf. on Machine Learning (ICML 2002), pp. 435–442 (2002)
Google Scholar
Zhou, Z.H., Chen, K.J., Jiang, Y.: Exploiting unlabeled data in content-based image retrieval. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 525–536. Springer, Heidelberg (2004)
Google Scholar
Zhou, Z.H., Li, M.: Semi-supervised learning by disagreement. Knowledge and Information Systems (in press)
Google Scholar
Li, M., Zhou, Z.H.: Improve computer-aided diagnosis with machine learning techniques using undiagnosed samples. IEEE Trans. on Systems, Man and Cybernetics- Part A: Systems and Humans 37(6), 1088–1098 (2007)
Article Google Scholar
Blake, C., Merz, C.: UCI repository of machine learning databases. University of California (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Ho, T.: The random subspace method for constructing decision forests. IEEE Trans. Pattern Analysis and Machine Intelligence 20(8), 832–844 (1998)
Article Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Neural Information Processing, University of Ulm, D-89069, Ulm, Germany
Mohamed Farouk Abdel Hady & Friedhelm Schwenker

Authors

Mohamed Farouk Abdel Hady
View author publications
You can also search for this author in PubMed Google Scholar
Friedhelm Schwenker
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Informatics Science, Nile University, 12677, Giza, Egypt
Neamat El Gayar
Centre for Vision, Speech and Signal Processing, University of Surrey, GU2 7XH, Guildford, Surrey, UK
Josef Kittler
Department of Electrical and Electronic Engineering, University of Cagliari, Piazza d’Armi, 09123, Cagliari, Italy
Fabio Roli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Abdel Hady, M.F., Schwenker, F. (2010). Combining Committee-Based Semi-supervised and Active Learning and Its Application to Handwritten Digits Recognition. In: El Gayar, N., Kittler, J., Roli, F. (eds) Multiple Classifier Systems. MCS 2010. Lecture Notes in Computer Science, vol 5997. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12127-2_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-12127-2_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12126-5
Online ISBN: 978-3-642-12127-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics