Abstract
Active learning aims at reducing the number of training examples to be labeled by automatically processing the unlabeled examples, then selecting the most informative ones with respect to a given cost function for a human to label. The major problem is to find the best selection strategy function to quickly reach high classification accuracy. Query-by-Committee (QBC) method of active learning is less computation than other active learning approaches, but its classification accuracy can not achieve the same high as passive learning. In this paper, a new selection strategy for the QBC method is presented by combining Vote Entropy with Kullback-Leibler divergence. Experimental results show that the proposed algorithm is better than previous QBC approach in classification accuracy. It can reach the same accuracy as passive learning with few labeled training examples.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Freund, Y., Seung, H.S., Samir, E., Tishby, N.: Selective Sampling Using the Query by Committee Algorithm. Machine Learning 28, 133–168 (1997)
Gong, X.J., Shun, J.P., Shi, Z.Z.: An Active Bayesian Network Classifier. Computer research and development 39, 574–579 (2002)
Riccardi, G., Hakkani-Tür, D.: Active Learning: Theory and Applications to Automatic Speech Recognition. IEEE Transaction on Speech and Audio Processing 13, 504–511 (2005)
McCallum, A.K., Nigam, K.: Employing EM and Pool-based Active Learning for Text Classification. In: Proceeding of the 15th International Conference on Machine Learning, pp. 350–358. Morgan Kaufmann, San Francisco Madison (1998)
Argamon-Engleson, S., Dagan, I.: Committee-based Sample Selection for Probabilistic Classifers. Journal of Artificial Intelligence Research 11, 335–460 (1999)
Lewis, D.D., Gale, W.A.: A Sequential Algorithm for Training Text Classifiers. In: Proceedings of SIGIR 1994, 17th ACM International Conference on Research and Development in Information Retrieva, pp. 3–12. Springer, Heidelberg (1994)
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian Network Classifiers. Machine Learning 29, 131–161 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhao, Y., Xu, C., Cao, Y. (2006). Research on Query-by-Committee Method of Active Learning and Application. In: Li, X., Zaïane, O.R., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2006. Lecture Notes in Computer Science(), vol 4093. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811305_107
Download citation
DOI: https://doi.org/10.1007/11811305_107
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37025-3
Online ISBN: 978-3-540-37026-0
eBook Packages: Computer ScienceComputer Science (R0)