Abstract
Since the number of instances in the training set is very large, data annotating task consumes plenty of time and energy. Active learning algorithms can efficiently reduce the number of instances that need to be annotated. In this paper, authors propose a new active learning algorithm. The algorithm is mainly proposed for multi-class classification model based on support vector machine (SVM). In the algorithm, the unlabeled instances that can promote several SVM classifiers in the multi-class classification model will be selected firstly. So when one newly selected instance is added into training set, more than one classification hyper-planes in the multi-class classification model will be promoted. During the process of instance selection, the algorithm also tries to choose the instance that is least similar with the instances that have already been annotated. In this way, the instances selected by the algorithm for annotating will perfectly represent the feature of the whole dataset.
Similar content being viewed by others
References
Tong S, Koller D (2002) Support vector machine active learning with applications to text classification. J Mach Learn Res 2:45–66
Bottou L, et al (1994) Comparison of classifier methods: a case study in handwriting digit recognition. In: International conference on pattern recognition, p 77
Freund Y, Seung HS, Shamir E, Tishby N (1997) Selective sampling using the query by committee algorithm. Mach Learn 28:133–168
Xu Z, Yu K, Tresp V, Xu X, Wang J (2003) Representative sampling for text classification using support vector machine. In: European conference on information retrieval
Dasgupta S, Hsu D (2008) Hierarchical sampling for active learning. In: Proceedings the 25th international conference on machine learning, pp 208–215
Nguyen HT, Smeulders A (2004) Active learning using pre-clustering. In: Proceedings of the 21st international conference on machine learning, p 79
Holub A, Perona P, Burl MC (2008) Entropy-based active learning for object recognition. In: Computer vision and pattern recognition
Abe N, Mamitsuka H (2002) Query learning using boosting and bagging. In: International conference on machine learning
Thompson Cynthia A, Califf ME, Raymond J (1999) Active learning for natural language parsing and information extraction. In: ICML, pp 406–414
Melville P, Yang SM, Saar-Tsechansky M, Mooney R (2005) Active learning for probability estimation using Jensen–Shannon divergence. In: European conference on machine learning, pp 268–279
Scheffer T, Wrobel S (2001) Active learning of partially hidden markov models. In: Proceedings of the ECML/PKDD workshop on instance selection
Roy N, Mccallum A (1978) Toward optimal active learning through sampling estimation of error reduction. In: Proceedings international conference on machine learning, p 441448
Tong S, Koller D (2000) Active learning for parameter estimation in bayesian networks. In: NIPS, pp 647–653
Lindenbaum M, Markovitch S, Rusakov D (1999) Selective sampling for nearest neighbor classifiers. In: Proceedings of the sixteenth national conference on artificial intelligence (AAAI-99) and the eleventh conference on innovative applications of artificial intelligence (IAAI-99), pp 366–371
Pasolli E, Melgani F, Tuia D et al (2014) SVM active learning approach for image classification using spatial information. IEEE Trans Geosci Remote Sens 52:2217–2233
Li X, Guo Y (2013) Active learning with multi-Label SVM Classification. In: IJCAI, pp 1479–1485
Gan J, Li A, Lei QL, et al (2017) K-means based on active learning for support vector machine. In: 2017 IEEE/ACIS 16th international conference on computer and information science (ICIS). IEEE, pp 727–731
Tuia D, Ratle F, Pacifici F et al (2009) Active learning methods for remote sensing image classification. IEEE Trans Geosci Remote Sens 47:2218–2232
Settles B, Craven M (2008) An analysis of active learning strategies for sequence labeling tasks. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 1070–1079
Smailovic J, Grcar M, Lavrac N et al (2014) Stream-based active learning for sentiment analysis in the financial domain. Inf Sci 285:181–203
Hu X, Tang J, Gao H, et al (2013) Actnet: active learning for networked texts in microblogging. In: Proceedings of the 2013 SIAM international conference on data mining, society for industrial and applied mathematics, pp 306–314
Weigl E, Heidl W, Lughofer E, Eitzinger C, Radauer T (2016) On improving performance of surface inspection systems by on-line active learning and flexible classifier updates. Mach Vis Appl 27:103–127
Lughofer E, Weigl E, Heidl W, Eitzinger C, Radauer T (2016) Recognizing input space and target concept drifts with scarcely labelled and unlabelled instances. Inf Sci 355:127–151
Settles B (2012) Active learning. Synth Lect Artif Intell Mach Learn 6:1–114
Lughofer E (2017) On-line active learning: a new paradigm to improve practical useability of data stream modeling methods. Inf Sci 415–416:356–376
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Liu, D., Liu, Y. An active learning algorithm for multi-class classification. Pattern Anal Applic 22, 1051–1063 (2019). https://doi.org/10.1007/s10044-018-0716-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-018-0716-1