Skip to main content
Log in

An active learning algorithm for multi-class classification

  • Theoretical Advances
  • Published:
Pattern Analysis and Applications Aims and scope Submit manuscript

Abstract

Since the number of instances in the training set is very large, data annotating task consumes plenty of time and energy. Active learning algorithms can efficiently reduce the number of instances that need to be annotated. In this paper, authors propose a new active learning algorithm. The algorithm is mainly proposed for multi-class classification model based on support vector machine (SVM). In the algorithm, the unlabeled instances that can promote several SVM classifiers in the multi-class classification model will be selected firstly. So when one newly selected instance is added into training set, more than one classification hyper-planes in the multi-class classification model will be promoted. During the process of instance selection, the algorithm also tries to choose the instance that is least similar with the instances that have already been annotated. In this way, the instances selected by the algorithm for annotating will perfectly represent the feature of the whole dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Tong S, Koller D (2002) Support vector machine active learning with applications to text classification. J Mach Learn Res 2:45–66

    MATH  Google Scholar 

  2. Bottou L, et al (1994) Comparison of classifier methods: a case study in handwriting digit recognition. In: International conference on pattern recognition, p 77

  3. Freund Y, Seung HS, Shamir E, Tishby N (1997) Selective sampling using the query by committee algorithm. Mach Learn 28:133–168

    Article  MATH  Google Scholar 

  4. Xu Z, Yu K, Tresp V, Xu X, Wang J (2003) Representative sampling for text classification using support vector machine. In: European conference on information retrieval

  5. Dasgupta S, Hsu D (2008) Hierarchical sampling for active learning. In: Proceedings the 25th international conference on machine learning, pp 208–215

  6. Nguyen HT, Smeulders A (2004) Active learning using pre-clustering. In: Proceedings of the 21st international conference on machine learning, p 79

  7. Holub A, Perona P, Burl MC (2008) Entropy-based active learning for object recognition. In: Computer vision and pattern recognition

  8. Abe N, Mamitsuka H (2002) Query learning using boosting and bagging. In: International conference on machine learning

  9. Thompson Cynthia A, Califf ME, Raymond J (1999) Active learning for natural language parsing and information extraction. In: ICML, pp 406–414

  10. Melville P, Yang SM, Saar-Tsechansky M, Mooney R (2005) Active learning for probability estimation using Jensen–Shannon divergence. In: European conference on machine learning, pp 268–279

  11. Scheffer T, Wrobel S (2001) Active learning of partially hidden markov models. In: Proceedings of the ECML/PKDD workshop on instance selection

  12. Roy N, Mccallum A (1978) Toward optimal active learning through sampling estimation of error reduction. In: Proceedings international conference on machine learning, p 441448

  13. Tong S, Koller D (2000) Active learning for parameter estimation in bayesian networks. In: NIPS, pp 647–653

  14. Lindenbaum M, Markovitch S, Rusakov D (1999) Selective sampling for nearest neighbor classifiers. In: Proceedings of the sixteenth national conference on artificial intelligence (AAAI-99) and the eleventh conference on innovative applications of artificial intelligence (IAAI-99), pp 366–371

  15. Pasolli E, Melgani F, Tuia D et al (2014) SVM active learning approach for image classification using spatial information. IEEE Trans Geosci Remote Sens 52:2217–2233

    Article  Google Scholar 

  16. Li X, Guo Y (2013) Active learning with multi-Label SVM Classification. In: IJCAI, pp 1479–1485

  17. Gan J, Li A, Lei QL, et al (2017) K-means based on active learning for support vector machine. In: 2017 IEEE/ACIS 16th international conference on computer and information science (ICIS). IEEE, pp 727–731

  18. Tuia D, Ratle F, Pacifici F et al (2009) Active learning methods for remote sensing image classification. IEEE Trans Geosci Remote Sens 47:2218–2232

    Article  Google Scholar 

  19. Settles B, Craven M (2008) An analysis of active learning strategies for sequence labeling tasks. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 1070–1079

  20. Smailovic J, Grcar M, Lavrac N et al (2014) Stream-based active learning for sentiment analysis in the financial domain. Inf Sci 285:181–203

    Article  Google Scholar 

  21. Hu X, Tang J, Gao H, et al (2013) Actnet: active learning for networked texts in microblogging. In: Proceedings of the 2013 SIAM international conference on data mining, society for industrial and applied mathematics, pp 306–314

  22. Weigl E, Heidl W, Lughofer E, Eitzinger C, Radauer T (2016) On improving performance of surface inspection systems by on-line active learning and flexible classifier updates. Mach Vis Appl 27:103–127

    Article  Google Scholar 

  23. Lughofer E, Weigl E, Heidl W, Eitzinger C, Radauer T (2016) Recognizing input space and target concept drifts with scarcely labelled and unlabelled instances. Inf Sci 355:127–151

    Article  Google Scholar 

  24. Settles B (2012) Active learning. Synth Lect Artif Intell Mach Learn 6:1–114

    Article  MathSciNet  MATH  Google Scholar 

  25. Lughofer E (2017) On-line active learning: a new paradigm to improve practical useability of data stream modeling methods. Inf Sci 415–416:356–376

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dongjiang Liu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, D., Liu, Y. An active learning algorithm for multi-class classification. Pattern Anal Applic 22, 1051–1063 (2019). https://doi.org/10.1007/s10044-018-0716-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10044-018-0716-1

Keywords

Navigation