Skip to main content
Log in

A Model-Free Subject Selection Method for Active Learning Classification Procedures

  • Published:
Journal of Classification Aims and scope Submit manuscript

Abstract

To construct a classification rule via an active learning method, during the learning process, users select training subjects sequentially, without knowing their labels, based on the model learned at the current stage. For a parametric-model-based classification rule, methods of statistical experimental design are popular guidelines for selecting new learning subjects. However, there is a lack of a counterpart for non-parametric-model-based classifiers, such as support vector machines. Thus, we propose a subject selection scheme via an extended influential index for the area under a receiver operating characteristic curve, which is applicable to general classifiers with continuous scores.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Agresti, A. (2018). An introduction to categorical data analysis. New York: Wiley.

    MATH  Google Scholar 

  • Antal, B., & Hajdu, A. (2014). An ensemble-based system for automatic screening of diabetic retinopathy. Knowledge-Based Systems, 60, 20–27.

    Article  Google Scholar 

  • Chang, Y.-C.I., & Chen, R.-B. (2019). Active learning with simultaneous subject and variable selections. Neurocomputing, 329, 495–505.

    Article  Google Scholar 

  • Chen, Z., Wang, Z., & Chang, Y.-C.I. (2020). Sequential adaptive variables and subject selection for gee methods. Biometrics, 76(2), 496–507.

    Article  MathSciNet  Google Scholar 

  • Cook, R.D. (1986). Assessment of local influence. Journal of the Royal Statistical Society, Series B, 48(2), 133–169.

    MathSciNet  MATH  Google Scholar 

  • Deng, X., Joseph, V.R., Sudjianto, A., & Wu, C.J. (2009). Active learning through sequential design, with applications to detection of money laundering. Journal of the American Statistical Association, 104(487), 969–981.

    Article  MathSciNet  Google Scholar 

  • Dua, D., & Graff, C. (2017). UCI machine learning repository.

  • Hampel, F.R. (1974). The influence curve and its role in robust estimation. Journal of the American Statistical Association, 69(346), 383–393.

    Article  MathSciNet  Google Scholar 

  • Owen, A.B. (2001). Empirical likelihood. CRC Press.

  • Pepe, M. (2003). The statistical evaluation of medical tests for classification and prediction. Oxford University Press.

  • Pepe, M.S., & Cai, T. (2004). The analysis of placement values for evaluating discriminatory measures. Biometrics, 60(1), 528–535.

    Article  MathSciNet  Google Scholar 

  • Schein, A.I., & Ungar, L.H. (2007). Active learning for logistic regression: an evaluation. Machine Learning, 68(3), 235–265.

    Article  Google Scholar 

  • Tong, S., & Koller, D. (2001). Support vector machine active learning with applications to text classification. Journal of Machine Learning Research, 2(Nov), 45–66.

    MATH  Google Scholar 

  • Wang, J., & Park, E. (2017). Active learning for penalized logistic regression via sequential experimental design. Neurocomputing, 222, 183–190.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuan-chin Ivan Chang.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

(PDF 193 KB)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ke, BS., Chang, Yc.I. A Model-Free Subject Selection Method for Active Learning Classification Procedures. J Classif 38, 544–555 (2021). https://doi.org/10.1007/s00357-021-09388-3

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00357-021-09388-3

Keywords