Summary
Skewed binary classification concerns the assignment of a new unknown object to one of two populations, 0 or 1, on the basis of a q-dimensional vector x = (x1, …xq), where one of the populations, for example population 0, is the prevalent class. Assignment rules are developed from learning samples of known objects, that is, objects known to come from each of the two populations. Since population 1 is the rare class, overfitting and generalization problems arise easily for many classification models. We propose an effective solution by assigning more weights to class 1. The idea is to produce noisy replicates of the rare cases while keeping the dominant class 0 cases unchanged. The classification models considered are: nearest neighbor method, neural networks, classification trees, and quadratic discriminant. Noisy replication of the rare cases was applied to three real world and simulated data sets. Encouraging results were obtained for all the classification models considered.






Similar content being viewed by others
6 References
Bishop, C. (1995). Training with noise is equivalent to Tikohonov regularization. Neural Computation, 7, pp.108–116.
Breiman, L. (1996). Bagging predictors. Machine Learning, 26, No. 2, pp.123–140.
Breiman, L., Friedman, J. H., Olshen, R. A., and Stone, C. J. (1984). Classification and Regression Trees. Wadsworth & Brooks, Monterey, California.
Hanley, J.A. and McNeil, B.J. (1982). The meaning and use of the area under a receiver operating characteristics (ROC) curve. Radiology, 143, pp.29–36.
Hertz, J., Krogh, A., and Palmer, R.G. (1991). Introduction to the Theory of Neural Computation. Addison-Wesley, Redwood City, CA.
Mkhadri, A., Celeux G., and Nasroallah A. (1997). Regularization in discriminant analysis: An overview. Computational Statistics and Data Analysis, 23, pp.403–423.
Quinlan, J. R. (1993). C4.5: Program for Machine Learning. Morgan Kaufmann, San Mateo.
Raviv, Y. and Intrator, N. (1995). Bootstrapping with noise: An effective regularization technique. Technical Report, Tel-Aviv University, Israel.
Ripley, B.D. (1996). Pattern Recognition and Neural Networks. Cambridge University Press.
Sietsma, J. and Dow, R.J.F. (1991). Creating artificial networks that generalize. Neural Networks, 4, pp.67–79.
Venables, W.N. and Ripley, B.D. (1994). Modern Applied Statistics with S-plus. Springer-Verlag, New York.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Lee, S.S. Regularization in skewed binary classification. Computational Statistics 14, 277–292 (1999). https://doi.org/10.1007/s001800050018
Published:
Issue Date:
DOI: https://doi.org/10.1007/s001800050018