Abstract:
Class imbalance problems are very common in real-world applications, for example, fraud detection, medical diagnosis, and anomaly detection. In this paper, we propose an ...Show MoreMetadata
Abstract:
Class imbalance problems are very common in real-world applications, for example, fraud detection, medical diagnosis, and anomaly detection. In this paper, we propose an approach to solve the problem based on stacking and inverse random undersampling (SIRUS). First, the method of inverse random undersampling is used to undersample the majority class samples in order to generate a large number of different training subsets. Second, a group of different component classifiers are to learn the decision boundary between the minority and the majority classes for each training subset. A stacking model is applied to separate the minority class from the majority one, where the result produced by each classifier is taken as a feature to train a meta classifier. Comparison experiments are conducted based on 17 datasets from UCI machine learning repository. Many metrics such as AUC, F1, and G-mean illustrate the effectiveness of our approach.
Date of Conference: 27-29 March 2018
Date Added to IEEE Xplore: 21 May 2018
ISBN Information: