Abstract:
Machine learning (ML)-based classification strategy has been successfully applied in actual industrial monitoring but it is often hindered when the dataset is imbalanced....Show MoreMetadata
Abstract:
Machine learning (ML)-based classification strategy has been successfully applied in actual industrial monitoring but it is often hindered when the dataset is imbalanced. Technically, the misclassification phenomenon, as a serious performance degradation of generalization ability, often occurs in minority class. For this problem, borderline-synthetic minority oversampling technique (B-SMOTE), which aims to enrich the quantity of minority samples around decision boundaries, has received considerable attention. However, most imbalanced classification techniques under the framework of B-SMOTE generate instances by a random weight number from 0 to 1, which may result in an authentic reduction of newly born samples. Herein, a novel oversampling strategy, which aims to provide new safety criteria and reassign the threshold of weight coefficient, is proposed to boost the authenticity of generated samples and classification accuracy. In addition, light gradient boosting machine (LightGBM) is adopted to build the classification model. Related experiments show the effectiveness and superiority of the proposed method in handling imbalanced classification tasks.
Published in: IEEE Transactions on Instrumentation and Measurement ( Volume: 72)