Skip to main content
Log in

A software defect prediction method with metric compensation based on feature selection and transfer learning

一种基于特征选择与迁移学习的度量补偿软件缺陷预测方法

  • Research Article
  • Published:
Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Abstract

Cross-project software defect prediction solves the problem of insufficient training data for traditional defect prediction, and overcomes the challenge of applying models learned from multiple different source projects to target project. At the same time, two new problems emerge: (1) too many irrelevant and redundant features in the model training process will affect the training efficiency and thus decrease the prediction accuracy of the model; (2) the distribution of metric values will vary greatly from project to project due to the development environment and other factors, resulting in lower prediction accuracy when the model achieves cross-project prediction. In the proposed method, the Pearson feature selection method is introduced to address data redundancy, and the metric compensation based transfer learning technique is used to address the problem of large differences in data distribution between the source project and target project. In this paper, we propose a software defect prediction method with metric compensation based on feature selection and transfer learning. The experimental results show that the model constructed with this method achieves better results on area under the receiver operating characteristic curve (AUC) value and F1-measure metric.

摘要

跨项目软件缺陷预测解决了传统缺陷预测中训练数据不足的问题, 克服了将多个不同源项目中学习的模型应用于目标项目的挑战。与此同时, 出现两个新问题: (1) 模型训练过程中过多无关和冗余特征影响训练效率, 降低了模型预测精度; (2) 由于开发环境等因素, 度量值的分布因项目而异, 当模型用于跨项目预测时, 预测精度较低。本文引入皮尔逊特征选择方法解决数据冗余问题, 采用基于迁移学习的度量补偿技术解决源项目和目标项目之间数据分布差异较大的问题。提出一种基于特征选择和迁移学习的度量补偿软件缺陷预测方法。实验结果表明, 用该方法构建的模型在AUC (接收器工作特性曲线下面积) 值和F1度量指标上取得较好结果。

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

Download references

Author information

Authors and Affiliations

Authors

Contributions

Jinfu CHEN and Saihua CAI designed the research. Xiaoli WANG, Saihua CAI, and Jiaping XU processed the data. Jinfu CHEN, Xiaoli WANG, and Saihua CAI drafted the paper. Xiaoli WANG, Jiaping XU, Jingyi CHEN, and Haibo CHEN finished the experiments. Jingyi CHEN and Haibo CHEN helped organize the paper. Jinfu CHEN, Xiaoli WANG, and Saihua CAI revised and finalized the paper.

Corresponding author

Correspondence to Saihua Cai  (蔡赛华).

Additional information

Compliance with ethics guidelines

Jinfu CHEN, Xiaoli WANG, Saihua CAI, Jiaping XU, Jingyi CHEN, and Haibo CHEN declare that they have no conflict of interest.

Project supported by the National Natural Science Foundation of China (Nos. 62172194 and U1836116), the National Key R&D Program of China (No. 2020YFB1005500), the Leadingedge Technology Program of Jiangsu Provincial Natural Science Foundation, China (No. BK20202001), the China Postdoctoral Science Foundation (No. 2021M691310), the Postdoctoral Science Foundation of Jiangsu Province, China (No. 2021K636C), and the Future Network Scientific Research Fund Project, China (No. FNSRFP-2021-YB-50)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, J., Wang, X., Cai, S. et al. A software defect prediction method with metric compensation based on feature selection and transfer learning. Front Inform Technol Electron Eng 23, 715–731 (2022). https://doi.org/10.1631/FITEE.2100468

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/FITEE.2100468

Key words

关键词

CLC number

Navigation