Abstract
Traditional software defect prediction approaches often focus on static code metrics. Software evolution could cause changes to the source code of the software, as well as changes to the code metrics and label. In recent years, researchers have proposed many process metrics for evolutionary projects, but they are mainly used to predict the defect proneness of software modules. Whether the change of code metrics (CCMs) could be used to predict the label change on evolutionary projects, and which CCMs are more correlated to the label change? To answer these questions, this paper proposes a framework for building the new datasets with CCMs and label change, and explores the correlations between CCMs and label change with feature ranking approaches. An empirical study is conducted on 40 versions of 11 open-source projects. The experimental results indicate that CCMs can predict the label change.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Yuan, W., Wang, P., Guo, Y., He, L., He, T.: Mining the software engineering forums: what’s new and what’s left. In: Proceedings of Web Information Systems and Applications, pp. 513–524 (2020)
Hosseini, S., Turhan, B., Gunarathna, D.: A systematic literature review and meta-analysis on cross project defect prediction. IEEE Trans. Softw. Eng. 45(2), 111–147 (2019)
Li, N., Shepperd, M., Guo, Y.: A systematic review of unsupervised learning techniques for software defect prediction. Inf. Softw. Technol. 122, 106287 (2020)
Jiang, L., Jiang, S., Gong, L., Dong, Y., Yu, Q.: Which process metrics are significantly important to change of defects in evolving projects: an empirical study. IEEE Access 8, 93705–93722 (2020)
Menzies, T., Milton, Z., Turhan, B., Cukic, B., Jiang, Y., Bener, A.: Defect prediction from static code features: current results, limitations, new approaches. Autom. Softw. Eng. 17(4), 375–407 (2010)
Shatnawi, R., Li, W.: The effectiveness of software metrics in identifying error-prone classes in post-release software evolution process. J. Syst. Softw. 81(11), 1868–1882 (2008)
Kpodjedo, S., Ricca, F., Galinier, P., Guéhéneuc, Y., Antoniol, G.: Design evolution metrics for defect prediction in object oriented systems. Empir. Softw. Eng. 16(1), 141–175 (2011)
Bhattacharya, P., Iliofotou, M., Neamtiu, I., Faloutsos, M.: Graph-based analysis and prediction for software evolution. In: Proceedings of International Conference on Software Engineering, pp. 419–429 (2012)
Rahman, F., Devanbu, P.: How, and why, process metrics are better. In: Proceedings of International Conference on Software Engineering, pp. 432–441 (2013)
Madeyski, L., Jureczko, M.: Which process metrics can significantly improve defect prediction models? An empirical study. Softw. Qual. J. 23(3), 393–422 (2015)
Wang, D., Wang, Q.: Improving the performance of defect prediction based on evolution data. J. Softw. 27, 3014–3029 (2016)
Liu, J., Zhou, Y., Yang, Y., Lu, H., Xu, B.: Code churn: a neglected metric in effort-aware just-in-time defect prediction. In: Proceedings of International Symposium on Empirical Software Engineering and Measurement, pp. 11–19 (2017)
Stanić, B., Afzal, W.: Process metrics are not bad predictors of fault proneness. In: Proceedings of International Conference on Software Quality, Reliability and Security Companion, pp. 493–499 (2017)
Yu, Q., Jiang, S., Qian, J., Bo, L., Jiang, L., Zhang, G.: Process metrics for software defect prediction in object-oriented programs. IET Softw. 14(3), 283–292 (2020)
Yu, Q., Zhu, Y., Han, H., Zhao, Y., Jiang, S., Qian, J.: Evolutionary measures for object-oriented projects and impact on the performance of cross-version defect prediction. In: Proceedings of Asia-Pacific Symposium on Internetware, pp. 192–201 (2022)
Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)
Freund, Y., Schapire, R.E.: Experiments with a new boosting algorithm. In: Proceedings of the International Conference on Machine Learning, pp. 148–156 (1996)
Gong, L., Jiang, S., Jiang, L.: Research progress of software defect prediction. J. Softw. 30, 3090–3114 (2019)
Yu, Q., Jiang, S., Zhang, Y., Wang, X., Gao, P., Qian, J.: The impact study of class imbalance on the performance of software defect prediction models. Chin. J. Comput. 41(4), 809–824 (2018)
Tantithamthavorn, C., McIntosh, S., Hassan, A.E., Matsumoto, K.: The impact of automated parameter optimization on defect prediction models. IEEE Trans. Softw. Eng. 45(7), 683–711 (2018)
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)
Karegowda, A., Manjunath, A.S., Jayaram, M.A.: Comparative study of attribute selection using gain ratio and correlation based feature selection. Int. J. Inf. Technol. Knowl. Manag. 2(2), 271–277 (2010)
Acknowledgments
This work was supported in part by the National Natural Science Foundation of China (61902161 and 62077029), the CCF-Huawei Populus Grove Fund (CCF-HuaweiFM202209), the Guangxi Key Laboratory of Trusted Software (kx201704), and the Research Support Program for Doctorate Teachers of Jiangsu Normal University (17XLR001).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Yu, Q., Jiang, S., Zhu, Y., Han, H., Zhao, Y., Jiang, Y. (2022). The Change of Code Metrics for Predicting the Label Change on Evolutionary Projects: An Empirical Study. In: Zhao, X., Yang, S., Wang, X., Li, J. (eds) Web Information Systems and Applications. WISA 2022. Lecture Notes in Computer Science, vol 13579. Springer, Cham. https://doi.org/10.1007/978-3-031-20309-1_27
Download citation
DOI: https://doi.org/10.1007/978-3-031-20309-1_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20308-4
Online ISBN: 978-3-031-20309-1
eBook Packages: Computer ScienceComputer Science (R0)