Abstract
Edge-cloud system is a crucial computing infrastructure for the innovations of modern society. In addition, the high interest in the edge-cloud system leads to various studies for testing to ensure the reliability of the system. However, like traditional software systems, the amount of resources for testing is always limited. Thus, we suggest CodeBERT Based Just-In-Time (JIT) Software Defect Prediction (SDP) model to address the limitation. This method helps practitioners prioritize the limited testing resources for the defect-prone functions in commits and improves the system’s reliability. We generate GitHub Pull-Request (GHPR) datasets on two open-source framework projects for edge-cloud system in GitHub. After that, we evaluate the performance of the proposed model on the GHPR datasets in within-project environment and cross-project environment. To the best of our knowledge, it is the first attempt to apply SDP to edge-cloud systems, and as a result of the evaluation, we can confirm the applicability of JIT SDP in edge-cloud project. In addition, we expect the proposed method would be helpful for the effective allocation of limited resources when developing edge-cloud systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
baetyl. https://github.com/baetyl/baetyl
Codebert on huggingface. https://huggingface.co/microsoft/codebert-base
Edgex foundry. https://github.com/edgexfoundry/edgex-go
ghpr-tools. https://github.com/soroushj/ghpr-tools
Github restful API. https://docs.github.com/en/rest
Kubeedge. https://github.com/kubeedge/kubeedge
Simpleilot. https://github.com/simpleiot/simpleiot
Blondet, M.V.R., Badarinath, A., Khanna, C., Jin, Z.: A wearable real-time BCI system based on mobile cloud computing. In: 2013 6th International IEEE/EMBS Conference on Neural Engineering (NER), pp. 739–742. IEEE (2013)
Butterfield, E.H.: Fog computing with go: a comparative study (2016)
Buyya, R., Srirama, S.N.: Fog and Edge Computing: Principles and Paradigms. Wiley, Hoboken (2019)
Deng, J., Lu, L., Qiu, S.: Software defect prediction via LSTM. IET Softw. 14(4), 443–450 (2020)
Feng, Z., et al.: Codebert: A pre-trained model for programming and natural languages. arXiv preprint arXiv:2002.08155 (2020)
Husain, H., Wu, H.H., Gazit, T., Allamanis, M., Brockschmidt, M.: CodeSearchNet challenge: evaluating the state of semantic code search. arXiv preprint arXiv:1909.09436 (2019)
Khanan, C., et al.: JITBot: an explainable just-in-time defect prediction bot. In: Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering, pp. 1336–1339 (2020)
Li, J., He, P., Zhu, J., Lyu, M.R.: Software defect prediction via convolutional neural network. In: 2017 IEEE International Conference on Software Quality, Reliability and Security (QRS), pp. 318–328. IEEE (2017)
de Matos, F.F.S., Rego, P.A., Trinta, F.A.M.: An empirical study about the adoption of multi-language technique in computation offloading in a mobile cloud computing scenario. In: CLOSER, pp. 207–214 (2021)
Pan, C., Lu, M., Xu, B.: An empirical study on software defect prediction using CodeBERT model. Appl. Sci. 11(11), 4793 (2021)
Pandey, S.K., Mishra, R.B., Tripathi, A.K.: Machine learning based methods for software fault prediction: a survey. Expert Syst. Appl. 172, 114595 (2021)
Shi, K., Lu, Y., Chang, J., Wei, Z.: Pathpair2vec: an AST path pair-based code representation method for defect prediction. J. Comput. Lang. 59, 100979 (2020)
Tantithamthavorn, C., Hassan, A.E., Matsumoto, K.: The impact of class rebalancing techniques on the performance and interpretation of defect prediction models. IEEE Trans. Softw. Eng. 46(11), 1200–1219 (2018)
Wahono, R.S.: A systematic literature review of software defect prediction. J. Softw. Eng. 1(1), 1–16 (2015)
Xu, J., Wang, F., Ai, J.: Defect prediction with semantics and context features of codes based on graph representation learning. IEEE Trans. Reliab. 70(2), 613–625 (2020)
Xu, J., Yan, L., Wang, F., Ai, J.: A GitHub-based data collection method for software defect prediction. In: 2019 6th International Conference on Dependable Systems and Their Applications (DSA), pp. 100–108. IEEE (2020)
Zhou, X., Han, D., Lo, D.: Assessing generalizability of CodeBERT. In: 2021 IEEE International Conference on Software Maintenance and Evolution (ICSME), pp. 425–436. IEEE (2021)
Acknowledgment
This research was supported by the National Research Foundation of Korea (NRF-2020R1F1A1071888), the Ministry of Science and ICT (MSIT), Korea, under the Information Technology Research Center (ITRC) support program supervised by the Institute of Information & Communications Technology Planning & Evaluation (IITP-2022-2020-0-01795), and the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF- 2022R1I1A3069233).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Kwon, S., Jang, JI., Lee, S., Ryu, D., Baik, J. (2023). CodeBERT Based Software Defect Prediction for Edge-Cloud Systems. In: Agapito, G., et al. Current Trends in Web Engineering. ICWE 2022. Communications in Computer and Information Science, vol 1668. Springer, Cham. https://doi.org/10.1007/978-3-031-25380-5_1
Download citation
DOI: https://doi.org/10.1007/978-3-031-25380-5_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-25379-9
Online ISBN: 978-3-031-25380-5
eBook Packages: Computer ScienceComputer Science (R0)