Skip to main content

CodeBERT Based Software Defect Prediction for Edge-Cloud Systems

  • Conference paper
  • First Online:
Current Trends in Web Engineering (ICWE 2022)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1668))

Included in the following conference series:

Abstract

Edge-cloud system is a crucial computing infrastructure for the innovations of modern society. In addition, the high interest in the edge-cloud system leads to various studies for testing to ensure the reliability of the system. However, like traditional software systems, the amount of resources for testing is always limited. Thus, we suggest CodeBERT Based Just-In-Time (JIT) Software Defect Prediction (SDP) model to address the limitation. This method helps practitioners prioritize the limited testing resources for the defect-prone functions in commits and improves the system’s reliability. We generate GitHub Pull-Request (GHPR) datasets on two open-source framework projects for edge-cloud system in GitHub. After that, we evaluate the performance of the proposed model on the GHPR datasets in within-project environment and cross-project environment. To the best of our knowledge, it is the first attempt to apply SDP to edge-cloud systems, and as a result of the evaluation, we can confirm the applicability of JIT SDP in edge-cloud project. In addition, we expect the proposed method would be helpful for the effective allocation of limited resources when developing edge-cloud systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. baetyl. https://github.com/baetyl/baetyl

  2. Codebert on huggingface. https://huggingface.co/microsoft/codebert-base

  3. Edgex foundry. https://github.com/edgexfoundry/edgex-go

  4. ghpr-tools. https://github.com/soroushj/ghpr-tools

  5. Github restful API. https://docs.github.com/en/rest

  6. Kubeedge. https://github.com/kubeedge/kubeedge

  7. Simpleilot. https://github.com/simpleiot/simpleiot

  8. Blondet, M.V.R., Badarinath, A., Khanna, C., Jin, Z.: A wearable real-time BCI system based on mobile cloud computing. In: 2013 6th International IEEE/EMBS Conference on Neural Engineering (NER), pp. 739–742. IEEE (2013)

    Google Scholar 

  9. Butterfield, E.H.: Fog computing with go: a comparative study (2016)

    Google Scholar 

  10. Buyya, R., Srirama, S.N.: Fog and Edge Computing: Principles and Paradigms. Wiley, Hoboken (2019)

    Book  Google Scholar 

  11. Deng, J., Lu, L., Qiu, S.: Software defect prediction via LSTM. IET Softw. 14(4), 443–450 (2020)

    Article  Google Scholar 

  12. Feng, Z., et al.: Codebert: A pre-trained model for programming and natural languages. arXiv preprint arXiv:2002.08155 (2020)

  13. Husain, H., Wu, H.H., Gazit, T., Allamanis, M., Brockschmidt, M.: CodeSearchNet challenge: evaluating the state of semantic code search. arXiv preprint arXiv:1909.09436 (2019)

  14. Khanan, C., et al.: JITBot: an explainable just-in-time defect prediction bot. In: Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering, pp. 1336–1339 (2020)

    Google Scholar 

  15. Li, J., He, P., Zhu, J., Lyu, M.R.: Software defect prediction via convolutional neural network. In: 2017 IEEE International Conference on Software Quality, Reliability and Security (QRS), pp. 318–328. IEEE (2017)

    Google Scholar 

  16. de Matos, F.F.S., Rego, P.A., Trinta, F.A.M.: An empirical study about the adoption of multi-language technique in computation offloading in a mobile cloud computing scenario. In: CLOSER, pp. 207–214 (2021)

    Google Scholar 

  17. Pan, C., Lu, M., Xu, B.: An empirical study on software defect prediction using CodeBERT model. Appl. Sci. 11(11), 4793 (2021)

    Article  Google Scholar 

  18. Pandey, S.K., Mishra, R.B., Tripathi, A.K.: Machine learning based methods for software fault prediction: a survey. Expert Syst. Appl. 172, 114595 (2021)

    Google Scholar 

  19. Shi, K., Lu, Y., Chang, J., Wei, Z.: Pathpair2vec: an AST path pair-based code representation method for defect prediction. J. Comput. Lang. 59, 100979 (2020)

    Google Scholar 

  20. Tantithamthavorn, C., Hassan, A.E., Matsumoto, K.: The impact of class rebalancing techniques on the performance and interpretation of defect prediction models. IEEE Trans. Softw. Eng. 46(11), 1200–1219 (2018)

    Article  Google Scholar 

  21. Wahono, R.S.: A systematic literature review of software defect prediction. J. Softw. Eng. 1(1), 1–16 (2015)

    Google Scholar 

  22. Xu, J., Wang, F., Ai, J.: Defect prediction with semantics and context features of codes based on graph representation learning. IEEE Trans. Reliab. 70(2), 613–625 (2020)

    Article  Google Scholar 

  23. Xu, J., Yan, L., Wang, F., Ai, J.: A GitHub-based data collection method for software defect prediction. In: 2019 6th International Conference on Dependable Systems and Their Applications (DSA), pp. 100–108. IEEE (2020)

    Google Scholar 

  24. Zhou, X., Han, D., Lo, D.: Assessing generalizability of CodeBERT. In: 2021 IEEE International Conference on Software Maintenance and Evolution (ICSME), pp. 425–436. IEEE (2021)

    Google Scholar 

Download references

Acknowledgment

This research was supported by the National Research Foundation of Korea (NRF-2020R1F1A1071888), the Ministry of Science and ICT (MSIT), Korea, under the Information Technology Research Center (ITRC) support program supervised by the Institute of Information & Communications Technology Planning & Evaluation (IITP-2022-2020-0-01795), and the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF- 2022R1I1A3069233).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jongmoon Baik .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kwon, S., Jang, JI., Lee, S., Ryu, D., Baik, J. (2023). CodeBERT Based Software Defect Prediction for Edge-Cloud Systems. In: Agapito, G., et al. Current Trends in Web Engineering. ICWE 2022. Communications in Computer and Information Science, vol 1668. Springer, Cham. https://doi.org/10.1007/978-3-031-25380-5_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-25380-5_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-25379-9

  • Online ISBN: 978-3-031-25380-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics