research-article

Open access

A Practical Failure Prediction Model based on Code Smells and Software Development Metrics

Authors:

Martin Schütz,

Reinhold PlöschAuthors Info & Claims

ESSE '23: Proceedings of the 4th European Symposium on Software Engineering

Pages 14 - 22

https://doi.org/10.1145/3651640.3651644

Published: 02 July 2024 Publication History

All formats PDF

Abstract

Making errors during software development is unavoidable. Developers inevitably make errors that take additional time to fix later. Consequently, efforts for bug fixing compete with implementing new features. Typically, the later bugs are found, the higher the cost for remediation. To address this concern, software testing should start as early as possible in software development lifecycle. For this purpose, static analysis is proposed, but typically shows too many findings and hence do not support development teams appropriately. So, it would be a benefit to premature detect those findings in static analysis that will result in failures to reduce subsequent efforts notably. The purpose of the paper is to analyze failure data from issue tracking systems that are correlated to findings from static analysis. Thereupon an artificial intelligence-based approach is used to train practicable models for business environment that enables effective prediction of software faults. The results from static analysis show that predefined complexity measures encompassed the most defects. While there are commonalities in relevant defect findings in static analysis reports, meaningful prediction models cannot be expected based solely on this data. In addition to the findings of the static analysis, metrics like code changes in a time period or number of authors involved in code changes were considered for building the prediction models. Two of the developed prediction models have a high accuracy and excellent utility rate. These resulting prediction models are currently used at Raiffeisen Software GmbH for a long-term study on failure prediction based on code smells.

References

[1]

Enas A. Alikhashashneh, Rajeev R. Raje, and James H. Hill. 2018. Using Machine Learning Techniques to Classify and Predict Static Code Analysis Tool Warnings. In 15th IEEE/ACS International Conference on Computer Systems and Applications, AICCSA 2018, Aqaba, Jordan, October 28 - Nov. 1, 2018. IEEE Computer Society, 1–8. https://doi.org/10.1109/AICCSA.2018.8612819

[2]

Graham Bath and Judy McKay. 2008. The Software Test Engineer’s Handbook: A Study Guide for the ISTQB Test Analyst and Technical Test Analyst Advanced Level Certificates. Rocky Nook; 1. Edition, San Rafael, CA, United States.

[3]

Cathal Boogerd and Leon Moonen. 2006. Prioritizing Software Inspection Results using Static Profiling. abs/cs/0607063 (2006). arXiv:cs/0607063http://arxiv.org/abs/cs/0607063

[4]

Zadia Codabux and Byron J. Williams. 2016. Technical debt prioritization using predictive analytics. In Proceedings of the 38th International Conference on Software Engineering, ICSE 2016, Austin, TX, USA, May 14-22, 2016 - Companion Volume, Laura K. Dillon, Willem Visser, and Laurie A. Williams (Eds.). ACM, 704–706. https://doi.org/10.1145/2889160.2892643

Digital Library

[5]

Ward Cunningham. 1992. The WyCash portfolio management system. In Addendum to the Proceedings on Object-Oriented Programming Systems, Languages, and Applications, OOPSLA 1992 Addendum, Vancouver, British Columbia, Canada, October 18-22, 1992, Jerry L. Archibald and Mark C. Wilkes (Eds.). ACM, 29–30. https://doi.org/10.1145/157709.157715

Digital Library

[6]

Karim O. Elish and Mahmoud O. Elish. 2008. Predicting defect-prone software modules using support vector machines. 81, 5 (2008), 649–660. https://doi.org/10.1016/j.jss.2007.07.040

Digital Library

[7]

Tracy Hall, Sarah Beecham, David Bowes, David Gray, and Steve Counsell. 2012. A Systematic Literature Review on Fault Prediction Performance in Software Engineering. 38, 6 (2012), 1276–1304. https://doi.org/10.1109/TSE.2011.103

Digital Library

[8]

Quinn Hanam, Lin Tan, Reid Holmes, and Patrick Lam. 2014. Finding patterns in static analysis alerts: improving actionable alert ranking. In 11th Working Conference on Mining Software Repositories, MSR 2014, Proceedings, May 31 - June 1, 2014, Hyderabad, India, Premkumar T. Devanbu, Sung Kim, and Martin Pinzger (Eds.). ACM, 152–161. https://doi.org/10.1145/2597073.2597100

Digital Library

[9]

Sunghun Kim and Michael D. Ernst. 2007. Which warnings should I fix first?. In Proceedings of the 6th joint meeting of the European Software Engineering Conference and the ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2007, Dubrovnik, Croatia, September 3-7, 2007, Ivica Crnkovic and Antonia Bertolino (Eds.). ACM, 45–54. https://doi.org/10.1145/1287624.1287633

Digital Library

[10]

Wanwangying Ma, Lin Chen, Yuming Zhou, and Baowen Xu. 2016. Do We Have a Chance to Fix Bugs When Refactoring Code Smells?. In International Conference on Software Analysis, Testing and Evolution, SATE 2016, Kunming, China, November 3-4, 2016. IEEE, 24–29. https://doi.org/10.1109/SATE.2016.11

[11]

Ruchika Malhotra and Priya Singh. 2020. Exploiting bad-smells and object-oriented characteristics to prioritize classes for refactoring. 11, 2s (2020), 133–144. https://doi.org/10.1007/s13198-020-01001-x

[12]

Raimund Moser, Witold Pedrycz, and Giancarlo Succi. 2008. A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction. In 30th International Conference on Software Engineering (ICSE 2008), Leipzig, Germany, May 10-18, 2008, Wilhelm Schäfer, Matthew B. Dwyer, and Volker Gruhn (Eds.). ACM, 181–190. https://doi.org/10.1145/1368088.1368114

Digital Library

[13]

Santosh Singh Rathore and Sandeep Kumar. 2019. A study on software fault prediction techniques. 51, 2 (2019), 255–327. https://doi.org/10.1007/s10462-017-9563-5

Digital Library

[14]

RSG. 2023. Raiffeisen Software GmbH. https://www.r-software.at. Accessed: August 22nd, 2023.

[15]

Natthawute Sae-Lim, Shinpei Hayashi, and Motoshi Saeki. 2018. Context-based approach to prioritize code smells for prefactoring. 30, 6 (2018). https://doi.org/10.1002/smr.1886

Digital Library

[16]

Andreas Spillner and Tilo Linz. 2021. Software Testing Foundations: A Study Guide for the Certified Tester Exam. dpunkt.verlag; 5th, revised and updated Edition, Heidelberg, Germany.

Cited By

Nızam Aİslamoğlu EKerem Adali ÖAydin M(2025)Optimizing Pre-Trained Code Embeddings With Triplet Loss for Code Smell DetectionIEEE Access10.1109/ACCESS.2025.354256613(31335-31350)Online publication date: 2025
https://doi.org/10.1109/ACCESS.2025.3542566
Campbell J(2024)A Methodology for Analysing Code Anomalies in Open-Source Software Using Big Data Analytics2024 IEEE International Conference on Big Data (BigData)10.1109/BigData62323.2024.10825952(8216-8218)Online publication date: 15-Dec-2024
https://doi.org/10.1109/BigData62323.2024.10825952

Index Terms

A Practical Failure Prediction Model based on Code Smells and Software Development Metrics
1. Software and its engineering
  1. Software creation and management
    1. Software post-development issues
      1. Maintaining software
    2. Software verification and validation
      1. Empirical software validation
      2. Software defect analysis

Recommendations

Are architectural smells independent from code smells? An empirical study
Highlights
- Case study analyzing the correlations among code smells, groups of code smells and architectural smells.
Abstract
Background. Architectural smells and code smells are symptoms of bad code or design that can cause different quality problems, such as faults, technical debt, or difficulties with maintenance and evolution. Some studies ...
Bug Prediction Capability of Primitive Enthusiasm Metrics
Computational Science and Its Applications – ICCSA 2021
Abstract
Bugs in software development life cycle are unavoidable. Manually finding these bugs is not always the most effective way. To aid this, various bug prediction approaches which are using code metrics are developed and are also still in active ...
Examining the Bug Prediction Capabilities of Primitive Obsession Metrics
Computational Science and Its Applications – ICCSA 2021
Abstract
Bug prediction is an approach that helps make bug detection more automated during software development. Based on a bug dataset a prediction model is built to locate future bugs. Bug datasets contain information about previous defects in the code, ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ESSE '23: Proceedings of the 4th European Symposium on Software Engineering

December 2023

116 pages

ISBN:9798400708817

DOI:10.1145/3651640

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 July 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ESSE 2023

ESSE 2023: The 4th European Symposium on Software Engineering

December 1 - 3, 2023

Napoli, Italy

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
181
Total Downloads

Downloads (Last 12 months)181
Downloads (Last 6 weeks)36

Reflects downloads up to 05 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Nızam Aİslamoğlu EKerem Adali ÖAydin M(2025)Optimizing Pre-Trained Code Embeddings With Triplet Loss for Code Smell DetectionIEEE Access10.1109/ACCESS.2025.354256613(31335-31350)Online publication date: 2025
https://doi.org/10.1109/ACCESS.2025.3542566
Campbell J(2024)A Methodology for Analysing Code Anomalies in Open-Source Software Using Big Data Analytics2024 IEEE International Conference on Big Data (BigData)10.1109/BigData62323.2024.10825952(8216-8218)Online publication date: 15-Dec-2024
https://doi.org/10.1109/BigData62323.2024.10825952

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten