Predicting Bugs in Large Industrial Software Systems

Ostrand, Thomas J.; Weyuker, Elaine J.

doi:10.1007/978-3-642-36054-1_3

Thomas J. Ostrand¹⁸ &
Elaine J. Weyuker¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 7171))

Included in the following conference series:

2309 Accesses

Abstract

This chapter is a survey of close to ten years of software fault prediction research performed by our group. We describe our initial motivation, the variables used to make predictions, provide a description of our standard model based on Negative Binomial Regression, and summarize the results of using this model to make predictions for nine large industrial software systems. The systems range in size from hundreds of thousands to millions of lines of code. All have been in the field for multiple years and many releases, and continue to be maintained and enhanced, usually at 3 month intervals.

Effectiveness of the fault predictions is assessed using two different metrics. We compare the effectiveness of the standard model to augmented models that include variables related to developer counts, to inter-file calling structure, and to information about specific developers who modified the code.

We also evaluate alternate prediction models based on different training algorithms, including Recursive Partitioning, Bayesian Additive Regression Trees, and Random Forests.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 49.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A study on software fault prediction techniques

Article 30 May 2017

Linear and non-linear bayesian regression methods for software fault prediction

Article 04 January 2022

Comparative Analysis of Prediction Models for Software Bug Prediction

References

Breiman, L.: Random Forests. Machine Learning 45, 5–32 (2001)
Article MATH Google Scholar
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth, Belmont (1984)
MATH Google Scholar
Chipman, H.A., George, E.I., McCulloch, R.E.: BART: Bayesian Additive Regression Trees (2008), http://arxiv.org/abs/0806.3286v1
McCullagh, P., Nelder, J.A.: Generalized Linear Models, 2nd edn. Chapman and Hall, London (1989)
Book MATH Google Scholar
Ostrand, T.J., Weyuker, E.J.: The Distribution of Faults in a Large Industrial Software System. In: International Symposium on Software Testing and Analysis (ISSTA 2002), pp. 55–64. ACM Press, New York (2002)
Google Scholar
Ostrand, T.J., Weyuker, E.J., Bell, R.M.: Predicting the Location and Number of Faults in Large Software Systems. IEEE Trans. on Software Engineering 31(4), 340–355 (2005)
Article Google Scholar
Ostrand, T.J., Weyuker, E.J., Bell, R.M.: Programmer-based Fault Prediction. In: Predictive Models for Software Engineering (PROMISE 2010). ACM Press, New York (2010)
Google Scholar
Shin, Y., Bell, R.M., Ostrand, T.J., Weyuker, E.J.: On the use of calling structure information to improve fault prediction. Empirical Software Eng. (July 2011), http://www.springerlink.com/content/r4q76v4317148451/
Weyuker, E.J., Ostrand, T.J., Bell, R.M.: We’re Finding Most of the Bugs, but What are we Missing? In: 3rd International Conference on Software Testing. IEEE Press, New York (2010)
Google Scholar
Weyuker, E.J., Ostrand, T.J., Bell, R.M.: Do Too Many Cooks Spoil the Broth? Using the Number of Developers to Enhance Defect Prediction Models. Empirical Software Eng. 13(5), 539–559 (2008)
Article Google Scholar
Weyuker, E.J., Ostrand, T.J., Bell, R.M.: Comparing the Effectiveness of Several Modeling Methods for Fault Prediction. Empirical Software Eng. 15(3), 277–295 (2010)
Article Google Scholar

Download references

Author information

Authors and Affiliations

AT&T Labs - Research, Florham Park, NJ, 180 Park Avenue, 07932, USA
Thomas J. Ostrand & Elaine J. Weyuker

Authors

Thomas J. Ostrand
View author publications
You can also search for this author in PubMed Google Scholar
Elaine J. Weyuker
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Software Engineering Lab, University of Salerno, Fisciano, SA, Italy
Andrea De Lucia
Università di Salerno, Via Ponte don Melillo, 84081, Fisciano, SA, Italy
Filomena Ferrucci

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ostrand, T.J., Weyuker, E.J. (2013). Predicting Bugs in Large Industrial Software Systems. In: De Lucia, A., Ferrucci, F. (eds) Software Engineering. ISSSE ISSSE ISSSE 2010 2009 2011. Lecture Notes in Computer Science, vol 7171. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36054-1_3

Download citation

DOI: https://doi.org/10.1007/978-3-642-36054-1_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36053-4
Online ISBN: 978-3-642-36054-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics