Reference Hub24
An Empirical Comparison of Machine Learning Techniques in Predicting the Bug Severity of Open and Closed Source Projects

An Empirical Comparison of Machine Learning Techniques in Predicting the Bug Severity of Open and Closed Source Projects

K. K. Chaturvedi, V.B. Singh
Copyright: © 2012 |Volume: 4 |Issue: 2 |Pages: 28
ISSN: 1942-3926|EISSN: 1942-3934|EISBN13: 9781466613782|DOI: 10.4018/jossp.2012040103
Cite Article Cite Article

MLA

Chaturvedi, K. K., and V.B. Singh. "An Empirical Comparison of Machine Learning Techniques in Predicting the Bug Severity of Open and Closed Source Projects." IJOSSP vol.4, no.2 2012: pp.32-59. http://doi.org/10.4018/jossp.2012040103

APA

Chaturvedi, K. K. & Singh, V. (2012). An Empirical Comparison of Machine Learning Techniques in Predicting the Bug Severity of Open and Closed Source Projects. International Journal of Open Source Software and Processes (IJOSSP), 4(2), 32-59. http://doi.org/10.4018/jossp.2012040103

Chicago

Chaturvedi, K. K., and V.B. Singh. "An Empirical Comparison of Machine Learning Techniques in Predicting the Bug Severity of Open and Closed Source Projects," International Journal of Open Source Software and Processes (IJOSSP) 4, no.2: 32-59. http://doi.org/10.4018/jossp.2012040103

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Bug severity is the degree of impact that a defect has on the development or operation of a component or system, and can be classified into different levels based on their impact on the system. Identification of severity level can be useful for bug triager in allocating the bug to the concerned bug fixer. Various researchers have attempted text mining techniques in predicting the severity of bugs, detection of duplicate bug reports and assignment of bugs to suitable fixer for its fix. In this paper, an attempt has been made to compare the performance of different machine learning techniques namely Support vector machine (SVM), probability based Naïve Bayes (NB), Decision Tree based J48 (A Java implementation of C4.5), rule based Repeated Incremental Pruning to Produce Error Reduction (RIPPER) and Random Forests (RF) learners in predicting the severity level (1 to 5) of a reported bug by analyzing the summary or short description of the bug reports. The bug report data has been taken from NASA’s PITS (Projects and Issue Tracking System) datasets as closed source and components of Eclipse, Mozilla & GNOME datasets as open source projects. The analysis has been carried out in RapidMiner and STATISTICA data mining tools. The authors measured the performance of different machine learning techniques by considering (i) the value of accuracy and F-Measure for all severity level and (ii) number of best cases at different threshold level of accuracy and F-Measure.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.