Reference Hub7
Fault-Prone Module Prediction Approaches Using Identifiers in Source Code

Fault-Prone Module Prediction Approaches Using Identifiers in Source Code

Osamu Mizuno, Naoki Kawashima, Kimiaki Kawamoto
Copyright: © 2015 |Volume: 3 |Issue: 1 |Pages: 14
ISSN: 2166-7160|EISSN: 2166-7179|EISBN13: 9781466680593|DOI: 10.4018/ijsi.2015010103
Cite Article Cite Article

MLA

Mizuno, Osamu, et al. "Fault-Prone Module Prediction Approaches Using Identifiers in Source Code." IJSI vol.3, no.1 2015: pp.36-49. http://doi.org/10.4018/ijsi.2015010103

APA

Mizuno, O., Kawashima, N., & Kawamoto, K. (2015). Fault-Prone Module Prediction Approaches Using Identifiers in Source Code. International Journal of Software Innovation (IJSI), 3(1), 36-49. http://doi.org/10.4018/ijsi.2015010103

Chicago

Mizuno, Osamu, Naoki Kawashima, and Kimiaki Kawamoto. "Fault-Prone Module Prediction Approaches Using Identifiers in Source Code," International Journal of Software Innovation (IJSI) 3, no.1: 36-49. http://doi.org/10.4018/ijsi.2015010103

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Prediction of fault-prone modules is an important area of software engineering. The authors assumed that the occurrence of faults is related to the semantics in the source code modules. Semantics in a software module can be extracted from identifiers in the module. Identifiers such as variable names and function names in source code are thus essential information to understand code. The naming for identifiers affects on code understandability; thus, the authors expect that they affect software quality. In this study, the authors examine the relationship between the length of identifiers and existence of software faults in a software module. Furthermore, the authors analyze the relationship between occurrence of “words” in identifiers and the existence of faults. From the experiments using the data from open source software, the authors modeled the relationship between the fault occurrence and the length of identifiers, and the relationship between the fault occurrence and the word in identifiers by the random forest technique. The result of the experiment showed that the length of identifiers can predict the fault-proneness of the software modules. Also, the result showed that the word occurrence model is as good a measure as traditional CK and LOC metrics models.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.