skip to main content
10.1145/2896387.2900324acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccConference Proceedingsconference-collections
research-article

Software Defect Prediction Using Feature Space Transformation

Published: 22 March 2016 Publication History

Abstract

In software quality estimation research, software defect prediction is a key topic. A defect prediction model is generally constructed using a variety of software attributes and each attribute may have positive, negative or neutral effect on a specific model. Selection of an optimal set of attributes for model development remains a vital yet unexplored issue. In this paper, we have introduced a new feature space transformation process with a normalization technique to improve the defect prediction accuracy. We proposed a feature space transformation technique and classify the instances using Support Vector Machine (SVM) with its histogram intersection kernel. The proposed method is evaluated using the data sets from NASA metric data repository and its application demonstrates acceptable accuracy.

References

[1]
K. Gao, T. M. Khoshgoftaar, H. Wang, and N. Seliya, "Choosing software metrics for defect prediction: an investigation on feature selection techniques," Software: Practice and Experience, vol. 41, no. 5, pp. 579--606, 2011.
[2]
H. Wang, T. M. Khoshgoftaar, and A. Napolitano, "A comparative study of ensemble feature selection techniques for software defect prediction," in Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on. IEEE, 2010, pp. 135--140.
[3]
G. Ilczuk, R. Mlynarski, W. Kargul, and A. Wakulicz-Deja, "New feature selection methods for qualification of the patients for cardiac pacemaker implantation," in Computers in Cardiology, 2007. IEEE, 2007, pp. 423--426.
[4]
T. M. Khoshgoftaar and K. Gao, "Feature selection with imbalanced data for software defect prediction," in Machine Learning and Applications, 2009. ICMLA'09. International Conference on. IEEE, 2009, pp. 235--240.
[5]
T. Menzies, J. Greenwald, and A. Frank, "Data mining static code attributes to learn defect predictors," Software Engineering, IEEE Transactions on, vol. 33, no. 1, pp. 2--13, 2007.
[6]
Q. Song, Z. Jia, M. Shepperd, S. Ying, and J. Liu, "A general software defect-proneness prediction framework," Software Engineering, IEEE Transactions on, vol. 37, no. 3, pp. 356--370, 2011.
[7]
D. Gray, D. Bowes, N. Davey, Y. Sun, and B. Christianso, "The misuse of the nasa metrics data program data sets for automated software defect prediction," in Evaluation and Assessment in Software Engineering (EASE 2011), 15th Annual Conference on. IET, 2011, pp. 96--103.
[8]
J. Nam, S. J. Pan, and S. Kim, "Transfer defect learning," in Proceedings of the 2013 International Conference on Software Engineering. IEEE Press, 2013, pp. 382--391.
[9]
J. I. Khan, A. U. Gias, M. S. Siddik, M. H. Rahman, S. M. Khaled, and M. Shoyaib, "An attribute selection process for software defect prediction," in Informatics, Electronics & Vision (ICIEV), 2014 International Conference on. IEEE, 2014, pp. 1--4.
[10]
D. Rodriguez, R. Ruiz, J. Cuadrado-Gallego, and J. Aguilar-Ruiz, "Detecting fault modules applying feature selection to classifiers," in Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on. IEEE, 2007, pp. 667--672.
[11]
K. Jong, E. Marchiori, M. Sebag, and A. van der Vaart, "Feature selection in proteomic pattern data with support vector machines," in Computational Intelligence in Bioinformatics and Computational Biology, 2004. CIBCB'04. Proceedings of the 2004 IEEE Symposium on. IEEE, 2004, pp. 41--48.
[12]
G. Forman, "An empirical study of feature selection metrics for text classification," The Journal of machine learning research, vol. 3, pp. 1289--1305, 2003.
[13]
H. Wang, T. M. Khoshgoftaar, and N. Seliya, "How many software metrics should be selected for defect prediction?" in FLAIRS Conference-, 2011.
[14]
P. He, B. Li, X. Liu, J. chen, and Y. Ma, "An empirical study on software defect prediction with a simplified metric set," Information and Software Technology, vol. 59, pp. 170--190, 2015.
[15]
T. Lee, J. Nam, D. Han, S. Kim, and H. P. In, "Micro interaction metrics for defect prediction," in Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering. ACM, 2011, pp. 311--321.
[16]
X. Shi, Q. Liu, W. Fan, P. S. Yu, and R. Zhu, "Transfer learning on heterogenous feature spaces via spectral transformation," in Data Mining (ICDM), 2010 IEEE 10th International Conference on. IEEE, 2010, pp. 1049--1054.
[17]
J. Chen, S. Liua, W. Liu, X. C. Q. Gu, and D. Chen, "A two-stage data preprocessing approach for software fault prediction," in Software Security and Reliability, 2014 Eighth International Conference on. IEEE, 2014, pp. 20--29.
[18]
R. Verma and A. Gupta, "Software defect prediction using two level data pre-processing," in Recent Advances in Computing and Software Systems (RACSS), 2012 International Conference on. IEEE, 2012, pp. 311--317.
[19]
A. Okutan and O. T. Yıldız, "Software defect prediction using bayesian networks," Empirical Software Engineering, vol. 19, no. 1, pp. 154--181, 2014.
[20]
G. Denaro, "Estimating fault-proneness for tuning testing activities," in Proceedings of the 22nd international conference on Software engineering. ACM, 2000, pp. 311--317.
[21]
H. Can, X. Jianchuna, Z. Ruide, L. Juelong, Y. Qiliang, and X. Liqiang, "A new model for software defect prediction using particle swarm optimization and support vector machine," in Control and Decision Conference (CCDC), 2013 25th Chinese. IEEE, 2013, pp. 4106--4110.
[22]
B.-J. Park, S.-K. Oh, and W. Pedrycz, "The design of polynomial function-based neural network predictors for detection of software defects," Information Sciences, vol. 229, pp. 40--57, 2013.
[23]
S. Maji, A. C. Berg, and J. Malik, "Classification using intersection kernel support vector machines is efficient," in Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on. IEEE, 2008, pp. 1--8.
[24]
B. Turhan and A. Bener, "Software defect prediction: Heuristics for weighted naïve bayes." in ICSOFT (SE), 2007, pp. 244--249.
[25]
X.-Y. Jing, S. Ying, Z.-W. Zhang, S.-S. Wu, and J. Liu, "Dictionary learning based software defect prediction," in Proceedings of the 36th International Conference on Software Engineering. ACM, 2014, pp. 414--423.

Cited By

View all
  • (2023)Vote-Based Feature Selection Method for Stratigraphic Recognition in Tunnelling Process of Shield MachineChinese Journal of Mechanical Engineering10.1186/s10033-023-00932-336:1Online publication date: 24-Oct-2023
  • (2023)Analysis of Feature Selection Methods in Software Defect Prediction ModelsIEEE Access10.1109/ACCESS.2023.334324911(145954-145974)Online publication date: 2023
  • (2023)Machine learning-based defect prediction model using multilayer perceptron algorithm for escalating the reliability of the softwareThe Journal of Supercomputing10.1007/s11227-023-05836-680:7(10122-10147)Online publication date: 19-Dec-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICC '16: Proceedings of the International Conference on Internet of things and Cloud Computing
March 2016
535 pages
ISBN:9781450340632
DOI:10.1145/2896387
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 March 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Attribute selection
  2. Feature space transformation
  3. Software defect prediction

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICC '16

Acceptance Rates

Overall Acceptance Rate 213 of 590 submissions, 36%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Vote-Based Feature Selection Method for Stratigraphic Recognition in Tunnelling Process of Shield MachineChinese Journal of Mechanical Engineering10.1186/s10033-023-00932-336:1Online publication date: 24-Oct-2023
  • (2023)Analysis of Feature Selection Methods in Software Defect Prediction ModelsIEEE Access10.1109/ACCESS.2023.334324911(145954-145974)Online publication date: 2023
  • (2023)Machine learning-based defect prediction model using multilayer perceptron algorithm for escalating the reliability of the softwareThe Journal of Supercomputing10.1007/s11227-023-05836-680:7(10122-10147)Online publication date: 19-Dec-2023
  • (2022)Data quality issues in software fault prediction: a systematic literature reviewArtificial Intelligence Review10.1007/s10462-022-10371-656:8(7839-7908)Online publication date: 21-Dec-2022
  • (2021)Classification Algorithms for Software Defect Prediction: A Systematic Literature Review2021 9th International Conference in Software Engineering Research and Innovation (CONISOFT)10.1109/CONISOFT52520.2021.00034(189-196)Online publication date: Oct-2021
  • (2017)Integrated Approach to Software Defect PredictionIEEE Access10.1109/ACCESS.2017.27591805(21524-21547)Online publication date: 2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media