Identifying Association between Longer Itemsets and Software Defects

Rana, Zeeshan A.; Malik, Sehrish Abdul; Shamail, Shafay; Awais, Mian M.

doi:10.1007/978-3-642-42051-1_18

Zeeshan A. Rana²⁰,
Sehrish Abdul Malik²⁰,
Shafay Shamail²⁰ &
…
Mian M. Awais²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8228))

Included in the following conference series:

International Conference on Neural Information Processing

4456 Accesses
1 Citations

Abstract

Software defects are an indicator of software quality. Software with lesser number of defective modules are desired. Prediction of software defects using software measurements facilitates early identification of defect-prone modules. Association relationship between software measures and defects improves prediction of defective modules. To find association relationship between software measures and defects, each numeric measure is divided into bins. Each bin is called 1-itemset (or an itemset of length 1). When certain itemsets and defective modules appear together in a dataset, they are considered associated with each other. Frequency of their co-occurrence depicts the strength of the association relationship. Existing studies find the relationship between 1-itemsets and defective modules. Itemsets that have high association with defects are called focused itemsets. Focused itemsets can be used to build prediction models with higher Recall values. This paper explores the relationship between defective modules and itemsets with length greater than 1. Focused itemsets with length greater than 1 involve multiple bins at same time. Identification of the focused itemsets has improved the performance of decision tree based defect prediction model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Optimization of association rules using hybrid data mining technique

Article 18 April 2021

Enhancing Security and Performance of Software Defect Prediction Models: A Literature Review

Contributing Features-Based Schemes for Software Defect Prediction

References

Anwar, S., Rana, Z.A., Shamail, S., Awais, M.M.: Using association rules to identify similarities between software datasets. In: The 8th International Conference on the Quality of Information and Communications Technology (QUATIC), pp. 114–119. IEEE Computer Society, Lisbon (2012)
Google Scholar
Baojun, M., Dejaeger, K., Vanthienen, J., Baesens, B.: Software defect prediction based on association rule classification. Open Access publications from Katholieke Universiteit Leuven urn:hdl:123456789/296322, Katholieke Universiteit Leuven (February 2011)
Google Scholar
Boetticher, G., Menzies, T., Ostrand, T.: Promise repository of empirical software engineering data (2007)
Google Scholar
Challagulla, V.U.B., Bastani, F.B., Paul, R.A.: Empirical assessment of machine learning based sofwtare defect prediction techniques. In: Proceedings of 10th Workshop on Object-Oriented Real-Time Dependable Systems (WORDS 2005), pp. 263–270. IEEE Computer Society, Washington, DC (2005)
Google Scholar
Fenton, N.E., Neil, M.: A critique of software defect prediction models. IEEE Transactions on Software Engineering 25(5), 675–687 (1999)
Article Google Scholar
Jiawei, H., Micheline, K.: Data Mining - Concepts and Techniques. Morgan Kaufmann (2002)
Google Scholar
Kamei, Y., Monden, A., Morisaki, S., Matsumoto, K.-I.: A hybrid faulty module prediction using association rule mining and logistic regression analysis. In: Proceedings of the Second ACM-IEEE International Symposium on Empirical Software Engineering and Measurement, ESEM 2008, pp. 279–281. ACM, New York (2008)
Chapter Google Scholar
Menzies, T., Greenwald, J., Frank, A.: Data mining static code attributes to learn defect predictors. Software Engineering, IEEE Transactions on 33(1), 2–13 (2007)
Article Google Scholar
Rana, Z.A., Shamail, S., Awais, M.M.: Towards a generic model for software quality prediction. In: WoSQ 2008: Proceedings of the 6th International Workshop on Software Quality, pp. 35–40. ACM (May 2008)
Google Scholar
Song, Q., Shepperd, M., Cartwright, M., Mair, C.: Software defect association mining and defect correction effort prediction. IEEE Transactions on Software Engineering 32(2), 69–82 (2006)
Article Google Scholar
Witten, I.H., Frank, E., Trigg, L., Hall, M., Holmes, G., Cunningham, S.J.: The waikato environment for knowledge analysis, weka (2008)
Google Scholar
Zafar, H., Rana, Z.A., Shamail, S., Awais, M.M.: Finding focused itemsets from software defect data. In: Proceedings of the 15th International Multitopic Conference (INMIC 2012). IEEE (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

LUMS School of Science and Engineering Lahore, Pakistan
Zeeshan A. Rana, Sehrish Abdul Malik, Shafay Shamail & Mian M. Awais

Authors

Zeeshan A. Rana
View author publications
You can also search for this author in PubMed Google Scholar
Sehrish Abdul Malik
View author publications
You can also search for this author in PubMed Google Scholar
Shafay Shamail
View author publications
You can also search for this author in PubMed Google Scholar
Mian M. Awais
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Kyungpook National University, 1370 Sankyuk-Dong, Puk-Gu, 702-701, Taegu, Korea
Minho Lee
The University of Tokyo, 7-3-1 Hongo, 113-8656, Bunkyo-ku, Tokyo, Japan
Akira Hirose
Institute of Automation, Key Laboratory of Complex Systems and Intelligence Science, Chinese Academy of Sciences, 100190, Beijing, China
Zeng-Guang Hou
Sungkyunkwan University, 2066, Seobu-ro, Jangan-gu, 440-746, Suwon, Korea
Rhee Man Kil

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rana, Z.A., Malik, S.A., Shamail, S., Awais, M.M. (2013). Identifying Association between Longer Itemsets and Software Defects. In: Lee, M., Hirose, A., Hou, ZG., Kil, R.M. (eds) Neural Information Processing. ICONIP 2013. Lecture Notes in Computer Science, vol 8228. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-42051-1_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-42051-1_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-42050-4
Online ISBN: 978-3-642-42051-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics