research-article

Effort-aware just-in-time defect prediction: simple unsupervised models could be better than supervised models

Authors:

Hareton LeungAuthors Info & Claims

FSE 2016: Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering

Pages 157 - 168

https://doi.org/10.1145/2950290.2950353

Published: 01 November 2016 Publication History

Abstract

Unsupervised models do not require the defect data to build the prediction models and hence incur a low building cost and gain a wide application range. Consequently, it would be more desirable for practitioners to apply unsupervised models in effort-aware just-in-time (JIT) defect prediction if they can predict defect-inducing changes well. However, little is currently known on their prediction effectiveness in this context. We aim to investigate the predictive power of simple unsupervised models in effort-aware JIT defect prediction, especially compared with the state-of-the-art supervised models in the recent literature. We first use the most commonly used change metrics to build simple unsupervised models. Then, we compare these unsupervised models with the state-of-the-art supervised models under cross-validation, time-wise-cross-validation, and across-project prediction settings to determine whether they are of practical value. The experimental results, from open-source software systems, show that many simple unsupervised models perform better than the state-of-the-art supervised models in effort-aware JIT defect prediction.

References

[1]

E. Arisholm, L. C. Briand, and E. B. Johannessen. A systematic and comprehensive investigation of methods to build and evaluate fault prediction models. Journal of Systems and Software, 83(1):2–17, Jan. 2010.

Digital Library

[2]

Y. Benjamini and Y. Hochberg. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society. Series B (Methodological), 57(1):289–300, Jan. 1995.

[3]

P. Bishnu and V. Bhattacherjee. Software Fault Prediction Using Quad Tree-Based K-Means Clustering Algorithm. IEEE Transactions on Knowledge and Data Engineering, 24(6):1146–1150, June 2012.

Digital Library

[4]

M. D’Ambros, M. Lanza, and R. Robbes. An extensive comparison of bug prediction approaches. In 2010 7th IEEE Working Conference on Mining Software Repositories (MSR), pages 31–41, May 2010.

[5]

J. Ekanayake, J. Tappolet, H. C. Gall, and A. Bernstein. Time variance and defect prediction in software projects. Empirical Software Engineering, 17(4-5):348–389, Nov. 2011.

Digital Library

[6]

J. Eyolfson, L. Tan, and P. Lam. Do Time of Day and Developer Experience Affect Commit Bugginess? MSR ’11, pages 153–162, New York, NY, USA, 2011. ACM.

Digital Library

[7]

T. Fukushima, Y. Kamei, S. McIntosh, K. Yamashita, and N. Ubayashi. An Empirical Study of Just-in-time Defect Prediction Using Cross-project Models. In Proceedings of the 11th Working Conference on Mining Software Repositories, MSR 2014, pages 172–181, New York, NY, USA, 2014. ACM.

Digital Library

[8]

B. Ghotra, S. McIntosh, and A. E. Hassan. Revisiting the Impact of Classification Techniques on the Performance of Defect Prediction Models. In 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering (ICSE), volume 1, pages 789–800, May 2015.

Digital Library

[9]

T. Graves, A. Karr, J. Marron, and H. Siy. Predicting fault incidence using software change history. IEEE Transactions on Software Engineering, 26(7):653–661, July 2000.

Digital Library

[10]

T. Hall, S. Beecham, D. Bowes, D. Gray, and S. Counsell. A Systematic Literature Review on Fault Prediction Performance in Software Engineering. IEEE Transactions on Software Engineering, 38(6):1276–1304, Nov. 2012.

Digital Library

[11]

A. E. Hassan. Predicting Faults Using the Complexity of Code Changes. In Proceedings of the 31st International Conference on Software Engineering, ICSE ’09, pages 78–88, Washington, DC, USA, 2009. IEEE Computer Society.

Digital Library

[12]

Y. Kamei, S. Matsumoto, A. Monden, K.-i. Matsumoto, B. Adams, and A. E. Hassan. Revisiting Common Bug Prediction Findings Using Effort-aware Models. In Proceedings of the 2010 IEEE International Conference on Software Maintenance, ICSM ’10, pages 1–10, Washington, DC, USA, 2010. IEEE Computer Society.

Digital Library

[13]

Y. Kamei, E. Shihab, B. Adams, A. Hassan, A. Mockus, A. Sinha, and N. Ubayashi. A large-scale empirical study of just-in-time quality assurance. IEEE Transactions on Software Engineering, 39(6):757–773, June 2013.

Digital Library

[14]

H. Khalid, M. Nagappan, E. Shihab, and A. E. Hassan. Prioritizing the Devices to Test Your App on: A Case Study of Android Game Apps. In Proceedings of the 22Nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, FSE 2014, pages 610–620, New York, NY, USA, 2014. ACM.

Digital Library

[15]

S. Kim, E. Whitehead, and Y. Zhang. Classifying Software Changes: Clean or Buggy? IEEE Transactions on Software Engineering, 34(2):181–196, Mar. 2008.

Digital Library

[16]

A. Koru, D. Zhang, K. El Emam, and H. Liu. An Investigation into the Functional Form of the Size-Defect Relationship for Software Modules. IEEE Transactions on Software Engineering, 35(2):293–304, Mar. 2009.

Digital Library

[17]

A. Koru, D. Zhang, and H. Liu. Modeling the Effect of Size on Defect Proneness for Open-Source Software. pages 115–124, May 2007.

Digital Library

[18]

A. G. Koru, K. E. Emam, D. Zhang, H. Liu, and D. Mathew. Theory of relative defect proneness. Empirical Software Engineering, 13(5):473–498, Oct. 2008.

Digital Library

[19]

G. Koru, H. Liu, D. Zhang, and K. E. Emam. Testing the theory of relative defect proneness for closed-source software. Empirical Software Engineering, 15(6):577–598, Dec. 2010.

Digital Library

[20]

S. Lessmann, B. Baesens, C. Mues, and S. Pietsch. Benchmarking Classification Models for Software Defect Prediction: A Proposed Framework and Novel Findings. IEEE Transactions on Software Engineering, 34(4):485–496, July 2008.

Digital Library

[21]

C. Lewis, Z. Lin, C. Sadowski, X. Zhu, R. Ou, and E. J. Whitehead Jr. Does Bug Prediction Support Human Developers? Findings from a Google Case Study. In Proceedings of the 2013 International Conference on Software Engineering, ICSE ’13, pages 372–381, Piscataway, NJ, USA, 2013. IEEE Press.

Digital Library

[22]

S. Matsumoto, Y. Kamei, A. Monden, K.-i. Matsumoto, and M. Nakamura. An analysis of developer metrics for fault prediction. In Proceedings of the 6th International Conference on Predictive Models in Software Engineering, PROMISE ’10, pages 18:1–18:9, New York, NY, USA, 2010. ACM.

Digital Library

[23]

T. Mende and R. Koschke. Revisiting the Evaluation of Defect Prediction Models. In Proceedings of the 5th International Conference on Predictor Models in Software Engineering, PROMISE ’09, pages 7:1–7:10, New York, NY, USA, 2009. ACM.

Digital Library

[24]

T. Mende and R. Koschke. Effort-aware defect prediction models. In Proceedings of the 2010 14th European Conference on Software Maintenance and Reengineering, CSMR ’10, pages 107–116, Washington, DC, USA, 2010. IEEE Computer Society.

Digital Library

[25]

T. Menzies, J. Greenwald, and A. Frank. Data Mining Static Code Attributes to Learn Defect Predictors. IEEE Transactions on Software Engineering, 33(1):2–13, Jan. 2007.

[26]

T. Menzies, Z. Milton, B. Turhan, B. Cukic, Y. Jiang, and A. Bener. Defect prediction from static code features: current results, limitations, new approaches. Automated Software Engineering, 17(4):375–407, May 2010.

Digital Library

[27]

N. Mittas and L. Angelis. Ranking and Clustering Software Cost Estimation Models through a Multiple Comparisons Algorithm. IEEE Transactions on Software Engineering, 39(4):537–551, Apr. 2013.

Digital Library

[28]

A. Mockus and D. M. Weiss. Predicting risk of software changes. Bell Labs Technical Journal, 5(2):169–180, Apr. 2000.

[29]

A. Monden, T. Hayashi, S. Shinoda, K. Shirai, J. Yoshida, M. Barker, and K. Matsumoto. Assessing the Cost Effectiveness of Fault Prediction in Acceptance Testing. IEEE Transactions on Software Engineering, 39(10):1345–1357, Oct. 2013.

Digital Library

[30]

N. Nagappan and T. Ball. Use of relative code churn measures to predict system defect density. In Proceedings of the 27th International Conference on Software Engineering, ICSE ’05, pages 284–292, New York, NY, USA, 2005. ACM.

Digital Library

[31]

A. T. Nguyen, T. T. Nguyen, H. A. Nguyen, and T. N. Nguyen. Multi-layered Approach for Recovering Links Between Bug Reports and Fixes. In Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering, FSE ’12, pages 63:1–63:11, New York, NY, USA, 2012. ACM.

Digital Library

[32]

F. Rahman and P. Devanbu. How, and why, process metrics are better. In Proceedings of the 2013 International Conference on Software Engineering, ICSE ’13, pages 432–441, Piscataway, NJ, USA, 2013.

Digital Library

[33]

IEEE Press.

[34]

F. Rahman, D. Posnett, and P. Devanbu. Recalling the ”imprecision” of cross-project defect prediction. In Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering, FSE ’12, pages 61:1–61:11, New York, NY, USA, 2012. ACM.

Digital Library

[35]

F. Rahman, D. Posnett, A. Hindle, E. Barr, and P. Devanbu. BugCache for Inspections: Hit or Miss? In Proceedings of the 19th ACM SIGSOFT Symposium and the 13th European Conference on Foundations of Software Engineering, ESEC/FSE ’11, pages 322–331, New York, NY, USA, 2011. ACM.

Digital Library

[36]

J. Romano, J. D. Kromrey, J. Coraggio, and J. Skowronek. Appropriate statistics for ordinal level data: Should we really be using t-test and cohen’s d for evaluating group differences on the nsse and other surveys. In annual meeting of the Florida Association of Institutional Research, pages 1–33, 2006.

[37]

Y. Shin, A. Meneely, L. Williams, and J. Osborne. Evaluating Complexity, Code Churn, and Developer Activity Metrics as Indicators of Software Vulnerabilities. IEEE Transactions on Software Engineering, 37(6):772–787, Nov. 2011.

Digital Library

[38]

J. Sliwerski, T. Zimmermann, and A. Zeller. When do changes induce fixes? In Proceedings of the 2005 International Workshop on Mining Software Repositories, MSR 2005, Saint Louis, Missouri, USA, May 17, 2005. ACM, 2005.

Digital Library

[39]

R. Wu, H. Zhang, S. Kim, and S.-C. Cheung. ReLink: Recovering Links Between Bugs and Changes. In Proceedings of the 19th ACM SIGSOFT Symposium and the 13th European Conference on Foundations of Software Engineering, ESEC/FSE ’11, pages 15–25, New York, NY, USA, 2011. ACM.

Digital Library

[40]

Y. Yang, Y. Zhou, H. Lu, L. Chen, Z. Chen, B. Xu, H. Leung, and Z. Zhang. Are Slice-Based Cohesion Metrics Actually Useful in Effort-Aware Post-Release Fault-Proneness Prediction? An Empirical Study. IEEE Transactions on Software Engineering, 41(4):331–357, Apr. 2015.

Digital Library

[41]

Z. Yin, D. Yuan, Y. Zhou, S. Pasupathy, and L. Bairavasundaram. How Do Fixes Become Bugs? In Proceedings of the 19th ACM SIGSOFT Symposium and the 13th European Conference on Foundations of Software Engineering, ESEC/FSE ’11, pages 26–36, New York, NY, USA, 2011. ACM.

Digital Library

[42]

S. Zhong, T. M. Khoshgoftaar, and N. Seliya. Unsupervised learning for expert-based software quality estimation. In Proceedings of the Eighth IEEE International Conference on High Assurance Systems Engineering, HASE’04, pages 149–155, Washington, DC, USA, 2004. IEEE Computer Society.

Digital Library

[43]

Y. Zhou, B. Xu, H. Leung, and L. Chen. An in-depth study of the potentially confounding effect of class size in fault prediction. ACM Trans. Softw. Eng. Methodol., 23(1):10:1–10:51, Feb. 2014.

Digital Library

[44]

T. Zimmermann, N. Nagappan, H. Gall, E. Giger, and B. Murphy. Cross-project Defect Prediction: A Large Scale Experiment on Data vs. Domain vs. Process. In Proceedings of the the 7th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on The Foundations of Software Engineering, ESEC/FSE ’09, pages 91–100, New York, NY, USA, 2009. ACM.

Digital Library

[45]

T. Zimmermann, R. Premraj, and A. Zeller. Predicting defects for eclipse. In Proceedings of the Third International Workshop on Predictor Models in Software Engineering, PROMISE ’07, pages 9–, Washington, DC, USA, 2007. IEEE Computer Society.

Digital Library

Cited By

Yin SGuo SLi HLi CChen RLi XJiang H(2025)Line-Level Defect Prediction by Capturing Code Contexts With Graph Convolutional NetworksIEEE Transactions on Software Engineering10.1109/TSE.2024.350372351:1(172-191)Online publication date: Jan-2025
https://doi.org/10.1109/TSE.2024.3503723
Sun YWu FWu DJing XSun Y(2025)Multi-view learning based on product and process metrics for software defect predictionApplied Intelligence10.1007/s10489-025-06288-655:6Online publication date: 4-Feb-2025
https://doi.org/10.1007/s10489-025-06288-6
Shaikh MTunio IKhan JJung Y(2024)Effort-Aware Fault-Proneness Prediction Using Non-API-Based Package-Modularization MetricsMathematics10.3390/math1214220112:14(2201)Online publication date: 13-Jul-2024
https://doi.org/10.3390/math12142201
Show More Cited By

Index Terms

Effort-aware just-in-time defect prediction: simple unsupervised models could be better than supervised models
1. Software and its engineering
  1. Software creation and management
    1. Software development process management
      1. Risk management

Recommendations

Effort-Aware Tri-Training for Semi-supervised Just-in-Time Defect Prediction
Advances in Knowledge Discovery and Data Mining
Abstract
In recent years, just-in-time (JIT) defect prediction has gained considerable interest as it enables developers to identify risky changes at check-in time. Previous studies tried to conduct research from both supervised and unsupervised ...
Code churn: a neglected metric in effort-aware just-in-time defect prediction
ESEM '17: Proceedings of the 11th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement

Background: An increasing research effort has devoted to just-in-time (JIT) defect prediction. A recent study by Yang et al. at FSE'16 leveraged individual change metrics to build unsupervised JIT defect prediction model. They found that many ...
Are Slice-Based Cohesion Metrics Actually Useful in Effort-Aware Post-Release Fault-Proneness Prediction? An Empirical Study
Background. Slice-based cohesion metrics leverage program slices with respect to the output variables of a module to quantify the strength of functional relatedness of the elements within the module. Although slice-based cohesion metrics have been ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

FSE 2016: Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering

November 2016

1156 pages

ISBN:9781450342186

DOI:10.1145/2950290

General Chair:
Thomas Zimmermann
Microsoft Research, USA
,
Program Chairs:
Jane Cleland-Huang
University of Notre Dame, USA
,
Zhendong Su
University of California at Davis, USA

Copyright © 2016 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGSOFT: ACM Special Interest Group on Software Engineering

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 November 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

program A for Outstanding PhD candidate of Nanjing University.
Natural Science Foundation of Jiangsu Province
PolyU Grant
National Natural Science Foundation of China
Hong Kong Competitive Earmarked Research Grant
National Key Basic Research and Development Program of China

Conference

FSE'16

Sponsor:

SIGSOFT

FSE'16: 24nd ACM SIGSOFT International Symposium on the Foundations of Software Engineering

November 13 - 18, 2016

WA, Seattle, USA

Acceptance Rates

Overall Acceptance Rate 17 of 128 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

164
Total Citations
View Citations
1,598
Total Downloads

Downloads (Last 12 months)58
Downloads (Last 6 weeks)3

Reflects downloads up to 14 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Yin SGuo SLi HLi CChen RLi XJiang H(2025)Line-Level Defect Prediction by Capturing Code Contexts With Graph Convolutional NetworksIEEE Transactions on Software Engineering10.1109/TSE.2024.350372351:1(172-191)Online publication date: Jan-2025
https://doi.org/10.1109/TSE.2024.3503723
Sun YWu FWu DJing XSun Y(2025)Multi-view learning based on product and process metrics for software defect predictionApplied Intelligence10.1007/s10489-025-06288-655:6Online publication date: 4-Feb-2025
https://doi.org/10.1007/s10489-025-06288-6
Shaikh MTunio IKhan JJung Y(2024)Effort-Aware Fault-Proneness Prediction Using Non-API-Based Package-Modularization MetricsMathematics10.3390/math1214220112:14(2201)Online publication date: 13-Jul-2024
https://doi.org/10.3390/math12142201
Esposito MFalaschi VFalessi D(2024)An Extensive Comparison of Static Application Security Testing ToolsProceedings of the 28th International Conference on Evaluation and Assessment in Software Engineering10.1145/3661167.3661199(69-78)Online publication date: 18-Jun-2024
https://dl.acm.org/doi/10.1145/3661167.3661199
Oueslati KLaberge GLamothe MKhomh F(2024)Mining Action Rules for Defect Reduction PlanningProceedings of the ACM on Software Engineering10.1145/36608091:FSE(2309-2331)Online publication date: 12-Jul-2024
https://dl.acm.org/doi/10.1145/3660809
Shahini XMetzger APohl KSpinellis DConstantinou EBacchelli A(2024)An Empirical Study on Just-in-time Conformal Defect PredictionProceedings of the 21st International Conference on Mining Software Repositories10.1145/3643991.3644928(88-99)Online publication date: 15-Apr-2024
https://dl.acm.org/doi/10.1145/3643991.3644928
Guo SLi DHuang LLv SChen RLi HLi XJiang H(2024)Estimating Uncertainty in Labeled Changes by SZZ Tools on Just-In-Time Defect PredictionACM Transactions on Software Engineering and Methodology10.1145/363722633:4(1-25)Online publication date: 18-Apr-2024
https://dl.acm.org/doi/10.1145/3637226
Wang HGao ZHu XLo DGrundy JWang X(2024)Just-In-Time TODO-Missed Commits DetectionIEEE Transactions on Software Engineering10.1109/TSE.2024.340500550:11(2732-2752)Online publication date: Nov-2024
https://doi.org/10.1109/TSE.2024.3405005
Xue QZhuang WZhao LZhang X(2024)TWAO: Time-Weight-Aware Oversampling Method for Just-in-Time Software Defect Prediction2024 IEEE 24th International Conference on Software Quality, Reliability and Security (QRS)10.1109/QRS62785.2024.00040(328-339)Online publication date: 1-Jul-2024
https://doi.org/10.1109/QRS62785.2024.00040
Alnagi EAzzeh M(2024)Just-in-Time Software Defect Prediction Techniques: A Survey2024 15th International Conference on Information and Communication Systems (ICICS)10.1109/ICICS63486.2024.10638276(1-6)Online publication date: 13-Aug-2024
https://doi.org/10.1109/ICICS63486.2024.10638276
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten