Abstract
Cost estimation is important in software development for controlling and planning software risks and schedule. Good estimation models, such as COCOMO, can avoid insufficient resources being allocated to a project. In this study, we find that COCOMO's estimates can be improved via WRAPPER- a feature subset selection method developed by the data mining community. Using data sets from the PROMISE repository, we show WRAPPER significantly and dramatically improves COCOMO's predictive power.
- B. Boehm, Software Engineering Economics. Prentice Hall, 1981. Google ScholarDigital Library
- B. Boehm, E. Horowitz, R. Madachy, D. Reifer, B. K. Clark, B. Steece, A. W. Brown, S. Chulani, and C. Abts, Software Cost Estimation with Cocomo II. Prentice Hall, 2000. Google ScholarDigital Library
- P. S. L. M. L. NJ, "Your guide to price-s: Estimating cost and schedule of software development and support," 1998.Google Scholar
- L. H. Putnam, Software Cost Estimating and Life-Cycle Control: Getting the Software Numbers, New York. The Institute of Electrical and Electronics Engineers, Inc., 1980.Google Scholar
- D. of USA, "Parametric cost estimating handbook, second edition," 1999.Google Scholar
- J. Sayyad Shirabad and T. Menzies, "The PROMISE Repository of Software Engineering Databases." School of Information Technology and Engineering, University of Ottawa, Canada, 2005. Available from http://promise.site.uottawa.ca/SERepository.Google Scholar
- S. Chulani, B. Boehm, and B. Steece, "Bayesian analysis of empirical software engineering cost models," IEEE Transactions on Software Engineering, vol. 25, July/August 1999. Google ScholarDigital Library
- M. Hall and G. Holmes, "Benchmarking attribute selection techniques for discrete class data mining," IEEE Transactions On Knowledge And Data Engineering, vol. 15, no. 6, pp. 1437--1447, 2003. Google ScholarDigital Library
- I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, 1999. Google ScholarDigital Library
- R. Kohavi and G. H. John, "Wrappers for feature subset selection," Artificial Intelligence, vol. 97, no. 1--2, pp. 273--324, 1997. Google ScholarDigital Library
- J. Yang and V. Honavar, "Feature subset selection using a genetic algorithm," IEEE Intelligent Systems, vol. 13, no. 2, pp. 44--49, 1998. Google ScholarDigital Library
Index Terms
- Feature subset selection can improve software cost estimation accuracy
Recommendations
Feature subset selection can improve software cost estimation accuracy
PROMISE '05: Proceedings of the 2005 workshop on Predictor models in software engineeringCost estimation is important in software development for controlling and planning software risks and schedule. Good estimation models, such as COCOMO, can avoid insufficient resources being allocated to a project. In this study, we find that COCOMO's ...
Finding the Right Data for Software Cost Modeling
Strange to say, when building a software cost model, sometimes it's useful to ignore much of the available cost data. One way to do this is to perform data-pruning experiments after data collection and before model building. Experiments involving a set ...
Simple software cost analysis: safe or unsafe?
PROMISE '05: Proceedings of the 2005 workshop on Predictor models in software engineeringDelta estimation uses changes to old projects to estimate new projects. Delta estimation assumes that new costs can be extrapolated from old projects. In this study, we show that in certain real-world data sets. there exists attributes where this ...
Comments