Abstract
The objective of this paper is to investigate the use of Support Vector Regression (SVR) for Web development effort estimation when using a cross-company data set. Four kernels of SVR were used, linear, polynomial, Gaussian and sigmoid and two preprocessing strategies of the variables were applied, namely normalization and logarithmic. The hold-out validation process was carried out for all the eight configurations using a training set and a validation set from the Tukutuku data set. Our results suggest that the predictions obtained with linear kernel applying a logarithmic transformation of variables (LinLog) are significantly better than those obtained with the other configurations. In addition, SVR has been compared with the traditional estimation techniques, such as Manual StepWise Regression, Case-Based Reasoning, and Bayesian Networks. Our results suggest that SVR with LinLog configuration can provide significantly superior prediction accuracy than other techniques.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abrahao, S.M., Mendes, E., Gomez, J., Insfran, E.: A Model-Driven Measurement Procedure for Sizing Web Applications: Design, Automation and Validation. In: Engels, G., Opdyke, B., Schmidt, D.C., Weil, F. (eds.) MODELS 2007. LNCS, vol. 4735, pp. 467–481. Springer, Heidelberg (2007)
Bailey, J.W., Basili, V.R.: A meta model for software development resource expenditure. In: Proceedings of the Fifth International Conference on Software Engineering, San Diego, California, USA, pp. 107–116 (1981)
Baresi, L., Morasca, S.: Three Empirical Studies on Estimating the Design Effort of Web Applications. Transactions on Software Engineering and Methodology 16(4)
Braga, P.L., Oliveira, A.L.I., Meira, S.R.L.: Software Effort Estimation using Machine Learning Techniques with Robust Confidence Intervals. In: HIS 2007, pp. 352–357 (2007)
Brieman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. Wadsworth Inc., Belmont (1984)
Christodoulou, S.P., Zafiris, P.A., Papatheodorou, T.S.: WWW2000: The Developer’s view and a practitioner’s approach to Web Engineering. In: Proc. ICSE Workshop on Web Engineering, Limerick, Ireland, pp. 75–92 (2000)
Chulani, S., Boehm, B., Steece, B.: Bayesian Analysis of Empirical Software Engineering Cost Models. IEEE Transactions on Software Engineering 25, 573–583 (1999)
Conte, S.D., Dunsmore, H.E., Shen, V.Y.: Software Engineering Metrics and Models. Benjamin-Cummins (1986)
Costagliola, G., Di Martino, S., Ferrucci, F., Gravino, C., Tortora, G., Vitiello, G.: Effort estimation modeling techniques: a case study for web applications. In: Procs. Intl. Conference on Web Engineering (ICWE 2006), pp. 9–16 (2006)
Desharnais, J.M.: Analyse statistique de la productivitie des projets in 834 formatique a partie de la technique des point des fonction, Ph.D. thesis, 835 Unpublished Masters Thesis, University of Montreal (1989)
Di Martino, S., Ferrucci, F., Gravino, C., Mendes, E.: Comparing Size Measures for Predicting Web Application Development Effort: A Case Study. In: Proceedings of Empirical Software Engineering and Measurement, pp. 324–333. IEEE Press, Los Alamitos (2007)
Kitchenham, B.A.: A Procedure for Analyzing Unbalanced Datasets. IEEE Transactions on Software Engineering 24(4), 278–301 (1998)
Kitchenham, B., Pickard, L.M., MacDonell, S.G., Shepperd, M.J.: What accuracy statistics really measure. IEE Proceedings Software 148(3), 81–85 (2001)
Kitchenham, B.A., Mendes, E.: A Comparison of Cross-company and Single-company Effort Estimation Models for Web Applications. In: Procs. EASE 2004, pp. 47–55 (2004)
Kitchenham, B.A., Mendes, E., Travassos, G.: A Systematic Review of Cross- and Within-company Cost Estimation Studies. In: Proceedings of Empirical Assessment in Software Engineering, pp. 89–98 (2006)
Kitchenham, B., Mendes, E.: Travassos, Cross versus Within-Company Cost Estimation Studies: A systematic Review. IEEE Transactions on Software Engineering 33(5) (2007)
Jeffery, R., Ruhe, M., Wieczorek, I.: Using public domain metrics to estimate software development effort. In: Proceedings Metrics 2001, London, pp. 16–27 (2001)
Joachims, T.: Making large-Scale SVM Learning Practical. In: Schölkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning. MIT-Press, Cambridge (1999)
Mattera, D., Haykin, S.: Support vector machines for dynamic reconstruction of a chaotic system. In: Scholkopf, B., Burges, C.J.C., Smola, A.J. (eds.) Advances in Kernel Methods— Support Vector Learning, pp. 211–242. MIT Press, Cambridge (1999)
Maxwell, K.: Applied Statistics for Software Managers. Software Quality Institute Series. Prentice Hall, Englewood Cliffs (2002)
Mendes, E.: The Use of Bayesian Networks for Web Effort Estimation: Further Investigation. In: Proceedings of International Conference on Web Engineering (2008)
Mendes, E., Kitchenham, B.A.: Further Comparison of Cross-company and Within-company Effort Estimation Models for Web Applications. In: Proc. IEEE Metrics, pp. 348–357 (2004)
Mendes, E., Counsell, S.: Web Development Effort Estimation using Analogy. In: Proc. 2000 Australian Software Engineering Conference, pp. 203–212 (2000)
Mendes, E., Mosley, N., Counsell, S.: Investigating Web Size Metrics for Early Web Cost Estimation. Journal of Systems and Software 77(2), 157–172 (2005)
Mendes, E., Mosley, N.: Bayesian Network Models for Web Effort Prediction: A Comparative Study. IEEE Transactions on Software Engineering 34(6), 723–737 (2008)
Mendes, E., Mosley, N., Counsell, S.: Web Effort Estimation. In: Mendes, E., Mosley, N. (eds.) Web Engineering. Springer, Heidelberg (2005)
Mendes, E., Mosley, N., Counsell, S.: Early Web Size Measures and Effort Prediction for Web Costimation. In: Proceedings of the IEEE Metrics Symposium, pp. 18–29 (2003)
Mendes, E., Mosley, N., Counsell, S.: Comparison of Length, complexity and functionality as size measures for predicting Web design and authoring effort. IEE Proc. Software 149(3), 86–92 (2002)
Mendes, E., Counsell, S., Mosley, N., Triggs, C., Watson, I.: A Comparative Study of Cost Estimation Models for Web Hypermedia Applications. Empirical Software Engineering 8(23), 163–196 (2003)
Mendes, E., Mosley, N., Counsell, S.: Web metrics - Metrics for estimating effort to design and author Web applications. IEEE MultiMedia, 50–57 (January-March 2001)
Mendes, E., Martino, S.D., Ferrucci, F., Gravino, C.: Cross-company vs. single-company web effort models using the Tukutuku database: An extended study. Journal of System & Software 81(5), 673–690 (2008)
Oliveira, A.L.I.: Estimation of software project effort with support vector regression. Neurocomputing 69(13-15), 1749–1753 (2006)
Reifer, D.J.: Web Development: Estimating Quick-to-Market Software. IEEE Software, 57–64 (November-December 2000)
Reifer, D.J.: Ten deadly risks in Internet and intranet software development. IEEE Software, 12–14 (March-April 2002)
Ruhe, M., Jeffery, R., Wieczorek, I.: Cost estimation for Web applications. In: Proc. ICSE 2003, pp. 285–294 (2003)
Scholkopf, B.: Support Vector Learning. R. Oldenbourg Verlag, Munchen. Doktorarbeit, TU Berlin (1997), http://www.kernel-machines.org
Scholkopf, B., Smola, A.J.: Learning with Kernels. MIT Press, Cambridge (2002)
Smola, A.J., Schölkopf, B.: A tutorial on support vector regression. Statistics and Computing 14(3), 199–222 (2004)
Shepperd, M.J., Kadoda, G.: Using Simulation to Evaluate Prediction Techniques. In: Proceedings IEEE Metrics 2001, London, UK, pp. 349–358 (2001)
Vapnik, V., Lerner, A.: Pattern recognition using generalized portrait method. Automation and Remote Control 24, 774–780 (1963)
Vapnik, V., Chervonenkis, A.: A note on one class of perceptrons. Automatics and Remote Control, 25 (1964)
Vapnik, V.: The nature of Statistical Learning Theory. Springer, Heidelberg (1995)
Vapnik, V.: Statistical Learning Theory. Wiley, Chichester (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Corazza, A., Di Martino, S., Ferrucci, F., Gravino, C., Mendes, E. (2009). Using Support Vector Regression for Web Development Effort Estimation. In: Abran, A., Braungarten, R., Dumke, R.R., Cuadrado-Gallego, J.J., Brunekreef, J. (eds) Software Process and Product Measurement. IWSM 2009. Lecture Notes in Computer Science, vol 5891. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-05415-0_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-05415-0_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-05414-3
Online ISBN: 978-3-642-05415-0
eBook Packages: Computer ScienceComputer Science (R0)