Software Project Effort Estimation Based on Multiple Parametric Models Generated Through Data Clustering

Gallego, Juan J. Cuadrado; Rodríguez, Daniel; Sicilia, Miguel Ángel; Rubio, Miguel Garre; Crespo, Angel García

doi:10.1007/s11390-007-9043-5

Software Project Effort Estimation Based on Multiple Parametric Models Generated Through Data Clustering

Regular Paper
Published: 30 May 2007

Volume 22, pages 371–378, (2007)
Cite this article

Journal of Computer Science and Technology Aims and scope Submit manuscript

Juan J. Cuadrado Gallego¹,
Daniel Rodríguez¹,
Miguel Ángel Sicilia¹,
Miguel Garre Rubio¹ &
…
Angel García Crespo²

162 Accesses
Explore all metrics

Abstract

Parametric software effort estimation models usually consists of only a single mathematical relationship. With the advent of software repositories containing data from heterogeneous projects, these types of models suffer from poor adjustment and predictive accuracy. One possible way to alleviate this problem is the use of a set of mathematical equations obtained through dividing of the historical project datasets according to different parameters into subdatasets called partitions. In turn, partitions are divided into clusters that serve as a tool for more accurate models. In this paper, we describe the process, tool and results of such approach through a case study using a publicly available repository, ISBSG. Results suggest the adequacy of the technique as an extension of existing single-expression models without making the estimation process much more complex that uses a single estimation model. A tool to support the process is also presented.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Negative results for software effort estimation

Article 21 November 2016

Stepwise Regression Clustering Method in Function Points Estimation

Investigating the use of moving windows to improve software effort prediction: a replicated study

Article 26 August 2016

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Boehm B, Abts C, Chulani S. Software development cost estimation approaches — A survey. USC Center for Software Engineering Technical Report USC-CSE-2000-505, 2000.
Parametric Estimating Initiative. Parametric Estimating Handbook, 2nd Edition, 1999.
Stensrud E, Foss T, Kitchenham B, Myrtveit I. An empirical validation of the relationship between the magnitude of relative error and project size. In Proc. the Eighth IEEE Symp. Software Metrics, Ottawa, Canada, 2002, pp.3–12.
Cuadrado-Gallego J J, Sicilia M A, Garre M et al. An empirical study of process-related attributes in segmented software cost-estimation relationships. Journal of Systems and Software, 2006, 79(3): 351–361.
Google Scholar
Shepperd M, Schofield C, Kitchenham B. Effort estimation using analogy. In Proc. 8th Int. Conf. Software Engineering, IEEE Computer Society Press, Berlin, 1996, pp.170–178.
Xu Z, Khoshgoftaar T. Identification of fuzzy models of software cost estimation. Fuzzy Sets and Systems, 2004, 145(1): 141–163.
Article MathSciNet Google Scholar
Pedrycz W, Succi G. Genetic granular classifiers in modeling software quality. The Journal of Systems and Software, 2002, 76(3): 277–285.
Article Google Scholar
Dick S, Meeks A, Last M et al. Data mining in software metrics databases. Fuzzy Sets and Systems, 2004, 145(1): 81–110.
Article MathSciNet Google Scholar
Lung C H, Zaman M, Nandi A. Applications of clustering techniques to software partitioning, recovery and restructuring. Journal of Systems and Software, 2004, 73(2): 227–244.
Article Google Scholar
Dolado J. On the problem of the software cost function. Information and Software Technology, 2001, 43(1): 61–72.
Article Google Scholar
Shepperd M, Schofield C. Estimating software project effort using analogies. IEEE Trans. Software Engineering, 1997, 23(11): 736–743.
Article Google Scholar
Oligny S, Bourque P, Abran A, Fournier B. Exploring the relation between effort and duration in software engineering project. In Proc. World Computer Congress, Beijing, China, August 21–25, 2000, pp.175–178.
Marquardt W. An algorithm for least squares estimation of non-linear parameters. J. Soc. Indust. Appl. Math., 1963, 11: 431–441.
Article MATH MathSciNet Google Scholar
Conte S D, Dunsmore H E, Shen V Y. Software Engineering Metrics and Models. Menlo Park: Benjamin/Cummings, CA, 1986.
Google Scholar
Kohavi R, John G. Automatic parameter selection by minimizing estimated error. In Proc. 12th Int. Conf. Machine Learning, San Francisco, 1995, pp.304–312.
Witten I H, Frank E. Data Mining, Practical Machine Learning Tools and Techniques with Java Implementations. San Francisco: Morgan Kaufmann Publishers, USA, 2005.
Google Scholar
NESMA. NESMA FPA counting practices manual (CPM 2.0), 1996.
Dreger J B. Function Point Analysis. Englewood Cliffs, NJ: Prentice Hall, 1989.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, The University of Alcalá, Alcalá, Spain
Juan J. Cuadrado Gallego, Daniel Rodríguez, Miguel Ángel Sicilia & Miguel Garre Rubio
Department of Computer Science, Carlos III University, Madrid, Spain
Angel García Crespo

Authors

Juan J. Cuadrado Gallego
View author publications
You can also search for this author inPubMed Google Scholar
Daniel Rodríguez
View author publications
You can also search for this author inPubMed Google Scholar
Miguel Ángel Sicilia
View author publications
You can also search for this author inPubMed Google Scholar
Miguel Garre Rubio
View author publications
You can also search for this author inPubMed Google Scholar
Angel García Crespo
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Juan J. Cuadrado Gallego.

Additional information

This work is supported by the Spanish Ministry of Science and Technology under Grant No. CICYT TIN2004-06689-C03.

Electronic supplementary material

Supplementary material - Chinese Abstract (PDF 33 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gallego, J.J.C., Rodríguez, D., Sicilia, M.Á. et al. Software Project Effort Estimation Based on Multiple Parametric Models Generated Through Data Clustering. J Comput Sci Technol 22, 371–378 (2007). https://doi.org/10.1007/s11390-007-9043-5

Download citation

Received: 15 May 2006
Revised: 15 February 2007
Published: 30 May 2007
Issue Date: May 2007
DOI: https://doi.org/10.1007/s11390-007-9043-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Software Project Effort Estimation Based on Multiple Parametric Models Generated Through Data Clustering

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Negative results for software effort estimation

Stepwise Regression Clustering Method in Function Points Estimation

Investigating the use of moving windows to improve software effort prediction: a replicated study

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material - Chinese Abstract (PDF 33 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now