Domain-based effort distribution model for software cost estimation

In software cost estimation, effort allocation is an important and usually challenging task for project management. Due to the Cone of Uncertainty effect on overall effort estimation and lack of representative effort distribution data, project managers often find it difficult to plan for staffing and other team resources. This often leads to risky decisions to assign too few or too many people to complete software lifecycle activities. As a result, projects with inaccurate resource allocation will generally experience serious schedule delay or cost overrun, which has been the outcome of 44% of the projects reported by the Standish Group [Standish, 2009]. Due to lack of data, most effort estimation models, including COCOMO II, use a one-size-fits-all distribution of effort by phase and activity. The availability of a critical mass of data from U.S. Defense Department software projects on effort distribution has enabled me to test several hypotheses that effort distributions vary by project size, personnel capability, and application domains. This dissertation will summarize the analysis approach, describe the techniques and methodologies used, and report the results. The key results were that size and personnel capability were not significant sources of effort distribution variability, but that analysis of the influence of application domain on effort distribution rejected the null hypothesis that the distributions do not vary by domains, at least for the U.S. Defense Department sector. The results were then used to produce an enhanced version of the COCOMO II model and tool for better estimation of the effort distributions for the data-supported domains.

[1]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[2]  Michel R. V. Chaudron,et al.  Evaluating RUP Software Development Processes Through Visualization of Effort Distribution , 2008, 2008 34th Euromicro Conference Software Engineering and Advanced Applications.

[3]  Caroline Davis,et al.  Oxford University Press in Africa, 1927â80 , 2013 .

[4]  Peter V. Norden Curve Fitting for a Model of Applied Research and Development Scheduling , 1958, IBM J. Res. Dev..

[5]  E. Abt Understanding statistics 3 , 2010, Evidence-Based Dentistry.

[6]  F. David,et al.  Statistical Estimates and Transformed Beta-Variables. , 1960 .

[7]  Barry W. Boehm,et al.  Phase distribution of software development effort , 2008, ESEM '08.

[8]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[9]  Ellis Horowitz,et al.  Software Cost Estimation with COCOMO II , 2000 .

[10]  K. Pearson On the Criterion that a Given System of Deviations from the Probable in the Case of a Correlated System of Variables is Such that it Can be Reasonably Supposed to have Arisen from Random Sampling , 1900 .

[11]  Steve McConnell Software Estimation: Demystifying the Black Art , 2006 .

[12]  Karl Pearson F.R.S. X. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling , 2009 .

[13]  P. Krutchen,et al.  The Rational Unified Process: An Introduction , 2000 .

[14]  Mark Schlack,et al.  Digital Equipment Corp. , 1993 .

[15]  M. Pagano,et al.  Student's t test. , 1993, Nutrition.

[16]  Barry W. Boehm,et al.  Software Engineering Economics , 1993, IEEE Transactions on Software Engineering.

[17]  Barry Boehm,et al.  Shrinking the cone of uncertainty with continuous assessment for software team dynamics in design and development , 2012 .

[18]  M. Stephens EDF Statistics for Goodness of Fit and Some Comparisons , 1974 .

[19]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[20]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[21]  Ayse Basar Bener,et al.  Domain specific phase by phase effort estimation in software projects , 2009, 2009 24th International Symposium on Computer and Information Sciences.