Consolidating evidence based studies in software cost/effort estimation — A tertiary study

Software Effort Estimation is key to the success of any project since all downstream activities such as planning, budgeting, developing and Monitoring cannot be executed without clarity on the scope of the activity that needs to be performed. This is a tertiary study that follows the Systematic Literature Review (SLR) process as put forth by Kitchenham in her seminal paper, based on five criteria: estimation technique, estimation accuracy, type of dataset and independent variables used in empirical research on effort estimation. Our study covering 820 Primary Studies through 14 SLRs, shows that Software Effort Estimation studies focus more on statistical techniques and Machine Learning is taking precedence in comparison to the others; whereas Expert Judgement is preferred by the industry due to its intuitiveness. There is a need for models that are simple to understand and global, due to the distributed nature of software development. The studies are inconclusive about the accuracy benefits of using a within company dataset vs.external datasets. Machine learning techniques such FL and GA in combination with Analogy methods generate more accurate estimates. There is increasing consensus on the use of Mean Magnitude of Relative Error (MMRE), Median Magnitude of Relative Error (MdMRE) and Prediction Pred (25%) as the accuracy metric. 78% of the Primary Studies reported accuracy using MMRE. The best MMRE reported is in the range of 7 to 75. ISBSG (International Software Benchmarking Standards Group) and Desharnais datasets with 27% and 17% usage respectively are the most widely used datasets in empirical studies on effort estimation. Fewer than 20 independent variables account for more than 90% impact of variables in empirical analysis on Software effort estimation.

[1]  Kai Petersen,et al.  Measuring and predicting software productivity: A systematic map and review , 2011, Inf. Softw. Technol..

[2]  Tony Gorschek,et al.  Evaluation and Measurement of Software Process Improvement—A Systematic Literature Review , 2012, IEEE Transactions on Software Engineering.

[3]  Beata Czarnacka-Chrobot The Economic Importance of Business Software Systems Size Measurement , 2010, 2010 Fifth International Multi-conference on Computing in the Global Information Technology.

[4]  Emilia Mendes,et al.  A systematic review of web resource estimation , 2012, PROMISE '12.

[5]  Karen T. Lum,et al.  Selecting Best Practices for Effort Estimation , 2006, IEEE Transactions on Software Engineering.

[6]  Ayse Bener,et al.  Evaluation of Feature Extraction Methods on Software Cost Estimation , 2007, ESEM 2007.

[7]  Carolyn B. Seaman,et al.  A comparison of software cost, duration, and quality for waterfall vs. iterative and incremental development: A systematic review , 2009, 2009 3rd International Symposium on Empirical Software Engineering and Measurement.

[8]  Pearl Brereton,et al.  A systematic review of systematic review process research in software engineering , 2013, Inf. Softw. Technol..

[9]  Emilia Mendes,et al.  A replicated comparison of cross-company and within-company effort estimation models using the ISBSG database , 2005, 11th IEEE International Software Metrics Symposium (METRICS'05).

[10]  Emilia Mendes,et al.  Effort estimation in agile software development: a systematic literature review , 2014, PROMISE.

[11]  Tore Dybå,et al.  The Future of Empirical Methods in Software Engineering Research , 2007, Future of Software Engineering (FOSE '07).

[12]  Christopher J. Lokan,et al.  The usage of ISBSG data fields in software effort estimation: A systematic mapping study , 2016, J. Syst. Softw..

[13]  Jongmoon Baik,et al.  MND-SCEMP: an empirical study of a software cost estimation modeling process in the defense domain , 2012, Empirical Software Engineering.

[14]  Barry W. Boehm,et al.  US DoD Application Domain Empirical Software Cost Analysis , 2011, 2011 International Symposium on Empirical Software Engineering and Measurement.

[15]  Barry W. Boehm,et al.  Maintenance Effort Estimation for Open Source Software: A Systematic Literature Review , 2016, 2016 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[16]  Yong Hu,et al.  Systematic literature review of machine learning based software development effort estimation models , 2012, Inf. Softw. Technol..

[17]  Carolyn Seaman,et al.  A comparison of software cost, duration, and quality for waterfall vs. iterative and incremental development: A systematic review , 2009, ESEM 2009.

[18]  Alain Abran,et al.  Analogy-based software development effort estimation: A systematic mapping and review , 2015, Inf. Softw. Technol..

[19]  Pearl Brereton,et al.  Protocol for a Tertiary study of Systematic Literature Reviews and Evidence-based Guidelines in IT and Software Engineering , 2009 .

[20]  Magne Jørgensen,et al.  A Systematic Review of Software Development Cost Estimation Studies , 2007, IEEE Transactions on Software Engineering.

[21]  Stephen G. MacDonell,et al.  Comparing Local and Global Software Effort Estimation Models -- Reflections on a Systematic Review , 2007, First International Symposium on Empirical Software Engineering and Measurement (ESEM 2007).

[22]  Tore Dybå,et al.  A Systematic Review of Theory Use in Software Engineering Experiments , 2007, IEEE Transactions on Software Engineering.

[23]  Tim Menzies,et al.  How to Find Relevant Data for Effort Estimation? , 2011, 2011 International Symposium on Empirical Software Engineering and Measurement.

[24]  Christopher J. Lokan An empirical analysis of function point adjustment factors , 2000, Inf. Softw. Technol..

[25]  Guilherme Horta Travassos,et al.  A systematic review of cross- vs. within- company cost estimation studies , 2006 .