Analogy-based effort estimation: a new method to discover set of analogies from dataset characteristics

Analogy-based effort estimation (ABE) is one of the efficient methods for software effort estimation because of its outstanding performance and capability of handling noisy datasets. Conventional ABE models usually use the same number of analogies for all projects in the datasets in order to make good estimates. The authors' claim is that using same number of analogies may produce overall best performance for the whole dataset but not necessarily best performance for each individual project. Therefore there is a need to better understand the dataset characteristics in order to discover the optimum set of analogies for each project rather than using a static k nearest projects. The authors propose a new technique based on bisecting k- medoids clustering algorithm to come up with the best set of analogies for each individual project before making the prediction. With bisecting k- medoids it is possible to better understand the dataset characteristic, and automatically find best set of analogies for each test project. Performance figures of the proposed estimation method are promising and better than those of other regular ABE models.

[1]  Martin Shepperd,et al.  Experiences Using Case-Based Reasoning to Predict Software Project Effort , 2000 .

[2]  Thomas J. Ostrand,et al.  \{PROMISE\} Repository of empirical software engineering data , 2007 .

[3]  Tim Menzies,et al.  On the Value of Ensemble Effort Estimation , 2012, IEEE Transactions on Software Engineering.

[4]  Francisco J. García-Peñalvo,et al.  SEffEst: Effort estimation in software projects using fuzzy logic and neural networks , 2012, Int. J. Comput. Intell. Syst..

[5]  Peter I. Cowling,et al.  Fuzzy grey relational analysis for software effort estimation , 2010, Empirical Software Engineering.

[6]  Mohammad Azzeh A replicated assessment and comparison of adaptation techniques for analogy-based effort estimation , 2011, Empirical Software Engineering.

[7]  R. S. D. Wahida Banu,et al.  Efficient effort estimation system viz. function points and quality assurance coverage , 2012, IET Softw..

[8]  Barbara A. Kitchenham,et al.  A Simulation Study of the Model Evaluation Criterion MMRE , 2003, IEEE Trans. Software Eng..

[9]  Emilia Mendes,et al.  A replicated assessment of the use of adaptation rules to improve Web cost estimation , 2003, 2003 International Symposium on Empirical Software Engineering, 2003. ISESE 2003. Proceedings..

[10]  Ayse Basar Bener,et al.  Exploiting the Essential Assumptions of Analogy-Based Effort Estimation , 2012, IEEE Transactions on Software Engineering.

[11]  Emilia Mendes,et al.  An Empirical Analysis of Linear Adaptation Techniques for Case-Based Prediction , 2003, ICCBR.

[12]  Alain Abran,et al.  Fuzzy Analogy: A New Approach for Software Cost Estimation , 2001 .

[13]  Mohammad Azzeh,et al.  Learning best K analogies from data distribution for case-based software effort estimation , 2012, ICSEA 2012.

[14]  Emilia Mendes,et al.  A Comparative Study of Cost Estimation Models for Web Hypermedia Applications , 2003, Empirical Software Engineering.

[15]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[16]  Martin J. Shepperd,et al.  Estimating Software Project Effort Using Analogies , 1997, IEEE Trans. Software Eng..

[17]  Magne Jørgensen,et al.  Software effort estimation by analogy and "regression toward the mean" , 2003, J. Syst. Softw..

[18]  D. Ross Jeffery,et al.  Analogy-X: Providing Statistical Inference to Analogy-Based Software Cost Estimation , 2008, IEEE Transactions on Software Engineering.

[19]  Michael M. Richter,et al.  A flexible method for software effort estimation by analogy , 2007, Empirical Software Engineering.

[20]  Ingunn Myrtveit,et al.  Reliability and validity in comparative studies of software prediction models , 2005, IEEE Transactions on Software Engineering.

[21]  Martin J. Shepperd,et al.  Comparing Software Prediction Techniques Using Simulation , 2001, IEEE Trans. Software Eng..

[22]  Michelle Cartwright,et al.  A replication of the use of regression towards the mean (R2M) as an adjustment to effort estimation models , 2005, 11th IEEE International Software Metrics Symposium (METRICS'05).

[23]  Mohammad Azzeh,et al.  Value of ranked voting methods for estimation by analogy , 2013, IET Softw..

[24]  Hae-Sang Park,et al.  A simple and fast algorithm for K-medoids clustering , 2009, Expert Syst. Appl..

[25]  Thong Ngee Goh,et al.  A study of the non-linear adjustment for analogy based software cost estimation , 2009, Empirical Software Engineering.

[26]  Bashar Nuseibeh State of the Journal , 2012, IEEE Trans. Software Eng..

[27]  Karen T. Lum,et al.  Selecting Best Practices for Effort Estimation , 2006, IEEE Transactions on Software Engineering.

[28]  Sun-Jen Huang,et al.  The adjusted analogy-based software effort estimation based on similarity distances , 2007, J. Syst. Softw..

[29]  Dayang N. A. Jawawi,et al.  Increasing the accuracy of software development effort estimation using projects clustering , 2012, IET Softw..

[30]  Lionel C. Briand,et al.  An assessment and comparison of common software cost estimation modeling techniques , 1999, Proceedings of the 1999 International Conference on Software Engineering (IEEE Cat. No.99CB37002).

[31]  Bart Baesens,et al.  Data Mining Techniques for Software Effort Estimation: A Comparative Study , 2012, IEEE Transactions on Software Engineering.