Analogy-based software effort estimation using Fuzzy numbers

Background: Early stage software effort estimation is a crucial task for project bedding and feasibility studies. Since collected data during the early stages of a software development lifecycle is always imprecise and uncertain, it is very hard to deliver accurate estimates. Analogy-based estimation, which is one of the popular estimation methods, is rarely used during the early stage of a project because of uncertainty associated with attribute measurement and data availability. Aims: We have integrated analogy-based estimation with Fuzzy numbers in order to improve the performance of software project effort estimation during the early stages of a software development lifecycle, using all available early data. Particularly, this paper proposes a new software project similarity measure and a new adaptation technique based on Fuzzy numbers. Method: Empirical evaluations with Jack-knifing procedure have been carried out using five benchmark data sets of software projects, namely, ISBSG, Desharnais, Kemerer, Albrecht and COCOMO, and results are reported. The results are compared to those obtained by methods employed in the literature using case-based reasoning and stepwise regression. Results: In all data sets the empirical evaluations have shown that the proposed similarity measure and adaptation techniques method were able to significantly improve the performance of analogy-based estimation during the early stages of software development. The results have also shown that the proposed method outperforms some well know estimation techniques such as case-based reasoning and stepwise regression. Conclusions: It is concluded that the proposed estimation model could form a useful approach for early stage estimation especially when data is almost uncertain.

[1]  Witold Pedrycz,et al.  Software cost estimation with fuzzy models , 2000, SIAP.

[2]  Les Carr,et al.  Proceedings of the fourteenth ACM conference on Hypertext and hypermedia , 2003 .

[3]  Kevin D. Reilly,et al.  Simulating continuous fuzzy systems , 2005, Inf. Sci..

[4]  Hsuan-Shih Lee An optimal aggregation method for fuzzy opinions of group decision , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[5]  Ramesh C. Jain A procedure for multiple-aspect decision making using fuzzy sets , 1977 .

[6]  Barbara A. Kitchenham,et al.  A Simulation Study of the Model Evaluation Criterion MMRE , 2003, IEEE Trans. Software Eng..

[7]  Emilia Mendes,et al.  A replicated assessment of the use of adaptation rules to improve Web cost estimation , 2003, 2003 International Symposium on Empirical Software Engineering, 2003. ISESE 2003. Proceedings..

[8]  Taghi M. Khoshgoftaar,et al.  Identification of fuzzy models of software cost estimation , 2004, Fuzzy Sets Syst..

[9]  Stephen G. MacDonell,et al.  What accuracy statistics really measure , 2001, IEE Proc. Softw..

[10]  Ramesh Jain,et al.  DECISION MAKING IN THE PRESENCE OF FUZZY VARIABLES , 1976 .

[11]  John E. Gaffney,et al.  Software Function, Source Lines of Code, and Development Effort Prediction: A Software Science Validation , 1983, IEEE Transactions on Software Engineering.

[12]  Stefan Biffl,et al.  Increasing the accuracy and reliability of analogy-based cost estimation with extensive project feature dimension weighting , 2004, Proceedings. 2004 International Symposium on Empirical Software Engineering, 2004. ISESE '04..

[13]  Sabyasachi Ghoshray,et al.  A linear regression model using triangular fuzzy number coefficients , 1999, Fuzzy Sets Syst..

[14]  Shyi-Ming Chen,et al.  OPERATIONS ON FUZZY NUMBERS WITH FUNCTION PRINCIPAL , 1985 .

[15]  Peter I. Cowling,et al.  Software Project Similarity Measurement Based on Fuzzy C-Means , 2008, ICSP.

[16]  Shyi-Ming Chen,et al.  Fuzzy risk analysis based on interval-valued fuzzy numbers , 2009, Expert Syst. Appl..

[17]  Shyi-Ming Chen,et al.  Fuzzy risk analysis based on measures of similarity between interval-valued fuzzy numbers , 2008, Comput. Math. Appl..

[18]  Emilia Mendes,et al.  A Comparative Study of Cost Estimation Models for Web Hypermedia Applications , 2003, Empirical Software Engineering.

[19]  Martin J. Shepperd,et al.  Estimating Software Project Effort Using Analogies , 1997, IEEE Trans. Software Eng..

[20]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Shari Lawrence Pfleeger,et al.  Software Cost Estimation and Sizing Methods, Issues, and Guidelines , 2005 .

[22]  Qinbao Song,et al.  Using grey relational analysis to predict software effort with small data sets , 2005, 11th IEEE International Software Metrics Symposium (METRICS'05).

[23]  Shyi-Ming Chen NEW METHODS FOR SUBJECTIVE MENTAL WORKLOAD ASSESSMENT AND FUZZY RISK ANALYSIS , 1996 .

[24]  Magne Jørgensen,et al.  How large are software cost overruns? A review of the 1994 CHAOS report , 2006, Inf. Softw. Technol..

[25]  Magne Jørgensen,et al.  Software effort estimation by analogy and "regression toward the mean" , 2003, J. Syst. Softw..

[26]  Shyi-Ming Chen,et al.  Fuzzy risk analysis based on similarity measures of generalized fuzzy numbers , 2003, IEEE Trans. Fuzzy Syst..

[27]  Barbara A. Kitchenham,et al.  Experiments with Analogy-X for Software Cost Estimation , 2008, 19th Australian Conference on Software Engineering (aswec 2008).

[28]  Shyi-Ming Chen,et al.  Fuzzy risk analysis based on fuzzy numbers with different shapes and different deviations , 2008, Expert Syst. Appl..

[29]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[30]  Alain Abran,et al.  Fuzzy Analogy: A New Approach for Software Cost Estimation , 2001 .

[31]  D. Dubois,et al.  Operations on fuzzy numbers , 1978 .

[32]  Peter I. Cowling,et al.  Fuzzy grey relational analysis for software effort estimation , 2010, Empirical Software Engineering.

[33]  Barry W. Boehm,et al.  Achievements and Challenges in Software Resource Estimation , 2005 .

[34]  Sun-Jen Huang,et al.  The adjusted analogy-based software effort estimation based on similarity distances , 2007, J. Syst. Softw..

[35]  Thomas J. Ostrand,et al.  \{PROMISE\} Repository of empirical software engineering data , 2007 .

[36]  Shyi-Ming Chen,et al.  A new approach for fuzzy risk analysis based on similarity measures of generalized fuzzy numbers , 2009, Expert Syst. Appl..

[37]  Magne Jørgensen Realism in assessment of effort estimation uncertainty: it matters how you ask , 2004, IEEE Transactions on Software Engineering.

[38]  Emilia Mendes,et al.  Do adaptation rules improve web cost estimation? , 2003, HYPERTEXT '03.