Software Project Similarity Measurement Based on Fuzzy C-Means

A reliable and accurate similarity measurement between two softwareprojects has always been a challenge for analogy-based software cost estimation.Since the effort for a new project is retrieved from similar historicalprojects, it is essentially to use the appropriate similarity measure that findsthose close projects which in turn increases the estimation accuracy. In softwareengineering literature, there is a relatively little research addressed the issue ofhow to find out similarity between two software projects when they are describedby numerical and categorical features. Despite simplicity of exitingsimilarity techniques such as: Euclidean distance, weighted Euclidean distanceand maximum distance, it is hard to deal with categorical features. In this paperwe present two approaches to measure similarity between two software projectsbased on fuzzy C-means clustering and fuzzy logic. The new approaches aresuitable for both numerical and categorical features.

[1]  Emilia Mendes,et al.  A replicated assessment of the use of adaptation rules to improve Web cost estimation , 2003, 2003 International Symposium on Empirical Software Engineering, 2003. ISESE 2003. Proceedings..

[2]  Cornelio Yáñez-Márquez,et al.  Software development effort estimation using fuzzy logic: a case study , 2005, Sixth Mexican International Conference on Computer Science (ENC'05).

[3]  Moti Schneider,et al.  Matching attributes in a fuzzy case based reasoning , 1999, 18th International Conference of the North American Fuzzy Information Processing Society - NAFIPS (Cat. No.99TH8397).

[4]  Barbara A. Kitchenham,et al.  Effort estimation using analogy , 1996, Proceedings of IEEE 18th International Conference on Software Engineering.

[5]  Alain Abran,et al.  A fuzzy logic based set of measures for software project similarity: validation and possible improvements , 2001, Proceedings Seventh International Software Metrics Symposium.

[6]  Khaled El Emam,et al.  Comparing case-based reasoning classifiers for predicting high risk software components , 2001, J. Syst. Softw..

[7]  Shari Lawrence Pfleeger,et al.  Software metrics (2nd ed.): a rigorous and practical approach , 1997 .

[8]  Shari Lawrence Pfleeger,et al.  Software Metrics : A Rigorous and Practical Approach , 1998 .

[9]  Martin J. Shepperd,et al.  Estimating Software Project Effort Using Analogies , 1997, IEEE Trans. Software Eng..

[10]  T. Ross Fuzzy Logic with Engineering Applications , 1994 .

[11]  Taghi M. Khoshgoftaar,et al.  Identification of fuzzy models of software cost estimation , 2004, Fuzzy Sets Syst..

[12]  Lefteris Angelis,et al.  Categorical missing data imputation for software cost estimation by multinomial logistic regression , 2006, J. Syst. Softw..

[13]  Lotfi A. Zadeh,et al.  Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic , 1997, Fuzzy Sets Syst..

[14]  Dr. Zbigniew Michalewicz,et al.  How to Solve It: Modern Heuristics , 2004 .

[15]  Witold Pedrycz,et al.  Software cost estimation with fuzzy models , 2000, SIAP.

[16]  Hichem Frigui,et al.  Fuzzy and possibilistic shell clustering algorithms and their application to boundary detection and surface approximation. II , 1995, IEEE Trans. Fuzzy Syst..

[17]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[18]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  D. Ross Jeffery,et al.  A comparative study of two software development cost modeling techniques using multi-organizational and company-specific data , 2000, Inf. Softw. Technol..

[20]  Pere Garcia-Calvés,et al.  Fuzzy similarity-based models in case-based reasoning , 2002, 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE'02. Proceedings (Cat. No.02CH37291).

[21]  Ioannis Stamelos,et al.  Estimating the development cost of custom software , 2003, Inf. Manag..

[22]  Sun-Jen Huang,et al.  Optimization of analogy weights by genetic algorithm for software effort estimation , 2006, Inf. Softw. Technol..

[23]  Emilia Mendes,et al.  Do adaptation rules improve web cost estimation? , 2003, HYPERTEXT '03.