Further investigation into the use of CBR and stepwise regression to predict development effort for Web hypermedia applications

To date studies using CBR for Web hypermedia effort prediction have not applied adaptation rules to adjust effort according to a given criterion. In addition, when applying n-fold cross-validation, their analysis has been limited to a maximum of three training sets, which according to recent studies, may lead to untrustworthy results. This paper has therefore two objectives. The first is to further investigate the use of CBR for Web hypermedia effort prediction by comparing the prediction accuracy of eight CBR techniques, of which three have previously been compared. The second objective is to compare the prediction accuracy of the best CBR technique against stepwise regression, using a twenty-fold cross-validation. All prediction accuracies were measured using Mean Magnitude of Relative Error (MMRE), Median Magnitude of Relative Error, Prediction at level 1 (1=25%), and boxplots of the residuals. One dataset was used in the estimation process and, according to all measures of prediction accuracy, stepwise regression showed the best prediction accuracy.

[1]  Chris F. Kemerer,et al.  An empirical validation of software cost estimation models , 1987, CACM.

[2]  Isabella Wieczorek,et al.  Resource Estimation in Software Engineering , 2002 .

[3]  Barbara A. Kitchenham,et al.  Effort estimation using analogy , 1996, Proceedings of IEEE 18th International Conference on Software Engineering.

[4]  Tomás Isakowitz,et al.  RMM: a methodology for structured hypermedia design , 1995, CACM.

[5]  Stephen G. MacDonell,et al.  What accuracy statistics really measure , 2001, IEE Proc. Softw..

[6]  Emilia Mendes,et al.  A comparison of development effort estimation techniques for Web hypermedia applications , 2002, Proceedings Eighth IEEE Symposium on Software Metrics.

[7]  Adrian Cowderoy Measures of size and complexity for web-site content , 2000 .

[8]  Larry Schroeder,et al.  Understanding Regression Analysis: An Introductory Guide , 2016 .

[9]  Stefan Schulz CBR-Works A State-of-the-Art Shell for Case-Based Application Building , 1999 .

[10]  Adrian J. C. Cowderoy,et al.  A metrics framework for multimedia creation , 1998, Proceedings Fifth International Software Metrics Symposium. Metrics (Cat. No.98TB100262).

[11]  Ian D. Watson,et al.  Applying case-based reasoning - techniques for the enterprise systems , 1997 .

[12]  Barbara Kitchenham,et al.  The MERMAID Approach to software cost estimation , 1990 .

[13]  Ioannis Stamelos,et al.  A Simulation Tool for Efficient Analogy Based Cost Estimation , 2000, Empirical Software Engineering.

[14]  Emilia Mendes,et al.  A comparison of case-based reasoning approaches , 2002, WWW '02.

[15]  Michelle Cartwright,et al.  Issues on the Effective Use of CBR Technology for Software Project Prediction , 2001, ICCBR.

[16]  Martin Shepperd,et al.  Using Simulation to Evaluate Prediction Techniques , 2001 .

[17]  Martin J. Shepperd,et al.  Making inferences with small numbers of training sets , 2002, IEE Proc. Softw..

[18]  D. Ross Jeffery,et al.  An Empirical Study of Analogy-based Software Effort Estimation , 1999, Empirical Software Engineering.

[19]  Lionel C. Briand,et al.  An assessment and comparison of common software cost estimation modeling techniques , 1999, Proceedings of the 1999 International Conference on Software Engineering (IEEE Cat. No.99CB37002).

[20]  Donald J. Reifer,et al.  Web Development: Estimating Quick-to-Market Software , 2000, IEEE Softw..

[21]  Emilia Mendes,et al.  A Comparison of Length , Complexity and Functionality as Size Measures for Predicting Web Design and Authoring Effort , 2001 .

[22]  Ingunn Myrtveit,et al.  A Controlled Experiment to Assess the Benefits of Estimating with Analogy and Regression Models , 1999, IEEE Trans. Software Eng..

[23]  Franca Garzotto,et al.  Towards a software engineering approach to Web site development , 1998, Proceedings Ninth International Workshop on Software Specification and Design.

[24]  R. S. Pressman,et al.  What a tangled Web we weave [Web engineering] , 2000 .

[25]  Gustavo Rossi,et al.  From Domain Models to Hypermedia Applications: an Object-Oriented Approach , 1994 .

[26]  Emilia Mendes,et al.  Web Metrics-Estimating Design and Authoring Effort , 2001, IEEE Multim..

[27]  Franca Garzotto,et al.  HDM—a model-based approach to hypertext application design , 1993, TOIS.

[28]  Seishi Okamoto,et al.  An Average-Case Analysis of k-Nearest Neighbor Classifier , 1995, ICCBR.

[29]  Martin Shepperd,et al.  Experiences Using Case-Based Reasoning to Predict Software Project Effort , 2000 .

[30]  Emilia Mendes,et al.  Measurement and Effort Prediction for Web Applications , 2001, Web Engineering.

[31]  H. E. Dunsmore,et al.  Software engineering metrics and models , 1986 .