Covariance effect analysis of similarity measurement methods for early construction cost estimation using case-based reasoning

Abstract Quality source data are essential to secure the reliability of early cost estimation, accuracy, and transparency of the estimation process. To improve the reliability of cost estimation results using CBR, a recurring issue in similarity measurement has been to accurately compute the distance among attributes and cases for the retrieval of the most similar singular or plural cases. However, these existing similarity measures have limitations in taking the covariance among attributes into consideration, and reflecting the effects of covariance in the computation of distances among attributes. To deal with this challenging issue, this research examines the weighted Mahalanobis distance based similarity measure applied to CBR cost estimation, and carries out comparative research on the existing distance measurement methods of CBR. To validate the suggested CBR cost model, simulation data test and case studies using 99 multi-family housing complexes are carried out. As a consequence, this research is expected to improve academic value and mathematical and statistical logicality of similarity measurement, especially in application to CBR cost estimation. The results can be further referenced to not only the area of construction cost estimation, but also international market selection, planning and scheduling, noise control and management, and energy management where case-based reasoning can be applied.

[1]  Wensheng Gao,et al.  A case-based reasoning approach to power transformer fault diagnosis using dissolved gas analysis data , 2009 .

[2]  Moonseo Park,et al.  Case Adaptation Method of Case-Based Reasoning for Construction Cost Estimation in Korea , 2012 .

[3]  Hans-Dieter Burkhard Similarity and Distance in Case Based Reasoning , 2001, Fundam. Informaticae.

[4]  Daniel S. Yeung,et al.  Weighted Mahalanobis Distance Kernels for Support Vector Machines , 2007, IEEE Transactions on Neural Networks.

[5]  B. Park,et al.  Choice of neighbor order in nearest-neighbor classification , 2008, 0810.5276.

[6]  Fred L. Collopy,et al.  Error Measures for Generalizing About Forecasting Methods: Empirical Comparisons , 1992 .

[7]  Christopher K. Riesbeck,et al.  Inside Case-Based Reasoning , 1989 .

[8]  Hyun-Soo Lee,et al.  Cost estimation model for building projects using case-based reasoning , 2011 .

[9]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[10]  Chang-Taek Hyun,et al.  A CBR-based hybrid model for predicting a construction duration and cost based on project characteristics in multi-family housing projects , 2010 .

[11]  G. McLachlan Discriminant Analysis and Statistical Pattern Recognition , 1992 .

[12]  Hung-Chi Chen,et al.  A novel CBR system for numeric prediction , 2012, Inf. Sci..

[13]  Kyoungmin Kim,et al.  Preliminary Cost Estimation Model Using Case-Based Reasoning and Genetic Algorithms , 2010, J. Comput. Civ. Eng..

[14]  Joseph Ahn,et al.  Military Facility Cost Estimation System Using Case-Based Reasoning in Korea , 2011, J. Comput. Civ. Eng..

[15]  S. Pal,et al.  Foundations of Soft Case-Based Reasoning: Pal/Soft Case-Based Reasoning , 2004 .

[16]  E. Tulving,et al.  Organization of memory. , 1973 .

[17]  Gavin C. Cawley,et al.  Fast exact leave-one-out cross-validation of sparse least-squares support vector machines , 2004, Neural Networks.

[18]  Chris Tofallis,et al.  Erratum: A better measure of relative prediction accuracy for model selection and model estimation , 2015, J. Oper. Res. Soc..

[19]  Kyung In Kang,et al.  A case-based reasoning cost estimating model using experience by analytic hierarchy process , 2007 .

[20]  Barry Smyth,et al.  Adaptation-Guided Retrieval: Questioning the Similarity Assumption in Reasoning , 1998, Artif. Intell..

[21]  Ken Black,et al.  Business Statistics: Contemporary Decision Making , 1994 .

[22]  Stephen D. Schuette,et al.  Building construction estimating , 1994 .

[23]  P. Mahalanobis On the generalized distance in statistics , 1936 .

[24]  Onur Behzat Tokdemir,et al.  Comparison of Case-Based Reasoning and Artificial Neural Networks , 1999 .

[25]  Ingoo Han,et al.  Global optimization of feature weights and the number of neighbors that combine in a case‐based reasoning system , 2006, Expert Syst. J. Knowl. Eng..

[26]  Beliz Ozorhon,et al.  Case-Based Reasoning Model for International Market Selection , 2006 .

[27]  Jing Du,et al.  Improved Similarity Measure in Case-Based Reasoning with Global Sensitivity Analysis: An Example of Construction Quantity Estimating , 2014, J. Comput. Civ. Eng..

[28]  Moonseo Park,et al.  Data Preprocessing–Based Parametric Cost Model for Building Projects: Case Studies of Korean Construction Projects , 2010 .

[29]  Ian D. Watson,et al.  Applying case-based reasoning - techniques for the enterprise systems , 1997 .

[30]  Oscar Castillo,et al.  Hybrid intelligent system for cardiac arrhythmia classification with Fuzzy K-Nearest Neighbors and neural networks combined with a fuzzy system , 2012, Expert Syst. Appl..

[31]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[32]  Jui-Sheng Chou,et al.  Web-based CBR system applied to early cost budgeting for pavement maintenance project , 2009, Expert Syst. Appl..

[33]  ChangTaek Hyun,et al.  MRA-based revised CBR model for cost prediction in the early stage of construction projects , 2012, Expert Syst. Appl..

[34]  Dorian Pyle,et al.  Data Preparation for Data Mining , 1999 .

[35]  D. Massart,et al.  The Mahalanobis distance , 2000 .

[36]  Joseph Ahn,et al.  The attribute impact concept: Applications in case-based reasoning and parametric cost estimation , 2014 .

[37]  David C. Rubin,et al.  A Basic-Systems Approach to Autobiographical Memory , 2005 .

[38]  Agnar Aamodt,et al.  Case-Based Reasoning: Foundational Issues, Methodological Variations, and System Approaches , 1994, AI Commun..

[39]  Sevgi Zeynep Dogan,et al.  Determining Attribute Weights in a CBR Model for Early Cost Prediction of Structural Systems , 2006 .

[40]  Nie-Jia Yau,et al.  Case‐Based Reasoning in Construction Management , 1998 .

[41]  Donald Eugene. Farrar,et al.  Multicollinearity in Regression Analysis; the Problem Revisited , 2011 .

[42]  Roger C. Schank,et al.  Dynamic Memory Revisited , 1999 .

[43]  S. Rogers,et al.  Cluster merging based on weighted mahalanobis distance with application in digital mammograph , 1998, Proceedings of the IEEE 1998 National Aerospace and Electronics Conference. NAECON 1998. Celebrating 50 Years (Cat. No.98CH36185).

[44]  Taehoon Hong,et al.  Revised Case-Based Reasoning Model Development Based on Multiple Regression Analysis for Railroad Bridge Construction , 2012 .

[45]  Ratnadip Adhikari,et al.  A neural network based linear ensemble framework for time series forecasting , 2015, Neurocomputing.

[46]  Moonseo Park,et al.  Construction Planning Method Using Case-Based Reasoning (CONPLA-CBR) , 2007 .