Knowledge Discovery Based Simulation System in Construction

Uncertainty is an entrenched characteristic of most construction projects. Typically, probability distributions are utilized to accommodate uncertainty when estimating duration of project’s activities. Distributions are fitted, based on the collected data from construction projects, to estimate activity durations, to assess productivity and cost, and to identify resource bottlenecks using simulation. The subjectivity in selecting these fitted probability distributions is an imprecise process and may significantly affect simulation outputs. Most research works in simulating construction operations has focused predominantly on modeling and has neglected to study the effect of subjective variables on simulation process. Therefore, there is a need for a system, which: (1) handles uncertainty, fuzziness, missing data, and outliers in input data, (2) effectively utilizes historical data, (3) models the effect of qualitative and quantitative variables on the simulation process, (4) enhances simulation modeling capabilities, and (5) optimizes simulation system output(s). The main objective of this research is to develop a knowledge discovery based simulation system for construction operations, which achieves the abovementioned necessities. This system comprises three stages: (i) a Knowledge Discovery Stage (KDS), (ii) a Simulation Stage (SS), and (iii) an Optimization Stage (OS). In the KDS, raw data are prepared for the SS where patterns, which represent knowledge implicitly stored or captured in large databases, are extracted using Fuzzy K-means technique. During the KDS, the effect of qualitative and quantitative variables on construction operation(s) is modelled using Fuzzy Clustering technique. This stage improves the efficiency of data modeling by 10% closer to real data. The movement of units is modeled in the SS where the interaction between flow units and idle times of resources can be examined to discover any bottlenecks and estimate the operation's productivity and cost. The OS, using Pareto ranking technique, assists in selecting and ranking feasible productivity-cost solution(s) for diverse resource combinations under different conditions. An automated general purpose construction simulation language (KEYSTONE) is developed using C#. The developed system is validated and verified using several case studies with sound and satisfactory results, i.e. 4% - 11% digression. The developed research/system benefits both researchers and practitioners because it provides robust simulation modeling tool(s) and optimum resources allocation for construction operations.

[1]  Daniel W. Halpin,et al.  Simulation experiment for improving construction processes , 2004, Proceedings of the 2004 Winter Simulation Conference, 2004..

[2]  Witold Pedrycz,et al.  Advances in Fuzzy Clustering and its Applications , 2007 .

[3]  Robert I. Carr,et al.  Knowledge Based Simulation of Construction , 1987 .

[4]  Christopher M. Taylor A Multi-Tiered Genetic Algorithm for Data Mining and Hypothesis Refinement , 2009 .

[5]  Gregory W. Corder,et al.  Nonparametric Statistics for Non-Statisticians: A Step-by-Step Approach , 2009 .

[6]  Lucio Soibelman,et al.  Data Preparation Process for Construction Knowledge Generation through Knowledge Discovery in Databases , 2002 .

[7]  James H. Garrett,et al.  A Knowledge Discovery Framework for Civil Infrastructure: A Case Study of the Intelligent Workplace , 2000, Engineering with Computers.

[8]  Paul J. Werbos,et al.  The roots of backpropagation , 1994 .

[9]  Jerry M. Mendel,et al.  Generating fuzzy rules by learning from examples , 1992, IEEE Trans. Syst. Man Cybern..

[10]  Semiha Kiziltas,et al.  Prediction of organizational effectiveness in construction companies , 2005 .

[11]  Hiroshi Nakajima,et al.  Efficient Processing of Nested Fuzzy SQL Queries in a Fuzzy Database , 2001, IEEE Trans. Knowl. Data Eng..

[12]  Aminah Robinson Fayek,et al.  A Framework for Integrating Fuzzy Expert Systems and Discrete Event Simulation , 2005 .

[13]  Daniel W. Halpin An Investigation of the Use of Simulation Network for Modeling Construction Operations , 1973 .

[14]  Peter J. Fleming,et al.  An Overview of Evolutionary Algorithms in Multiobjective Optimization , 1995, Evolutionary Computation.

[15]  L. D. Raedt,et al.  Three companions for data mining in first order logic , 2001 .

[16]  C. Fonseca,et al.  GENETIC ALGORITHMS FOR MULTI-OBJECTIVE OPTIMIZATION: FORMULATION, DISCUSSION, AND GENERALIZATION , 1993 .

[17]  Jack P. C. Kleijnen,et al.  Statistical aspects of simulation : An updated survey , 1982 .

[18]  Paul A. Fishwick,et al.  Method for resolving the consistency problem between rule-based and quantitative models using fuzzy simulation , 1997, Defense, Security, and Sensing.

[19]  Fei Hong-xiao Decision Support System Based on the Data Warehouse , 2004 .

[20]  V. Rao Vemuri,et al.  Multiniche Crowding in Genetic Algorithms and Its Application to the Assembly of DNA Restriction-Fragments , 1994, Evolutionary Computation.

[21]  Mohamed Marzouk,et al.  Fuzzy Clustering Model for Estimating Haulers’ Travel Time , 2004 .

[22]  Jitender S. Deogun,et al.  Towards Missing Data Imputation: A Study of Fuzzy K-means Clustering Method , 2004, Rough Sets and Current Trends in Computing.

[23]  A. Sawhney,et al.  Agent-based modeling and simulation in construction , 2003, Proceedings of the 2003 Winter Simulation Conference, 2003..

[24]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[25]  Earl Cox,et al.  Fuzzy Modeling And Genetic Algorithms For Data Mining And Exploration , 2005 .

[26]  Jean-François Boulicaut,et al.  Constraint-based Data Mining , 2005, Data Mining and Knowledge Discovery Handbook.

[27]  Kalyanmoy Deb,et al.  Muiltiobjective Optimization Using Nondominated Sorting in Genetic Algorithms , 1994, Evolutionary Computation.

[28]  Kwok-wing Chau,et al.  Application of data warehouse and Decision Support System in construction management , 2003 .

[29]  X. Ye,et al.  Fuzzy SQL Queries with Weights Based on Fuzzy Logic Conjunctions , 2009, 2009 WRI Global Congress on Intelligent Systems.

[30]  Averill M. Law,et al.  Simulation Modeling and Analysis , 1982 .

[31]  Wei Peng,et al.  Improving the management of building life cycle : a data mining approach , 2006 .

[32]  Peter Lawrence,et al.  Discrete-Event Simulation in the Analysis of Construction Processes , 2000 .

[33]  Vineet R. Kamat,et al.  Validating Complex Construction Simulation Models Using 3D Visualization , 2003 .

[34]  Kalyanmoy Deb,et al.  Multiobjective optimization , 1997 .

[35]  David M. Levine,et al.  Statistics for Managers Using Microsoft® Excel , 1997 .

[36]  Tommy W. S. Chow,et al.  A neural-based crowd estimation by hybrid global learning algorithm , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[37]  T. Ross Fuzzy Logic with Engineering Applications , 1994 .

[38]  Deborah J Fisher,et al.  The knowledge process , 1997 .

[39]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[40]  Riza C. Berkan,et al.  Fuzzy System Design Principles , 1997 .

[41]  Daniel W. Halpin,et al.  Pile construction productivity assessment , 2005 .

[42]  David A. Bell,et al.  Discovering Case Knowledge Using Data Mining , 1998, PAKDD.

[43]  Yinghua Lin,et al.  A new approach to fuzzy-neural system modeling , 1995, IEEE Trans. Fuzzy Syst..

[44]  D.W. Halpin,et al.  Real world applications of construction process simulation , 1999, WSC'99. 1999 Winter Simulation Conference Proceedings. 'Simulation - A Bridge to the Future' (Cat. No.99CH37038).

[45]  Simaan M. AbouRizk,et al.  Unified Modeling Methodology for Construction Simulation , 2002 .

[46]  Zafar Ullah Khan,et al.  Modeling and parameter ranking of construction labor productivity , 2005 .

[47]  L. Wasserman All of Nonparametric Statistics , 2005 .

[48]  Fang Wang On-site labor productivity estimation using neural networks , 2005 .

[49]  HyunJoo Kim Knowledge Discovery and Machine Learning in Construction Project Databases , 2002 .

[50]  Anil Sawhney,et al.  Adaptive Probabilistic Neural Network-Based Crane Type Selection System , 2002 .

[51]  Daniel W. Halpin,et al.  Visual construction operation simulation: The DISCO approach , 1994 .

[52]  Isao Hayashi,et al.  NN-driven fuzzy reasoning , 1991, Int. J. Approx. Reason..

[53]  Simaan M. AbouRizk,et al.  Simphony-an integrated environment for construction simulation , 2000, 2000 Winter Simulation Conference Proceedings (Cat. No.00CH37165).

[54]  David Arditi,et al.  Integrated Simulation System for Construction Operation and Project Scheduling , 2010, J. Comput. Civ. Eng..

[55]  Khaled A El-Rayes,et al.  Estimating weather impact on the duration of construction activities , 1997 .

[56]  Benjamin C. M. Fung,et al.  A framework for privacy-preserving cluster analysis , 2008, 2008 IEEE International Conference on Intelligence and Security Informatics.

[57]  Raja R. A. Issa,et al.  A Model for Integrating Construction Design and Schedule Data , 2003 .

[58]  John L. Bishop,et al.  General Purpose Visual Simulation System: A functional description , 1990, 1990 Winter Simulation Conference Proceedings.

[59]  Raymond T. Ng,et al.  Finding Aggregate Proximity Relationships and Commonalities in Spatial Data Mining , 1996, IEEE Trans. Knowl. Data Eng..

[60]  Photios G. Ioannou,et al.  Animation of Complex Construction Simulation Models , 1996 .

[61]  Hani G. Melhem,et al.  PREDICTION OF REMAINING SERVICE LIFE OF BRIDGE DECKS USING MACHINE LEARNING , 2003 .

[62]  Daniel W. Halpin,et al.  Simulation of cable-stayed bridges using DISCO , 1994, Proceedings of Winter Simulation Conference.

[63]  Ralf-Detlef Kutsche,et al.  Model-Based Software and Data Integration - First International Workshop, MBSDI 2008, Berlin, Germany, April 1-3, 2008. Proceedings , 2008, MBSDI.