Decision tree for selecting retaining wall systems based on logistic regression analysis

Machine learning techniques generally require thousands of cases to derive a reliable conclusion, but such a large number of excavation cases are very difficult to acquire in the construction domain. There have been efforts to develop retaining wall selection systems using machine learning techniques but based only on a couple of hundred cases of excavation work. The resultant rules were inconsistent and unreliable. This paper proposes an improved decision tree for selecting retaining wall systems. After retaining wall systems were divided into three components, i.e., the retaining wall, the lateral support, and optional grouting, a series of logistic regression analyses, analysis of variance (ANOVA), and chi-square tests were used to derive the variables and a decision tree for selecting retaining wall systems. The prediction accuracy rates for the retaining walls, lateral supports, and grouting were 82.6%, 80.4%, and 76.9%, respectively. These values were higher than the prediction accuracy rate (58.7%) of the decision tree built by an automated machine learning algorithm, Classification and Regression Trees (CART), with the same data set.

[1]  S J Boone DATABASE FOR RETAINING WALL AND GROUND MOVEMENT DUE TO DEEP EXCAVATIONS. DISCUSSION AND CLOSURE , 2002 .

[2]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[3]  KimGwangHee,et al.  A Study on the Selection of Retaining Wall Methods Using Neural Networks and Case-Based Reasoning , 2006 .

[4]  S. Menard Six Approaches to Calculating Standardized Logistic Regression Coefficients , 2004 .

[5]  Nie-Jia Yau,et al.  Applying case-based reasoning technique to retaining wall selection , 1998 .

[6]  J. Habbema,et al.  Prognostic Modeling with Logistic Regression Analysis , 2001, Medical decision making : an international journal of the Society for Medical Decision Making.

[7]  Sai On Cheung,et al.  Logistic Likelihood Analysis of Mediation Outcomes , 2006 .

[8]  Ian Witten,et al.  Data Mining , 2000 .

[9]  Hans C. Jessen,et al.  Applied Logistic Regression Analysis , 1996 .

[10]  U-Yeol Park,et al.  A Study on the Selection Model of Retaining Wall Methods Using Support Vector Machines , 2006 .

[11]  Michael J. A. Berry,et al.  Mastering Data Mining: The Art and Science of Customer Relationship Management , 1999 .

[12]  J V Tu,et al.  Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. , 1996, Journal of clinical epidemiology.

[13]  David Arditi,et al.  Predicting the Outcome of Construction Litigation Using Boosted Decision Trees , 2005 .

[14]  Jyh-Bin Yang,et al.  A rule induction-based knowledge system for retaining wall selection , 2002, Expert Syst. Appl..

[15]  Chee Hong Wong,et al.  Contractor Performance Prediction Model for the United Kingdom Construction Contractor: Study of Logistic Regression Approach , 2004 .

[16]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[17]  Nie-Jia Yau,et al.  Inducing rules for selecting retaining wall systems , 1999 .

[18]  P. Allison Multiple Regression: A Primer , 1994 .

[19]  Javier Artola A solution to the braced excavation collapse in Singapore , 2005 .

[20]  M. Arockiasamy,et al.  Selection of Earth Retention Systems Using Expert System , 1994 .

[21]  Ulrich Smoltczyk,et al.  Geotechnical engineering handbook , 2002 .

[22]  J. B. Yang Hybrid AI system for retaining wall selection , 2004 .