Predictive models from statistically nonconforming databases

Data sets in civil and structural engineering are often statistically challenging. This is because the data are from one-of-a-kind systems such as buildings and other large facilities, as opposed to replicated systems as found in most other fields of engineering. Special care is required in developing predictive models from such data. Herein a database of building natural period and damping is used to provide a rich context for analyzing one-of-a-kind systems. The database is statistically nonconforming in three ways. The data are nested, where measurements from different excitation sources are obtained for each building; the data set is unbalanced with measurements unevenly distributed among different building categories; and the variability is nonuniform. Furthermore, the number of possibly relevant building parameters is large. The goal is to develop a relatively simple, yet general approach for deriving predictive models based on such statistically nonconforming data sets. The approach is based on the statistical framework of generalized linear models and is structured in a manner to allow for engineering insights into the model. In the companion paper, it is shown how this approach can be applied to develop comprehensive models for building natural period and damping.

[1]  Risto Lethonen Multilevel Statistical Models (3rd ed.) , 2005 .

[2]  Harold R. Lindman Analysis of Variance in Experimental Design , 1991 .

[3]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[4]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[5]  M. Woodroofe On Model Selection and the ARC Sine Laws , 1982 .

[6]  S. R. Searle,et al.  Generalized, Linear, and Mixed Models , 2005 .

[7]  Yukio Tamura,et al.  Damping Evaluation Using Full-Scale Data of Buildings in Japan , 2003 .

[8]  Purushottam W. Laud,et al.  Predictive Variable Selection in Generalized Linear Models , 2002 .

[9]  H. Akaike A new look at the statistical model identification , 1974 .

[10]  Sergio Lagomarsino,et al.  Forecast models for damping and vibration periods of buildings , 1993 .

[11]  H. Bozdogan Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions , 1987 .

[12]  H. Goldstein Multilevel Statistical Models , 2006 .

[13]  A. P. Jeary,et al.  Random damping in buildings and its AR model , 1999 .

[14]  Anil K. Chopra,et al.  Period formulas for moment-resisting frame buildings , 1997 .

[15]  Anil K. Chopra,et al.  PERIOD FORMULAS FOR CONCRETE SHEAR WALL BUILDINGS , 1998 .

[16]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[17]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[18]  J. Berger,et al.  Optimal predictive model selection , 2004, math/0406464.

[19]  H. Scheffé,et al.  The Analysis of Variance , 1960 .