Selection Criterion of Working Correlation Structure for Spatially Correlated Data

Abstract To obtain regression parameter estimates in generalized estimation equation modeling, whether in longitudinal or spatially correlated data, it is necessary to specify the structure of the working correlation matrix. The regression parameter estimates can be affected by the choice of this matrix. Within spatial statistics, the correlation matrix also influences how spatial variability is modeled. Therefore, this study proposes a new method for selecting a working matrix, based on conditioning the variance-covariance matrix naive. The method performance is evaluated by an extensive simulation study, using the marginal distributions of normal, Poisson, and gamma for spatially correlated data. The correlation structure specification is based on semivariogram models, using the Wendland, Matérn, and spherical model families. The results reveal that regarding the hit rates of the true spatial correlation structure of simulated data, the proposed criterion resulted in better performance than competing criteria: quasi-likelihood under the independence model criterion QIC, correlation information criterion CIC, and the Rotnizky–Jewell criterion RJC. The application of an appropriate spatial correlation structure selection was shown using the first-semester average rainfall data of 2021 in the state of Pernambuco, Brazil.

[1]  M. A. Cirillo,et al.  Selection criterion of work matrix as a function of limiting estimates of the covariance matrix of correlated data in GEE , 2018, Biometrical journal. Biometrische Zeitschrift.

[2]  F. De Bastiani,et al.  Local Influence for Spatially Correlated Binomial Data: An Application to the Spodoptera frugiperda Infestation in Corn , 2017 .

[3]  M. Bevilacqua,et al.  Estimation and prediction using generalized Wendland covariance functions under fixed domain asymptotics , 2016, The Annals of Statistics.

[4]  A. Wahed,et al.  A determinant‐based criterion for working correlation structure selection in generalized estimating equations , 2016, Statistics in medicine.

[5]  Isao Yoshimura,et al.  Criterion for the Selection of a Working Correlation Structure in the Generalized Estimating Equation Approach for Longitudinal Balanced Data , 2011 .

[6]  Galit Shmueli,et al.  On Generating Multivariate Poisson Data in Management Science Applications , 2009 .

[7]  You-Gan Wang,et al.  Working‐correlation‐structure identification in generalized estimating equations , 2009, Statistics in medicine.

[8]  V. Carey,et al.  Criteria for Working–Correlation–Structure Selection in GEE , 2007 .

[9]  D. Botter,et al.  Diagnostic techniques in generalized estimating equations , 2007 .

[10]  V. P. Zastavnyi,et al.  On some properties of Buhmann functions , 2006 .

[11]  Donald Hedeker,et al.  Longitudinal Data Analysis , 2006 .

[12]  T. Gneiting Compactly Supported Correlation Functions , 2002 .

[13]  W. Pan Akaike's Information Criterion in Generalized Estimating Equations , 2001, Biometrics.

[14]  B C Sutradhar,et al.  On the Accuracy of Efficiency of Estimating Equation Approach , 2000, Biometrics.

[15]  Kalyan Das,et al.  Miscellanea. On the efficiency of regression estimators in generalised linear models for longitudinal data , 1999 .

[16]  P S Albert,et al.  A generalized estimating equations approach for spatially correlated binary data: applications to the analysis of neuroimaging data. , 1995, Biometrics.

[17]  Lue Ping Zhao,et al.  Multivariate Mean Parameter Estimation by Using a Partly Exponential Model , 1992 .

[18]  N. Jewell,et al.  Hypothesis testing of regression parameters in semiparametric generalized linear models for cluster correlated data , 1990 .

[19]  R. Prentice,et al.  Correlated binary regression with covariates specific to each binary observation. , 1988, Biometrics.

[20]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[21]  K Y Liang,et al.  Longitudinal data analysis for discrete and continuous outcomes. , 1986, Biometrics.

[22]  I. Olkin,et al.  Generating Correlation Matrices , 1984 .

[23]  David A. Belsley,et al.  Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1980 .

[24]  R. W. Wedderburn Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method , 1974 .

[25]  Wagner Hugo Bonat Modelling Mixed Types of Outcomes in Additive Genetic Models , 2017, The international journal of biostatistics.

[26]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[27]  B. Abbasi,et al.  Generating correlation matrices for normal random vectors in NORTA algorithm using artificial neural networks , 2008 .

[28]  Brian D. Ripley,et al.  Modern applied statistics with S, 4th Edition , 2002, Statistics and computing.