Influence analysis for linear mixed‐effects models

In this paper, we extend several regression diagnostic techniques commonly used in linear regression, such as leverage, infinitesimal influence, case deletion diagnostics, Cook's distance, and local influence to the linear mixed-effects model. In each case, the proposed new measure has a direct interpretation in terms of the effects on a parameter of interest, and collapses to the familiar linear regression measure when there are no random effects. The new measures are explicitly defined functions and do not necessitate re-estimation of the model, especially for cluster deletion diagnostics. The basis for both the cluster deletion diagnostics and Cook's distance is a generalization of Miller's simple update formula for case deletion for linear models. Pregibon's infinitesimal case deletion diagnostics is adapted to the linear mixed-effects model. A simple compact matrix formula is derived to assess the local influence of the fixed-effects regression coefficients. Finally, a link between the local influence approach and Cook's distance is established. These influence measures are applied to an analysis of 5-year Medicare reimbursements to colon cancer patients to identify the most influential observations and their effects on the fixed-effects coefficients.

[1]  M. Berger,et al.  Detection of Influential Observations in Longitudinal Mixed Effects Regression Models , 2001 .

[2]  R. Dennis Cook,et al.  Leverage and Superleverage in Nonlinear Regression , 1992 .

[3]  R. Cook Detection of influential observation in linear regression , 2000 .

[4]  Eugene Demidenko,et al.  Efficient estimation of general linear mixed effects models , 2002 .

[5]  B Y Yeap,et al.  Robust Two‐Stage Estimation in Hierarchical Nonlinear Models , 2001, Biometrics.

[6]  H. Trussell,et al.  The Distribution of an Arbitrary Studentized Residual and the Effects of Updating in Multiple Regression , 1974 .

[7]  A. Zaslavsky,et al.  Downweighting Influential Clusters in Surveys , 2001 .

[8]  Edward W. Frees,et al.  Influence Diagnostics for Linear Longitudinal Models , 1997 .

[9]  D. Pregibon Logistic Regression Diagnostics , 1981 .

[10]  R. Dennis Cook,et al.  Leverage, local influence and curvature in nonlinear regression , 1993 .

[11]  L. Skovgaard NONLINEAR MODELS FOR REPEATED MEASUREMENT DATA. , 1996 .

[12]  Ronald Christensen,et al.  Case-deletion diagnostics for mixed models , 1992 .

[13]  R. Jennrich,et al.  Unbalanced repeated-measures models with structured covariance matrices. , 1986, Biometrics.

[14]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[15]  Jianxin Pan,et al.  Growth curve models and statistical diagnostics , 2002 .

[16]  D. Bates,et al.  Newton-Raphson and EM Algorithms for Linear Mixed-Effects Models for Repeated-Measures Data , 1988 .

[17]  Christopher J. Nachtsheim,et al.  Diagnostics for mixed-model analysis of variance , 1987 .

[18]  David Wennberg,et al.  The Implications of Regional Variations in Medicare Spending. Part 1: The Content, Quality, and Accessibility of Care , 2003, Annals of Internal Medicine.

[19]  E. Vonesh,et al.  Linear and Nonlinear Models for the Analysis of Repeated Measurements , 1996 .

[20]  Rupert G. Miller An Unbalanced Jackknife , 1974 .

[21]  E Lesaffre,et al.  Local influence in linear mixed models. , 1998, Biometrics.

[22]  S. Chatterjee,et al.  Influential Observations, High Leverage Points, and Outliers in Linear Regression , 1986 .

[23]  S. Weisberg,et al.  Residuals and Influence in Regression , 1982 .