Generalized Collinearity Diagnostics

Abstract Working in the context of the linear model y = Xβ + e, we generalize the concept of variance inflation as a measure of collinearity to a subset of parameters in β (denoted by β 1, with the associated columns of X given by X 1). The essential idea underlying this generalization is to examine the impact on the precision of estimation—in particular, the size of an ellipsoidal joint confidence region for β 1—of less-than-optimal selection of other columns of the design matrix (X 2), treating still other columns (X 0) as unalterable, even hypothetically. In typical applications, X 1 contains a set of dummy regressors coding categories of a qualitative variable or a set of polynomial regressors in a quantitative variable; X 2 contains all other regressors in the model, save the constant, which is in X 0. If σ 2 V denotes the realized variance of , and σ 2 U is the variance associated with an optimal selection of X 2, then the corresponding scaled dispersion ellipsoids to be compared are ℰ v = {x : x′V ...

[1]  Nonresponse on surveys: a Canada- United States comparison* , 1985 .

[2]  Donald W. Marquaridt Generalized Inverses, Ridge Regression, Biased Linear Estimation, and Nonlinear Estimation , 1970 .

[3]  H. Theil Principles of econometrics , 1971 .

[4]  Calyampudi R. Rao,et al.  Linear Statistical Inference and Its Applications. , 1975 .

[5]  M. L. Eaton Multivariate statistics : a vector space approach , 1985 .

[6]  John Fox,et al.  Linear Statistical Models and Related Methods; With Applications to Social Research. , 1985 .

[7]  Klaus Eichner,et al.  Predicting Reponse Rates to Mailed Questionnaires , 1981 .

[8]  T. Heberlein,et al.  Factors affecting response rates to mailed questionnaires: A quantitative analysis of the published literature. , 1978 .

[9]  R. Cook [Demeaning Conditioning Diagnostics through Centering]: Comment , 1984 .

[10]  J. Goyder,et al.  Further Evidence on Factors Affecting Response Rates to Mailed Questionnaires , 1982 .

[11]  Edward E. Leamer,et al.  Model choice and specification analysis , 1983 .

[12]  A. Buse,et al.  Elements of econometrics , 1972 .

[13]  David A. Belsley Demeaning Conditioning Diagnostics through Centering , 1984 .

[14]  Edward Leamer Multicollinearity: A Bayesian Interpretation , 1973 .

[15]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[16]  D. F. Andrews,et al.  Finding the Outliers that Matter , 1978 .

[17]  Thomas A. Heberlein,et al.  The Effectiveness of the Heberlein-Baumgartner Models for Predicting Response Rates to Mailed Questionnaires: European and U.S. Examples , 1981 .