Building Multiple Regression Models Interactively

Automated multiple regression model-building techniques often hide important aspects of data from the data analyst. Such features as nonlinearity, collinearity, outliers, and points with high leverage can profoundly affect automated analyses, yet remain undetected. An alternative technique uses interactive computing and exploratory methods to discover unexpected features of the data. One important advantage of this approach is that the data analyst can use knowledge of the subject matter in the resolution of difficulties. The methods are illustrated with reanalyses of the two data sets used by Hocking (1976, Biometrics 32, 1-44) to illustrate the use of automated regression methods.

[1]  J. J. Sylvester,et al.  Lectures on the Principles of Universal Algebra , 1883 .

[2]  M. Ezekiel A Method of Handling Curvilinear Correlation for Any Number of Variables , 1924 .

[3]  R. Gnanadesikan,et al.  Probability plotting methods for the analysis of data. , 1968, Biometrika.

[4]  A. E. Hoerl,et al.  Ridge Regression: Applications to Nonorthogonal Problems , 1970 .

[5]  W. A. Larsen,et al.  The Use of Partial Residual Plots in Regression Analysis , 1972 .

[6]  G. C. McDonald,et al.  Instabilities of Regression Estimates Relating Air Pollution to Mortality , 1973 .

[7]  Herman Chernoff,et al.  The Use of Faces to Represent Points in k- Dimensional Space Graphically , 1973 .

[8]  C. L. Mallows Some Comments onCp , 1973 .

[9]  F. S. Wood The Use of Individual Effects and Residuals in Fitting Equations to Data , 1973 .

[10]  T. A. Ryan,et al.  Minitab Student Handbook , 1979 .

[11]  R. R. Hocking The analysis and selection of variables in linear regression , 1976 .

[12]  John W. Tukey,et al.  Exploratory Data Analysis. , 1979 .

[13]  Frederick Mosteller,et al.  Data Analysis and Regression , 1978 .

[14]  R. Welsch,et al.  The Hat Matrix in Regression and ANOVA , 1978 .

[15]  Gary C. McDonald,et al.  SOME APPLICATIONS OF THE “CHERNOFF FACES”: A TECHNIQUE FOR GRAPHICALLY REPRESENTING MULTIVARIATE DATA , 1978 .

[16]  W. J. Dixon,et al.  BMDP-77, Biomedical Computer Programs, P-Series , 1979 .

[17]  A. J. Barr,et al.  SAS user's guide , 1979 .

[18]  David C. Hoaglin,et al.  Applications, basics, and computing of exploratory data analysis , 1983 .

[19]  David A. Belsley,et al.  Regression Analysis and its Application: A Data-Oriented Approach.@@@Applied Linear Regression.@@@Regression Diagnostics: Identifying Influential Data and Sources of Collinearity , 1981 .

[20]  F. Hearne,et al.  BMDP-79 Biomedical Computer Programs, P Series , 1982 .