A fast model selection procedure for large families of models

Abstract An efficient procedure for model selection from large families of models is described. It is closely related to the all possible models approach but is considerably faster. It is based on two principles: first, if a model is accepted, then all models that include it are considered to be accepted; second, if a model is rejected, then all of its submodels are considered to be rejected. Application of the procedure to variable selection in multiple regression is illustrated. General algorithms are described that enable the procedure to be applied to any family of models that forms a lattice. As an example, a problem in multiple comparisons is considered.

[1]  R. R. Hocking,et al.  Selection of the Best Subset in Regression Analysis , 1967 .

[2]  M. Kendall,et al.  The discarding of variables in multivariate analysis. , 1967, Biometrika.

[3]  K. Gabriel,et al.  SIMULTANEOUS TEST PROCEDURES-SOME THEORY OF MULTIPLE COMPARISONS' , 1969 .

[4]  L. S. Feldt,et al.  THE SELECTION OF VARIABLES IN MULTIPLE REGRESSION ANALYSIS , 1970 .

[5]  R. R. Hocking,et al.  Computational Efficieucy in the Selection of Regression Variables , 1970 .

[6]  Murray Aitkin,et al.  Simultaneous Inference and the Choice of Variable Subsets in Multiple Regression , 1974 .

[7]  D. Cox,et al.  The Choice of Variables in Observational Studies , 1974 .

[8]  R. R. Hocking The analysis and selection of variables in linear regression , 1976 .

[9]  K. Berk Comparing Subset Regression Procedures , 1978 .

[10]  M. Thompson Selection of Variables in Multiple Regression: Part I. A Review and Evaluation , 1978 .

[11]  Kenneth N. Berk,et al.  Forward and backward stepping in variable selection , 1980 .

[12]  T. Hassard,et al.  Applied Linear Regression , 2005 .

[13]  R. W. Butler Bounds on the Significance Attained by the Best-fitting Regressor Variable , 1982 .

[14]  D. Pokorný,et al.  Procedures for Optimal Collapsing of Two-way Contingency Table , 1982 .

[15]  Some sequential selection procedures for good regression models , 1982 .

[16]  W. Dixon,et al.  BMDP statistical software , 1983 .

[17]  R. Dennis Cook,et al.  Cross-Validation of Regression Models , 1984 .

[18]  T. Havránek A Procedure for Model Search in Multidimensional Contingency Tables , 1984 .

[19]  Alan J. Miller Sélection of subsets of regression variables , 1984 .

[20]  Gerald J. Hahn,et al.  More intelligent statistical software and statistical expert systems: future directions , 1985 .

[21]  D. Edwards,et al.  A fast procedure for model search in multidimensional contingency tables , 1985 .

[22]  John W. Tukey,et al.  [More Intelligent Statistical Software and Statistical Expert Systems: Future Directions]: Comment , 1985 .