Improving on "Data mining reconsidered" by K.D. Hoover and S.J. Perez

Kevin Hoover and Stephen Perez take important steps towards resolving some key issues in econometric methodology. They simulate general-to-specific selection for linear, dynamic regression models, and find that their algorithm performs well in re-mining the ?Lovell database?. We discuss developments that improve on their results, automated in PcGets. Monte Carlo experiments and re-analyses of empirical studies show that pre-selection F-tests, encompassing tests, and sub-sample reliability checks all help eliminate ?spuriously-significant? regressors, without impugning recovery of the correct specification.

[1]  H. Akaike Prediction and Entropy , 1985 .

[2]  Kevin D. Hoover,et al.  Data mining reconsidered: encompassing and the general-to-specific approach to specification search , 1997 .

[3]  H. White A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity , 1980 .

[4]  R. Engle Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation , 1982 .

[5]  J. B. Ramsey,et al.  Tests for Specification Errors in Classical Linear Least‐Squares Regression Analysis , 1969 .

[6]  Rand R. Wilcox,et al.  The statistical implications of pre-test and Stein-rule estimators in econometrics , 1978 .

[7]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[8]  David F. Hendry,et al.  Econometrics-Alchemy or Science? , 1980 .

[9]  J. Stock,et al.  INFERENCE IN LINEAR TIME SERIES MODELS WITH SOME UNIT ROOTS , 1990 .

[10]  David F. Hendry,et al.  The Demand for M1 in the U.S.A., 1960–1988 , 1992 .

[11]  Bruce E. Hansen,et al.  Testing for parameter instability in linear models , 1992 .

[12]  David F. Hendry,et al.  Computer Automation of General-to-Specific Model Selection Procedures , 2001 .

[13]  George G. Judge,et al.  The Statistical Consequences of Preliminary Test Estimators in Regression , 1973 .

[14]  L. Godfrey,et al.  REGRESSION EQUATIONS WHEN THE REGRESSORS INCLUDE LAGGED DEPENDENT VARIABLES , 1978 .

[15]  B. G. Quinn,et al.  The determination of the order of an autoregression , 1979 .