论文信息 - Linear model selection by cross-validation

Linear model selection by cross-validation

We consider the problem of model (or variable) selection in the classical regression model based on cross-validation with an added penalty term for penalizing overfitting. Under some weak conditions, the new criterion is shown to be strongly consistent in the sense that with probability one, for all large n, the criterion chooses the smallest true model. The penalty function denoted by C n depends on the sample size n and is chosen to ensure the consistency in the selection of true model. There are various choices of C n suggested in the literature on model selection. In this paper we show that a particular choice of C n based on observed data, which makes it random, preserves the consistency property and provides improved performance over a fixed choice of C n .

Yuehua Wu | Calyampudi Radhakrishna Rao | C. R. Rao | Yuehua Wu

[1] W. Loh,et al. Consistent Variable Selection in Linear Models , 1995 .

[2] J. Shao. Linear Model Selection by Cross-validation , 1993 .

[3] A. McQuarrie,et al. Regression and Time Series Model Selection , 1998 .

[4] Calyampudi R. Rao,et al. A strongly consistent procedure for model selection in a regression problem , 1989 .

[5] B. G. Quinn,et al. The determination of the order of an autoregression , 1979 .

[6] Jun Shao. Convergence rates of the generalized information criterion , 1998 .

[7] H. Akaike,et al. Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[8] Yuehua Wu,et al. Model selection with data-oriented penalty , 1999 .