A monte carlo experiment was used to evaluate four procedures for estimating the population squared cross-validity of a sample least squares re gression equation. Four levels of population squared multiple correlation (Rp 2) and three levels of number of predictors (n) were factorially crossed to produce 12 population covariance matrices. Ran dom samples at four levels of sample size (N) were drawn from each population. The levels of N, n, and RP 2 were carefully selected to ensure relevance of simulation results for much applied research. The least squares regression equation from each isample was applied in its respective population to obtain the actual population squared cross-validity (Rcv 2). Estimates of Rcv 2 were computed using three formula estimators and the double cross-validation procedure. The results of the experiment demon strate that two estimators which have previously been advocated in the literature were negatively biased and exhibited poor accuracy. The negative bias for these two estimators increased as Rp 2 de creased and as the ratio of N to n decreased. As a consequence, their biases were most evident in small samples where cross-validation is imperative. In contrast, the third estimator was quite accurate and virtually unbiased within the scope of this simulation. This third estimator is recommended for applied settings which are adequately approxi mated by the correlation model.
[1]
R. Wherry,et al.
A New Formula for Predicting the Shrinkage of the Coefficient of Multiple Correlation
,
1931
.
[2]
John G. Claudy.
Multiple Regression and Validity Estimation in One Sample
,
1978
.
[3]
George R. Burket,et al.
A study of reduced rank models for multiple prediction
,
1943
.
[4]
Neal Schmitt,et al.
A Monte Carlo evaluation of three formula estimates of cross-validated multiple correlation.
,
1977
.
[5]
H. Gulliksen.
Theory of mental tests
,
1952
.
[6]
Frank L. Schmidt,et al.
The Relative Efficiency of Regression and Simple Unit Predictor Weights in Applied Differential Psychology
,
1971
.
[7]
F. Lord.
EFFICIENCY OF PREDICTION WHEN A REGRESSION EQUATION FROM ONE SAMPLE IS USED IN A NEW SAMPLE
,
1950
.
[8]
Michael W. Browne,et al.
PREDICTIVE VALIDITY OF A LINEAR REGRESSION EQUATION
,
1975
.
[9]
C. I. Mosier.
I. Problems and Designs of Cross-Validation 1
,
1951
.
[10]
P. Herzberg.
The Parameters of Cross-Validation
,
1967
.