Alternatives to least squares in multiple linear regression to predict production traits

Summary The objectives of this study were to describe and compare the efficacy of least-squares, ridge-and principal-components-regression methodology for analysis of data for which multicollinearity existed among regressor variables. A data set composed of calfhood traits (regressor variables) and first-progeny weaning weight (the dependent variable) of 771 Angus females was used. The statistics variance-inflation factors, eigenvalues, condition numbers and variance proportions were used in diagnosing multicollinearity. Although biased, the regression-coefficient estimates produced by ridge and principal components regression had smaller variances than estimates obtained from least squares. Between the two alternatives to least squares, ridge regression produced better results than principal-components regression. Ridge and least-squares regression produced similar regression-coefficient estimates, prediction sum of squares (PRESS-like) statistics and, to a lesser extent, conceptual predictive criteria (Cp-like) statistics. For prediction, the performance of least-squares and ridge-regression models was very similar. Zusammenfassung Alternativen zu Minimum-Quadrat bei multipler linearer Regression zur Voraussage von Leistungsmerkmalen Die Untersuchung bezieht auf Beschreibung und Vergleich der Wirksamkeit von Minimum-Quadrat, Ridge und Hauptkomponentenregressionsmethodik zur Analyse von Daten mit Multicollinearitat zwischen Regressorvariablen. Das Datenmaterial aus Kalberleistungen (Regressorvariable) und Absatzgewicht der ersten Nachkommen (abhangige Variable) von 771 Anguskuhen wurde verwendet. Die statistischen Grosen Varianz inflationsfaktoren, Eigenwerte, Bedingungszahl und Varianzverhaltnisse wurden zur Diagnose der Multicollinearitat verwendet. Obwohl verzerrt, haben die Regressionskoeffizientenschatzungen aus Ridge-und Hauptkomponentenregression geringere Varianzen als Minimum-Quadratschatzungen. Von den zwei Alternativen hat Ridge-Regression bessere Resultate als Hauptkomponentenregression ergeben. Ridge und Miniumum-Quadratregression haben ahnliche Regressionskoeffizienten ergeben, Schatzungsquadratsumme (PRESS-ahnlich), Statistik und, in einem geringeren Ausmas, Schatzungskriterien (Cp-ahnliche)-statistik. Im Hinblich auf Voraussage war die Leistung der Minimum-Quadrat-und Ridge-Regressions-Modelle sehr anlich.

[1]  Robert W. Kennard,et al.  A Note on the Cp Statistic , 1971 .

[2]  G. M. Furnival All Possible Regressions with Less Computation , 1971 .

[3]  Lawrence S. Mayer,et al.  On Biased Estimation in Linear Models , 1973 .

[4]  Hrishikesh D. Vinod,et al.  Application of New Ridge Regression Methods to a Study of Bell System Scale Economies , 1976 .

[5]  A. E. Hoerl,et al.  Ridge Regression: Applications to Nonorthogonal Problems , 1970 .

[6]  R. Bourdon,et al.  Genetic analysis of absolute growth measurements, relative growth rate and restricted selection indices in red Angus cattle. , 1990, Journal of animal science.

[7]  Shizhong Xu,et al.  The application of ridge regression to multiple trait selection indices1 , 1990 .

[8]  Douglas M. Hawkins,et al.  On the Investigation of Alternative Regressions by Principal Component Analysis , 1973 .

[9]  Estimating Returns to Agricultural Research, Extension, and Teaching at the State Level , 1983, Journal of Agricultural and Applied Economics.

[10]  R. R. Hocking,et al.  A Class of Biased Estimators in Linear Regression , 1976 .

[11]  Robert W. Kennard,et al.  A Note on the Cp Statistic , 1971 .

[12]  Ronald D. Snee,et al.  Validation of Regression Models: Methods and Examples , 1977 .

[13]  C. L. Mallows Some comments on C_p , 1973 .

[14]  W. Massy Principal Components Regression in Exploratory Statistical Research , 1965 .

[15]  R. R. Hocking The analysis and selection of variables in linear regression , 1976 .