Estimating Optimal Transformations for Multiple Regression Using the ACE Algorithm

This paper introduces the alternating conditional expectation (ACE) algorithm of Breiman and Friedman (1985) for estimating the trans- formations of a response and a set of predictor variables in multiple re- gression that produce the maximum linear effect between the (transformed) independent variables and the (transformed) response variable. These trans- formations can give the data analyst insight into the relationships between these variables so that relationship between them can be best described and non-linear relationships can be uncovered. The power and usefulness of ACE guided transformation in multivariate analysis are illustrated using a simulated data set as well as a real data set. The results from these exam- ples clearly demonstrate that ACE is able to identify the correct functional forms, to reveal more accurate relationships, and to improve the model fit considerably compared to the conventional linear model.

[1]  R. Tibshirani,et al.  Generalized Additive Models , 1991 .

[2]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[3]  Frederick Mosteller,et al.  Data Analysis and Regression , 1978 .

[4]  J. Friedman,et al.  Estimating Optimal Transformations for Multiple Regression and Correlation. , 1985 .

[5]  Adrian E. Raftery,et al.  Model Selection for Generalized Linear Models via GLIB, with Application to Epidemiology , 1993 .

[6]  J. R. Koehler,et al.  Modern Applied Statistics with S-Plus. , 1996 .

[7]  B. Silverman,et al.  Nonparametric Regression and Generalized Linear Models: A roughness penalty approach , 1993 .

[8]  David J. Hand,et al.  A Handbook of Small Data Sets , 1993 .

[9]  A. Raftery Bayesian Model Selection in Social Research , 1995 .

[10]  D. Cox Nonparametric Regression and Generalized Linear Models: A roughness penalty approach , 1993 .

[11]  J. Friedman,et al.  Projection Pursuit Regression , 1981 .

[12]  D. Ruppert,et al.  Transformation and Weighting in Regression , 1988 .

[13]  Richard D. Deveaux,et al.  Applied Smoothing Techniques for Data Analysis , 1999, Technometrics.

[14]  J. Kruskal Analysis of Factorial Experiments by Estimating Monotone Transformations of the Data , 1965 .

[15]  P. Royston,et al.  Regression using fractional polynomials of continuous covariates: parsimonious parametric modelling. , 1994 .

[16]  R. Simon,et al.  Flexible regression models with cubic splines. , 1989, Statistics in medicine.

[17]  Jerome H. Friedman,et al.  Smoothing of Scatterplots , 1982 .

[18]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[19]  S. Weisberg,et al.  Residuals and Influence in Regression , 1982 .

[20]  F. James Rohlf,et al.  Biometry: The Principles and Practice of Statistics in Biological Research , 1969 .

[21]  P Royston,et al.  A strategy for modelling the effect of a continuous covariate in medicine and epidemiology. , 2000, Statistics in medicine.

[22]  G. Box,et al.  Transformation of the Independent Variables , 1962 .

[23]  R. Tibshirani Estimating Transformations for Regression via Additivity and Variance Stabilization , 1988 .