SUMMARY The relationship between a response variable and one or more continuous covariates is often curved. Attempts to represent curvature in single- or multiple-regression models are usually made by means of polynomials of the covariates, typically quadratics. However, low order polynomials offer a limited family of shapes, and high order polynomials may fit poorly at the extreme values of the covariates. We propose an extended family of curves, which we call fractional polynomials, whose power terms are restricted to a small predefined set of integer and non-integer values. The powers are selected so that conventional polynomials are a subset of the family. Regression models using fractional polynomials of the covariates have appeared in the literature in an ad hoc fashion over a long period; we provide a unified description and a degree of formalization for them. They are shown to have considerable flexibility and are straightforward to fit using standard methods. We suggest an iterative algorithm for covariate selection and model fitting when several covariates are available. We give six examples of the use of fractional polynomial models in three types of regression analysis: normal errors, logistic and Cox regression. The examples all relate to medical data: fetal measurements, immunoglobulin concentrations in children, diabetes in children, infertility in women, myelomatosis (a type of leukaemia) and leg ulcers.
[1]
A. C. Atkinson,et al.
Diagnostic Tests for Transformations
,
1986
.
[2]
Sue J. Welham,et al.
Genstat 5 release 3 reference manual
,
1994
.
[3]
L. Walford,et al.
Bioenergetics and Growth
,
1947
.
[4]
D. Ruppert,et al.
Transformation and Weighting in Regression
,
1988
.
[5]
J. Kent.
Information gain and a general measure of correlation
,
1983
.
[6]
T J Cole,et al.
Smoothing reference centile curves: the LMS method and penalized likelihood.
,
1992,
Statistics in medicine.
[7]
D. Altman.
Construction of age-related reference centiles using absolute residuals.
,
1993,
Statistics in medicine.
[8]
R. Tibshirani,et al.
Generalized Additive Models
,
1991
.
[9]
A. Atkinson,et al.
Grouped Likelihood for the Shifted Power Transformation
,
1991
.
[10]
R. Tibshirani,et al.
Generalized Additive Models
,
1986
.
[11]
F. Richards.
A Method of Maximum‐Likelihood Estimation
,
1961
.
[12]
D. Altman,et al.
Charts of fetal size: 3. Abdominal measurements
,
1994,
British journal of obstetrics and gynaecology.
[13]
Standard errors resilient to error variance misspecification
,
1988
.
[14]
David W. Hosmer,et al.
Applied Logistic Regression
,
1991
.
[15]
John T. Kent,et al.
Measures of dependence for censored survival data
,
1988
.