A strategy for modelling the effect of a continuous covariate in medicine and epidemiology.

Low-dimensional parametric models are well understood, straightforward to communicate to other workers, have very smooth curves and may easily be checked for consistency with background scientific knowledge or understanding. They should therefore be ideal tools with which to represent smooth relationships between a continuous predictor and an outcome variable in medicine and epidemiology. Unfortunately, a seriously restricted set of such models is used routinely in practical data analysis - typically, linear, quadratic or occasionally cubic polynomials, or sometimes a power or logarithmic transformation of a covariate. Since their flexibility is limited, it is not surprising that the fit of such models is often poor. Royston and Altman's recent work on fractional polynomials has extended the range of available functions. It is clearly crucial that the chosen final model fits the data well. Achieving a good fit with minimal restriction on the functional form has been the motivation behind the major recent research effort on non-parametric curve-fitting techniques. Here I propose that one such model, a (possibly over-fitted) cubic smoothing spline, may be used to define a suitable reference curve against which the fit of a parametric model may be checked. I suggest a significance test for the purpose and examine its type I error and power in a small simulation study. Several families of parametric models, including some with sigmoid curves, are considered. Their suitability in fitting regression relationships found in several real data sets is investigated. With all the example data sets, a simple parametric model can be found which fits the data approximately as well as a cubic smoothing spline, but without the latter's tendency towards artefacts in the fitted curve.

[1]  G. Box,et al.  Transformation of the Independent Variables , 1962 .

[2]  M. Marmot,et al.  INEQUALITIES IN DEATH—SPECIFIC EXPLANATIONS OF A GENERAL PATTERN? , 1984, The Lancet.

[3]  B. Silverman,et al.  Nonparametric regression and generalized linear models , 1994 .

[4]  H. Akaike A Bayesian analysis of the minimum AIC procedure , 1978 .

[5]  C. Chatfield Model uncertainty, data mining and statistical inference , 1995 .

[6]  J. Cuzick,et al.  Results of the mrc myelomatosis trials for patients entered since 1980 , 1988, Hematological oncology.

[7]  Willi Sauerbrei,et al.  The Use of Resampling Methods to Simplify Regression Models in Medical Statistics , 1999 .

[8]  D. Harrington,et al.  Counting Processes and Survival Analysis , 1991 .

[9]  H. Becher,et al.  The concept of residual confounding in regression models and some applications. , 1992, Statistics in medicine.

[10]  D G Altman,et al.  What do we mean by validating a prognostic model? , 2000, Statistics in medicine.

[11]  T. Teräsvirta,et al.  Model selection criteria and model selection tests in regression models , 1986 .

[12]  P. Royston,et al.  Regression using fractional polynomials of continuous covariates: parsimonious parametric modelling. , 1994 .

[13]  R. Simon,et al.  Flexible regression models with cubic splines. , 1989, Statistics in medicine.

[14]  H B Valman,et al.  Serum immunoglobulin concentrations in preschool children measured by laser nephelometry: reference ranges for IgG, IgA, IgM. , 1983, Journal of clinical pathology.

[15]  P. Royston,et al.  Building multivariable prognostic and diagnostic models: transformation of the predictors by using fractional polynomials , 1999 .

[16]  Enno Mammen,et al.  Testing Parametric Versus Semiparametric Modelling in Generalized Linear Models , 1996 .

[17]  B. Silverman,et al.  Nonparametric Regression and Generalized Linear Models: A roughness penalty approach , 1993 .

[18]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[19]  P. Grambsch,et al.  Prognosis in primary biliary cirrhosis: Model for decision making , 1989, Hepatology.

[20]  C. Mulrow,et al.  The J-curve phenomenon and the treatment of hypertension. Is there a point beyond which pressure reduction is dangerous? , 1991, JAMA.

[21]  D. Altman,et al.  Beneficial effect of azathioprine and prediction of prognosis in primary biliary cirrhosis. Final results of an international trial. , 1985, Gastroenterology.

[22]  David Draper,et al.  Assessment and Propagation of Model Uncertainty , 2011 .

[23]  T. Severini,et al.  Quasi-Likelihood Estimation in Semiparametric Models , 1994 .