Problems of correlations between explanatory variables in multiple regression analyses in the dental literature

Multivariable analysis is a widely used statistical methodology for investigating associations amongst clinical variables. However, the problems of collinearity and multicollinearity, which can give rise to spurious results, have in the past frequently been disregarded in dental research. This article illustrates and explains the problems which may be encountered, in the hope of increasing awareness and understanding of these issues, thereby improving the quality of the statistical analyses undertaken in dental research. Three examples from different clinical dental specialities are used to demonstrate how to diagnose the problem of collinearity/multicollinearity in multiple regression analyses and to illustrate how collinearity/multicollinearity can seriously distort the model development process. Lack of awareness of these problems can give rise to misleading results and erroneous interpretations. Multivariable analysis is a useful tool for dental research, though only if its users thoroughly understand the assumptions and limitations of these methods. It would benefit evidence-based dentistry enormously if researchers were more aware of both the complexities involved in multiple regression when using these methods and of the need for expert statistical consultation in developing study design and selecting appropriate statistical methodologies.

[1]  M S Gilthorpe,et al.  Is Reduction of Pocket Probing Depth Correlated with the Baseline Value or is it “Mathematical Coupling”? , 2002, Journal of dental research.

[2]  S A Glantz,et al.  Multiple regression for physiological data analysis: the problem of multicollinearity. , 1985, The American journal of physiology.

[3]  R. F. Ling,et al.  Some cautionary notes on the use of principal components regression , 1998 .

[4]  S. Chatterjee,et al.  Regression Analysis by Example (2nd ed.). , 1992 .

[5]  D G Altman,et al.  Statistics in medical journals: developments in the 1980s. , 1991, Statistics in medicine.

[6]  E. L. Batista,et al.  Guided tissue regeneration associated with bovine-derived anorganic bone in mandibular class II furcation defects. 6-month results at re-entry. , 2000, Journal of periodontology.

[7]  Mark W. Watson Introduction to econometrics. , 1968 .

[8]  Yu-Kang Tu,et al.  Collinearity in linear regression is a serious problem in oral health research. , 2004, European journal of oral sciences.

[9]  E. Pedhazur Multiple Regression in Behavioral Research: Explanation and Prediction , 1982 .

[10]  D G Altman,et al.  Statistics in medical journals. , 1982, Statistics in medicine.

[11]  J F Osborn,et al.  Further statistics in dentistry Part 6: Multiple linear regression , 2002, British Dental Journal.

[12]  J. Elashoff,et al.  Multiple Regression in Behavioral Research. , 1975 .

[13]  Y Yamashita,et al.  Evaluation of mutans streptococci in plaque and saliva: correlation with caries development in preschool children. , 2003, Journal of dentistry.

[14]  S. Chatterjee,et al.  Regression Analysis by Example , 1979 .

[15]  A. E. Hoerl,et al.  Ridge Regression: Applications to Nonorthogonal Problems , 1970 .

[16]  J. Sterne,et al.  Essential Medical Statistics , 2003 .

[17]  N. Draper,et al.  Applied Regression Analysis: Draper/Applied Regression Analysis , 1998 .

[18]  Yu-Kang Tu,et al.  Ratio variables in regression analysis can give rise to spurious results: illustration from two studies in periodontology. , 2004, Journal of dentistry.

[19]  D. Moles Further statistics in dentistry: Introduction , 2002, British Dental Journal.

[20]  P. Knauf,et al.  Monthly publication in 1986 , 1985 .

[21]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[22]  Leif Jansson,et al.  Prediction of marginal bone loss and tooth loss--a prospective study over 20 years. , 2002, Journal of clinical periodontology.

[23]  N. Draper,et al.  Applied Regression Analysis , 1967 .

[24]  Robert H. Shumway,et al.  Applied Regression and Analysis of Variance for Stationary Time Series , 1970 .

[25]  H. Theil Introduction to econometrics , 1978 .

[26]  Richard F. Gunst,et al.  Applied Regression Analysis , 1999, Technometrics.

[27]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[28]  Yu-Kang Tu,et al.  Mathematical coupling can undermine the statistical assessment of clinical research: illustration from the treatment of guided tissue regeneration. , 2004 .