Dichotomising continuous data while retaining statistical power using a distributional approach

Dichotomisation of continuous data is known to be hugely problematic because information is lost, power is reduced and relationships may be obscured or changed. However, not only are differences in means difficult for clinicians to interpret, but thresholds also occur in many areas of medical practice and cannot be ignored. In recognition of both the problems of dichotomisation and the ways in which it may be useful clinically, we have used a distributional approach to derive a difference in proportions with a 95% CI that retains the precision and the power of the CI for the equivalent difference in means. In this way, we propose a dual approach that analyses continuous data using both means and proportions to replace dichotomisation alone and that may be useful in certain situations. We illustrate this work with examples and simulations that show good performance of the parametric approach under standard distributional assumptions from our own research and from the literature.

[1]  S. Senn U is for Unease: Reasons for Mistrusting Overlap Measures for Reporting Clinical Trials , 2011 .

[2]  E. de Souza,et al.  Prematurity and low birth weight among Brazilian adolescents and young adults. , 2010, Journal of pediatric and adolescent gynecology.

[3]  S. Suissa,et al.  Binary methods for continuous outcomes: a parametric alternative. , 1991, Journal of clinical epidemiology.

[4]  S. Dodd,et al.  Substance abuse during pregnancy: effect on pregnancy outcomes. , 2010, European journal of obstetrics, gynecology, and reproductive biology.

[5]  B. Tom,et al.  The Impact of Dichotomization on the Efficiency of Testing for an Interaction Effect in Exponential Family Models , 2004 .

[6]  H. R. Anderson,et al.  Maternal cotinine level during pregnancy and birthweight for gestational age. , 1998, International journal of epidemiology.

[7]  Douglas G Altman,et al.  Dichotomizing continuous predictors in multiple regression: a bad idea , 2006, Statistics in medicine.

[8]  An efficient test for the analysis of dichotomized variables when the reliability is known , 2006, Statistics in medicine.

[9]  Herng‐Ching Lin,et al.  No increased risk of adverse pregnancy outcomes in women with urinary tract infections: a nationwide population‐based study , 2010, Acta obstetricia et gynecologica Scandinavica.

[10]  Stephen Senn,et al.  Measurement in clinical trials: A neglected issue for statisticians? , 2009, Statistics in medicine.

[11]  Barry Kurt Moser,et al.  Odds ratios for a continuous outcome variable without dichotomizing , 2004, Statistics in medicine.

[12]  J. Fleiss,et al.  A simple approximation for calculating sample sizes for comparing independent proportions. , 1980, Biometrics.

[13]  A. Kosinski,et al.  Power considerations when a continuous outcome variable is dichotomized. , 1998, Journal of biopharmaceutical statistics.

[14]  S. Chinn A simple method for converting an odds ratio to effect size for use in meta-analysis. , 2000, Statistics in medicine.

[15]  Patrick Royston,et al.  The cost of dichotomising continuous variables , 2006, BMJ : British Medical Journal.

[16]  D. Ragland,et al.  Dichotomizing Continuous Outcome Variables: Dependence of the Magnitude of Association and Statistical Power on the Cutpoint , 1992, Epidemiology.

[17]  David L Streiner,et al.  Breaking up is Hard to Do: The Heartbreak of Dichotomizing Continuous Data , 2002, Canadian journal of psychiatry. Revue canadienne de psychiatrie.

[18]  Valerii Fedorov,et al.  Consequences of dichotomization , 2009, Pharmaceutical statistics.