Bivariate median splits and spurious statistical significance.

Despite pleas from methodologists, researchers often continue to dichotomize continuous predictor variables. The primary argument against this practice has been that it underestimates the strength of relationships and reduces statistical power. Although this argument is correct for relationships involving a single predictor, a different problem can arise when multiple predictors are involved. Specifically, dichotomizing 2 continuous independent variables can lead to false statistical significance. As a result, the typical justification for using a median split as long as results continue to be statistically significant is invalid, because such results may in fact be spurious. Thus, researchers who dichotomize multiple continuous predictor variables not only may lose power to detect true predictor-criterion relationships in some situations but also may dramatically increase the probability of Type I errors in other situations. For many years, behavioral statisticians have chided psychological researchers for artificially dichotomizing continuous variables. Despite many such methodological pleas (e.g., Cohen, 1983,1990; Humphreys, 1978; McNemar, 1969, pp. 444449), the ubiquitous median split has retained its popularity in many areas of psychology. Indeed, some well-known psychological theories such as the Type A/Type B personality distinction, the reflection/impulsivity distinction, and the models of sex roles discussed by Bern (1977) and Spence and Helmreich (1978) are based on forming dichotomies from continuous scores obtained from psychological instruments. Studies have demonstrated that dichotomization is not just an abstract methodological issue but can in fact greatly impact the interpretation of empirical results (e.g., Block, Block, & Harrington, 1974; Lubinski, Tellegen, & Butcher, 1983; Spence, 1983; Tellegen & Lubinski, 1983). Why have researchers continued to ignore methodologists' advice not to dichotomize their measures? Certainly one obvious reason is that data analysis procedures are generally somewhat simpler for dichotomous measures than for continuous measures. A common defense on the part of researchers is that as long as they can obtain statistical significance with a dichotomous measure, why should they have to bother with the more complicated statistical technique likely to be required by using a continuous measure? Underlying this argument is an implicit assumption that the effect of artificially dichotomizing a continuous measure is necessarily to lower the power of obtaining statistical significance. In fact, various methodologists have noted that dichotomizing a continuous measure in effect throws away information because individuals within a subgroup are treated as if they were identical with respect to the attribute in question, when there is evidence to the contrary. This loss of information typically re

[1]  F. F. Stephan,et al.  Statistical Procedures and their Mathematical Bases. , 1936 .

[2]  Maurice G. Kendall The advanced theory of statistics , 1958 .

[3]  S. Rosenbaum Moments of a Truncated Bivariate Normal Distribution , 1961 .

[4]  W. G. Cochran Errors of Measurement in Statistics , 1968 .

[5]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[6]  L. Humphreys,et al.  Pseudo-Orthogonal and Other Analysis of Variance Designs Involving Individual-Differences Variables. , 1974 .

[7]  J. Block,et al.  Some Misgivings About the Matching Familiar Figures Test as a Measure of Reflection-Impulsivity. , 1974 .

[8]  Jacob Cohen,et al.  Applied multiple regression/correlation analysis for the behavioral sciences , 1979 .

[9]  L. Cronbach,et al.  Aptitudes and instructional methods: A handbook for research on interactions , 1977 .

[10]  S. Bem,et al.  On the utility of alternative procedures for assessing psychological androgyny. , 1977, Journal of consulting and clinical psychology.

[11]  Lloyd G. Humphreys,et al.  Research on individual differences requires correlational analysis, not ANOVA , 1978 .

[12]  R. Helmreich,et al.  Masculinity & femininity: Their psychological dimensions, correlates, and antecedents , 1978 .

[13]  Jacob Cohen Partialed products are interactions; partialed powers are curve components. , 1978 .

[14]  D. A. Kenny,et al.  Correlation and causality , 1979 .

[15]  William S. Cleveland The elements of graphing data , 1980 .

[16]  A. Ashman Strategic behavior and linguistic functions of institutionalized moderately retarded persons. , 1982, International journal of rehabilitation research. Internationale Zeitschrift fur Rehabilitationsforschung. Revue internationale de recherches de readaptation.

[17]  An Attempted Modification of Impulsivity and Self-Esteem in Kindergartners. , 1982 .

[18]  J. Spence Comment on Lubinski, Tellegen, and Butcher's "Masculinity, femininity, and androgyny viewed and assessed as distinct concepts." , 1983 .

[19]  Jacob Cohen The Cost of Dichotomization , 1983 .

[20]  James N. Butcher,et al.  Masculinity, femininity, and androgyny viewed and assessed as distinct concepts. , 1983 .

[21]  Scott E. Maxwell,et al.  Another look at ANCOVA versus blocking. , 1984 .

[22]  Humor Preference and Coping with Stress , 1984, Psychological reports.

[23]  Mark Snyder,et al.  "To Carve Nature at Its Joints": On the Existence of Discrete Classes in Personality , 1985 .

[24]  H. F. Gollob,et al.  The Power of Hypothesis Tests for Comparisons , 1986 .

[25]  L. S. Madison,et al.  Infant behavior and development in relation to fetal movement and habituation. , 1986, Child development.

[26]  P. A. Thompson,et al.  Robustness properties of nonorthogonal analysis of variance. , 1987 .

[27]  T. Tang Effects of type A personality and leisure ethic on Chinese college students' leisure activities and academic performance. , 1987, The Journal of social psychology.

[28]  P. Lachenbruch Statistical Power Analysis for the Behavioral Sciences (2nd ed.) , 1989 .

[29]  J. Galassi,et al.  Internal attributions and types of depression in college students: The learned helplessness model revisited. , 1989 .

[30]  J. Jaccard,et al.  Interaction effects in multiple regression , 1992 .

[31]  Jacob Cohen,et al.  THINGS I HAVE LEARNED (SO FAR) , 1990 .

[32]  Scott E. Maxwell,et al.  Designing Experiments and Analyzing Data: A Model Comparison Perspective , 1990 .

[33]  L. Humphreys,et al.  Assessing spurious "moderator effects": Illustrated substantively with the hypothesized ("synergistic") relation between spatial and mathematical ability. , 1990, Psychological bulletin.

[34]  S. West,et al.  Multiple Regression: Testing and Interpreting Interactions. , 1994 .

[35]  R. Fildes Journal of the Royal Statistical Society (B): Gary K. Grunwald, Adrian E. Raftery and Peter Guttorp, 1993, “Time series of continuous proportions”, 55, 103–116.☆ , 1993 .