Loss of Power in Logistic, Ordinal Logistic, and Probit Regression When an Outcome Variable Is Coarsely Categorized

Variables that have been coarsely categorized into a small number of ordered categories are often modeled as outcome variables in psychological research. The authors employ a Monte Carlo study to investigate the effects of this coarse categorization of dependent variables on power to detect true effects using three classes of regression models: ordinary least squares (OLS) regression, ordinal logistic regression, and ordinal probit regression. Both the loss of power and the increase in required sample size to regain the lost power are estimated. The loss of power and required sample size increase were substantial under conditions in which the coarsely categorized variable is highly skewed, has few categories (e.g., 2, 3), or both. Ordinal logistic and ordinal probit regression protect marginally better against power loss than does OLS regression.

[1]  David L Streiner,et al.  Breaking up is Hard to Do: The Heartbreak of Dichotomizing Continuous Data , 2002, Canadian journal of psychiatry. Revue canadienne de psychiatrie.

[2]  F. Hsieh,et al.  Sample size tables for logistic regression. , 1989, Statistics in medicine.

[3]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[4]  M. Kendall,et al.  Kendall's advanced theory of statistics , 1995 .

[5]  Jacob Cohen,et al.  Applied multiple regression/correlation analysis for the behavioral sciences , 1979 .

[6]  S. Maxwell Sample size and multiple regression analysis. , 2000, Psychological methods.

[7]  W. Hauck,et al.  Wald's Test as Applied to Hypotheses in Logit Analysis , 1977 .

[8]  K. Bollen,et al.  Pearson's R and Coarsely Categorized Measures , 1981 .

[9]  B. Muthén BEYOND SEM: GENERAL LATENT VARIABLE MODELING , 2002 .

[10]  J. A. Calvin Regression Models for Categorical and Limited Dependent Variables , 1998 .

[11]  E. Krieg Biases Induced by Coarse Measurement Scales , 1999 .

[12]  Jacob Cohen Multiple regression as a general data-analytic system. , 1968 .

[13]  J. Vermunt Latent Class Models , 2004 .

[14]  James Jaccard,et al.  Measurement error in the analysis of interaction effects between continuous predictors using multiple regression: Multiple indicator and structural equation approaches. , 1995 .

[15]  Jacob Cohen The Cost of Dichotomization , 1983 .

[16]  B. Muthén,et al.  A comparison of some methodologies for the factor analysis of non‐normal Likert variables , 1985 .

[17]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[18]  Kristopher J Preacher,et al.  On the practice of dichotomization of quantitative variables. , 2002, Psychological methods.