The 'improved' brown and forsythe test for mean equality: some things can't be fixed

Mehrotra (1997) presented an ‘;improved’ Brown and Forsythe (1974) statistic which is designed to provide a valid test of mean equality in independent groups designs when variances are heterogeneous. In particular, the usual Brown and Fosythe procedure was modified by using a Satterthwaite approximation for numerator degrees of freedom instead of the usual value of number of groups minus one. Mehrotra then, through Monte Carlo methods, demonstrated that the ‘improved’ method resulted in a robust test of significance in cases where the usual Brown and Forsythe method did not. Accordingly, this ‘improved’ procedure was recommended. We show that under conditions likely to be encountered in applied settings, that is, conditions involving heterogeneous variances as well as nonnormal data, the ‘improved’ Brown and Forsythe procedure results in depressed or inflated rates of Type I error in unbalanced designs. Previous findings indicate, however, that one can obtain a robust test by adopting a heteroscedastic st...

[1]  B. L. Welch ON THE COMPARISON OF SEVERAL MEAN VALUES: AN ALTERNATIVE APPROACH , 1951 .

[2]  G. S. James THE COMPARISON OF SEVERAL GROUPS OF OBSERVATIONS WHEN THE RATIOS OF THE POPULATION VARIANCES ARE UNKNOWN , 1951 .

[3]  P. Hall On the Bootstrap and Confidence Intervals , 1986 .

[4]  P. Hall On the Number of Bootstrap Simulations Required to Construct a Confidence Interval , 1986 .

[5]  Rand R. Wilcox,et al.  New monte carlo results on the robustness of the anova f, w and f statistics , 1986 .

[6]  T. Micceri The unicorn, the normal curve, and other improbable creatures. , 1989 .

[7]  T. C. Oshima,et al.  Type I error rates for James's second-order test and Wilcox's Hm test under heteroscedasticity and non-normality , 1992 .

[8]  J. Booth,et al.  Resampling-Based Multiple Testing. , 1994 .

[9]  Ralph A. Alexander,et al.  A New and Simpler Approximation for ANOVA Under Variance Heterogeneity , 1994 .

[10]  Rand R. Wilcox,et al.  A one-way random effects model for trimmed means , 1994 .

[11]  Rand R. Wilcox,et al.  Some Results on the Tukey-Mclaughlin and Yuen Methods for Trimmed Means when Distributions are Skewed , 1994 .

[12]  Rand R. Wilcox,et al.  ANOVA: The practical importance of heteroscedastic methods, using trimmed means versus means, and designing simulation studies , 1995 .

[13]  R. Wilcox ANOVA: A Paradigm for Low Power and Misleading Measures of Effect Size? , 1995 .

[14]  J. Algina,et al.  Univariate and Multivariate Omnibus Hypothesis Tests Selected to Control Type I Error Rates When Population Variances Are Not Necessarily Equal , 1996 .

[15]  R. Wilcox Introduction to Robust Estimation and Hypothesis Testing , 1997 .

[16]  D. Mehrotra Improving the brown-forsythe solution to the generalized behrens-fisher problem , 1997 .

[17]  Carl J. Huberty,et al.  Statistical Practices of Educational Researchers: An Analysis of their ANOVA, MANOVA, and ANCOVA Analyses , 1998 .

[18]  Lisa M. Lix,et al.  To Trim or Not to Trim: Tests of Location Equality Under Heteroscedasticityand Nonnormality , 1998 .

[19]  Rand R. Wilcox,et al.  Can tests for treatment group equality be improved?: The bootstrap and trimmed means conjecture , 1998 .