A Revisit to the Behrens–Fisher Problem: Comparison of Five Test Methods

We revisit the well-known Behrens–Fisher problem and apply a newly developed ‘Computational Approach Test’ (CAT) to test the equality of two population means where the populations are assumed to be normal with unknown and possibly unequal variances. An advantage of the CAT is that it does not require the explicit knowledge of the sampling distribution of the test statistic. The CAT is then compared with three widely accepted tests—Welch–Satterthwaite test (WST), Cochran–Cox test (CCT), ‘Generalized p-value’ test (GPT)—and a recently suggested test based on the jackknife procedure, called Singh–Saxena–Srivastava test (SSST). Further, model robustness of these five tests are studied when the data actually came from t-distributions, but wrongly perceived as normal ones. Our detailed study based on a comprehensive simulation indicate some interesting results including the facts that the GPT is quite conservative, and the SSST is not as good as it has been claimed in the literature. To the best of our knowledge, the trends observed in our study have not been reported earlier in the existing literature.

[1]  Welch Bl THE GENERALIZATION OF ‘STUDENT'S’ PROBLEM WHEN SEVERAL DIFFERENT POPULATION VARLANCES ARE INVOLVED , 1947 .

[2]  A. Aspin,et al.  An examination and further development of a formula arising in the problem of comparing two mean values. , 1948, Biometrika.

[3]  R. A. Fisher,et al.  THE ASYMPTOTIC APPROACH TO BEHRENS'S INTEGRAL, WITH FURTHER TABLES FOR THE d TEST OF SIGNIFICANCE , 1941 .

[4]  Thomas E. Wehrly Fitting Statistical Distributions: The Generalized Lambda Distribution and Generalized Bootstrap Methods , 2002, Technometrics.

[5]  Ying Y. Wang,et al.  Probabilities of the Type I Errors of the Welch Tests for the Behrens-Fisher Problem , 1971 .

[6]  Satterthwaite Fe An approximate distribution of estimates of variance components. , 1946 .

[7]  G. Casella,et al.  Explaining the Gibbs Sampler , 1992 .

[8]  Nabendu Pal,et al.  A Computational Approach to Statistical Inferences , 2007 .

[9]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[10]  J. Gurland,et al.  Size and Power of Tests for Equality of Means of Two Normal Populations with Unequal Variances , 1975 .

[11]  Debashis Kushary,et al.  Bootstrap Methods and Their Application , 2000, Technometrics.

[12]  Pradeep Singh,et al.  Power Comparisons of Solutions to the Behrens-Fisher Problem , 2002 .

[13]  D. Giles,et al.  PRE-TEST ESTIMATION AND TESTING IN ECONOMETRICS: RECENT DEVELOPMENTS , 1993 .

[14]  Some tests with specified size for the behrens-fisher problem , 1999 .

[15]  Allan S. Cohen,et al.  On the Behrens-Fisher Problem: A Review , 1998 .

[16]  Anthony C. Davison,et al.  Bootstrap Methods and Their Application , 1998 .

[17]  J. S. Urban Hjorth,et al.  Computer Intensive Statistical Methods: Validation, Model Selection, and Bootstrap , 1993 .

[18]  The Specification of Rules for Rejecting Too Variable a Product, with Particular Reference to an Electric Lamp Problem , 1936 .

[19]  Ramon C. Littell,et al.  Exact Statistical Methods for Data Analysis , 2001, Technometrics.

[20]  H. Ruben A SIMPLE CONSERVATIVE AND ROBUST SOLUTION OF THE BEHRENS-FISHER PROBLEM , 2002 .

[21]  Asad Zaman,et al.  Statistical Foundations for Econometric Techniques , 1996 .

[22]  Kam-Wah Tsui,et al.  Generalized p-Values in Significance Testing of Hypotheses in the Presence of Nuisance Parameters , 1989 .

[23]  R. Fisher THE FIDUCIAL ARGUMENT IN STATISTICAL INFERENCE , 1935 .

[24]  S. T. Buckland,et al.  Computer Intensive Statistical Methods: Validation, Model Selection, and Bootstrap , 1993 .

[25]  F. E. Satterthwaite An approximate distribution of estimates of variance components. , 1946, Biometrics.

[26]  A. Baron Experimental Designs , 1990, The Behavior analyst.

[27]  William G. Cochran,et al.  Experimental Designs, 2nd Edition , 1950 .