A Comparison of Bayesian Models of Heteroscedasticity in Nested Normal Data

We consider the fitting of a Bayesian model to grouped data in which observations are assumed normally distributed around group means that are themselves normally distributed, and consider several alternatives for accommodating the possibility of heteroscedasticity within the data. We consider the case where the underlying distribution of the variances is unknown, and investigate several candidate prior distributions for those variances. In each case, the parameters of the candidate priors (the hyperparameters) are themselves given uninformative priors (hyperpriors). The most mathematically convenient model for the group variances is to assign them inverse gamma distributed priors, the inverse gamma distribution being the conjugate prior distribution for the unknown variance of a normal population. We demonstrate that for a wide class of underlying distributions of the group variances, a model that assigns the variances an inverse gamma-distributed prior displays favorable goodness-of-fit properties relative to other candidate priors, and hence may be used as standard for modeling such data. This allows us to take advantage of the elegant mathematical property of prior conjugacy in a wide variety of contexts without compromising model fitness. We test our findings on nine real world publicly available datasets from different domains, and on a wide range of artificially generated datasets.

[1]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[2]  Viktor Witkovský,et al.  Between group variance component interval estimation for the unbalanced heteroscedastic one-way random effects model , 2003 .

[3]  Alain Dussauchoy,et al.  Parameter estimation of the generalized gamma distribution , 2008, Math. Comput. Simul..

[4]  Kun Luo,et al.  Inferences on the Among-Group Variance Component in Unbalanced Heteroscedastic One-Fold Nested Design , 2012, Commun. Stat. Simul. Comput..

[5]  P. Gustafson,et al.  Conservative prior distributions for variance parameters in hierarchical models , 2006 .

[6]  D. Rubin Bayesianly Justifiable and Relevant Frequency Calculations for the Applied Statistician , 1984 .

[7]  K. Krishnamoorthy,et al.  A parametric bootstrap approach for ANOVA with unequal variances: Fixed and random models , 2007, Comput. Stat. Data Anal..

[8]  Marie Davidian,et al.  Nonlinear Models for Repeated Measurement Data , 1995 .

[9]  Jeroen K. Vermunt,et al.  Growth models for categorical response variables : standard, latent-class, and hybrid approaches. , 2007 .

[10]  Julia Volaufova,et al.  Heteroscedastic ANOVA: old p values, new views , 2009 .

[11]  S. M. Sadooghi-Alvandi,et al.  One-way ANOVA with Unequal Variances , 2012 .

[12]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[13]  B. L. Welch THE SIGNIFICANCE OF THE DIFFERENCE BETWEEN TWO MEANS WHEN THE POPULATION VARIANCES ARE UNEQUAL , 1938 .

[14]  Noriaki Yanagi Exact Analysis of Variance with Unequal Variances , 1980 .

[15]  Eric R. Ziegel,et al.  Data: A Collection of Problems From Many Fields for the Student and Research Worker , 1987 .

[16]  M. Stephens EDF Statistics for Goodness of Fit and Some Comparisons , 1974 .

[17]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  T. W. Anderson On the Distribution of the Two-Sample Cramer-von Mises Criterion , 1962 .

[19]  Jessica Gurevitch,et al.  STATISTICAL ISSUES IN ECOLOGICAL META‐ANALYSES , 1999 .

[20]  Edward J. Dudewicz,et al.  Exact solutions to the Behrens–Fisher Problem: Asymptotically optimal and finite sample efficient choice among , 2007 .

[21]  Xiao-Li Meng,et al.  POSTERIOR PREDICTIVE ASSESSMENT OF MODEL FITNESS VIA REALIZED DISCREPANCIES , 1996 .

[22]  N. Smirnov Table for Estimating the Goodness of Fit of Empirical Distributions , 1948 .

[23]  Chul H. Ahn,et al.  Modified ANOVA for Unequal Variances , 2003 .

[24]  P. Diggle Analysis of Longitudinal Data , 1995 .

[25]  R. Fisher THE FIDUCIAL ARGUMENT IN STATISTICAL INFERENCE , 1935 .

[26]  Murali Chandrashekaran,et al.  Meta-Analysis with Heteroscedastic Effects , 1993 .

[27]  A. Rukhin,et al.  Maximum Likelihood Analysis for Heteroscedastic One‐Way Random Effects ANOVA in Interlaboratory Studies , 1999, Biometrics.

[28]  E. Kulinskaya,et al.  A Welch‐type test for homogeneity of contrasts under heteroscedasticity with application to meta‐analysis , 2004, Statistics in medicine.

[29]  Max A. Little,et al.  Accurate telemonitoring of Parkinson’s disease progression by non-invasive speech tests , 2009 .

[30]  Randall K Ten Haken,et al.  An application of Bayesian statistical methods to adaptive radiotherapy , 2006, Physics in medicine and biology.

[31]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[32]  A. Gelman Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper) , 2004 .

[33]  Edward J. Dudewicz,et al.  New exact and asymptotically optimal heteroscedastic statistical procedures and tables, II , 1999 .

[34]  E. Růžička,et al.  Quantitative acoustic measurements for characterization of speech and voice disorders in early untreated Parkinson's disease. , 2011, The Journal of the Acoustical Society of America.

[35]  E. Dudewicz,et al.  Exact Analysis of Variance with Unequal Variances: Test Procedures and Tables , 1978 .

[36]  Welch Bl THE GENERALIZATION OF ‘STUDENT'S’ PROBLEM WHEN SEVERAL DIFFERENT POPULATION VARLANCES ARE INVOLVED , 1947 .

[37]  Edward J. Dudewicz,et al.  New exact and asymptotically optimal solution to the Behrens-Fisher problem, with tables , 1998 .

[38]  Max A. Little,et al.  Accurate Telemonitoring of Parkinson's Disease Progression by Noninvasive Speech Tests , 2009, IEEE Transactions on Biomedical Engineering.

[39]  Sam Weerahandi ANOVA under Unequal Error Variances , 1995 .

[40]  G. Casella,et al.  Explaining the Gibbs Sampler , 1992 .

[41]  I. Guttman The Use of the Concept of a Future Observation in Goodness‐Of‐Fit Problems , 1967 .

[42]  Scott M. Lynch,et al.  Bayesian Posterior Predictive Checks for Complex Models , 2004 .

[43]  W. G. Cochran Problems arising in the analysis of a series of similar experiments , 1937 .

[44]  Malay Ghosh,et al.  The behrens‐fisher problem revisited: A bayes‐frequentist synthesis , 2001 .

[45]  S. L. Andersen,et al.  Permutation Theory in the Derivation of Robust Criteria and the Study of Departures from Assumption , 1955 .

[46]  L. Skovgaard NONLINEAR MODELS FOR REPEATED MEASUREMENT DATA. , 1996 .

[47]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.