Assessing the pattern of covariance matrices via an augmentation multiple testing procedure

This paper extends the scedasticity comparison among several groups of observations, usually complying with the homoscedastic and the heteroscedastic cases, in order to deal with data sets laying in an intermediate situation. As is well known, homoscedasticity corresponds to equality in orientation, shape and size of the group scatters. Here our attention is focused on two weaker requirements: scatters with the same orientation, but with different shape and size, or scatters with the same shape and size but different orientation. We introduce a multiple testing procedure that takes into account each of the above conditions. This approach discloses a richer information on the data underlying structure than the classical method only based on homo/heteroscedasticity. At the same time, it allows a more parsimonious parametrization, whenever the patterned model is appropriate to describe the real data. The new inferential methodology is then applied to some well-known data sets, chosen in the multivariate literature, to show the real gain in using this more informative approach. Finally, a wide simulation study illustrates and compares the performance of the proposal using data sets with gradual departure from homoscedasticity.

[1]  S. S. Young,et al.  Resampling-Based Multiple Testing: Examples and Methods for p-Value Adjustment , 1993 .

[2]  Donald B. Rubin,et al.  Ensemble-Adjusted p Values , 1983 .

[3]  Salvatore Ingrassia,et al.  Constrained monotone EM algorithms for mixtures of multivariate t distributions , 2010, Stat. Comput..

[4]  David E. Booth,et al.  Multivariate statistical inference and applications , 1997 .

[5]  L. Wasserman,et al.  Exceedance Control of the False Discovery Proportion , 2006 .

[6]  Gérard Govaert,et al.  Gaussian parsimonious clustering models , 1995, Pattern Recognit..

[7]  S. P. Wright,et al.  Adjusted P-values for simultaneous inference , 1992 .

[8]  Brian D. Ripley,et al.  Pattern Recognition and Neural Networks , 1996 .

[9]  N. Campbell,et al.  A multivariate study of variation in two species of rock crab of the genus Leptograpsus , 1974 .

[10]  Alessio Farcomeni,et al.  A review of modern multiple hypothesis testing, with particular attention to the false discovery proportion , 2008, Statistical methods in medical research.

[11]  W. Berchtold,et al.  Angewandte multivariate Statistik. , 1984 .

[12]  P. Jolicoeur,et al.  Size and shape variation in the painted turtle. A principal component analysis. , 1960, Growth.

[13]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[14]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[15]  Salvatore Ingrassia,et al.  Constrained monotone EM algorithms for finite mixture of multivariate Gaussians , 2007, Comput. Stat. Data Anal..

[16]  A. Tamhane,et al.  Multiple Comparison Procedures , 2009 .

[17]  B. Holland,et al.  An Improved Sequentially Rejective Bonferroni Test Procedure , 1987 .

[18]  Douglas M. Hawkins,et al.  A new test for multivariate normality and homoscedasticity , 1981 .

[19]  A. Tamhane,et al.  Multiple Comparison Procedures , 1989 .

[20]  J. Goeman,et al.  The Inheritance Procedure: Multiple Testing of Tree-structured Hypotheses , 2012, Statistical applications in genetics and molecular biology.

[21]  K. Gabriel,et al.  SIMULTANEOUS TEST PROCEDURES-SOME THEORY OF MULTIPLE COMPARISONS' , 1969 .

[22]  J. Goeman,et al.  The Sequential Rejection Principle of Familywise Error Control , 2010, 1211.3313.

[23]  B. Flury Common Principal Components in k Groups , 1984 .

[24]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[25]  S. Dudoit,et al.  Multiple Testing Procedures with Applications to Genomics , 2007 .

[26]  O. Guilbaud,et al.  A recycling framework for the construction of Bonferroni‐based multiple tests , 2009, Statistics in medicine.

[27]  W. Brannath,et al.  A graphical approach to sequentially rejective multiple test procedures , 2009, Statistics in medicine.

[28]  Yogendra P. Chaubey Resampling-Based Multiple Testing: Examples and Methods for p-Value Adjustment , 1993 .

[29]  M. J. van der Laan,et al.  Augmentation Procedures for Control of the Generalized Family-Wise Error Rate and Tail Probabilities for the Proportion of False Positives , 2004, Statistical applications in genetics and molecular biology.

[30]  Salvatore Ingrassia,et al.  Weakly Homoscedastic Constraints for Mixtures of t-Distributions , 2008, GfKl.

[31]  M. Bartlett Properties of Sufficiency and Statistical Tests , 1992 .

[32]  Y. Benjamini Discovering the false discovery rate , 2010 .

[33]  K. Gabriel,et al.  On closed testing procedures with special reference to ordered analysis of variance , 1976 .

[34]  J. Shaffer Multiple Hypothesis Testing , 1995 .

[35]  Adrian E. Raftery,et al.  Fitting straight lines to point patterns , 1984, Pattern Recognit..

[36]  A. Raftery,et al.  Model-based Gaussian and non-Gaussian clustering , 1993 .

[37]  B. Flury Common Principal Components and Related Multivariate Models , 1988 .

[38]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[39]  W. Gautschi,et al.  An algorithm for simultaneous orthogonal transformation of several positive definite symmetric matrices to nearly diagonal form , 1986 .

[40]  Geoffrey J. McLachlan,et al.  Robust mixture modelling using the t distribution , 2000, Stat. Comput..

[41]  G. Constantine,et al.  The F‐G Diagonalization Algorithm , 1985 .

[42]  Tony Springall Common Principal Components and Related Multivariate Models , 1991 .

[43]  David J. Sheskin,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 1997 .

[44]  K. V. Mardia,et al.  Mardia's Test of Multinormality , 2004 .