Two‐sample test of high dimensional means under dependence

The paper considers in the high dimensional setting a canonical testing problem in multivariate analysis, namely testing the equality of two mean vectors. We introduce a new test statistic that is based on a linear transformation of the data by the precision matrix which incorporates the correlations between the variables. The limiting null distribution of the test statistic and the power of the test are analysed. It is shown that the test is particularly powerful against sparse alternatives and enjoys certain optimality. A simulation study is carried out to examine the numerical performance of the test and to compare it with other tests given in the literature. The results show that the test proposed significantly outperforms those tests in a range of settings.

[1]  T. W. Anderson An Introduction to Multivariate Statistical Analysis , 1959 .

[2]  S. Berman A Law of Large Numbers for the Maximum in a Stationary Gaussian Sequence , 1962 .

[3]  S. Berman Limit Theorems for the Maximum Term in Stationary Sequences , 1964 .

[4]  T. W. Anderson An Introduction to Multivariate Statistical Analysis, 2nd Edition. , 1985 .

[5]  T. W. Anderson,et al.  An Introduction to Multivariate Statistical Analysis , 1959 .

[6]  A. Zaitsev,et al.  On the gaussian approximation of convolutions under multidimensional analogues of S.N. Bernstein's inequality conditions , 1987 .

[7]  P. Hall On convergence rates of suprema , 1991 .

[8]  K. Worsley,et al.  THE DETECTION OF LOCAL SHAPE CHANGES VIA THE GEOMETRY OF HOTELLING’S T 2 FIELDS 1 , 1999 .

[9]  Z. Bai,et al.  EFFECT OF HIGH DIMENSION: BY AN EXAMPLE OF A TWO SAMPLE PROBLEM , 1999 .

[10]  G. Zhang,et al.  Application of adaptive time-frequency decomposition in ultrasonic NDE of highly-scattering materials. , 2000, Ultrasonics.

[11]  B. Clymer,et al.  Texture detection of simulated microcalcification susceptibility effects in magnetic resonance imaging of breasts , 2001, Journal of magnetic resonance imaging : JMRI.

[12]  Y. Baraud Non-asymptotic minimax rates of testing in signal detection , 2002 .

[13]  M. Yuan,et al.  Model selection and estimation in the Gaussian graphical model , 2007 .

[14]  P. Hall,et al.  PROPERTIES OF HIGHER CRITICISM UNDER STRONG DEPENDENCE , 2008, 0803.2095.

[15]  M. Srivastava,et al.  A test for the mean vector with fewer observations than the dimension , 2008 .

[16]  Bin Yu,et al.  Model Selection in Gaussian Graphical Models: High-Dimensional Consistency of boldmathell_1-regularized MLE , 2008, NIPS 2008.

[17]  Q. Shao,et al.  THE ASYMPTOTIC DISTRIBUTION AND BERRY-ESSEEN BOUND OF A NEW TEST FOR INDEPENDENCE IN HIGH DIMENSION WITH AN APPLICATION TO STOCHASTIC OPTIMIZATION , 2008, 0901.2468.

[18]  Martin J. Wainwright,et al.  Model Selection in Gaussian Graphical Models: High-Dimensional Consistency of l1-regularized MLE , 2008, NIPS.

[19]  Adam J. Rothman,et al.  Sparse permutation invariant covariance estimation , 2008, 0801.4837.

[20]  K. Worsley,et al.  Random fields of multivariate test statistics, with applications to shape analysis , 2008, 0803.1708.

[21]  B. Laurent,et al.  Non asymptotic minimax rates of testing in signal detection with heterogeneous variances , 2009, 0912.2423.

[22]  Muni S. Srivastava,et al.  A test for the mean vector with fewer observations than the dimension under non-normality , 2009, J. Multivar. Anal..

[23]  P. Bickel,et al.  Covariance regularization by thresholding , 2009, 0901.3079.

[24]  Harrison H. Zhou,et al.  Optimal rates of convergence for covariance matrix estimation , 2010, 1010.3866.

[25]  Ming Yuan,et al.  High Dimensional Inverse Covariance Matrix Estimation via Linear Programming , 2010, J. Mach. Learn. Res..

[26]  P. Hall,et al.  Innovated Higher Criticism for Detecting Sparse Signals in Correlated Noise , 2009, 0902.3837.

[27]  Song-xi Chen,et al.  A two-sample test for high-dimensional data with applications to gene-set testing , 2010, 1002.4547.

[28]  Yuxin Chen Model Selection in Gaussian Graphical Models from High-Dimensional Missing Data , 2011 .

[29]  Weidong Liu,et al.  Adaptive Thresholding for Sparse Covariance Matrix Estimation , 2011, 1102.2237.

[30]  E. Candès,et al.  Global testing under sparse alternatives: ANOVA, multiple comparisons and the higher criticism , 2010, 1007.1434.

[31]  T. Cai,et al.  A Constrained ℓ1 Minimization Approach to Sparse Precision Matrix Estimation , 2011, 1102.2233.

[32]  Harrison H. Zhou,et al.  MINIMAX ESTIMATION OF LARGE COVARIANCE MATRICES UNDER ℓ1-NORM , 2012 .

[33]  M. Yuan,et al.  Adaptive covariance matrix estimation through block thresholding , 2012, 1211.0459.

[34]  B. Nadler,et al.  HIGH DIMENSIONAL SPARSE COVARIANCE ESTIMATION : ACCURATE THRESHOLDS FOR THE MAXIMAL DIAGONAL ENTRY AND FOR THE LARGEST CORRELATION COEFFICIENT , 2012 .

[35]  T. Cai,et al.  Two-Sample Covariance Matrix Testing and Support Recovery in High-Dimensional and Sparse Settings , 2013 .

[36]  John,et al.  Instantaneous spectral analysis : Detection of low-frequency shadows associated with hydrocarbons , 2022 .