A generalized likelihood ratio test for normal mean when p is greater than n

The problem of testing the population mean vector of high-dimensional multivariate data is considered. Inspired by Roy's union-intersection test, a generalized high-dimensional likelihood ratio test for the normal population mean vector is proposed. The p -value for the test is obtained by using randomization method, which does not rely on assumptions about the structure of the covariance matrix. An interpretation of the new statistic is given, which does not rely on the normality assumption. Hence the proposed test is also available for non-normal multivariate population. Simulation studies show that the new test offers higher power than other two competing tests when the variables are dependent and performs particularly well for non-normal multivariate population.

[1]  Jun Zhu,et al.  Shrinkage-based regularization tests for high-dimensional data with application to gene set analysis , 2011, Comput. Stat. Data Anal..

[2]  Kshitij Khare,et al.  Wishart distributions for decomposable covariance graph models , 2011, 1103.1768.

[3]  M. Pourahmadi,et al.  BANDING SAMPLE AUTOCOVARIANCE MATRICES OF STATIONARY PROCESSES , 2009 .

[4]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[5]  Carlos M. Carvalho,et al.  FLEXIBLE COVARIANCE ESTIMATION IN GRAPHICAL GAUSSIAN MODELS , 2008, 0901.3267.

[6]  Fan Yang,et al.  Likelihood Ratio Tests for High‐Dimensional Normal Distributions , 2013, 1306.0254.

[7]  A. Dempster A HIGH DIMENSIONAL TWO SAMPLE SIGNIFICANCE TEST , 1958 .

[8]  Muni S. Srivastava,et al.  Asymptotic distributions of some test criteria for the mean vector with fewer observations than the dimension , 2013, J. Multivar. Anal..

[9]  Måns Thulin,et al.  A high-dimensional two-sample test for the mean using random subspaces , 2013, Comput. Stat. Data Anal..

[10]  Edgard M. Maboudou-Tchao,et al.  Tests for mean vectors in high dimension , 2013, Stat. Anal. Data Min..

[11]  Y. Fujikoshi,et al.  Asymptotic Results of a High Dimensional MANOVA Test and Power Comparison When the Dimension is Large Compared to the Sample Size , 2004 .

[12]  Jianqing Fan,et al.  High dimensional covariance matrix estimation using a factor model , 2007, math/0701124.

[13]  Stanley P. Azen,et al.  Computational Statistics and Data Analysis (CSDA) , 2006 .

[14]  James R. Schott,et al.  Some high-dimensional tests for a one-way MANOVA , 2007 .

[15]  Pan Central limit theorem for signal-to-interference ratio of reduced rank linear receiver , 2008 .

[16]  A. Tsybakov,et al.  Estimation of high-dimensional low-rank matrices , 2009, 0912.5338.

[17]  J. Marron,et al.  The maximal data piling direction for discrimination , 2010 .

[18]  Weidong Liu,et al.  Two‐sample test of high dimensional means under dependence , 2014 .

[19]  E. Lehmann Testing Statistical Hypotheses , 1960 .

[20]  S. N. Roy On a Heuristic Method of Test Construction and its use in Multivariate Analysis , 1953 .

[21]  A. Cohen,et al.  Projected Tests for Order Restricted Alternatives , 1994 .

[22]  M. Srivastava Multivariate Theory for Analyzing High Dimensional Data , 2007 .

[23]  T. W. Anderson,et al.  An Introduction to Multivariate Statistical Analysis , 1959 .

[24]  Olivier Ledoit,et al.  Some hypothesis tests for the covariance matrix when the dimension is large compared to the sample size , 2002 .

[25]  P. Bickel,et al.  Regularized estimation of large covariance matrices , 2008, 0803.1909.

[26]  J. W. Silverstein,et al.  COVARIANCE MATRICES , 2022 .

[27]  Muni S. Srivastava,et al.  A two sample test in high dimensional data , 2013, Journal of Multivariate Analysis.

[28]  Song-xi Chen,et al.  A two-sample test for high-dimensional data with applications to gene-set testing , 2010, 1002.4547.

[29]  Pranab Kumar Sen,et al.  Two-Stage Likelihood Ratio and Union-Intersection Tests for One-Sided Alternatives Multivariate Mean with Nuisance Dispersion Matrix , 1999 .

[30]  Z. Bai,et al.  EFFECT OF HIGH DIMENSION: BY AN EXAMPLE OF A TWO SAMPLE PROBLEM , 1999 .

[31]  M. Srivastava,et al.  A test for the mean vector with fewer observations than the dimension , 2008 .

[32]  Marcelo J. Moreira,et al.  Asymptotic power of sphericity tests for high-dimensional data , 2013, 1306.4867.

[33]  R. Muirhead Aspects of Multivariate Statistical Theory , 1982, Wiley Series in Probability and Statistics.

[34]  Debashis Paul,et al.  A Regularized Hotelling’s T2 Test for Pathway Analysis in Proteomic Studies , 2011, Journal of the American Statistical Association.

[35]  Bin Yu,et al.  High-dimensional covariance estimation by minimizing ℓ1-penalized log-determinant divergence , 2008, 0811.3628.

[36]  S. Kullback,et al.  Some Aspects of Multivariate Analysis. , 1958 .

[37]  Jianqing Fan,et al.  High Dimensional Covariance Matrix Estimation in Approximate Factor Models , 2011, Annals of statistics.

[38]  T. Bengtsson,et al.  Estimation of high-dimensional prior and posterior covariance matrices in Kalman filter variants , 2007 .

[39]  J. L. Tomsky,et al.  A NEW CLASS OF MULTIVARIATE TESTS BASED ON THE UNION-INTERSECTION PRINCIPLE' , 1981 .

[40]  Ping-Shou Zhong,et al.  Tests alternative to higher criticism for high-dimensional means under sparsity and column-wise dependence , 2013, 1312.5103.

[41]  A. Dempster A significance test for the separation of two highly multivariate small samples , 1960 .

[42]  Runze Li,et al.  Statistical Challenges with High Dimensionality: Feature Selection in Knowledge Discovery , 2006, math/0602133.

[43]  Joseph P. Romano,et al.  Testing Statistical Hypotheses, Third Edition , 2008, Springer texts in statistics.

[44]  Jianqing Fan,et al.  Large covariance estimation by thresholding principal orthogonal complements , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[45]  J. Marron,et al.  The high-dimension, low-sample-size geometric representation holds under mild conditions , 2007 .

[46]  Jiajuan Liang,et al.  Generalized F-tests for the multivariate normal mean , 2009, Comput. Stat. Data Anal..

[47]  Michael L. Davidson,et al.  A note on the Union-Intersection character of some MANOVA procedures , 1974 .

[48]  Jianhua Z. Huang,et al.  Covariance matrix selection and estimation via penalised normal likelihood , 2006 .

[49]  Azeem M. Shaikh,et al.  Hypothesis Testing in Econometrics , 2009 .

[50]  Harrison H. Zhou,et al.  MINIMAX ESTIMATION OF LARGE COVARIANCE MATRICES UNDER ℓ1-NORM , 2012 .