Testing high dimensional covariance matrices via posterior Bayes factor

Abstract With the advent of the era of big data, high dimensional covariance matrices are increasingly encountered and testing covariance structure has become an active area in contemporary statistical inference. Conventional testing methods fail when addressing high dimensional data due to the singularity of the sample covariance matrices. In this paper, we propose a novel test for the prominent identity test and sphericity test based on posterior Bayes factor. For general population model with finite fourth order moment, the limiting null distribution of the test statistic is obtained. Furthermore, we derive the asymptotic power function when the sample size and dimension are proportional against spiked alternatives. When the dimension is much larger than the sample size, under general alternatives, the limiting alternative distribution together with the consistency of the new test is also obtained. Monte Carlo simulation results show that the limiting approximation is quite accurate under the null for finite sample, and the proposed test outperforms some well-known tests in the literature in terms of Type I error rate and the empirical power.

[1]  Tatsuya Kubokawa,et al.  Tests for covariance matrices in high dimension with less sample size , 2014, J. Multivar. Anal..

[2]  Jianfeng Yao,et al.  Testing the Sphericity of a covariance matrix when the dimension is much larger than the sample size , 2015, 1508.02498.

[3]  Thomas J. Fisher On testing for an identity covariance matrix when the dimensionality equals or exceeds the sample size , 2012 .

[4]  Z. Bai,et al.  CLT for linear spectral statistics of large dimensional sample covariance matrices with dependent data , 2017, Statistical Papers.

[5]  G. Pan,et al.  Central limit theorem for signal-to-interference ratio of reduced rank linear receiver , 2008, 0806.2768.

[6]  Jianfeng Yao,et al.  Joint Central Limit Theorem for Eigenvalue Statistics from Several Dependent Large Dimensional Sample Covariance Matrices with Application , 2018 .

[7]  Shurong Zheng,et al.  Substitution principle for CLT of linear spectral statistics of high-dimensional sample covariance matrices with applications to hypothesis testing , 2014, 1404.6633.

[8]  M. Aitkin Posterior Bayes Factors , 1991 .

[9]  Weiming Li,et al.  A robust test for sphericity of high-dimensional covariance matrices , 2015, J. Multivar. Anal..

[10]  H. Nagao,et al.  On Some Test Criteria for Covariance Matrix , 1973 .

[11]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Jian-Feng Yao,et al.  A note on the CLT of the LSS for sample covariance matrix from a spiked population model , 2013, J. Multivar. Anal..

[13]  Thomas Verdebout,et al.  On High-Dimensional Sign Tests , 2016 .

[14]  J. W. Silverstein,et al.  Eigenvalues of large sample covariance matrices of spiked population models , 2004, math/0408165.

[15]  S. John Some optimal multivariate tests , 1971 .

[16]  Xiaoqian Sun,et al.  A new test for sphericity of the covariance matrix for high dimensional data , 2010, J. Multivar. Anal..

[17]  Huijun Chen,et al.  A study of two high-dimensional likelihood ratio tests under alternative hypotheses , 2017 .

[18]  Olivier Ledoit,et al.  Some hypothesis tests for the covariance matrix when the dimension is large compared to the sample size , 2002 .

[19]  Z. Bai,et al.  Corrections to LRT on large-dimensional covariance matrix by RMT , 2009, 0902.0552.

[20]  J. Mauchly Significance Test for Sphericity of a Normal $n$-Variate Distribution , 1940 .

[21]  Z. Bai,et al.  Convergence to the Semicircle Law , 1988 .

[22]  Tiefeng Jiang,et al.  Likelihood ratio tests for covariance matrices of high-dimensional normal distributions , 2012 .

[23]  I. Johnstone On the distribution of the largest eigenvalue in principal components analysis , 2001 .

[24]  Song-xi Chen,et al.  Tests for High-Dimensional Covariance Matrices , 2010, Random Matrices: Theory and Applications.

[25]  G. Pan,et al.  CLT for linear spectral statistics of normalized sample covariance matrices with the dimension much larger than the sample size , 2015, 1506.00458.

[26]  Jianfeng Yao,et al.  On the sphericity test with large-dimensional observations , 2013, 1303.4035.

[27]  Zhi Liu,et al.  High-dimensional covariance matrices in elliptical distributions with application to spherical test , 2018, The Annals of Statistics.

[28]  M. Srivastava Some Tests Concerning the Covariance Matrix in High Dimensional Data , 2005 .