Note on Mean Vector Testing for High-Dimensional Dependent Observations

When testing for the mean vector in a high-dimensional setting, it is generally assumed that the observations are independently and identically distributed. However if the data are dependent, the existing test procedures fail to preserve type I error at a given nominal significance level. We propose a new test for the mean vector when the dimension increases linearly with sample size and the data is a realization of an M-dependent stationary process. The order M is also allowed to increase with the sample size. Asymptotic normality of the test statistic is derived by extending the Central Limit Theorem for M-dependent processes using two-dimensional triangular arrays. The cost of ignoring dependence among observations is assessed in finite samples through simulations.

[1]  Weidong Liu,et al.  Two‐sample test of high dimensional means under dependence , 2014 .

[2]  Muni S. Srivastava,et al.  A two sample test in high dimensional data , 2013, Journal of Multivariate Analysis.

[3]  Song-xi Chen,et al.  A two-sample test for high-dimensional data with applications to gene-set testing , 2010, 1002.4547.

[4]  Z. Bai,et al.  EFFECT OF HIGH DIMENSION: BY AN EXAMPLE OF A TWO SAMPLE PROBLEM , 1999 .

[5]  M. Srivastava,et al.  A test for the mean vector with fewer observations than the dimension , 2008 .

[6]  Ron Reeder,et al.  Estimation of the mean of functional time series and a two‐sample problem , 2011, 1105.0019.

[7]  I. Olkin,et al.  Inequalities: Theory of Majorization and Its Applications , 1980 .

[8]  H. Robbins,et al.  The Central Limit Theorem for Dependent Random Variables , 1948 .

[9]  Muni S. Srivastava,et al.  A test for the mean vector with fewer observations than the dimension under non-normality , 2009, J. Multivar. Anal..

[10]  J. Magnus The moments of products of quadratic forms in normal variables , 1978 .

[11]  Junyong Park,et al.  A test for the mean vector in large dimension and small samples , 2013 .

[12]  J. Steinebach,et al.  Testing for Changes in Multivariate Dependent Observations with an Application to Temperature Changes , 1999 .

[13]  Pearlly Yan,et al.  Statistical methods for detecting differentially methylated regions based on MethylCap-seq data , 2016, Briefings Bioinform..

[14]  A. Dempster A HIGH DIMENSIONAL TWO SAMPLE SIGNIFICANCE TEST , 1958 .

[15]  P. H. Diananda The central limit theorem for m-dependent variables , 1955, Mathematical Proceedings of the Cambridge Philosophical Society.

[16]  Ping-Shou Zhong,et al.  Tests alternative to higher criticism for high-dimensional means under sparsity and column-wise dependence , 2013, 1312.5103.

[17]  P. Hall,et al.  Martingale Limit Theory and its Application. , 1984 .

[18]  Andrej Yu. Garnaev,et al.  On widths of the Euclidean Ball , 1984 .

[19]  Matthew Reimherr,et al.  Two sample inference in functional linear models , 2009 .

[20]  Joseph P. Romano,et al.  A more general central limit theorem for m-dependent random variables with unbounded m , 2000 .

[21]  K. Berk A Central Limit Theorem for $m$-Dependent Random Variables with Unbounded $m$ , 1973 .