THE ASYMPTOTIC DISTRIBUTION AND BERRY-ESSEEN BOUND OF A NEW TEST FOR INDEPENDENCE IN HIGH DIMENSION WITH AN APPLICATION TO STOCHASTIC OPTIMIZATION

Let $\mathbf{X}_1,...,\mathbf{X}_n$ be a random sample from a $p$-dimensional population distribution. Assume that $c_1n^{\alpha}\leq p\leq c_2n^{\alpha}$ for some positive constants $c_1,c_2$ and $\alpha$. In this paper we introduce a new statistic for testing independence of the $p$-variates of the population and prove that the limiting distribution is the extreme distribution of type I with a rate of convergence $O((\log n)^{5/2}/\sqrt{n})$. This is much faster than $O(1/\log n)$, a typical convergence rate for this type of extreme distribution. A simulation study and application to stochastic optimization are discussed.

[1]  V. V. Petrov Sums of Independent Random Variables , 1975 .

[2]  P. Hall On the rate of convergence of normal extremes , 1979 .

[3]  J. Hüsler Extremes and related properties of random sequences and processes , 1984 .

[4]  L. Gordon,et al.  Two moments su ce for Poisson approx-imations: the Chen-Stein method , 1989 .

[5]  D. Donoho,et al.  Uncertainty principles and signal recovery , 1989 .

[6]  A. Sakhanenko,et al.  Berry-esseen type estimates for large deviation probabilities , 1991 .

[7]  E. Ziegel Extreme Value Theory and Applications , 1994 .

[8]  David L. Donoho,et al.  Aide-Memoire . High-Dimensional Data Analysis : The Curses and Blessings of Dimensionality , 2000 .

[9]  Gregory Piatetsky-Shapiro,et al.  High-Dimensional Data Analysis: The Curses and Blessings of Dimensionality , 2000 .

[10]  I. Johnstone On the distribution of the largest eigenvalue in principal components analysis , 2001 .

[11]  Xiaoming Huo,et al.  Uncertainty principles and ideal atomic decomposition , 2001, IEEE Trans. Inf. Theory.

[12]  Olivier Ledoit,et al.  Some hypothesis tests for the covariance matrix when the dimension is large compared to the sample size , 2002 .

[13]  Michael Elad,et al.  Optimally sparse representation in general (nonorthogonal) dictionaries via ℓ1 minimization , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Tiefeng Jiang,et al.  The asymptotic distributions of the largest entries of sample correlation matrices , 2004, math/0406184.

[15]  Jie Chen,et al.  Sparse representations for multiple measurement vectors (MMV) in an over-complete dictionary , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[16]  James R. Schott,et al.  Testing for complete independence in high dimensions , 2005 .

[17]  D. Donoho For most large underdetermined systems of equations, the minimal 𝓁1‐norm near‐solution approximates the sparsest near‐solution , 2006 .

[18]  Runze Li,et al.  Statistical Challenges with High Dimensionality: Feature Selection in Knowledge Discovery , 2006, math/0602133.

[19]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[20]  Michael Elad,et al.  Stable recovery of sparse overcomplete representations in the presence of noise , 2006, IEEE Transactions on Information Theory.

[21]  U. Einmahl,et al.  Characterization of LIL behavior in Banach space , 2006, math/0608687.

[22]  Wang Zhou Asymptotic distribution of the largest off-diagonal entry of correlation matrices , 2007 .