The Impossibility Region for Detecting Sparse Mixtures using the Higher Criticism

Abstract: Consider a multiple hypothesis testing setting involving rare/weak effects: relatively few tests, out of possibly many, deviate from their null hypothesis behavior. Summarizing the significance of each test by a P -value, we construct a global test against the null using the Higher Criticism (HC) statistics of these P-values. We calibrate the rare/weak model using parameters controlling the asymptotic distribution of non-null P -values near zero. We derive a region in the parameter space where the HC test is asymptotically powerless. Our derivation involves very different tools than previously used to show powerlessness of HC, relying on properties of the empirical processes underlying HC. In particular, our result applies to situations where HC is not asymptotically optimal, or when the asymptotically detectable region of the parameter space is unknown.

[1]  Higher Criticism to Compare Two Large Frequency Tables, with sensitivity to Possible Rare and Weak Differences , 2020, 2007.01958.

[2]  Jiashun Jin,et al.  Feature selection by higher criticism thresholding achieves the optimal phase diagram , 2008, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[3]  E. Candès,et al.  Global testing under sparse alternatives: ANOVA, multiple comparisons and the higher criticism , 2010, 1007.1434.

[4]  Yihong Wu,et al.  Optimal Detection of Sparse Mixtures Against a Given Null Distribution , 2014, IEEE Transactions on Information Theory.

[5]  Boaz Nadler,et al.  On the exact Berk-Jones statistics and their p-value calculation , 2013, 1311.3190.

[6]  D. Donoho,et al.  Higher criticism thresholding: Optimal feature selection when useful features are rare and weak , 2008, Proceedings of the National Academy of Sciences.

[7]  D. Donoho,et al.  Higher criticism for detecting sparse heterogeneous mixtures , 2004, math/0410072.

[8]  Jiashun Jin,et al.  Optimal detection of heterogeneous and heteroscedastic mixtures , 2011 .

[9]  E. Arias-Castro,et al.  The Sparse Poisson Means Model , 2015, 1505.01247.

[10]  Alexandre Berred,et al.  Limit results for ordered uniform spacings , 2008 .

[11]  P. Hall,et al.  Innovated Higher Criticism for Detecting Sparse Signals in Correlated Noise , 2009, 0902.3837.

[12]  H. Finner,et al.  Goodness of fit tests in terms of local levels with special emphasis on higher criticism tests , 2016, 1603.05461.

[13]  Higher Criticism for Discriminating Word-Frequency Tables and Testing Authorship , 2019, ArXiv.

[14]  Jian Li,et al.  Higher criticism: $p$-values and criticism , 2014, 1411.1437.