Signal identification in ERP data by decorrelated Higher Criticism Thresholding

Event-related potentials (ERPs) are intensive recordings of electrical activity along the scalp time-locked to motor, sensory, or cognitive events. A main objective in ERP studies is to select (rare) time points at which (weak) ERP amplitudes (features) are significantly associated with experimental variable of interest. The Higher Criticism Thresholding (HCT), as an optimal signal detection procedure in the " rare-and-weak " paradigm, appears to be ideally suited for identifying ERP features. However, ERPs exhibit complex temporal dependence patterns violating the assumption under which signal identification can be achieved efficiently for HCT. This article first highlights this impact of dependence in terms of instability of signal estimation by HCT. A factor modeling for the covariance in HCT is then introduced to decorrelate test statistics and to restore stability in estimation. The detection boundary under factor-analytic dependence is derived and the phase diagram is correspondingly extended. Using simulations and a real data analysis example, the proposed method is shown to estimate more efficiently the support of signals compared with standard HCT and other HCT approaches based on a shrinkage estimation of the covariance matrix.

[1]  Nathaniel J. Smith,et al.  Regression-based estimation of ERP waveforms: II. Nonlinear effects, overlap correction, and practical considerations. , 2015, Psychophysiology.

[2]  P. Hall,et al.  PROPERTIES OF HIGHER CRITICISM UNDER STRONG DEPENDENCE , 2008, 0803.2095.

[3]  Jiashun Jin,et al.  Feature selection by higher criticism thresholding achieves the optimal phase diagram , 2008, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[4]  Korbinian Strimmer,et al.  Signal identification for rare and weak features: higher criticism or false discovery rates? , 2011, Biostatistics.

[5]  D. Donoho,et al.  Higher criticism thresholding: Optimal feature selection when useful features are rare and weak , 2008, Proceedings of the National Academy of Sciences.

[6]  Nancy R. Zhang,et al.  Multiple hypothesis testing adjusted for latent variables, with an application to the AGEMAP gene expression data , 2013, 1301.2420.

[7]  Jeffrey T Leek,et al.  A general framework for multiple testing dependence , 2008, Proceedings of the National Academy of Sciences.

[8]  D. Donoho,et al.  Higher criticism for detecting sparse heterogeneous mixtures , 2004, math/0410072.

[9]  D. Guthrie,et al.  Significance testing of difference potentials. , 1991, Psychophysiology.

[10]  P. Hall,et al.  Innovated Higher Criticism for Detecting Sparse Signals in Correlated Noise , 2009, 0902.3837.

[11]  S. Hsieh,et al.  A factor-adjusted multiple testing procedure for ERP data analysis , 2012, Behavior research methods.

[12]  K. Strimmer,et al.  Feature selection in omics prediction problems using cat scores and false nondiscovery rate control , 2009, 0903.2003.

[13]  K. Strimmer,et al.  Statistical Applications in Genetics and Molecular Biology A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics , 2011 .

[14]  Nathaniel J. Smith,et al.  Regression-based estimation of ERP waveforms: I. The rERP framework. , 2015, Psychophysiology.

[15]  David Causeur,et al.  ACCOUNTING FOR TIME DEPENDENCE IN LARGE-SCALE MULTIPLE TESTING OF EVENT-RELATED POTENTIAL DATA , 2016 .

[16]  David Causeur,et al.  A factor model to analyze heterogeneity in gene expression , 2010, BMC Bioinformatics.

[17]  Jiashun Jin,et al.  Optimal detection of heterogeneous and heteroscedastic mixtures , 2011 .

[18]  Korbinian Strimmer,et al.  Gene ranking and biomarker discovery under correlation , 2009, Bioinform..

[19]  T W Picton,et al.  The P300 Wave of the Human Event‐Related Potential , 1992, Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society.

[20]  Céline Bugli,et al.  Functional ANOVA with random functional effects: an application to event‐related potentials modelling for electroencephalograms analysis , 2006, Statistics in medicine.

[21]  John D. Storey,et al.  Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis , 2007, PLoS genetics.

[22]  E. Gordon,et al.  THE TEST-RETEST RELIABILITY OF A STANDARDIZED NEUROCOGNITIVE AND NEUROPHYSIOLOGICAL TEST BATTERY: “NEUROMARKER” , 2005, The International journal of neuroscience.

[23]  David Causeur,et al.  Stability of feature selection in classification issues for high-dimensional correlated data , 2015, Statistics and Computing.

[24]  P. Bickel,et al.  Some theory for Fisher''s linear discriminant function , 2004 .

[25]  Chloé Friguet,et al.  A Factor Model Approach to Multiple Testing Under Dependence , 2009 .

[26]  Nancy Zhang,et al.  Multiple hypothesis testing , adjusting for latent variables , 2011 .