Does data cleaning improve brain state classification?

Background Neuroscientists routinely seek to identify and remove noisy or artifactual observations from their data. They do so guided by the belief that including such data would reduce power to detect effects of interest. Whereas standard methods can exclude certain well defined noise sources (e.g., line noise), most forms of noise do not easily separate from signals of interest. Here we ask how well methods routinely used to “clean” human electrophysiological recordings actually boost power to detect brain-behavior correlations. New Method This, to the authors’ knowledge, is the first large-scale study of the impact of intracranial EEG preprocessing on brain state classification. Results We find that several commonly used data cleaning methods (automated methods based on statistical properties of the signal and manual methods based on expert review) reduce statistical power for both univariate and multivariate classification of successful memory encoding, a behavioral state with very well-characterized electrophysiological biomarkers. Comparison with Existing Methods By reallocating resources towards collecting more within-patient data instead of attempting to “clean” data, neuroscientists may see increases in the statistical power to detect physiological phenomena. Conclusions These findings highlight the challenge of partitioning signal and noise in the analysis of brain-behavior relations. They also prescribe increases in sample size and numbers of observations, rather than data cleaning, as the best approach to improving statistical power.

[1]  M. Kahana,et al.  Synchronous and Asynchronous Theta and Gamma Activity during Episodic Memory Formation , 2013, The Journal of Neuroscience.

[2]  Josef Parvizi,et al.  Frontal and motor cortex contributions to response inhibition: evidence from electrocorticography. , 2016, Journal of neurophysiology.

[3]  M J Kahana,et al.  Widespread theta synchrony and high-frequency desynchronization underlies enhanced cognition , 2017, Nature Communications.

[4]  Michael J. Jutras,et al.  Direct Brain Stimulation Modulates Encoding States and Memory Performance in Humans , 2017, Current Biology.

[5]  Simon Hanslmayr,et al.  Data-driven re-referencing of intracranial EEG based on independent component analysis (ICA) , 2017, Journal of Neuroscience Methods.

[6]  Matt Stead,et al.  Human Neuroscience , 2022 .

[7]  Biyu J. He Scale-free brain activity: past, present, and future , 2014, Trends in Cognitive Sciences.

[8]  Vishnu Sreekumar,et al.  Signal Complexity of Human Intracranial EEG Tracks Successful Associative-Memory Formation across Individuals , 2018, The Journal of Neuroscience.

[9]  Kaustubh Supekar,et al.  Sparse logistic regression for whole-brain classification of fMRI data , 2010, NeuroImage.

[10]  Arne D. Ekstrom,et al.  Oscillations Go the Distance: Low-Frequency Human Hippocampal Oscillations Code Spatial Distance in the Absence of Sensory Cues during Teleportation , 2016, Neuron.

[11]  Daniel P. Kennedy,et al.  Differential electrophysiological response during rest, self-referential, and non–self-referential tasks in human posteromedial cortex , 2011, Proceedings of the National Academy of Sciences.

[12]  Murray Glanzer,et al.  Distance between related words in free recall: Trace of the STS , 1969 .

[13]  Nicole M. Long,et al.  Subsequent memory effect in intracranial and scalp EEG , 2014, NeuroImage.

[14]  Virginia R. de Sa,et al.  Preprocessing and Meta-Classification for Brain-Computer Interfaces , 2007, IEEE Transactions on Biomedical Engineering.

[15]  Bradley C. Lega,et al.  Closed-loop stimulation of temporal cortex rescues functional networks and improves memory , 2018, Nature Communications.

[16]  Per B. Sederberg,et al.  Oscillatory correlates of the primacy effect in episodic memory , 2006, NeuroImage.

[17]  Y. Ezzyat,et al.  Direct brain stimulation during episodic memory , 2018, Current Opinion in Biomedical Engineering.

[18]  Andrew C. Connolly,et al.  Interictal epileptiform discharges impair word recall in multiple brain areas , 2017, Epilepsia.

[19]  S. Raghavachari,et al.  Distinct patterns of brain oscillations underlie two basic parameters of human maze learning. , 2001, Journal of neurophysiology.

[20]  Josef Parvizi,et al.  Promises and limitations of human intracranial electroencephalography , 2018, Nature Neuroscience.

[21]  Brent M. Berry,et al.  Visually validated semi-automatic high-frequency oscillation detection aides the delineation of epileptogenic regions during intra-operative electrocorticography , 2018, Clinical Neurophysiology.

[22]  M. Kahana,et al.  Slow-Theta-to-Gamma Phase-Amplitude Coupling in Human Hippocampus Supports the Formation of New Episodic Memories. , 2016, Cerebral cortex.

[23]  E. Halgren,et al.  High-frequency neural activity and human cognition: Past, present and possible future of intracranial EEG research , 2012, Progress in Neurobiology.

[24]  R. B. Reilly,et al.  FASTER: Fully Automated Statistical Thresholding for EEG artifact Rejection , 2010, Journal of Neuroscience Methods.

[25]  Kazuyuki Aihara,et al.  Logistic Regression for Single Trial EEG Classification , 2006, NIPS.

[26]  U. Rutishauser,et al.  Human memory strength is predicted by theta-frequency phase-locking of single neurons , 2010, Nature.

[27]  Gabriel Curio,et al.  A novel method for reliable and fast extraction of neuronal EEG/MEG oscillations on the basis of spatio-spectral decomposition , 2011, NeuroImage.

[28]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[29]  Paul Ferrari,et al.  Self-paced movements induce high-frequency gamma oscillations in primary motor cortex , 2008, NeuroImage.

[30]  Biyu J. He,et al.  The Temporal Structures and Functional Significance of Scale-free Brain Activity , 2010, Neuron.

[31]  Brian Litt,et al.  Gamma Oscillations Distinguish True From False Memories , 2007, Psychological science.

[32]  K. Miller,et al.  Human Retrosplenial Cortex Displays Transient Theta Phase Locking with Medial Temporal Cortex Prior to Activation during Autobiographical Memory Retrieval , 2013, The Journal of Neuroscience.

[33]  Rajesh P. N. Rao,et al.  Spectral Changes in Cortical Surface Potentials during Motor Movement , 2007, The Journal of Neuroscience.

[34]  Manuel R. Mercier,et al.  Evaluation of cortical local field potential diffusion in stereotactic electro-encephalography recordings: A glimpse on white matter signal , 2017, NeuroImage.

[35]  Arne D. Ekstrom,et al.  Human hippocampal theta activity during virtual navigation , 2005, Hippocampus.

[36]  Richard Gao,et al.  Inferring synaptic excitation/inhibition balance from field potentials , 2016, NeuroImage.

[37]  Robert T. Knight,et al.  Cortical Spatio-temporal Dynamics Underlying Phonological Target Detection in Humans , 2011, Journal of Cognitive Neuroscience.

[38]  S. P. Pederson,et al.  On Robustness in the Logistic Regression Model , 1993 .

[39]  Rafi U. Haque,et al.  Cortical Low-Frequency Power and Progressive Phase Synchrony Precede Successful Memory Encoding , 2015, The Journal of Neuroscience.

[40]  Kareem A. Zaghloul,et al.  Decreases in theta and increases in high frequency activity underlie associative memory encoding , 2015, NeuroImage.

[41]  Peter Brown,et al.  Midline Frontal Cortex Low-Frequency Activity Drives Subthalamic Nucleus Oscillations during Conflict , 2014, The Journal of Neuroscience.

[42]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[43]  M J Stokes,et al.  EEG-based communication: a pattern recognition approach. , 2000, IEEE transactions on rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[44]  F. McCoy,et al.  Janus-faced PIDD: a sensor for DNA damage-induced cell death or survival? , 2012, Molecular cell.

[45]  Jonathan Winawer,et al.  Asynchronous Broadband Signals Are the Principal Source of the BOLD Response in Human Visual Cortex , 2013, Current Biology.

[46]  P. Nunez,et al.  Electric fields of the brain , 1981 .

[47]  B. Wandell,et al.  Gamma oscillations in visual cortex: the stimulus matters , 2015, Trends in Cognitive Sciences.

[48]  Simon Hanslmayr,et al.  Data-driven re-referencing of intracranial EEG based on independent component analysis (ICA) , 2017 .

[49]  Marc W Howard,et al.  Theta and Gamma Oscillations during Encoding Predict Subsequent Recall , 2003, The Journal of Neuroscience.

[50]  Brian Litt,et al.  Behavioral / Systems / Cognitive Hippocampal Gamma Oscillations Increase with Memory Load , 2010 .

[51]  Michael J. Kahana,et al.  Multivariate Stochastic Volatility Modeling of Neural Data , 2018 .

[52]  Shennan A. Weiss,et al.  A method for the topographical identification and quantification of high frequency oscillations in intracranial electroencephalography recordings , 2018, Clinical Neurophysiology.

[53]  Arne D. Ekstrom,et al.  A network approach for modulating memory processes via direct and indirect brain stimulation: Toward a causal approach for the neural basis of memory , 2016, Neurobiology of Learning and Memory.

[54]  Andrew W. Moore,et al.  Fast Robust Logistic Regression for Large Sparse Datasets with Binary Outputs , 2003, AISTATS.

[55]  Francesco Carlo Morabito,et al.  Semi-Automatic Artifact Rejection Procedure based on Kurtosis, Renyi's Entropy and Independent Component Scalp Maps , 2005, IEC.

[56]  Meng Wang,et al.  Optimal referencing for stereo-electroencephalographic (SEEG) recordings , 2018, NeuroImage.

[57]  M. Kahana,et al.  Neuronal and oscillatory activity during reward processing in the human ventral striatum , 2011, Neuroreport.

[58]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[59]  Nicole M. Long,et al.  Human intracranial high-frequency activity maps episodic memory formation in space and time , 2014, NeuroImage.

[60]  Jack J. Lin,et al.  Direct brain recordings reveal hippocampal rhythm underpinnings of language processing , 2016, Proceedings of the National Academy of Sciences.

[61]  Mathew H. Evans,et al.  Many Analysts, One Data Set: Making Transparent How Variations in Analytic Choices Affect Results , 2018, Advances in Methods and Practices in Psychological Science.

[62]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[63]  R. Knight,et al.  Shifts in Gamma Phase–Amplitude Coupling Frequency from Theta to Alpha Over Posterior Cortex During Visual Tasks , 2010, Front. Hum. Neurosci..

[64]  Leif D. Nelson,et al.  Data from Paper “False-Positive Psychology: Undisclosed Flexibility in Data Collection and Analysis Allows Presenting Anything as Significant” , 2014 .

[65]  Nicole M. Long,et al.  Successful memory formation is driven by contextual encoding in the core memory network , 2015, NeuroImage.

[66]  Rajesh P. N. Rao,et al.  Real-time functional brain mapping using electrocorticography , 2007, NeuroImage.

[67]  K. Grill-Spector,et al.  Electrical Stimulation of the Left and Right Human Fusiform Gyrus Causes Different Effects in Conscious Face Perception , 2014, The Journal of Neuroscience.

[68]  Nitin Tandon,et al.  Intracranial electroencephalography reveals different temporal profiles for dorsal- and ventro-lateral prefrontal cortex in preparing to stop action. , 2013, Cerebral cortex.

[69]  M. Sigman,et al.  Attention, in and Out: Scalp-Level and Intracranial EEG Correlates of Interoception and Exteroception , 2017, Front. Neurosci..

[70]  S. Raghavachari,et al.  Gating of Human Theta Oscillations by a Working Memory Task , 2001, The Journal of Neuroscience.

[71]  Gal Chechik,et al.  A unifying principle underlying the extracellular field potential spectral responses in the human cortex. , 2015, Journal of neurophysiology.

[72]  J. Jacobs,et al.  Traveling Theta Waves in the Human Hippocampus , 2015, The Journal of Neuroscience.

[73]  Jeremy R. Manning,et al.  Broadband Shifts in Local Field Potential Power Spectra Are Correlated with Single-Neuron Spiking in Humans , 2009, The Journal of Neuroscience.

[74]  Shennan A. Weiss,et al.  Utilization of independent component analysis for accurate pathological ripple detection in intracranial EEG recordings recorded extra- and intra-operatively , 2018, Clinical Neurophysiology.

[75]  Ata Kabán,et al.  Label-Noise Robust Logistic Regression and Its Applications , 2012, ECML/PKDD.

[76]  N. Crone,et al.  High-frequency gamma oscillations and human brain mapping with electrocorticography. , 2006, Progress in brain research.

[77]  R. Sekuler,et al.  Intracranial electroencephalography reveals two distinct similarity effects during item recognition , 2009, Brain Research.