Automagic: Standardized preprocessing of big EEG data

Electroencephalography (EEG) recordings have been rarely included in large-scale studies. This is arguably not due to a lack of information that lies in EEG recordings but mainly on account of methodological issues. In many cases, particularly in clinical, pediatric and aging populations, the EEG has a high degree of artifact contamination and the quality of EEG recordings often substantially differs between subjects. Although there exists a variety of standardized preprocessing methods to clean EEG from artifacts, currently there is no method to objectively quantify the quality of preprocessed EEG. This makes the commonly accepted procedure of excluding subjects from analyses due to exceeding contamination of artifacts highly subjective. As a consequence, P-hacking is fostered, the replicability of results is decreased, and it is difficult to pool data from different study sites. In addition, in large-scale studies, data are collected over years or even decades, requiring software that controls and manages the preprocessing of ongoing and dynamically growing studies. To address these challenges, we developed Automagic, an open-source MATLAB toolbox that acts as a wrapper to run currently available preprocessing methods and offers objective standardized quality assessment for growing studies. The software is compatible with the Brain Imaging Data Structure (BIDS) standard and hence facilitates data sharing. In the present paper we outline the functionality of Automagic and examine the effect of applying combinations of methods on a sample of resting EEG data. This examination suggests that applying a pipeline of algorithms to detect artifactual channels in combination with Multiple Artifact Rejection Algorithm (MARA), an independent component analysis (ICA)-based artifact correction method, is sufficient to reduce a large extent of artifacts.

[1]  C. Jack,et al.  Ways toward an early diagnosis in Alzheimer’s disease: The Alzheimer’s Disease Neuroimaging Initiative (ADNI) , 2005, Alzheimer's & Dementia.

[2]  Thorsten O. Zander,et al.  Evaluation of a Dry EEG System for Application of Passive Brain-Computer Interfaces in Autonomous Driving , 2017, Front. Hum. Neurosci..

[3]  J. Ford,et al.  Clinical application of the P3 component of event-related potentials. II. Dementia, depression and schizophrenia. , 1984, Electroencephalography and clinical neurophysiology.

[4]  A. Mognon,et al.  ADJUST: An automatic EEG artifact detector based on the joint use of spatial and temporal features. , 2011, Psychophysiology.

[5]  Marc Wildi,et al.  Test–retest reliability of resting EEG spectra validates a statistical signature of persons , 2007, Clinical Neurophysiology.

[6]  Michael P. Milham,et al.  A multi-modal approach to decomposing standard neuropsychological test performance: Symbol Search , 2017, bioRxiv.

[7]  Kyungmin Su,et al.  The PREP pipeline: standardized preprocessing for large-scale EEG analysis , 2015, Front. Neuroinform..

[8]  Robert T. Knight,et al.  Parameterizing neural power spectra , 2018, bioRxiv.

[9]  R. Barry,et al.  EOG correction: which regression should we use? , 2000, Psychophysiology.

[10]  Stefan Debener,et al.  Simultaneous EEG and fMRI: Recording, Analysis, and Application , 2010 .

[11]  Leif D. Nelson,et al.  False-Positive Psychology , 2011, Psychological science.

[12]  Jussi Korpela,et al.  Computational Testing for Automated Preprocessing 2: Practical Demonstration of a System for Scientific Data-Processing Workflow Management for High-Volume EEG , 2018, Front. Neurosci..

[13]  April R. Levin,et al.  BEAPP: The Batch Electroencephalography Automated Processing Platform , 2018, Front. Neurosci..

[14]  Mark C. W. van Rossum,et al.  Systematic biases in early ERP and ERF components as a result of high-pass filtering , 2012, Journal of Neuroscience Methods.

[15]  T. Insel,et al.  Why has it taken so long for biological psychiatry to develop clinical tests and what to do about it? , 2012, Molecular Psychiatry.

[16]  T. Insel,et al.  Toward the future of psychiatric diagnosis: the seven pillars of RDoC , 2013, BMC Medicine.

[17]  April R. Levin,et al.  The Harvard Automated Processing Pipeline for Electroencephalography (HAPPE): Standardized Processing Software for Developmental and High-Artifact Data , 2018, Front. Neurosci..

[18]  Jessica A. Turner,et al.  Sharing the wealth: Neuroimaging data repositories , 2016, NeuroImage.

[19]  L. Fraenkel,et al.  A four-step approach for developing diagnostic tests in psychiatry: EEG in ADHD as a test case. , 2005, The Journal of neuropsychiatry and clinical neurosciences.

[20]  Lucas C Parra,et al.  A resource for assessing information processing in the developing brain using EEG and eye tracking , 2016, Scientific Data.

[21]  J. Sweeney,et al.  Resting state EEG abnormalities in autism spectrum disorders , 2013, Journal of Neurodevelopmental Disorders.

[22]  Lucas C. Parra,et al.  Recipes for the linear analysis of EEG , 2005, NeuroImage.

[23]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[24]  Robert Oostenveld,et al.  EEG-BIDS, an extension to the brain imaging data structure for electroencephalography , 2019, Scientific Data.

[25]  Sangmoon Kim,et al.  Depression, anxiety, and resting frontal EEG asymmetry: a meta-analytic review. , 2006, Journal of abnormal psychology.

[26]  Thea Radüntz,et al.  Signal Quality Evaluation of Emerging EEG Devices , 2018, Front. Physiol..

[27]  Michael H. Herzog,et al.  An automatic pre-processing pipeline for EEG analysis (APP) based on robust statistics , 2018, Clinical Neurophysiology.

[28]  S. Luck,et al.  How inappropriate high-pass filters can produce artifactual effects and incorrect conclusions in ERP studies of language and cognition. , 2015, Psychophysiology.

[29]  R. B. Reilly,et al.  FASTER: Fully Automated Statistical Thresholding for EEG artifact Rejection , 2010, Journal of Neuroscience Methods.

[30]  J Haueisen,et al.  Novel Multipin Electrode Cap System for Dry Electroencephalography , 2015, Brain Topography.

[31]  Kay A. Robbins,et al.  Preparing Laboratory and Real-World EEG Data for Large-Scale Analysis: A Containerized Approach , 2016, Front. Neuroinform..

[32]  W. Iacono,et al.  The status of spectral EEG abnormality as a diagnostic test for schizophrenia , 2008, Schizophrenia Research.

[33]  Alexandre Gramfort,et al.  Autoreject: Automated artifact rejection for MEG and EEG data , 2016, NeuroImage.

[34]  Masatoshi Nakamura,et al.  Technical quality evaluation of EEG recording based on electroencephalographers' knowledge. , 2005, Medical engineering & physics.

[35]  Arnaud Delorme,et al.  EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis , 2004, Journal of Neuroscience Methods.

[36]  G. Hajcak,et al.  The error-related negativity (ERN) and psychopathology: toward an endophenotype. , 2008, Clinical psychology review.

[37]  Marco Cecchi,et al.  A clinical trial to validate event-related potential markers of Alzheimer's disease in outpatient settings , 2015, Alzheimer's & dementia.

[38]  Jon Touryan,et al.  A Comparison of Electroencephalography Signals Acquired from Conventional and Mobile Systems , 2014 .

[39]  C. C. Duncan,et al.  Event-related potentials in clinical research: Guidelines for eliciting, recording, and quantifying mismatch negativity, P300, and N400 , 2009, Clinical Neurophysiology.

[40]  G. Sapiro,et al.  A collaborative framework for 3D alignment and classification of heterogeneous subvolumes in cryo-electron tomography. , 2013, Journal of structural biology.

[41]  G. A. Miller,et al.  Committee report: publication guidelines and recommendations for studies using electroencephalography and magnetoencephalography. , 2014, Psychophysiology.

[42]  Luke J. Chang,et al.  Building better biomarkers: brain models in translational neuroimaging , 2017, Nature Neuroscience.

[43]  T. Jung,et al.  Dry and Noncontact EEG Sensors for Mobile Brain–Computer Interfaces , 2012, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[44]  S. Luck,et al.  The effects of electrode impedance on data quality and statistical significance in ERP recordings. , 2010, Psychophysiology.

[45]  Ben Cowley,et al.  Computational testing for automated preprocessing: a Matlab toolbox to enable large scale electroencephalography data processing , 2017, PeerJ Comput. Sci..

[46]  T. Sejnowski,et al.  Removing electroencephalographic artifacts by blind source separation. , 2000, Psychophysiology.

[47]  Daniel P. Kennedy,et al.  The Autism Brain Imaging Data Exchange: Towards Large-Scale Evaluation of the Intrinsic Brain Architecture in Autism , 2013, Molecular Psychiatry.

[48]  J. Kowalski,et al.  Evidence-Based Evaluation of Diagnostic Accuracy of Resting EEG in Dementia and Mild Cognitive Impairment , 2009, Clinical EEG and neuroscience.

[49]  Stephanie Brandl,et al.  Robust artifactual independent component classification for BCI practitioners , 2014, Journal of neural engineering.

[50]  S. Sponheim,et al.  Internal consistency reliability of resting EEG power spectra in schizophrenic and normal subjects. , 1995, Psychophysiology.

[51]  Waltz,et al.  Descriptor : An open resource for transdiagnostic research in pediatric mental health and learning disorders , 2019 .

[52]  Robert Oostenveld,et al.  BIDS-EEG: an extension to the Brain Imaging Data Structure (BIDS) Specification for electroencephalography , 2018 .

[53]  M. Tangermann,et al.  Automatic Classification of Artifactual ICA-Components for Artifact Removal in EEG Signals , 2011, Behavioral and Brain Functions.

[54]  Kay A. Robbins,et al.  Automated EEG mega-analysis I: Spectral and amplitude characteristics across studies , 2018, NeuroImage.

[55]  Thomas E. Nichols,et al.  Best practices in data analysis and sharing in neuroimaging using MRI , 2017, Nature Neuroscience.

[56]  Tzyy-Ping Jung,et al.  Real-time modeling and 3D visualization of source dynamics and connectivity using wearable EEG , 2013, 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[57]  W. Klimesch EEG alpha and theta oscillations reflect cognitive and memory performance: a review and analysis , 1999, Brain Research Reviews.

[58]  F. Hatz,et al.  Reliability of fully automated versus visually controlled pre- and post-processing of resting-state EEG , 2015, Clinical Neurophysiology.

[59]  Terrence J. Sejnowski,et al.  Enhanced detection of artifacts in EEG data using higher-order statistics and independent component analysis , 2007, NeuroImage.

[60]  Satrajit S. Ghosh,et al.  The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments , 2016, Scientific Data.

[61]  Tim Mullen,et al.  Automated EEG mega-analysis II: Cognitive aspects of event related features , 2018, NeuroImage.