Autoreject: Automated artifact rejection for MEG and EEG data

Abstract We present an automated algorithm for unified rejection and repair of bad trials in magnetoencephalography (MEG) and electroencephalography (EEG) signals. Our method capitalizes on cross‐validation in conjunction with a robust evaluation metric to estimate the optimal peak‐to‐peak threshold – a quantity commonly used for identifying bad trials in M/EEG. This approach is then extended to a more sophisticated algorithm which estimates this threshold for each sensor yielding trial‐wise bad sensors. Depending on the number of bad sensors, the trial is then repaired by interpolation or by excluding it from subsequent analysis. All steps of the algorithm are fully automated thus lending itself to the name Autoreject. In order to assess the practical significance of the algorithm, we conducted extensive validation and comparisons with state‐of‐the‐art methods on four public datasets containing MEG and EEG recordings from more than 200 subjects. The comparisons include purely qualitative efforts as well as quantitatively benchmarking against human supervised and semi‐automated preprocessing pipelines. The algorithm allowed us to automate the preprocessing of MEG data from the Human Connectome Project (HCP) going up to the computation of the evoked responses. The automated nature of our method minimizes the burden of human inspection, hence supporting scalability and reliability demanded by data analysis in modern neuroscience. Graphical abstract Symbol. No caption available. HighlightsA strategy for artifact rejection in M/EEG using peak‐to‐peak thresholds is proposedThe thresholds are estimated using cross‐validation with a robust error metricThe method detects and repairs outlier data segments for each sensorComparison with competing methods on 200 subjects with ground truth responses

[1]  E. Halgren,et al.  Dynamic Statistical Parametric Mapping Combining fMRI and MEG for High-Resolution Imaging of Cortical Activity , 2000, Neuron.

[2]  Karl J. Friston,et al.  EEG and MEG Data Analysis in SPM8 , 2011, Comput. Intell. Neurosci..

[3]  Jasper Snoek,et al.  Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[4]  E. Somersalo,et al.  Visualization of Magnetoencephalographic Data Using Minimum Current Estimates , 1999, NeuroImage.

[5]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[6]  Mark W. Woolrich,et al.  MEG beamforming using Bayesian PCA for adaptive data covariance matrix regularization , 2011, NeuroImage.

[7]  Krzysztof J. Gorgolewski,et al.  A Practical Guide for Improving Transparency and Reproducibility in Neuroimaging Research , 2016, bioRxiv.

[8]  Yoshua Bengio,et al.  Algorithms for Hyper-Parameter Optimization , 2011, NIPS.

[9]  S. Taulu,et al.  Suppression of Interference and Artifacts by the Signal Space Separation Method , 2003, Brain Topography.

[10]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .

[11]  Richard M. Leahy,et al.  Brainstorm: A User-Friendly Application for MEG/EEG Analysis , 2011, Comput. Intell. Neurosci..

[12]  R N Vigário,et al.  Extraction of ocular artefacts from EEG using independent component analysis. , 1997, Electroencephalography and clinical neurophysiology.

[13]  Satrajit S. Ghosh,et al.  The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments , 2016, Scientific Data.

[14]  Quentin Noirhomme,et al.  Reanalysis of “Bedside detection of awareness in the vegetative state: a cohort study” , 2013, The Lancet.

[15]  B. Moor,et al.  Robust Cross-Validation Score Functions with Application to Weighted Least Squares Support Vector Machine Function Estimation ∗ , 2009 .

[16]  Alexandre Gramfort,et al.  Automated model selection in covariance estimation and spatial whitening of MEG and EEG signals , 2015, NeuroImage.

[17]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[18]  Mark W. Woolrich,et al.  Adding dynamics to the Human Connectome Project with MEG , 2013, NeuroImage.

[19]  Erich Schröger,et al.  Digital filter design for electrophysiological data – a practical approach , 2015, Journal of Neuroscience Methods.

[20]  Robert Oostenveld,et al.  FieldTrip: Open Source Software for Advanced Analysis of MEG, EEG, and Invasive Electrophysiological Data , 2010, Comput. Intell. Neurosci..

[21]  Lauri Parkkonen Instrumentation and Data Preprocessing , 2010 .

[22]  Uwe Pietrzyk,et al.  Integration of Amplitude and Phase Statistics for Complete Artifact Removal in Independent Components of Neuromagnetic Recordings , 2008, IEEE Transactions on Biomedical Engineering.

[23]  Jörn Diedrichsen,et al.  Detecting and adjusting for artifacts in fMRI time series data , 2005, NeuroImage.

[24]  Stanislas Dehaene,et al.  Automated Measurement and Prediction of Consciousness in Vegetative and Minimally Conscious Patients , 2015, ICML 2015.

[25]  M. Congedo,et al.  The Riemannian Potato: an automatic and adaptive artifact detection method for online experiments using Riemannian geometry , 2013 .

[26]  Erkki Oja,et al.  Independent component approach to the analysis of EEG and MEG recordings , 2000, IEEE Transactions on Biomedical Engineering.

[27]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[28]  Tim Sprosen,et al.  UK Biobank: from concept to reality. , 2005, Pharmacogenomics.

[29]  R. Ilmoniemi,et al.  Interpreting magnetic fields of the brain: minimum norm estimates , 2006, Medical and Biological Engineering and Computing.

[30]  R. B. Reilly,et al.  FASTER: Fully Automated Statistical Thresholding for EEG artifact Rejection , 2010, Journal of Neuroscience Methods.

[31]  J Gross,et al.  REPRINTS , 1962, The Lancet.

[32]  R. Ilmoniemi,et al.  Signal-space projection method for separating MEG or EEG into components , 1997, Medical and Biological Engineering and Computing.

[33]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[34]  Martin Luessi,et al.  MNE software for processing MEG and EEG data , 2014, NeuroImage.

[35]  B. Rockstroh,et al.  Statistical control of artifacts in dense array EEG/MEG studies. , 2000, Psychophysiology.

[36]  Richard N Henson,et al.  A multi-subject, multi-modal human neuroimaging dataset , 2015, Scientific Data.

[37]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[38]  D. H. Leung,et al.  Cross-validation in nonparametric regression with outliers , 2005 .

[39]  Martin Luessi,et al.  MEG and EEG data analysis with MNE-Python , 2013, Front. Neuroinform..

[40]  Alexandre Gramfort,et al.  Automated rejection and repair of bad trials in MEG/EEG , 2016, 2016 International Workshop on Pattern Recognition in Neuroimaging (PRNI).

[41]  Kyungmin Su,et al.  The PREP pipeline: standardized preprocessing for large-scale EEG analysis , 2015, Front. Neuroinform..

[42]  Arnaud Delorme,et al.  EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis , 2004, Journal of Neuroscience Methods.

[43]  Steen Moeller,et al.  The Human Connectome Project: A data acquisition perspective , 2012, NeuroImage.

[44]  Cam-CAN Group,et al.  The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) data repository: Structural and functional MRI, MEG, and cognitive data from a cross-sectional adult lifespan sample , 2017, NeuroImage.

[45]  F. Perrin,et al.  Spherical splines for scalp potential and current density mapping. , 1989, Electroencephalography and clinical neurophysiology.

[46]  Kenneth Kreutz-Delgado,et al.  Hierarchical Event Descriptor (HED) tags for analysis of event-related EEG studies , 2013, 2013 IEEE Global Conference on Signal and Information Processing.

[47]  Srivas Chennu,et al.  Bedside detection of awareness in the vegetative state: a cohort study , 2011, The Lancet.

[48]  N. Birbaumer,et al.  BCI2000: a general-purpose brain-computer interface (BCI) system , 2004, IEEE Transactions on Biomedical Engineering.

[49]  S. Dehaene,et al.  A hierarchy of cortical responses to sequence violations in three-month-old infants , 2014, Cognition.

[50]  R D Pascual-Marqui,et al.  Standardized low-resolution brain electromagnetic tomography (sLORETA): technical details. , 2002, Methods and findings in experimental and clinical pharmacology.

[51]  Alan C. Evans,et al.  OMEGA: The Open MEG Archive , 2016, NeuroImage.

[52]  Alain de Cheveigné,et al.  Sensor noise suppression , 2008, Journal of Neuroscience Methods.

[53]  Robert T. Knight,et al.  Five-dimensional neuroimaging: Localization of the time–frequency dynamics of cortical activity , 2008, NeuroImage.

[54]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .