Detecting abnormal electroencephalograms using deep convolutional networks

OBJECTIVES Electroencephalography (EEG) is a central part of the medical evaluation for patients with neurological disorders. Training an algorithm to label the EEG normal vs abnormal seems challenging, because of EEG heterogeneity and dependence of contextual factors, including age and sleep stage. Our objectives were to validate prior work on an independent data set suggesting that deep learning methods can discriminate between normal vs abnormal EEGs, to understand whether age and sleep stage information can improve discrimination, and to understand what factors lead to errors. METHODS We train a deep convolutional neural network on a heterogeneous set of 8522 routine EEGs from the Massachusetts General Hospital. We explore several strategies for optimizing model performance, including accounting for age and sleep stage. RESULTS The area under the receiver operating characteristic curve (AUC) on an independent test set (n = 851) is 0.917 marginally improved by including age (AUC = 0.924), and both age and sleep stages (AUC = 0.925), though not statistically significant. CONCLUSIONS The model architecture generalizes well to an independent dataset. Adding age and sleep stage to the model does not significantly improve performance. SIGNIFICANCE Insights learned from misclassified examples, and minimal improvement by adding sleep stage and age suggest fruitful directions for further research.

[1]  Olga Sourina,et al.  Large-Scale Automated Sleep Staging , 2017, Sleep.

[2]  T. Pedley Current Practice of Clinical Electroenceph‐alography , 1980, Neurology.

[3]  K. Jordan,et al.  Emergency EEG and Continuous EEG Monitoring in Acute Ischemic Stroke , 2004, Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society.

[4]  C. V. van Donselaar,et al.  Value of the electroencephalogram in adult patients with untreated idiopathic first seizures. , 1992, Archives of neurology.

[5]  Owen Falzon,et al.  Brain Symmetry Index in Healthy and Stroke Patients for Assessment and Prognosis , 2017, Stroke research and treatment.

[6]  Michel J. A. M. van Putten,et al.  The revised brain symmetry index , 2007, Clinical Neurophysiology.

[7]  R. Chervin,et al.  The visual scoring of sleep and arousal in infants and children. , 2007, Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine.

[8]  Jonathan W. Schooler,et al.  Metascience could rescue the ‘replication crisis’ , 2014, Nature.

[9]  Arthur C. Grant,et al.  EEG interpretation reliability and interpreter confidence: A large single-center study , 2014, Epilepsy & Behavior.

[10]  G. B. Young,et al.  The EEG in Coma , 2000, Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society.

[11]  Joseph Picone,et al.  The Temple University Hospital EEG Data Corpus , 2016, Front. Neurosci..

[12]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[13]  Joseph Sullivan,et al.  Interrater agreement in the interpretation of neonatal electroencephalography in hypoxic‐ischemic encephalopathy , 2017, Epilepsia.

[14]  P. Kaplan The EEG in Metabolic Encephalopathy and Coma , 2004, Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society.

[15]  Peter W. Kaplan,et al.  Generalized periodic discharges and ‘triphasic waves’: A blinded evaluation of inter-rater agreement and clinical significance , 2016, Clinical Neurophysiology.

[16]  M. Walker,et al.  Sleep and Human Aging , 2017, Neuron.

[17]  Tonio Ball,et al.  Deep learning with convolutional neural networks for decoding and visualization of EEG pathology , 2017, 2017 IEEE Signal Processing in Medicine and Biology Symposium (SPMB).

[18]  Wolfram Burgard,et al.  Deep learning with convolutional neural networks for EEG decoding and visualization , 2017, Human brain mapping.

[19]  S. Fujimoto,et al.  An intervention to improve the interrater reliability of clinical EEG interpretations , 2003, Psychiatry and clinical neurosciences.

[20]  C. Deacon,et al.  Triphasic Waves Versus Nonconvulsive Status Epilepticus: EEG Distinction , 2006, Canadian Journal of Neurological Sciences / Journal Canadien des Sciences Neurologiques.