Representation learning for improved interpretability and classification accuracy of clinical factors from EEG

Despite extensive standardization, diagnostic interviews for mental health disorders encompass substantial subjective judgment. Previous studies have demonstrated that EEG-based neural measures can function as reliable objective correlates of depression, or even predictors of depression and its course. However, their clinical utility has not been fully realized because of 1) the lack of automated ways to deal with the inherent noise associated with EEG data at scale, and 2) the lack of knowledge of which aspects of the EEG signal may be markers of a clinical disorder. Here we adapt an unsupervised pipeline from the recent deep representation learning literature to address these problems by 1) learning a disentangled representation using $\beta$-VAE to denoise the signal, and 2) extracting interpretable features associated with a sparse set of clinical labels using a Symbol-Concept Association Network (SCAN). We demonstrate that our method is able to outperform the canonical hand-engineered baseline classification method on a number of factors, including participant age and depression diagnosis. Furthermore, our method recovers a representation that can be used to automatically extract denoised Event Related Potentials (ERPs) from novel, single EEG trajectories, and supports fast supervised re-mapping to various clinical labels, allowing clinicians to re-use a single EEG representation regardless of updates to the standardized diagnostic system. Finally, single factors of the learned disentangled representations often correspond to meaningful markers of clinical factors, as automatically detected by SCAN, allowing for human interpretability and post-hoc expert analysis of the recommendations made by the model.

[1]  Elif Derya Übeyli,et al.  Multiclass Support Vector Machines for EEG-Signals Classification , 2007, IEEE Transactions on Information Technology in Biomedicine.

[2]  S. Smith EEG in the diagnosis, classification, and management of patients with epilepsy , 2005, Journal of Neurology, Neurosurgery & Psychiatry.

[3]  Alexander Lerchner,et al.  A Heuristic for Unsupervised Model Selection for Variational Disentangled Representation Learning , 2019, ICLR.

[4]  A. Cole,et al.  The Utility of Routine EEG in the Diagnosis of Sleep Disordered Breathing , 2012, Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society.

[5]  Paul Bebbington,et al.  Adult Psychiatric Morbidity in England, 2007: Results of a Household Survey , 2009 .

[6]  Joachim M. Buhmann,et al.  The Balanced Accuracy and Its Posterior Distribution , 2010, 2010 20th International Conference on Pattern Recognition.

[7]  Michal Valko,et al.  Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning , 2020, NeurIPS.

[8]  G. Hajcak,et al.  Reduced neural response to reward and pleasant pictures independently relate to depression , 2020, Psychological Medicine.

[9]  Anthony J. Ries,et al.  Decoding P300 Variability Using Convolutional Neural Networks , 2019, bioRxiv.

[10]  Yann LeCun,et al.  Classification of patterns of EEG synchronization for seizure prediction , 2009, Clinical Neurophysiology.

[11]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[12]  Murray Shanahan,et al.  SCAN: Learning Hierarchical Compositional Visual Concepts , 2017, ICLR.

[13]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[14]  Geoffrey E. Hinton,et al.  A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[15]  Arnoud Arntz,et al.  Inter-rater reliability of the Structured Clinical Interview for DSM-IV Axis I Disorders (SCID I) and Axis II Disorders (SCID II). , 2011, Clinical psychology & psychotherapy.

[16]  E Donchin,et al.  A new method for off-line removal of ocular artifact. , 1983, Electroencephalography and clinical neurophysiology.

[17]  Daan Wierstra,et al.  Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[18]  G. Hajcak,et al.  Using Multilevel Modeling to Examine Blunted Neural Responses to Reward in Major Depression. , 2018, Biological psychiatry. Cognitive neuroscience and neuroimaging.

[19]  Dan Foti,et al.  Reduced electrocortical response to threatening faces in major depressive disorder , 2010, Depression and anxiety.

[20]  E. Walker,et al.  Diagnostic and Statistical Manual of Mental Disorders , 2013 .

[21]  Elif Derya Übeyli,et al.  Recurrent neural networks employing Lyapunov exponents for EEG signals classification , 2005, Expert Syst. Appl..

[22]  Mark Chen,et al.  Generative Pretraining From Pixels , 2020, ICML.

[23]  Tiago H. Falk,et al.  Deep learning-based electroencephalography analysis: a systematic review , 2019, Journal of neural engineering.

[24]  Arnaud Delorme,et al.  EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis , 2004, Journal of Neuroscience Methods.

[25]  Kazuyuki Aihara,et al.  Logistic Regression for Single Trial EEG Classification , 2006, NIPS.

[26]  Yoshua Bengio,et al.  Better Mixing via Deep Representations , 2012, ICML.

[27]  Brechtje Jelles,et al.  Non-linear dynamical analysis of multichannel EEG: Clinical applications in dementia and Parkinson's disease , 2005, Brain Topography.

[28]  P. Lang International Affective Picture System (IAPS) : Technical Manual and Affective Ratings , 1995 .

[29]  Dan J Stein,et al.  Global, regional, and national incidence, prevalence, and years lived with disability for 301 acute and chronic diseases and injuries in 188 countries, 1990–2013: a systematic analysis for the Global Burden of Disease Study 2013 , 2015, The Lancet.

[30]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[31]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[32]  Christoph Reichert,et al.  Convolutional Neural Networks for Decoding of Covert Attention Focus and Saliency Maps for EEG Feature Visualization , 2019 .

[33]  Klaas E. Stephan,et al.  Ketamine Affects Prediction Errors about Statistical Regularities: A Computational Single-Trial Analysis of the Mismatch Negativity , 2019, The Journal of Neuroscience.

[34]  E. Fried,et al.  Depression is not a consistent syndrome: An investigation of unique symptom patterns in the STAR*D study. , 2015, Journal of affective disorders.

[35]  H. Parthasarathy,et al.  NemaFootPrinter: a web based software for the identification of conserved non-coding genome sequence regions between C. elegans and C. briggsae , 1981, Nature Immunology.

[36]  P. Bebbington,et al.  Mental Health and Wellbeing in England: the Adult Psychiatric Morbidity Survey 2014 , 2016 .

[37]  Hubert Cecotti,et al.  Convolutional Neural Networks for P300 Detection with Application to Brain-Computer Interfaces , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  S. Makeig,et al.  Mining event-related brain dynamics , 2004, Trends in Cognitive Sciences.

[39]  Matthew Botvinick,et al.  MONet: Unsupervised Scene Decomposition and Representation , 2019, ArXiv.

[40]  Mohammed Yeasin,et al.  Learning Representations from EEG with Deep Recurrent-Convolutional Neural Networks , 2015, ICLR.

[41]  T. Insel,et al.  Wesleyan University From the SelectedWorks of Charles A . Sanislow , Ph . D . 2010 Research Domain Criteria ( RDoC ) : Toward a New Classification Framework for Research on Mental Disorders , 2018 .

[42]  Anna Weinberg,et al.  Blunted Reward Processing in Remitted Melancholic Depression , 2017, Clinical psychological science : a journal of the Association for Psychological Science.

[43]  Abdulhamit Subasi,et al.  EEG signal classification using PCA, ICA, LDA and support vector machines , 2010, Expert Syst. Appl..

[44]  Greg Hajcak,et al.  The Utility of Event-Related Potentials in Clinical Psychology. , 2019, Annual review of clinical psychology.

[45]  Christopher Burgess,et al.  beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.

[46]  R. Kotov,et al.  Diagnostic and Symptom-Based Predictors of Emotional Processing in Generalized Anxiety Disorder and Major Depressive Disorder: An Event-Related Potential Study , 2016, Cognitive Therapy and Research.

[47]  José A. Castellanos,et al.  Probabilistic Performance Evaluation for Multiclass Classification Using the Posterior Balanced Accuracy , 2013, ROBOT.

[48]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[49]  Bernadette A. Thomas,et al.  Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010 , 2012, The Lancet.

[50]  T. Sejnowski,et al.  Removing electroencephalographic artifacts by blind source separation. , 2000, Psychophysiology.

[51]  W. Tam,et al.  Prevalence of Depression in the Community from 30 Countries between 1994 and 2014 , 2018, Scientific Reports.

[52]  Clemens Brunner,et al.  Better than random? A closer look on BCI results , 2008 .

[53]  Bertram E. Shi,et al.  Convolutional Neural Network for Target Face Detection using Single-trial EEG Signal , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[54]  Miguel P. Eckstein,et al.  Single-Trial Classification of Event-Related Potentials in Rapid Serial Visual Presentation Tasks Using Supervised Spatial Filtering , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[55]  T. Vos,et al.  Global burden of disease attributable to mental and substance use disorders: findings from the Global Burden of Disease Study 2010 , 2013, The Lancet.

[56]  R. Kotov,et al.  Depression and reduced neural response to emotional images: Distinction from anxiety, and importance of symptom dimensions and age of onset. , 2016, Journal of abnormal psychology.

[57]  Pierre Baldi,et al.  Autoencoders, Unsupervised Learning, and Deep Architectures , 2011, ICML Unsupervised and Transfer Learning.

[58]  R C Burgess,et al.  Event related potentials I , 1992, Clinical Neurophysiology.

[59]  Ryan C. N. D'Arcy,et al.  Detection of event-related potentials in individual subjects using support vector machines , 2014, Brain Informatics.

[60]  R. Kessler,et al.  Sex and depression in the National Comorbidity Survey. I: Lifetime prevalence, chronicity and recurrence. , 1993, Journal of affective disorders.

[61]  R. B. Reilly,et al.  FASTER: Fully Automated Statistical Thresholding for EEG artifact Rejection , 2010, Journal of Neuroscience Methods.

[62]  Martin Luessi,et al.  MNE software for processing MEG and EEG data , 2014, NeuroImage.