Unsupervised Decoding of Long-Term, Naturalistic Human Neural Recordings with Automated Video and Audio Annotations

Fully automated decoding of human activities and intentions from direct neural recordings is a tantalizing challenge in brain-computer interfacing. Implementing Brain Computer Interfaces (BCIs) outside carefully controlled experiments in laboratory settings requires adaptive and scalable strategies with minimal supervision. Here we describe an unsupervised approach to decoding neural states from naturalistic human brain recordings. We analyzed continuous, long-term electrocorticography (ECoG) data recorded over many days from the brain of subjects in a hospital room, with simultaneous audio and video recordings. We discovered coherent clusters in high-dimensional ECoG recordings using hierarchical clustering and automatically annotated them using speech and movement labels extracted from audio and video. To our knowledge, this represents the first time techniques from computer vision and speech processing have been used for natural ECoG decoding. Interpretable behaviors were decoded from ECoG data, including moving, speaking and resting; the results were assessed by comparison with manual annotation. Discovered clusters were projected back onto the brain revealing features consistent with known functional areas, opening the door to automated functional brain mapping in natural settings.

[1]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Robert T. Knight,et al.  Spatial and temporal relationships of electrocorticographic alpha and gamma activity during auditory processing , 2014, NeuroImage.

[3]  Yasuharu Koike,et al.  Prediction of Three-Dimensional Arm Trajectories Based on ECoG Signals Recorded from Human Sensorimotor Cortex , 2013, PloS one.

[4]  E. Fetz,et al.  Decoupling the Cortical Power Spectrum Reveals Real-Time Representation of Individual Finger Movements in Humans , 2009, The Journal of Neuroscience.

[5]  Febo Cincotti,et al.  Towards Noninvasive Hybrid Brain–Computer Interfaces: Framework, Practice, Clinical Application, and Beyond , 2015, Proceedings of the IEEE.

[6]  Alexandra Branzan Albu,et al.  Automated Analysis of Wild Fish Behavior in a Natural Habitat , 2015, EMR@ICMR.

[7]  Dumitru Erhan,et al.  Scalable Object Detection Using Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Wei Wu,et al.  Spoken sentences decoding based on intracranial high gamma response using dynamic time warping , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[9]  Rajesh P. N. Rao,et al.  Cortical activity during motor execution, motor imagery, and imagery-based online feedback , 2010, Proceedings of the National Academy of Sciences.

[10]  Tanja Schultz,et al.  Brain-to-text: decoding spoken phrases from phone representations in the brain , 2015, Front. Neurosci..

[11]  Andreas Schulze-Bonhage,et al.  Signal quality of simultaneously recorded invasive and non-invasive EEG , 2009, NeuroImage.

[12]  J. Wolpaw,et al.  Brain-Computer Interfaces: Principles and Practice , 2012 .

[13]  Qiang Ji,et al.  Decoding Finger Flexion from Electrocorticographic Signals Using a Sparse Gaussian Process , 2010, 2010 20th International Conference on Pattern Recognition.

[14]  James Baker,et al.  A historical perspective of speech recognition , 2014, CACM.

[15]  Amy Loutfi,et al.  Sleep Stage Classification Using Unsupervised Feature Learning , 2012, Adv. Artif. Neural Syst..

[16]  Rajesh P. N. Rao Brain-Computer Interfacing: An Introduction , 2010 .

[17]  Daniel Moran,et al.  Evolution of brain–computer interface: action potentials, local field potentials and electrocorticograms , 2010, Current Opinion in Neurobiology.

[18]  Daryl R Kipke,et al.  Complex impedance spectroscopy for monitoring tissue responses to inserted neural implants , 2007, Journal of neural engineering.

[19]  Rajesh P. N. Rao,et al.  Correction for Miller et al., Cortical activity during motor execution, motor imagery, and imagery-based online feedback , 2010, Proceedings of the National Academy of Sciences.

[20]  Christian Kothe,et al.  Towards passive brain–computer interfaces: applying brain–computer interface technology to human–machine systems in general , 2011, Journal of neural engineering.

[21]  Bingni W. Brunton,et al.  Extracting spatial–temporal coherent patterns in large-scale neural recordings using dynamic mode decomposition , 2014, Journal of Neuroscience Methods.

[22]  E. Fetz,et al.  Correlations between the same motor cortex cells and arm muscles during a trained task, free behavior, and natural sleep in the macaque monkey. , 2007, Journal of neurophysiology.

[23]  Larry H. Matthies,et al.  First-Person Activity Recognition: What Are They Doing to Me? , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Christian Szegedy,et al.  DeepPose: Human Pose Estimation via Deep Neural Networks , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Nitish V. Thakor,et al.  Demonstration of a Semi-Autonomous Hybrid Brain–Machine Interface Using Human Intracranial EEG, Eye Tracking, and Computer Vision to Control a Robotic Upper Limb Prosthetic , 2014, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[26]  David T Bundy,et al.  Mapping sensorimotor cortex with slow cortical potential resting-state networks while awake and under anesthesia. , 2012, Neurosurgery.

[27]  Anastasios Bezerianos,et al.  Joint Spatial-Spectral Feature Space Clustering for Speech Activity Detection from ECoG Signals , 2014, IEEE Transactions on Biomedical Engineering.

[28]  Josef P. Rauschecker,et al.  Wernicke’s area revisited: Parallel streams and word processing , 2013, Brain and Language.

[29]  Andreas Schulze-Bonhage,et al.  “Doctor” or “darling”? Decoding the communication partner from ECoG of the anterior temporal lobe during non-experimental, real-life social interaction , 2012, Front. Hum. Neurosci..

[30]  Rajesh P. N. Rao Brain-Computer Interfacing: Major Types of BCIs , 2013 .

[31]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[32]  Andreas Schulze-Bonhage,et al.  Decoding natural grasp types from human ECoG , 2012, NeuroImage.

[33]  Johanna Ruescher,et al.  Somatotopic mapping of natural upper- and lower-extremity movements and speech production with high gamma electrocorticography , 2013, NeuroImage.

[34]  Nitish V. Thakor,et al.  Simultaneous Neural Control of Simple Reaching and Grasping With the Modular Prosthetic Limb Using Intracranial EEG , 2014, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[35]  LängkvistMartin,et al.  Sleep stage classification using unsupervised feature learning , 2012 .

[36]  N Jeremy Hill,et al.  Recording human electrocorticographic (ECoG) signals for neuroscientific research and real-time functional cortical mapping. , 2012, Journal of visualized experiments : JoVE.

[37]  Jonathan R. Wolpaw,et al.  Brain–Computer InterfacesPrinciples and Practice , 2012 .

[38]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[39]  Anastasios Bezerianos,et al.  Real-time voice activity detection for ECoG-based speech brain machine interfaces , 2014, 2014 19th International Conference on Digital Signal Processing.

[40]  N. F. Ramsey,et al.  Task-free electrocorticography frequency mapping of the motor cortex , 2013, Clinical Neurophysiology.

[41]  Nathan E. Crone,et al.  Electrocorticographic language mapping in children by high-gamma synchronization during spontaneous conversation: Comparison with conventional electrical cortical stimulation , 2015, Epilepsy Research.

[42]  Robin C. Ashmore,et al.  An Electrocorticographic Brain Interface in an Individual with Tetraplegia , 2013, PloS one.

[43]  Gerwin Schalk,et al.  Brain–computer interfacing based on cognitive control , 2010, Annals of neurology.

[44]  Michael H Kohrman,et al.  ECoG gamma activity during a language task: differentiating expressive and receptive speech areas. , 2008, Brain : a journal of neurology.

[45]  Kai Zhan,et al.  First-Person Activity Recognition , 2014 .

[46]  Qiang Ji,et al.  Decoding onset and direction of movements using Electrocorticographic (ECoG) signals in humans , 2012, Front. Neuroeng..

[47]  Javier Ramírez,et al.  Efficient voice activity detection algorithms using long-term speech information , 2004, Speech Commun..

[48]  Rajesh P. N. Rao,et al.  Electrocorticography-based brain computer Interface-the seattle experience , 2006, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[49]  Gidon Felsen,et al.  A natural approach to studying vision , 2005, Nature Neuroscience.

[50]  Bernhard Schölkopf,et al.  Methods Towards Invasive Human Brain Computer Interfaces , 2004, NIPS.

[51]  Ronald Poppe,et al.  Vision-based human motion analysis: An overview , 2007, Comput. Vis. Image Underst..

[52]  John J. Foxe,et al.  Neuro-Oscillatory Phase Alignment Drives Speeded Multisensory Response Times: An Electro-Corticographic Investigation , 2015, The Journal of Neuroscience.

[53]  Zoran Nenadic,et al.  State and trajectory decoding of upper extremity movements from electrocorticogram , 2013, 2013 6th International IEEE/EMBS Conference on Neural Engineering (NER).

[54]  Robert D Flint,et al.  Direct classification of all American English phonemes using signals from functional speech motor cortex , 2014, Journal of neural engineering.

[55]  Sid Lamrous,et al.  Divisive Hierarchical K-Means , 2006, 2006 International Conference on Computational Inteligence for Modelling Control and Automation and International Conference on Intelligent Agents Web Technologies and International Commerce (CIMCA'06).

[56]  Rajesh P. N. Rao,et al.  Real-time functional brain mapping using electrocorticography , 2007, NeuroImage.

[57]  Nicholas P. Szrama,et al.  Using the electrocorticographic speech network to control a brain–computer interface in humans , 2011, Journal of neural engineering.

[58]  Michael I. Jordan,et al.  Machine learning: Trends, perspectives, and prospects , 2015, Science.

[59]  Andreas Schulze-Bonhage,et al.  From speech to thought: the neuronal basis of cognitive units in non-experimental, real-life communication investigated using ECoG , 2014, Front. Hum. Neurosci..

[60]  H. Yokoi,et al.  Real-time control of a prosthetic hand using human electrocorticography signals. , 2011, Journal of neurosurgery.

[61]  Bradley Greger,et al.  Decoding spoken words using local field potentials recorded from the cortical surface , 2010, Journal of neural engineering.

[62]  J. A. Wilson,et al.  Two-dimensional movement control using electrocorticographic signals in humans , 2008, Journal of neural engineering.

[63]  J L Gallant,et al.  Sparse coding and decorrelation in primary visual cortex during natural vision. , 2000, Science.

[64]  D. Dijk,et al.  Frontal predominance of a relative increase in sleep delta and theta EEG activity after sleep loss in humans. , 1999, Sleep research online : SRO.

[65]  Nick F. Ramsey,et al.  Mismatch Between Electrocortical Stimulation and Electrocorticography Frequency Mapping of Language , 2013, Brain Stimulation.

[66]  G. Schalk,et al.  Decoding vowels and consonants in spoken and imagined words using electrocorticographic signals in humans , 2011, Journal of neural engineering.

[67]  Rajesh P. N. Rao,et al.  Localization and classification of phonemes using high spatial resolution electrocorticography (ECoG) grids , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[68]  E. Chang,et al.  Categorical Speech Representation in Human Superior Temporal Gyrus , 2010, Nature Neuroscience.

[69]  Hae Won Shin,et al.  Unsupervised learning of electrocorticography motifs with binary descriptors of wavelet features and hierarchical clustering , 2014, 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[70]  Mohammad Dastjerdi,et al.  Numerical processing in the human parietal cortex during experimental and natural conditions , 2013, Nature Communications.