Progress in speech decoding from the electrocorticogram

Recent advances in neuroimaging methods have improved our ability to explore the neurological processes underlying speech and language. As a result of these investigations, it is now possible to decode aspects of speech directly from neural activity toward the development of neuroprosthetic devices for individuals with severe neuromuscular and communication disorders. Much of what is known about the neural correlates of speech articulation and perception is based on lesion and cortical electrical stimulation studies, as well as modern non-invasive neuroimaging. Though extremely important to the current understanding of brain function, traditional neuroimaging methods are primarily limited by the spatial and temporal resolution of the imaging technique. Electrical activity measured from the cortex, or electrocorticography (ECoG), offers several advantages over other neuroimaging modalities for characterization and real-time decoding of brain activity. Specifically, ECoG is well-suited for the study of speech and language owing to its unique spatial and temporal resolution capabilities that allow it to accurately capture the fast-changing dynamics of the large cortical networks underlying speech processing. This review presents the current progress of ECoG-based speech characterization and decoding studies, including an overview of prior neuroimaging studies, ECoG representations of speech production and perception, and a discussion of future directions.

[1]  C. Larson,et al.  Error-dependent modulation of speech-induced auditory suppression for pitch-shifted voice feedback , 2011, BMC Neuroscience.

[2]  C. Crainiceanu,et al.  Electrocorticographic high gamma activity versus electrical cortical stimulation mapping of naming. , 2005, Brain : a journal of neurology.

[3]  G. Pfurtscheller,et al.  Brain-Computer Interfaces for Communication and Control. , 2011, Communications of the ACM.

[4]  S. Petersen,et al.  Neuroimaging studies of word reading. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Michael H Kohrman,et al.  ECoG gamma activity during a language task: differentiating expressive and receptive speech areas. , 2008, Brain : a journal of neurology.

[6]  G. Schalk,et al.  Silent Communication: Toward Using Brain Signals , 2012, IEEE Pulse.

[7]  Andrew B. Schwartz,et al.  Brain-Controlled Interfaces: Movement Restoration with Neural Prosthetics , 2006, Neuron.

[8]  N. Crone,et al.  High-frequency gamma oscillations and human brain mapping with electrocorticography. , 2006, Progress in brain research.

[9]  Greg Worrell,et al.  Long-Term Measurement of Impedance in Chronically Implanted Depth and Subdural Electrodes During Responsive Neurostimulation in Humans , 2013, Brain Stimulation.

[10]  W. Levelt,et al.  The spatial and temporal signatures of word production components , 2004, Cognition.

[11]  G. Hickok Computational neuroanatomy of speech production , 2012, Nature Reviews Neuroscience.

[12]  E. T. Possing,et al.  Human temporal lobe activation by speech and nonspeech sounds. , 2000, Cerebral cortex.

[13]  J. Greenlee,et al.  Human Auditory Cortical Activation during Self-Vocalization , 2011, PloS one.

[14]  Kristofer E. Bouchard,et al.  Functional Organization of Human Sensorimotor Cortex for Speech Articulation , 2013, Nature.

[15]  Brian N. Pasley,et al.  Reconstructing Speech from Human Auditory Cortex , 2012, PLoS biology.

[16]  F. Guenther,et al.  Classification of Intended Phoneme Production from Chronic Intracortical Microelectrode Recordings in Speech-Motor Cortex , 2011, Front. Neurosci..

[17]  F. Guenther,et al.  A Wireless Brain-Machine Interface for Real-Time Speech Synthesis , 2009, PloS one.

[18]  Erik Edwards,et al.  Comparison of time-frequency responses and the event-related potential to auditory speech stimuli in human cortex. , 2009, Journal of neurophysiology.

[19]  Lisa D. Sanders,et al.  An ERP study of continuous speech processing. II. Segmentation, semantics, and syntax in non-native speakers. , 2003, Brain research. Cognitive brain research.

[20]  N. Barbaro,et al.  Spatiotemporal Dynamics of Word Processing in the Human Brain , 2007, Front. Neurosci..

[21]  H. Yokoi,et al.  Real-time control of a prosthetic hand using human electrocorticography signals. , 2011, Journal of neurosurgery.

[22]  Cathy J. Price,et al.  A review and synthesis of the first 20 years of PET and fMRI studies of heard speech, spoken language and reading , 2012, NeuroImage.

[23]  Bruce A. Schneider,et al.  Investigating the Influence of Continuous Babble on Auditory Short-Term Memory Performance , 2008, Quarterly journal of experimental psychology.

[24]  B. Schölkopf,et al.  Voluntary brain regulation and communication with electrocorticogram signals , 2008, Epilepsy & Behavior.

[25]  Robert D Flint,et al.  Direct classification of all American English phonemes using signals from functional speech motor cortex , 2014, Journal of neural engineering.

[26]  G. Schalk,et al.  Decoding vowels and consonants in spoken and imagined words using electrocorticographic signals in humans , 2011, Journal of neural engineering.

[27]  E. Chang,et al.  Categorical Speech Representation in Human Superior Temporal Gyrus , 2010, Nature Neuroscience.

[28]  Kai J Miller,et al.  Rapid online language mapping with electrocorticography. , 2011, Journal of neurosurgery. Pediatrics.

[29]  David Poeppel,et al.  The Tracking of Speech Envelope in the Human Cortex , 2013, PloS one.

[30]  E. Chang,et al.  Human cortical sensorimotor network underlying feedback control of vocal pitch , 2013, Proceedings of the National Academy of Sciences.

[31]  Greg Gibson,et al.  Rare and common variants: twenty arguments , 2012, Nature Reviews Genetics.

[32]  P. Broca Perte de la parole, ramouissement chronique et destruction partielle du lobe antérieur gauche du cerveau , 1861 .

[33]  Sophie K. Scott,et al.  Auditory neuroimaging with fMRI and PET , 2014, Hearing Research.

[34]  Michael R Sperling,et al.  Impedance Variations over Time for a Closed‐Loop Neurostimulation Device: Early Experience with Chronically Implanted Electrodes , 2013, Neuromodulation : journal of the International Neuromodulation Society.

[35]  Colin M. Brown,et al.  ERP effects of listening to speech: semantic ERP effects , 2000, Neuropsychologia.

[36]  Paul Broca;Perte de la Parole--ramollissement chronique et destruction partielle du lobe anterieur gauche du cerveau 〔Bulletins de la Societe d′anthropologie,1re serie,2:235-238,1861.〕 , 1980 .

[37]  Brian N. Pasley,et al.  Decoding spectrotemporal features of overt and covert speech from the human cortex , 2014, Front. Neuroeng..

[38]  Lesya Y. Ganushchak,et al.  The Use of Electroencephalography in Language Production Research: A Review , 2011, Front. Psychology.

[39]  Eric Leuthardt,et al.  Spatiotemporal dynamics of electrocorticographic high gamma activity during overt and covert word repetition , 2011, NeuroImage.

[40]  Wei Wu,et al.  Spoken sentences decoding based on intracranial high gamma response using dynamic time warping , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[41]  C. Price The anatomy of language: contributions from functional neuroimaging , 2000, Journal of anatomy.

[42]  C. Wernicke Der aphasische Symptomenkomplex , 1974 .

[43]  Rajesh P. N. Rao,et al.  Localization and classification of phonemes using high spatial resolution electrocorticography (ECoG) grids , 2008, 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[44]  N. Kraus,et al.  Musical Experience and the Aging Auditory System: Implications for Cognitive Abilities and Hearing Speech in Noise , 2011, PloS one.

[45]  B. Gordon,et al.  Induced electrocorticographic gamma activity during auditory perception , 2001, Clinical Neurophysiology.

[46]  G. Schalk,et al.  Brain-Computer Interfaces Using Electrocorticographic Signals , 2011, IEEE Reviews in Biomedical Engineering.

[47]  G. Ojemann Cortical organization of language , 1991, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[48]  Satrajit S. Ghosh,et al.  Neural modeling and imaging of the cortical interactions underlying syllable production , 2006, Brain and Language.

[49]  C. Buss,et al.  Children's Brain Development Benefits from Longer Gestation , 2011, Front. Psychology.

[50]  Gustavo P. Sudre,et al.  Decoding semantic information from human electrocorticographic (ECoG) signals , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[51]  Gerwin Schalk,et al.  A brain–computer interface using electrocorticographic signals in humans , 2004, Journal of neural engineering.

[52]  A. Palmini,et al.  The concept of the epileptogenic zone: a modern look at Penfield and Jasper's views on the role of interictal spikes. , 2006, Epileptic disorders : international epilepsy journal with videotape.

[53]  Robert T Knight,et al.  Decoding speech for understanding and treating aphasia. , 2013, Progress in brain research.

[54]  Nicholas P. Szrama,et al.  Using the electrocorticographic speech network to control a brain–computer interface in humans , 2011, Journal of neural engineering.

[55]  William D. Penny,et al.  A dynamical pattern recognition model of gamma activity in auditory cortex , 2012, Neural Networks.

[56]  D. Poeppel,et al.  The cortical organization of speech processing , 2007, Nature Reviews Neuroscience.

[57]  J. A. Wilson,et al.  Two-dimensional movement control using electrocorticographic signals in humans , 2008, Journal of neural engineering.

[58]  Karl J. Friston,et al.  Hearing and saying. The functional neuro-anatomy of auditory word processing. , 1996, Brain : a journal of neurology.

[59]  Anastasios Bezerianos,et al.  Joint Spatial-Spectral Feature Space Clustering for Speech Activity Detection from ECoG Signals , 2014, IEEE Transactions on Biomedical Engineering.

[60]  R. Irizarry,et al.  Electrocorticographic gamma activity during word production in spoken and sign language , 2001, Neurology.

[61]  Lisa D. Sanders,et al.  An ERP study of continuous speech processing. I. Segmentation, semantics, and syntax in native speakers. , 2003, Brain research. Cognitive brain research.

[62]  Bradley Greger,et al.  Decoding spoken words using local field potentials recorded from the cortical surface , 2010, Journal of neural engineering.

[63]  Christopher K. Kovach,et al.  Temporal Envelope of Time-Compressed Speech Represented in the Human Auditory Cortex , 2009, The Journal of Neuroscience.

[64]  M. Berger,et al.  High gamma activity in response to deviant auditory stimuli recorded directly from human cortex. , 2005, Journal of neurophysiology.

[65]  Gerwin Schalk,et al.  Temporal evolution of gamma activity in human cortex during an overt and covert word repetition task , 2012, Front. Hum. Neurosci..

[66]  John L. Czajka,et al.  Digest of Data on Persons with Disabilities. , 1984 .