Optimal selection of electrocorticographic sensors for voice activity detection

An effective speech brain machine interface requires selecting the best cortical recording sites and signal features for decoding speech production, but also minimal clinical risk for the patient. Motivated by this need to reduce patient risk, the purpose of this study is to detect voice activity (speech onset and offset) automatically from spatial-spectral features of electrocorticographic signals using the optimal number of sensors (minimal invasiveness). ECoG signals were recorded while a subject performed two different syllable repetition tasks. We found that the optimal frequency resolution for detecting voice activity is 8 Hz using 31 sensors out of 55, achieving 98.2% accuracy by employing support vector machines (SVM) as a classifier, and that acceptable accuracy of 96.7% was achieved using 15 sensors, which would permit a less invasive surgery for the placement of electrodes. The proposed voice activity detector may be utilized as a part of an ECoG-based automated natural speech BMI system.

[1]  H. Flor,et al.  A spelling device for the paralysed , 1999, Nature.

[2]  J. Wolpaw,et al.  Decoding two-dimensional movement trajectories using electrocorticographic signals in humans , 2007, Journal of neural engineering.

[3]  Rabab K Ward,et al.  A survey of signal processing algorithms in brain–computer interfaces based on electrical brain signals , 2007, Journal of neural engineering.

[4]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[5]  E. W. Sellers,et al.  Toward enhanced P300 speller performance , 2008, Journal of Neuroscience Methods.

[6]  Bradley Greger,et al.  Decoding spoken words using local field potentials recorded from the cortical surface , 2010, Journal of neural engineering.

[7]  Brian N. Pasley,et al.  Reconstructing Speech from Human Auditory Cortex , 2012, PLoS biology.

[8]  D.J. McFarland,et al.  The wadsworth BCI research and development program: at home with BCI , 2006, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[9]  V. Gilja,et al.  Signal Processing Challenges for Neural Prostheses , 2008, IEEE Signal Processing Magazine.

[10]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[11]  Makoto Sato,et al.  Single-trial classification of vowel speech imagery using common spatial patterns , 2009, Neural Networks.

[12]  Anastasios Bezerianos,et al.  Joint Spatial-Spectral Feature Space Clustering for Speech Activity Detection from ECoG Signals , 2014, IEEE Transactions on Biomedical Engineering.

[13]  G. Schalk,et al.  Decoding vowels and consonants in spoken and imagined words using electrocorticographic signals in humans , 2011, Journal of neural engineering.

[14]  E Donchin,et al.  The mental prosthesis: assessing the speed of a P300-based brain-computer interface. , 2000, IEEE transactions on rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[15]  N. Birbaumer,et al.  The thought-translation device (TTD): neurobehavioral mechanisms and clinical outcome , 2003, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[16]  D GOLDMAN,et al.  The clinical use of the "average" reference electrode in monopolar recording. , 1950, Electroencephalography and clinical neurophysiology.

[17]  S. Acharya,et al.  Connectivity Analysis as a Novel Approach to Motor Decoding for Prosthesis Control , 2012, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[18]  Christa Neuper,et al.  An asynchronously controlled EEG-based virtual keyboard: improvement of the spelling rate , 2004, IEEE Transactions on Biomedical Engineering.

[19]  F. Guenther,et al.  A Wireless Brain-Machine Interface for Real-Time Speech Synthesis , 2009, PloS one.

[20]  S. Acharya,et al.  Toward Electrocorticographic Control of a Dexterous Upper Limb Prosthesis: Building Brain-Machine Interfaces , 2012, IEEE Pulse.

[21]  Wei Wu,et al.  Spoken sentences decoding based on intracranial high gamma response using dynamic time warping , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[22]  Ciprian M. Crainiceanu,et al.  Dynamics of large-scale cortical interactions at high gamma frequencies during word production: Event related causality (ERC) analysis of human electrocorticography (ECoG) , 2011, NeuroImage.

[23]  J. Wolpaw,et al.  Decoding flexion of individual fingers using electrocorticographic signals in humans , 2009, Journal of neural engineering.

[24]  J. Selhorst,et al.  "Locked-in" syndrome. , 1987, Stroke.

[25]  X. Zeng,et al.  Geometric strategies for neuroanatomic analysis from MRI , 2004, NeuroImage.