Feasibility of Identity Vectors for use as subject verification and cohort retrieval of electroencephalograms

The success of Identity Vectors in speech recognition as a tool for subject verification, language detection, word recognition, and accent/dialect classification suggests the technique is a robust method of unsupervised learning on high dimensional data, such as electroencephalograms. Tests run on the PhysioNet EEG Motor Movement/Imagery corpus concerning the matching of subject specific trials showed an average verification of 99% for the 109 subject-trial tests. Further tests on the ability to cluster repeated subject-trials produced at least one matching subject-trial for 60% of the subjects. The driving component of the Identity Vector process is the creation of Universal Background Models derived from single dimension Gaussian mixtures of user defined sizes operating on cepstrum feature coefficients. Taken as a whole, the results of this work indicate that Identity Vectors can be effective at distinguishing between subjects and show promise when asked to generate cohorts of related data.

[1]  Larry P. Heck,et al.  MSR Identity Toolbox v1.0: A MATLAB Toolbox for Speaker Recognition Research , 2013 .

[2]  Jeffrey M. Hausdorff,et al.  Physionet: Components of a New Research Resource for Complex Physiologic Signals". Circu-lation Vol , 2000 .

[3]  I. Obeid,et al.  The TUH EEG CORPUS: A big data resource for automated EEG interpretation , 2014, 2014 IEEE Signal Processing in Medicine and Biology Symposium (SPMB).

[4]  Douglas A. Reynolds,et al.  The NIST 2014 Speaker Recognition i-vector Machine Learning Challenge , 2014, Odyssey.

[5]  R. Barry,et al.  EEG differences between eyes-closed and eyes-open resting conditions , 2007, Clinical Neurophysiology.

[6]  J. C. Sackellares,et al.  Inter-rater agreement on identification of electrographic seizures and periodic discharges in ICU EEG recordings , 2015, Clinical Neurophysiology.

[7]  Jesús B. Alonso,et al.  Electroencephalogram subject identification: A review , 2014, Expert Syst. Appl..

[8]  Robert J. Schalkoff,et al.  Standardized database development for EEG epileptiform transient detection: EEGnet scoring system and machine learning analysis , 2013, Journal of Neuroscience Methods.

[9]  Patrick Kenny,et al.  Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Joseph Picone,et al.  Improved EEG event classification using differential energy , 2015, 2015 IEEE Signal Processing in Medicine and Biology Symposium (SPMB).

[11]  Patrick Kenny,et al.  Joint Factor Analysis Versus Eigenchannels in Speaker Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Douglas A. Reynolds,et al.  Universal Background Models , 2009, Encyclopedia of Biometrics.

[13]  Patrick Kenny,et al.  Eigenvoice modeling with sparse training data , 2005, IEEE Transactions on Speech and Audio Processing.

[14]  Patrick Kenny,et al.  A Study of Interspeaker Variability in Speaker Verification , 2008, IEEE Transactions on Audio, Speech, and Language Processing.