Sparse Data Analysis Strategy for Neural Spike Classification

Many of the multichannel extracellular recordings of neural activity consist of attempting to sort spikes on the basis of shared characteristics with some feature detection techniques. Then spikes can be sorted into distinct clusters. There are in general two main statistical issues: firstly, spike sorting can result in well-sorted units, but by with no means one can be sure that one is dealing with single units due to the number of neurons adjacent to the recording electrode. Secondly, the waveform dimensionality is reduced in a small subset of discriminating features. This shortening dimension effort was introduced as an aid to visualization and manual clustering, but also to reduce the computational complexity in automatic classification. We introduce a metric based on common neighbourhood to introduce sparsity in the dataset and separate data into more homogeneous subgroups. The approach is particularly well suited for clustering when the individual clusters are elongated (that is nonspherical). In addition it does need not to select the number of clusters, it is very efficient to visualize clusters in a dataset, it is robust to noise, it can handle imbalanced data, and it is fully automatic and deterministic.

[1]  Yi Zhou,et al.  Spike sorting based on automatic template reconstruction with a partial solution to the overlapping problem , 2004, Journal of Neuroscience Methods.

[2]  Gilles Laurent,et al.  Using noise signature to optimize spike-sorting and to assess neuronal classification quality , 2002, Journal of Neuroscience Methods.

[3]  R. Tibshirani,et al.  Discriminant Analysis by Gaussian Mixtures , 1996 .

[4]  Vincent Vigneron,et al.  Signal Subspace Separation Based on the Divergence Measure of a Set of Wavelets Coefficients , 2009, ICA.

[5]  Enrique H. Ruspini,et al.  Numerical methods for fuzzy clustering , 1970, Inf. Sci..

[6]  D. Nolan The excess-mass ellipsoid , 1991 .

[7]  Vincent Kanade,et al.  Clustering Algorithms , 2021, Wireless RF Energy Transfer in the Massive IoT Era.

[8]  David S. Johnson,et al.  Compressing Large Boolean Matrices using Reordering Techniques , 2004, VLDB.

[9]  J. Letelier,et al.  Spike sorting based on discrete wavelet transform coefficients , 2000, Journal of Neuroscience Methods.

[10]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[11]  Thomas Martinetz,et al.  Unsupervised spike sorting with ICA and its evaluation using GENESIS simulations , 2005, Neurocomputing.

[12]  T. Villmann,et al.  Mathematical Aspects of Divergence Based Vector Quantization Using Fr´ echet-Derivatives , 2009 .

[13]  George Zouridakis,et al.  Identification of reliable spike templates in multi-unit extracellular recordings using fuzzy clustering , 2000, Comput. Methods Programs Biomed..

[14]  Vladimir Batagelj,et al.  Generalized blockmodeling of two-mode network data , 2004, Soc. Networks.

[15]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[16]  W. Härdle Smoothing Techniques: With Implementation in S , 1991 .

[17]  Inderjit S. Dhillon,et al.  Information-theoretic co-clustering , 2003, KDD '03.

[18]  Phipps Arabie,et al.  The bond energy algorithm revisited , 1990, IEEE Trans. Syst. Man Cybern..

[19]  Christophe Pouzat,et al.  Improved spike-sorting by modeling firing statistics and burst-dependent spike amplitude attenuation: a Markov chain Monte Carlo approach. , 2004, Journal of neurophysiology.

[20]  M. Brusco,et al.  Heuristic Implementation of Dynamic Programming for Matrix Permutation Problems in Combinatorial Data Analysis , 2008 .

[21]  V. Batagelj Notes on blockmodeling , 1997 .

[22]  Guy A. Orban,et al.  Spike Recognition and On-Line Classification by Unsupervised Learning System , 1979, IEEE Transactions on Biomedical Engineering.

[23]  Yoshio Sakurai,et al.  Automatic sorting for multi-neuronal activity recorded with tetrodes in the presence of overlapping spikes. , 2003, Journal of neurophysiology.

[24]  Camille Brunet,et al.  Une famille de matrices sparses pour une modélisation multi-échelle par blocs , 2011, Fouille de données complexes.

[25]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[26]  Phipps Arabie,et al.  AN OVERVIEW OF COMBINATORIAL DATA ANALYSIS , 1996 .

[27]  Weixiong Zhang,et al.  Rearrangement Clustering: Pitfalls, Remedies, and Applications , 2006, J. Mach. Learn. Res..

[28]  M. Brusco,et al.  Inducing a blockmodel structure of two-mode binary data using seriation procedures , 2006 .

[29]  Partha P. Mitra,et al.  Automatic sorting of multiple unit neuronal signals in the presence of anisotropic and non-Gaussian variability , 1996, Journal of Neuroscience Methods.

[30]  G. Buzsáki,et al.  Characterization of neocortical principal cells and interneurons by network interactions and extracellular features. , 2004, Journal of neurophysiology.

[31]  Hakan Ferhatosmanoglu,et al.  Analysis of Basic Data Reordering Techniques , 2008, SSDBM.

[32]  A. Bowman,et al.  A look at some data on the old faithful geyser , 1990 .

[33]  B.C. Wheeler,et al.  Real-time multichannel neural spike recognition with DSPs , 1990, IEEE Engineering in Medicine and Biology Magazine.

[34]  Gilles Caraux,et al.  PermutMatrix: a graphical environment to arrange gene expression profiles in optimal linear order , 2005, Bioinform..

[35]  A. Raftery,et al.  Variable Selection for Model-Based Clustering , 2006 .

[36]  A.F. Atiya,et al.  Recognition of multiunit neural signals , 1992, IEEE Transactions on Biomedical Engineering.

[37]  Phipps Arabie,et al.  Combinatorial Data Analysis: Optimization by Dynamic Programming , 1987 .

[38]  Geoffrey J. McLachlan,et al.  Mixture models : inference and applications to clustering , 1989 .

[39]  Partha P. Mitra,et al.  Erratum: Automatic sorting of multiple unit neuronal signals in the presence of anisotropic and non-Gaussian variability (J. Neurosci. Methods 69 (1996) (175-188)) (PII: S0165020796000507) , 1997 .

[40]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[41]  Michael Hahsler,et al.  Getting Things in Order: An Introduction to the R Package seriation , 2008 .

[42]  Paul J. Schweitzer,et al.  Problem Decomposition and Data Reorganization by a Clustering Technique , 1972, Oper. Res..

[43]  M S Lewicki,et al.  A review of methods for spike sorting: the detection and classification of neural action potentials. , 1998, Network.

[44]  Frank E. Hanson,et al.  An artificial neural network for neural spike classification , 1997, Proceedings of the IEEE 23rd Northeast Bioengineering Conference.

[45]  You-Yin Chen,et al.  Decomposition of EEG Signals for Multichannel Neural Activity Analysis in Animal Experiments , 2010, LVA/ICA.

[46]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[47]  Hans-Hermann Bock,et al.  Two-mode clustering methods: astructuredoverview , 2004, Statistical methods in medical research.

[48]  Stefan Niermann Optimizing the Ordering of Tables With Evolutionary Computation , 2005 .

[49]  Chun-Houh Chen GENERALIZED ASSOCIATION PLOTS: INFORMATION VISUALIZATION VIA ITERATIVELY GENERATED CORRELATION MATRICES , 2002 .