Compatibility Evaluation of Clustering Algorithms for Contemporary Extracellular Neural Spike Sorting

Deciphering useful information from electrophysiological data recorded from the brain, in-vivo or in-vitro, is dependent on the capability to analyse spike patterns efficiently and accurately. The spike analysis mechanisms are heavily reliant on the clustering algorithms that enable separation of spike trends based on their spatio-temporal behaviors. Literature review report several clustering algorithms over decades focused on different applications. Although spike analysis algorithms employ only a small subset of clustering algorithms, however, not much work has been reported on the compliance and suitability of such clustering algorithms for spike analysis. In our study, we have attempted to comment on the suitability of available clustering algorithms and performance capacity when exposed to spike analysis. In this regard, the study reports a compatibility evaluation on algorithms previously employed in spike sorting as well as the algorithms yet to be investigated for application in sorting neural spikes. The performance of the algorithms is compared in terms of their accuracy, confusion matrix and accepted validation indices. Three data sets comprising of easy, difficult, and real spike similarity with known ground-truth are chosen for assessment, ensuring a uniform testbed. The procedure also employs two feature-sets, principal component analysis and wavelets. The report also presents a statistical score scheme to evaluate the performance individually and overall. The open nature of the data sets, the clustering algorithms and the evaluation criteria make the proposed evaluation framework widely accessible to the research community. We believe that the study presents a reference guide for emerging neuroscientists to select the most suitable algorithms for their spike analysis requirements.

[1]  M. Salganicoff,et al.  Unsupervised waveform classification for multi-neuron recordings: a real-time, software-based system. I. Algorithms and implementation , 1988, Journal of Neuroscience Methods.

[2]  Hojjat Adeli,et al.  MUSIC-Expected maximization gaussian mixture methodology for clustering and detection of task-related neuronal firing rates , 2017, Behavioural Brain Research.

[3]  F. Öhberg,et al.  A neural network approach to real-time spike discrimination during simultaneous recording from several multi-unit nerve filaments , 1996, Journal of Neuroscience Methods.

[4]  Bruce L. McNaughton,et al.  The stereotrode: A new technique for simultaneous isolation of several single units in the central nervous system from multiple unit records , 1983, Journal of Neuroscience Methods.

[5]  S. Nahavandi,et al.  Neurotropism and behavioral changes associated with Zika infection in the vector Aedes aegypti , 2018, Emerging Microbes & Infections.

[6]  Richard A. Andersen,et al.  On the Separation of Signals from Neighboring Cells in Tetrode Recordings , 1997, NIPS.

[7]  Agnès Gruart,et al.  Spike sorting based on shape, phase, and distribution features, and K-TOPS clustering with validity and error indices , 2018, Scientific Reports.

[8]  Saeid Nahavandi,et al.  Hierarchical estimation of neural activity through explicit identification of temporally synchronous spikes , 2017, Neurocomputing.

[9]  Doheon Lee,et al.  Evaluation of the performance of clustering algorithms in kernel-induced feature space , 2005, Pattern Recognit..

[10]  Florian Mormann,et al.  Reliable Analysis of Single-Unit Recordings from the Human Brain under Noisy Conditions: Tracking Neurons over Hours , 2016, PloS one.

[11]  Tomoki Fukai,et al.  A novel view of the variational Bayesian clustering , 2009, Neurocomputing.

[12]  Cyrille Rossant,et al.  Spike sorting for large, dense electrode arrays , 2015 .

[13]  S. Leib,et al.  Embryonic Stem Cell-Derived Neurons Grown on Multi-Electrode Arrays as a Novel In vitro Bioassay for the Detection of Clostridium botulinum Neurotoxins , 2017, Front. Pharmacol..

[14]  Young-Rae Cho,et al.  Survey: Enhancing protein complex prediction in PPI networks with GO similarity weighting , 2013, Interdisciplinary Sciences: Computational Life Sciences.

[15]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[16]  Sudipto Guha,et al.  CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.

[17]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[18]  Andrei Novikov,et al.  PyClustering: Data Mining Library , 2019, J. Open Source Softw..

[19]  Raktim Sinha,et al.  MeV: MultiExperiment Viewer , 2010 .

[20]  Rodrigo Quian Quiroga,et al.  Past, present and future of spike sorting techniques , 2015, Brain Research Bulletin.

[21]  Liam Paninski,et al.  Kalman Filter Mixture Model for Spike Sorting of Non-stationary Data , 2010 .

[22]  R. Segev,et al.  A method for spike sorting and detection based on wavelet packets and Shannon's mutual information , 2002, Journal of Neuroscience Methods.

[23]  Yee Whye Teh,et al.  Dependent Dirichlet Process Spike Sorting , 2008, NIPS.

[24]  He Cui,et al.  Spike-timing pattern operates as gamma-distribution across cell types, regions and animal species and is essential for naturally-occurring cognitive states , 2018 .

[25]  D. Barbour,et al.  In Vitro Assay for the Detection of Network Connectivity in Embryonic Stem Cell-Derived Cultures , 2018, bioRxiv.

[26]  Kenneth D. Harris,et al.  High-Dimensional Cluster Analysis with the Masked EM Algorithm , 2013, Neural Computation.

[27]  Kenneth D Harris,et al.  Improving data quality in neuronal population recordings , 2016, Nature Neuroscience.

[28]  Eero P. Simoncelli,et al.  Journal of Neuroscience Methods , 2022 .

[29]  C. Koch,et al.  On the origin of the extracellular action potential waveform: A modeling study. , 2006, Journal of neurophysiology.

[30]  Eyke Hüllermeier,et al.  Comparing Fuzzy Partitions: A Generalization of the Rand Index and Related Measures , 2012, IEEE Transactions on Fuzzy Systems.

[31]  A Vassault,et al.  [Examination procedures]. , 2010, Annales de biologie clinique.

[32]  Partha P. Mitra,et al.  Automatic sorting of multiple unit neuronal signals in the presence of anisotropic and non-Gaussian variability , 1996, Journal of Neuroscience Methods.

[33]  Min Wu,et al.  A core-attachment based method to detect protein complexes in PPI networks , 2009, BMC Bioinformatics.

[34]  R. Quian Quiroga,et al.  Unsupervised Spike Detection and Sorting with Wavelets and Superparamagnetic Clustering , 2004, Neural Computation.

[35]  A. Luczak,et al.  Recording Large-scale Neuronal Ensembles with Silicon Probes in the Anesthetized Rat , 2011, Journal of visualized experiments : JoVE.

[36]  D. Hubel Tungsten Microelectrode for Recording from Single Units. , 1957, Science.

[37]  Matteo Carandini,et al.  Kilosort: realtime spike-sorting for extracellular electrophysiology with hundreds of channels , 2016, bioRxiv.

[38]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[39]  R. Sharan,et al.  CLICK: a clustering algorithm with applications to gene expression analysis. , 2000, Proceedings. International Conference on Intelligent Systems for Molecular Biology.

[40]  George Zouridakis,et al.  Identification of reliable spike templates in multi-unit extracellular recordings using fuzzy clustering , 2000, Comput. Methods Programs Biomed..

[41]  Anil K. Jain,et al.  Simultaneous feature selection and clustering using mixture models , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Taejeong Kim,et al.  A new action potential detector using the MTEO and its effects on spike sorting systems at low signal-to-noise ratios , 2006, IEEE Transactions on Biomedical Engineering.

[43]  Vijay Balasubramanian,et al.  Fast, Scalable, Bayesian Spike Identification for Multi-Electrode Arrays , 2011 .

[44]  Saeid Nahavandi,et al.  A review on cluster estimation methods and their application to neural spike data , 2018, Journal of neural engineering.

[45]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Shigehiko Kanaya,et al.  Development and implementation of an algorithm for detection of protein complexes in large interaction networks , 2006, BMC Bioinformatics.

[47]  T. Blanche,et al.  Polytrodes: high-density silicon electrode arrays for large-scale multiunit recording. , 2005, Journal of neurophysiology.

[48]  Jeremy F. Magland,et al.  A Fully Automated Approach to Spike Sorting , 2017, Neuron.

[49]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[50]  Sudipto Guha,et al.  ROCK: A Robust Clustering Algorithm for Categorical Attributes , 2000, Inf. Syst..

[51]  Andreas Hierlemann,et al.  Combination of High-density Microelectrode Array and Patch Clamp Recordings to Enable Studies of Multisynaptic Integration , 2017, Scientific Reports.

[52]  Vipin Kumar,et al.  Chameleon: Hierarchical Clustering Using Dynamic Modeling , 1999, Computer.

[53]  Chee Peng Lim,et al.  Statistical Modelling of Artificial Neural Network for Sorting Temporally Synchronous Spikes , 2015, ICONIP.

[54]  Ron Shamir,et al.  EXPANDER – an integrative program suite for microarray data analysis , 2005, BMC Bioinformatics.

[55]  Michael S. Lewicki,et al.  Bayesian Modeling and Classification of Neural Signals , 1993, Neural Computation.

[56]  Young-Rae Cho,et al.  Detecting protein complexes and functional modules from protein interaction networks: A graph entropy approach , 2011 .

[57]  Hagai Attias,et al.  Inferring Parameters and Structure of Latent Variable Models by Variational Bayes , 1999, UAI.

[58]  Larry D. Hostetler,et al.  The estimation of the gradient of a density function, with applications in pattern recognition , 1975, IEEE Trans. Inf. Theory.

[59]  Tomoki Fukai,et al.  Accurate spike sorting for multi‐unit recordings , 2010, The European journal of neuroscience.

[60]  M S Lewicki,et al.  A review of methods for spike sorting: the detection and classification of neural action potentials. , 1998, Network.

[61]  Tomoki Fukai,et al.  Spike detection from noisy neural data in linear‐probe recordings , 2014, The European journal of neuroscience.

[62]  A. Bhatti,et al.  Mechanism of docosahexaenoic acid in the enhancement of neuronal signalling , 2017 .

[63]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[64]  Guosong Hong,et al.  Novel electrode technologies for neural recordings , 2019, Nature Reviews Neuroscience.

[65]  Roded Sharan,et al.  Center CLICK: A Clustering Algorithm with Applications to Gene Expression Analysis , 2000, ISMB.

[66]  Blatt,et al.  Superparamagnetic clustering of data. , 1998, Physical review letters.

[67]  Athanassios G. Siapas,et al.  Model-based spike sorting with a mixture of drifting t-distributions , 2017, Journal of Neuroscience Methods.

[68]  Sudipto Guha,et al.  ROCK: a robust clustering algorithm for categorical attributes , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[69]  Gary Look,et al.  Alzheimer's Therapeutics Targeting Amyloid Beta 1–42 Oligomers II: Sigma-2/PGRMC1 Receptors Mediate Abeta 42 Oligomer Binding and Synaptotoxicity , 2014, PloS one.

[70]  Adriano B L Tort,et al.  Spike sorting with Gaussian mixture models , 2018, Scientific Reports.

[71]  Shy Shoham,et al.  Robust, automatic spike sorting using mixtures of multivariate t-distributions , 2003, Journal of Neuroscience Methods.

[72]  Markus Meister,et al.  Multi-neuronal signals from the retina: acquisition and analysis , 1994, Journal of Neuroscience Methods.

[73]  Rodrigo Quian Quiroga,et al.  How many neurons can we see with current spike sorting algorithms? , 2012, Journal of Neuroscience Methods.

[74]  Eero P. Simoncelli,et al.  A Model-Based Spike Sorting Algorithm for Removing Correlation Artifacts in Multi-Neuron Recordings , 2013, PloS one.

[75]  Jason S. Prentice,et al.  Fast, Scalable, Bayesian Spike Identification for Multi-Electrode Arrays , 2010, PloS one.

[76]  Yucheng Dong,et al.  A Unified Framework , 2018, Linguistic Decision Making.

[77]  Gert Cauwenberghs,et al.  Independent Component Analysis for Fully Automated Multi-Electrode Array Spike Sorting , 2018, 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[78]  Cesar H. Comin,et al.  A Systematic Comparison of Supervised Classifiers , 2013, PloS one.

[79]  Timothy J. Blanche,et al.  Python for Large-Scale Electrophysiology , 2009, Front. Neuroinform..

[80]  B. McNaughton,et al.  Tetrodes markedly improve the reliability and yield of multiple single-unit isolation from multi-unit recordings in cat striate cortex , 1995, Journal of Neuroscience Methods.

[81]  J. Csicsvari,et al.  Accuracy of tetrode spike separation as determined by simultaneous intracellular and extracellular measurements. , 2000, Journal of neurophysiology.

[82]  Jean-Pierre Valentin,et al.  Predicting cardiac safety using human induced pluripotent stem cell-derived cardiomyocytes combined with multi-electrode array (MEA) technology: A conference report. , 2018, Journal of pharmacological and toxicological methods.

[83]  André Hardy,et al.  An examination of procedures for determining the number of clusters in a data set , 1994 .

[84]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[85]  R. Carter 11 – IT and society , 1991 .

[86]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[87]  Mark A. Girolami,et al.  Mercer kernel-based clustering in feature space , 2002, IEEE Trans. Neural Networks.

[88]  Gary D. Bader,et al.  An automated method for finding molecular complexes in large protein interaction networks , 2003, BMC Bioinformatics.

[89]  Ron Shamir,et al.  Clustering Gene Expression Patterns , 1999, J. Comput. Biol..

[90]  Geoffrey H. Ball,et al.  ISODATA, A NOVEL METHOD OF DATA ANALYSIS AND PATTERN CLASSIFICATION , 1965 .

[91]  Yingjie Tian,et al.  A Comprehensive Survey of Clustering Algorithms , 2015, Annals of Data Science.

[92]  Daniel Novak,et al.  Performance comparison of extracellular spike sorting algorithms for single-channel recordings , 2012, Journal of Neuroscience Methods.

[93]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[94]  Kwong-Sak Leung,et al.  Scalable model-based clustering for large databases based on data summarization , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[95]  Kunal J. Paralikar,et al.  New approaches to eliminating common-noise artifacts in recordings from intracortical microelectrode arrays: Inter-electrode correlation and virtual referencing , 2009, Journal of Neuroscience Methods.

[96]  Nicholas V. Swindale,et al.  Spike sorting for polytrodes: a divide and conquer approach , 2014, Front. Syst. Neurosci..

[97]  Alberto Diaspro,et al.  Brain Function: Novel Technologies Driving Novel Understanding , 2014 .

[98]  Gang Chen,et al.  Modifying the DPClus algorithm for identifying protein complexes based on new topological structures , 2008, BMC Bioinformatics.

[99]  David B. Dunson,et al.  Multichannel Electrophysiological Spike Sorting via Joint Dictionary Learning and Mixture Modeling , 2013, IEEE Transactions on Biomedical Engineering.

[100]  S. Nahavandi,et al.  Zika virus-induced hyper excitation precedes death of mouse primary neuron , 2018, Virology Journal.

[101]  M. Abdelguerfi,et al.  Introduction 1.2 Parallel Database Systems 1.2.1 Computation Model 2 1.2 Parallel Database Systems Introduction Select * from Employee, Department Where (employee.dept_no @bullet Department.dept_no) and (employee.position = "manager") (a) Sql Request 1.2.2 Engineering Model , 2022 .

[102]  C. P. Lim,et al.  Unified selective sorting approach to analyse multi-electrode extracellular data , 2016, Scientific Reports.