Automatic Clustering of Flow Cytometry Data with Density-Based Merging

The ability of flow cytometry to allow fast single cell interrogation of a large number of cells has made this technology ubiquitous and indispensable in the clinical and laboratory setting. A current limit to the potential of this technology is the lack of automated tools for analyzing the resulting data. We describe methodology and software to automatically identify cell populations in flow cytometry data. Our approach advances the paradigm of manually gating sequential two-dimensional projections of the data to a procedure that automatically produces gates based on statistical theory. Our approach is nonparametric and can reproduce nonconvex subpopulations that are known to occur in flow cytometry samples, but which cannot be produced with current parametric model-based approaches. We illustrate the methodology with a sample of mouse spleen and peritoneal cavity cells.

[1]  Mario Roederer,et al.  A new “Logicle” display method avoids deceptive effects of logarithmic scaling for low signals and compensated data , 2006, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[2]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[3]  Josef Spidlen,et al.  Data standards for flow cytometry. , 2006, Omics : a journal of integrative biology.

[4]  Doug Redelman CytometryML , 2004, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[5]  M. Wand Fast Computation of Multivariate Kernel Estimators , 1994 .

[6]  John Ferbas,et al.  Mixture modeling approach to flow cytometry data , 2008, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[7]  Gérard Lizard Flow cytometry analyses and bioinformatics: Interest in new softwares to optimize novel technologies and to favor the emergence of innovative concepts in cell research , 2007, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[8]  Probal Chaudhuri,et al.  Significance in Scale Space for Bivariate Density Estimation , 2002 .

[9]  S Demers,et al.  Analyzing multivariate flow cytometric data in aquatic sciences. , 1992, Cytometry.

[10]  Raphael Gottardo,et al.  Automated gating of flow cytometry data via robust model‐based clustering , 2008, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[11]  Mario Roederer,et al.  Dear Reader, , 2003, Nature Medicine.

[12]  Larry D. Hostetler,et al.  The estimation of the gradient of a density function, with applications in pattern recognition , 1975, IEEE Trans. Inf. Theory.

[13]  T C Bakker Schut,et al.  Cluster analysis of flow cytometric list mode data on a personal computer. , 1993, Cytometry.

[14]  R. Murphy Automated identification of subpopulations in flow cytometric list mode data using cluster analysis. , 1985, Cytometry.

[15]  M. Roederer,et al.  11-color, 13-parameter flow cytometry: Identification of human naive T cells by phenotype, function, and T-cell receptor diversity , 2001, Nature Medicine.

[16]  Vincent Kanade,et al.  Clustering Algorithms , 2021, Wireless RF Energy Transfer in the Massive IoT Era.