Probability state modeling theory

As the technology of cytometry matures, there is mounting pressure to address two major issues with data analyses. The first issue is to develop new analysis methods for high‐dimensional data that can directly reveal and quantify important characteristics associated with complex cellular biology. The other issue is to replace subjective and inaccurate gating with automated methods that objectively define subpopulations and account for population overlap due to measurement uncertainty. Probability state modeling (PSM) is a technique that addresses both of these issues. The theory and important algorithms associated with PSM are presented along with simple examples and general strategies for autonomous analyses. PSM is leveraged to better understand B‐cell ontogeny in bone marrow in a companion Cytometry Part B manuscript. Three short relevant videos are available in the online supporting information for both of these papers. PSM avoids the dimensionality barrier normally associated with high‐dimensionality modeling by using broadened quantile functions instead of frequency functions to represent the modulation of cellular epitopes as cells differentiate. Since modeling programs ultimately minimize or maximize one or more objective functions, they are particularly amenable to automation and, therefore, represent a viable alternative to subjective and inaccurate gating approaches. © 2015 International Society for Advancement of Cytometry

[1]  C B Bagwell Data Analysis Through Modeling , 2001, Current protocols in cytometry.

[2]  B. Wood,et al.  Multicolor immunophenotyping: human immune system hematopoiesis. , 2004, Methods in cell biology.

[3]  V. Maino,et al.  Probability state modeling of memory CD8⁺ T-cell differentiation. , 2013, Journal of immunological methods.

[4]  Horst Bischof,et al.  3D parallel coordinate systems—A new data visualization method in the context of microscopy‐based multicolor tissue cytometry , 2006, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[5]  C. Bruce Bagwell,et al.  Probability State Modeling: A New Paradigm for Cytometric Analysis , 2010 .

[6]  Teresa H. Y. Meng,et al.  CytoSPADE: high-performance analysis and visualization of high-dimensional cytometry data , 2012, Bioinform..

[7]  Matthew D. Cooper,et al.  Revealing Structure in Visualizations of Dense 2D and 3D Parallel Coordinates , 2006, Inf. Vis..

[8]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[9]  Edward J. Wegman,et al.  High Dimensional Clustering Using Parallel Coordinates and the Grand Tour , 1997 .

[10]  Wade T Rogers,et al.  Cytometric fingerprinting: Quantitative characterization of multivariate distributions , 2008, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[11]  Raviv Raich,et al.  Analysis of clinical flow cytometric immunophenotyping data by clustering on statistical manifolds: Treating flow cytometry data as high‐dimensional objects , 2009, Cytometry. Part B, Clinical cytometry.

[12]  R. Murphy Automated identification of subpopulations in flow cytometric list mode data using cluster analysis. , 1985, Cytometry.

[13]  C B Bagwell,et al.  Fluorescence Spectral Overlap Compensation for Any Number of Flow Cytometry Parameters , 1993, Annals of the New York Academy of Sciences.

[14]  B H Davis,et al.  Automated quantitation of fetomaternal hemorrhage by flow cytometry for HbF‐containing fetal red blood cells using probability state modeling , 2013, International journal of laboratory hematology.

[15]  B. Wood,et al.  Human B‐cell and progenitor stages as determined by probability state modeling of multidimensional cytometry data , 2015, Cytometry. Part B, Clinical cytometry.

[16]  C Bruce Bagwell,et al.  Automated analysis of GPI‐deficient leukocyte flow cytometric data using GemStone™ , 2012, Cytometry. Part B, Clinical cytometry.

[17]  M Roederer,et al.  Probability binning comparison: a metric for quantitating multivariate distribution differences. , 2001, Cytometry.

[18]  Sean C. Bendall,et al.  Single-Cell Trajectory Detection Uncovers Progression and Regulatory Coordination in Human B Cell Development , 2014, Cell.

[19]  Daniel Haley,et al.  Polychromatic flow cytometry: A rapid method for the reduction and analysis of complex multiparameter data , 2006, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[20]  John Ferbas,et al.  Mixture modeling approach to flow cytometry data , 2008, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[21]  D. Herbert,et al.  Automated analysis of flow cytometric data for CD34+ stem cell enumeration using a probability state model , 2012, Cytometry. Part B, Clinical cytometry.

[22]  C Bruce Bagwell,et al.  Breaking the dimensionality barrier. , 2011, Methods in molecular biology.

[23]  W. Gilchrist,et al.  Statistical Modelling with Quantile Functions , 2000 .

[24]  C Bruce Bagwell,et al.  Hyperlog—A flexible log‐like transform for negative, zero, and positive valued data , 2005, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[25]  Sean C. Bendall,et al.  Extracting a Cellular Hierarchy from High-dimensional Cytometry Data with SPADE , 2011, Nature Biotechnology.

[26]  C. Bruce Bagwell,et al.  Automated analysis of flow cytometric data for measuring neutrophil CD64 expression using a multi‐instrument compatible probability state model , 2015, Cytometry. Part B, Clinical cytometry.