From big flow cytometry datasets to smart diagnostic strategies: The EuroFlow approach.

The rise in the analytical speed of mutiparameter flow cytometers made possible by the introduction of digital instruments, has brought up the possibility to manage progressively higher number of parameters simultaneously on significantly greater numbers of individual cells. This has led to an exponential increase in the complexity and volume of flow cytometry data generated about cells present in individual samples evaluated in a single measurement. This increase demands for new developments in flow cytometry data analysis, graphical representation, and visualization and interpretation tools to address the new big data challenges, i.e. processing data files of ≥10-25 parameters per cell in samples with >5-10 million cells (= up to 250 million data points per cell sample) obtained in a few minutes. Here, we present a comprehensive review of some of the tools developed by the EuroFlow consortium for processing flow cytometric big data files in diagnostic laboratories, particularly focused on automated EuroFlow approaches for: i) identification of all cell populations coexisting in a sample (automated gating); ii) smart classification of aberrant cell populations in routine diagnostics; iii) automated reporting; together with iv) new tools developed to visualize n-dimensional data in 2-dimensional plots to support expert-guided automated data analysis. The concept of using reference data bases implemented into software programs, in combination with multivariate statistical analysis pioneered by EuroFlow, provides an innovative, highly efficient and fast approach for diagnostic screening, classification and monitoring of patients with distinct hematological and immune disorders, as well as other diseases.

[1]  A Orfao,et al.  Next Generation Flow for highly sensitive and standardized detection of minimal residual disease in multiple myeloma , 2017, Leukemia.

[2]  M. van der Burg,et al.  Defects in memory B-cell and plasma cell subsets expressing different immunoglobulin-subclasses in patients with CVID and immunoglobulin subclass deficiencies. , 2019, The Journal of allergy and clinical immunology.

[3]  E S Costa,et al.  A probabilistic approach for the evaluation of minimal residual disease by multiparameter flow cytometry in leukemic B‐cell chronic lymphoproliferative disorders , 2008, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[4]  Sean C. Bendall,et al.  Extracting a Cellular Hierarchy from High-dimensional Cytometry Data with SPADE , 2011, Nature Biotechnology.

[5]  M. van der Burg,et al.  Age‐associated distribution of normal B‐cell and plasma cell subsets in peripheral blood , 2018, The Journal of allergy and clinical immunology.

[6]  Y. Heyden,et al.  Robust statistics in data analysis — A review: Basic concepts , 2007 .

[7]  Geoffrey E. Hinton,et al.  Neighbourhood Components Analysis , 2004, NIPS.

[8]  T. Kalina,et al.  Frequent issues and lessons learned from EuroFlow QA. , 2018, Journal of immunological methods.

[9]  E S Costa,et al.  Automated pattern-guided principal component analysis vs expert-based immunophenotypic classification of B-cell chronic lymphoproliferative disorders: a step forward in the standardization of clinical immunophenotyping , 2010, Leukemia.

[10]  Leonid Hrebien,et al.  A statistical pattern recognition approach for determining cellular viability and lineage phenotype in cultured cells and murine bone marrow , 2007, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[11]  M. Pérez‐Andrés,et al.  Distribution of subsets of blood monocytic cells throughout life. , 2019, The Journal of allergy and clinical immunology.

[12]  B. Becher,et al.  The end of gating? An introduction to automated analysis of high dimensional cytometry data , 2016, European journal of immunology.

[13]  B. Coiffier,et al.  Bortezomib plus rituximab alone in patients with relapsed, rituximab-naive or rituximab-sensitive, follicular lymphoma: a randomised phase 3 Trial. , 2011 .

[14]  V. Marx Biology: The big challenges of big data , 2013, Nature.

[15]  Claus Aranha,et al.  Optimized bi-dimensional data projection for clustering visualization , 2013, Inf. Sci..

[16]  M. Roederer,et al.  OMIP‐051 – 28‐color flow cytometry panel to characterize B cells and myeloid cells , 2018, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[17]  Ryan R Brinkman,et al.  Rapid cell population identification in flow cytometry data , 2011, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[18]  R G Sweet,et al.  Fluorescence activated cell sorting. , 1972, The Review of scientific instruments.

[19]  Wade T. Rogers,et al.  FlowFP: A Bioconductor Package for Fingerprinting Flow Cytometric Data , 2009, Adv. Bioinformatics.

[20]  N. Aghaeepour,et al.  Automated analysis of multidimensional flow cytometry data improves diagnostic accuracy between mantle cell lymphoma and small lymphocytic lymphoma. , 2012, American journal of clinical pathology.

[21]  Brent L Wood,et al.  Principles of minimal residual disease detection for hematopoietic neoplasms by flow cytometry , 2016, Cytometry. Part B, Clinical cytometry.

[22]  Peng Qiu,et al.  Computational prediction of manually gated rare cells in flow cytometry data , 2015, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[23]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[24]  Caroline Peltier,et al.  Comparison of Canonical Variate Analysis and Principal Component Analysis on 422 descriptive sensory studies , 2015 .

[25]  Greg Finak,et al.  flowDensity: reproducing manual gating of flow cytometry data by automated density-based cell population identification , 2015, Bioinform..

[26]  Quentin Lecrevisse,et al.  Standardized flow cytometry for highly sensitive MRD measurements in B-cell acute lymphoblastic leukemia. , 2017, Blood.

[27]  Elena Blanco,et al.  The EuroFlow PID Orientation Tube for Flow Cytometric Diagnostic Screening of Primary Immunodeficiencies of the Lymphoid System , 2019, Front. Immunol..

[28]  J Verde,et al.  Automated database-guided expert-supervised orientation for immunophenotypic diagnosis and classification of acute leukemia , 2017, Leukemia.

[29]  Mario Cazzola,et al.  The 2016 revision to the World Health Organization classification of myeloid neoplasms and acute leukemia. , 2016, Blood.

[30]  W. Bonner,et al.  Cell Sorting: Automated Separation of Mammalian Cells as a Function of Intracellular Fluorescence , 1969, Science.

[31]  T. Kalina,et al.  EuroFlow standardization of flow cytometer instrument settings and immunophenotyping protocols , 2012, Leukemia.

[32]  Giulia Freer,et al.  Intracellular cytokine detection by fluorescence-activated flow cytometry: basic principles and recent advances. , 2013, Methods.

[33]  P. Chattopadhyay,et al.  OMIP‐050: A 28‐color/30‐parameter Fluorescence Flow Cytometry Panel to Enumerate and Characterize Cells Expressing a Wide Array of Immune Checkpoint Molecules , 2018, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[34]  Greg Finak,et al.  OpenCyto: An Open Source Infrastructure for Scalable, Robust, Reproducible, and Automated, End-to-End Flow Cytometry Data Analysis , 2014, PLoS Comput. Biol..

[35]  S. Crowe,et al.  Lymphocyte subset analysis by Boolean algebra: a phenotypic approach using a cocktail of 5 antibodies and 3 color immunofluorescence. , 1994, Cytometry.

[36]  Peter Kiesel,et al.  Time encoded multicolor fluorescence detection in a microfluidic flow cytometer. , 2012, Lab on a chip.

[37]  Hsuan-Tien Lin,et al.  Learning From Data , 2012 .

[38]  Holden T Maecker,et al.  Algorithmic Tools for Mining High-Dimensional Cytometry Data , 2015, The Journal of Immunology.

[39]  D. Sutherland,et al.  The ISHAGE guidelines for CD34+ cell determination by flow cytometry. International Society of Hematotherapy and Graft Engineering. , 1996, Journal of hematotherapy.

[40]  Shuicheng Yan,et al.  Graph Embedding and Extensions: A General Framework for Dimensionality Reduction , 2007 .

[41]  A Orfao,et al.  EuroFlow antibody panels for standardized n-dimensional flow cytometric immunophenotyping of normal, reactive and malignant leukocytes , 2012, Leukemia.

[42]  Andrew W. Mead Review of the Development of Multidimensional Scaling Methods , 1992 .

[43]  J. P. McCoy,et al.  Standardizing Flow Cytometry Immunophenotyping Analysis from the Human ImmunoPhenotyping Consortium , 2016, Scientific Reports.

[44]  Joachim L Schultze,et al.  Teaching 'big data' analysis to young immunologists , 2015, Nature Immunology.

[45]  E S Costa,et al.  A new automated flow cytometry data analysis approach for the diagnostic screening of neoplastic B-cell disorders in peripheral blood samples with absolute lymphocytosis , 2006, Leukemia.

[46]  Mario Roederer,et al.  SPICE: Exploration and analysis of post‐cytometric complex multivariate datasets , 2011, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[47]  J. Paul Robinson,et al.  Computational analysis of high-throughput flow cytometry data , 2012, Expert opinion on drug discovery.

[48]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[49]  A. Órfão,et al.  Clinically useful information provided by the flow cytometric immunophenotyping of hematological malignancies: current status and future directions. , 1999, Clinical chemistry.

[50]  Jorge Cadima,et al.  Principal component analysis: a review and recent developments , 2016, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[51]  S. Ehl,et al.  The European Society for Immunodeficiencies (ESID) Registry Working Definitions for the Clinical Diagnosis of Inborn Errors of Immunity. , 2019, The journal of allergy and clinical immunology. In practice.

[52]  Greg Finak,et al.  Critical assessment of automated flow cytometry data analysis techniques , 2013, Nature Methods.

[53]  Raphael Gottardo,et al.  Automated gating of flow cytometry data via robust model‐based clustering , 2008, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[54]  Alberto Orfao,et al.  Overview of clinical flow cytometry data analysis: recent advances and future challenges. , 2013, Trends in biotechnology.

[55]  R de Groot,et al.  Immunophenotyping of blood lymphocytes in childhood. Reference values for lymphocyte subpopulations. , 1997, The Journal of pediatrics.

[56]  H. Goldschmidt,et al.  International Myeloma Working Group consensus criteria for response and minimal residual disease assessment in multiple myeloma. , 2016, The Lancet. Oncology.