FlowAnd: Comprehensive Computational Framework for Flow Cytometry Data Analysis

Flow cytometry is a widely used high-throughput measurement technology in basic research and diagnostics. Recently the amount of data generated from flow cytometry experiments has been increasing, both in sample numbers and the number of parameters measured per cell. These highly multivariate datasets have become too large for use with tools depending mainly on manual analysis. We have implemented a computational framework (FlowAnd) that is designed to analyze and integrate largescale, multi-color flow cytometry data. The tool implements methods for data importing, various transformations, several clustering algorithms for automatic clustering, visualization tools as well as straightforward statistical testing. We applied FlowAnd to a phosphoproteomics data set from 37 chronic myeloid leukemia patients treated with two kinase inhibitors. Our results indicate high concordance between automated gating using three clustering algorithms and manual gating. Analysis of more than 70 flow cytometry experiments demonstrate the utility of features in FlowAnd, such as a graphical tool for rapid validation of clustering results, in large-scale flow cytometry data analysis. The FlowAnd framework allows accurate, fast and well documented analysis of multidimensional flow cytometry experiments. It provides several clustering algorithms for automatic gating, the possibility to add novel tools in various programming languages, such as Java, R, Python or MATLAB in an environment amenable to high-performance computing. FlowAnd can also be easily modified to comply with various marker panels and parameter settings. FlowAnd, all data and user guide are freely available under GNU General Public License at http://csbi.ltdk.helsinki.fi/flowand.

[1]  Mario Roederer,et al.  A chromatic explosion: the development and future of multiparameter flow cytometry , 2008, Immunology.

[2]  Arvind Gupta,et al.  Data reduction for spectral clustering to analyze high throughput flow cytometry data , 2010, BMC Bioinformatics.

[3]  M. Ansari,et al.  Multiparameter flow cytometry in the diagnosis and management of acute leukemia. , 2011, Archives of pathology & laboratory medicine.

[4]  Ryan R Brinkman,et al.  Rapid cell population identification in flow cytometry data , 2011, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[5]  Nikesh Kotecha,et al.  Web‐Based Analysis and Publication of Flow Cytometry Experiments , 2010, Current protocols in cytometry.

[6]  William C. Ray,et al.  FIND: A new software tool and development platform for enhanced multicolor flow analysis , 2011, BMC Bioinformatics.

[7]  Robert Gentleman,et al.  flowCore: a Bioconductor package for high throughput flow cytometry , 2009, BMC Bioinformatics.

[8]  J. Mesirov,et al.  Automated high-dimensional flow cytometric data analysis , 2009, Proceedings of the National Academy of Sciences.

[9]  S. Mustjoki,et al.  Poor cytokine-induced phosphorylation in chronic myeloid leukemia patients at diagnosis is effectively reversed by tyrosine kinase inhibitor therapy. , 2011, Experimental hematology.

[10]  K. Ovaska,et al.  Large-scale data integration framework provides a comprehensive view on glioblastoma multiforme , 2010, Genome Medicine.