Agile workflow for interactive analysis of mass cytometry data

Motivation Single-cell proteomics technologies, such as mass cytometry, have enabled characterization of cell-to-cell variation and cell populations at a single cell resolution. These large amounts of data, however, require dedicated, interactive tools for translating the data into knowledge. Results We present a comprehensive, interactive method called Cyto to streamline analysis of large-scale cytometry data. Cyto is a workflow-based open-source solution that automatizes the use of of state-of-the-art single-cell analysis methods with interactive visualization. We show the utility of Cyto by applying it to mass cytometry data from peripheral blood and high-grade serous ovarian cancer (HGSOC) samples. Our results show that Cyto is able to reliably capture the immune cell sub-populations from peripheral blood as well as cellular compositions of unique immune- and cancer cell subpopulations in HGSOC tumor and ascites samples. Availability The method is available as a Docker container at https://hub.docker.com/r/anduril/cyto and the user guide and source code are available at https://bitbucket.org/anduril-dev/cyto Contact sampsa.hautaniemi@helsinki.fi Supplementary information Supplementary material is available and FCS files are hosted at flowrepository.org/id/FR-FCM-Z2LW

[1]  Malgorzata Nowicka,et al.  CyTOF workflow: differential discovery in high-throughput high-dimensional cytometry datasets. , 2019, F1000Research.

[2]  Sean C. Bendall,et al.  Extracting a Cellular Hierarchy from High-dimensional Cytometry Data with SPADE , 2011, Nature Biotechnology.

[3]  Mark D. Robinson,et al.  Comparison of Clustering Methods for High-Dimensional Single-Cell Flow and Mass Cytometry Data , 2016, bioRxiv.

[4]  M. Mearin,et al.  Mass Cytometry of the Human Mucosal Immune System Identifies Tissue- and Disease-Associated Immune Subsets. , 2016, Immunity.

[5]  B. Becher,et al.  The end of omics? High dimensional single cell analysis in precision medicine , 2019, European journal of immunology.

[6]  Nikesh Kotecha,et al.  Web‐Based Analysis and Publication of Flow Cytometry Experiments , 2010, Current protocols in cytometry.

[7]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[8]  Malgorzata Nowicka,et al.  CyTOF workflow: differential discovery in high-throughput high-dimensional cytometry datasets. , 2017, F1000Research.

[9]  Elmar Eisemann,et al.  Visual analysis of mass cytometry data by hierarchical stochastic neighbour embedding reveals rare cell types , 2017, Nature Communications.

[10]  Fabian J. Theis,et al.  destiny: diffusion maps for large-scale single-cell data in R , 2015, Bioinform..

[11]  Elmar Eisemann,et al.  Cytosplore: Interactive Immune Cell Phenotyping for Large Single‐Cell Datasets , 2016, Comput. Graph. Forum.

[12]  Greg Finak,et al.  OpenCyto: An Open Source Infrastructure for Scalable, Robust, Reproducible, and Automated, End-to-End Flow Cytometry Data Analysis , 2014, PLoS Comput. Biol..

[13]  Sean C. Bendall,et al.  viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia , 2013, Nature Biotechnology.

[14]  Peng Qiu,et al.  Toward deterministic and semiautomated SPADE analysis , 2017, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[15]  Julia Casado,et al.  Anduril 2: upgraded large-scale data integration framework , 2019, Bioinform..

[16]  Jonas S Almeida Computational ecosystems for data-driven medical genomics , 2010, Genome Medicine.

[17]  Hao Chen,et al.  Cytofkit: A Bioconductor Package for an Integrated Mass Cytometry Data Analysis Pipeline , 2016, PLoS Comput. Biol..

[18]  Petter Brodin,et al.  The biology of the cell – insights from mass cytometry , 2018, The FEBS journal.

[19]  Alan J. Dix,et al.  Starting simple: adding value to static visualisation through simple interaction , 1998, AVI '98.