Minimizing Batch Effects in Mass Cytometry Data

Cytometry by Time-Of-Flight (CyTOF) uses antibodies conjugated to isotopically pure metals to identify and quantify a large number of cellular features with single-cell resolution. A barcoding approach allows for 20 unique samples to be pooled and processed together in one tube, reducing the intra-barcode technical variability. However, with only 20 samples per barcode, multiple barcode sets (batches) are required to address questions in robustly powered study designs. A batch adjustment procedure is required to reduce variability across batches and to facilitate direct comparison of runs performed across multiple barcodes run over weeks, months, or years. We describe a method using technical replicates that are included in each run to determine and apply an appropriate adjustment per batch without manual intervention. The use of technical replicate samples (i.e., anchors or reference samples) avoids assumptions of sample homogeneity among batches, and allows direct estimation of batch effects and appropriate adjustment parameters applicable to all samples within a batch. Quantification of cell subpopulations and mean signal intensity pre- and post-adjustment using both manual gating and unsupervised clustering demonstrate substantial mitigation of batch effects in the anchor samples used for this adjustment calculation, and in a second validation set of technical replicates.

[1]  N. L. Johnson,et al.  Linear Statistical Inference and Its Applications , 1966 .

[2]  Xing Qiu,et al.  The impact of quantile and rank normalization procedures on the testing power of gene differential expression analysis , 2013, BMC Bioinformatics.

[3]  Nikolay Samusik,et al.  Mass Cytometric Functional Profiling of Acute Myeloid Leukemia Defines Cell-Cycle and Immunophenotypic Properties That Correlate with Known Responses to Therapy. , 2015, Cancer discovery.

[4]  R. Tibshirani,et al.  Automated identification of stratifying signatures in cellular subpopulations , 2014, Proceedings of the National Academy of Sciences.

[5]  John D. Storey,et al.  Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis , 2007, PLoS genetics.

[6]  Matthew R Clutter,et al.  Single‐cell mass cytometry adapted to measurements of the cell cycle , 2012, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[7]  Lisa M. Kronstad,et al.  Differential Induction of IFN-α and Modulation of CD112 and CD54 Expression Govern the Magnitude of NK Cell IFN-γ Response to Influenza A Viruses , 2018, The Journal of Immunology.

[8]  John C. Marioni,et al.  Testing for differential abundance in mass cytometry data , 2017, Nature Methods.

[9]  Mark M. Davis,et al.  Comparison of CyTOF assays across sites: Results of a six-center pilot study , 2017, Journal of immunological methods.

[10]  Andrzej Cichocki,et al.  Nonnegative Matrix and Tensor Factorization T , 2007 .

[11]  Eli R. Zunder,et al.  Transient partial permeabilization with saponin enables cellular barcoding prior to surface marker staining , 2014, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[12]  Sean C. Bendall,et al.  Normalization of mass cytometry data with bead standards , 2013, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[13]  Eli R. Zunder,et al.  Palladium-based mass tag cell barcoding with a doublet-filtering scheme and single-cell deconvolution algorithm , 2015, Nature Protocols.

[14]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[15]  Stuart B. Goodman,et al.  Clinical recovery from surgery correlates with single-cell immune signatures , 2014, Science Translational Medicine.

[16]  S. Dudoit,et al.  Normalization of RNA-seq data using factor analysis of control genes or samples , 2014, Nature Biotechnology.

[17]  Sean C. Bendall,et al.  Single-cell developmental classification of B cell precursor acute lymphoblastic leukemia at diagnosis reveals predictors of relapse , 2018, Nature Medicine.

[18]  Reema Baskar,et al.  Parallel analysis of tri-molecular biosynthesis with cell identity and function in single cells , 2019, Nature Communications.

[19]  David M. Simcha,et al.  Tackling the widespread and critical impact of batch effects in high-throughput data , 2010, Nature Reviews Genetics.

[20]  G. Nolan,et al.  Mass cytometry identifies a distinct monocyte cytokine signature shared by clinically heterogeneous pediatric SLE patients. , 2017, Journal of autoimmunity.

[21]  Robert Tibshirani,et al.  Proliferative tracing with single-cell mass cytometry optimizes generation of stem cell memory-like T cells , 2018, Nature Biotechnology.

[22]  Claire Duvallet,et al.  Correcting for batch effects in case-control microbiome studies , 2018, bioRxiv.

[23]  Sean C. Bendall,et al.  Single-cell systems-level analysis of human Toll-like receptor activation defines a chemokine signature in patients with systemic lupus erythematosus. , 2015, The Journal of allergy and clinical immunology.

[24]  R. A. van den Berg,et al.  Centering, scaling, and transformations: improving the biological information content of metabolomics data , 2006, BMC Genomics.

[25]  B. Walker,et al.  Standardization and quality control for high‐dimensional mass cytometry studies of human samples , 2016, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[26]  Sean C. Bendall,et al.  Data-Driven Phenotypic Dissection of AML Reveals Progenitor-like Cells that Correlate with Prognosis , 2015, Cell.

[27]  Andreas Grützkau,et al.  Stabilizing Antibody Cocktails for Mass Cytometry , 2019, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[28]  Andrew E. Jaffe,et al.  Bioinformatics Applications Note Gene Expression the Sva Package for Removing Batch Effects and Other Unwanted Variation in High-throughput Experiments , 2022 .

[29]  Vito R T Zanotelli,et al.  In-Depth Characterization of Monocyte-Derived Macrophages using a Mass Cytometry-Based Phagocytosis Assay , 2019, Scientific Reports.

[30]  Mario Roederer,et al.  Data File Standard for Flow Cytometry, version FCS 3.1 , 2009, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[31]  Cheng Li,et al.  Adjusting batch effects in microarray expression data using empirical Bayes methods. , 2007, Biostatistics.

[32]  Sean C. Bendall,et al.  Single-Cell Trajectory Detection Uncovers Progression and Regulatory Coordination in Human B Cell Development , 2014, Cell.

[33]  P. Good,et al.  Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses , 1995 .

[34]  Nicolas Servant,et al.  A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis , 2013, Briefings Bioinform..