Wishbone identifies bifurcating developmental trajectories from single-cell data

Recent single-cell analysis technologies offer an unprecedented opportunity to elucidate developmental pathways. Here we present Wishbone, an algorithm for positioning single cells along bifurcating developmental trajectories with high resolution. Wishbone uses multi-dimensional single-cell data, such as mass cytometry or RNA-Seq data, as input and orders cells according to their developmental progression, and it pinpoints bifurcation points by labeling each cell as pre-bifurcation or as one of two post-bifurcation cell fates. Using 30-channel mass cytometry data, we show that Wishbone accurately recovers the known stages of T-cell development in the mouse thymus, including the bifurcation point. We also apply the algorithm to mouse myeloid differentiation and demonstrate its generalization to additional lineages. A comparison of Wishbone to diffusion maps, SCUBA and Monocle shows that it outperforms these methods both in the accuracy of ordering cells and in the correct identification of branch points.

[1]  J. Haldane,et al.  “Introduction to Modern Genetics” , 1939, Nature.

[2]  C. Waddington An introduction to modern genetics, by C.H. Waddington ... , 1939 .

[3]  G. Pinkus,et al.  Myeloperoxidase: a specific marker for myeloid cells in paraffin sections. , 1991, Modern pathology : an official journal of the United States and Canadian Academy of Pathology, Inc.

[4]  T. Nakayama,et al.  CD69 cell surface expression identifies developing thymocytes which audition for T cell antigen receptor-mediated positive selection. , 1993, International immunology.

[5]  Joshua B. Tenenbaum,et al.  Global Versus Local Methods in Nonlinear Dimensionality Reduction , 2002, NIPS.

[6]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[7]  Joshua B. Tenenbaum,et al.  Sparse multidimensional scaling using land-mark points , 2004 .

[8]  Ann B. Lee,et al.  Geometric diffusions as a tool for harmonic analysis and structure definition of data: diffusion maps. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Daniel G. Tenen,et al.  Transcription factors in myeloid development: balancing differentiation with transformation , 2007, Nature Reviews Immunology.

[11]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[12]  W. Paul,et al.  Distinct functions for the transcription factors GATA-3 and ThPOK during intrathymic differentiation of CD4+ T cells , 2008, Nature Immunology.

[13]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[14]  D. Koller,et al.  The Immunological Genome Project: networks of gene expression in immune cells , 2008, Nature Immunology.

[15]  A. Singer,et al.  Lineage fate and intense debate: myths, models and mechanisms of CD4- versus CD8-lineage choice , 2008, Nature Reviews Immunology.

[16]  Catalin C. Barbacioru,et al.  mRNA-Seq whole-transcriptome analysis of a single cell , 2009, Nature Methods.

[17]  J. Dick,et al.  Revised map of the human progenitor hierarchy shows the origin of macrophages and dendritic cells in early lymphoid development , 2010, Nature Immunology.

[18]  Masayuki Yamamoto,et al.  GATA factor switching during erythroid differentiation , 2010, Current opinion in hematology.

[19]  H. Nakauchi,et al.  Reprogramming adult hematopoietic cells , 2010, Current opinion in hematology.

[20]  F. Radtke,et al.  Mechanisms of T cell development and transformation. , 2011, Annual review of cell and developmental biology.

[21]  Sean C. Bendall,et al.  Single-Cell Mass Cytometry of Differential Immune and Drug Responses Across a Human Hematopoietic Continuum , 2011, Science.

[22]  A. Bhandoola,et al.  Signal integration and crosstalk during thymocyte migration and emigration , 2011, Nature Reviews Immunology.

[23]  G. Nolan,et al.  The transcriptional landscape of αβ T cell differentiation , 2013, Nature Immunology.

[24]  Sean C. Bendall,et al.  viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia , 2013, Nature Biotechnology.

[25]  Ellen V. Rothenberg,et al.  Developmental gene networks: a triathlon on the course to T cell identity , 2014, Nature Reviews Immunology.

[26]  Sean C. Bendall,et al.  Single-Cell Trajectory Detection Uncovers Progression and Regulatory Coordination in Human B Cell Development , 2014, Cell.

[27]  Cole Trapnell,et al.  The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells , 2014, Nature Biotechnology.

[28]  E. Marco,et al.  Bifurcation analysis of single-cell gene expression data reveals epigenetic landscape , 2014, Proceedings of the National Academy of Sciences.

[29]  Sean C. Bendall,et al.  Data-Driven Phenotypic Dissection of AML Reveals Progenitor-like Cells that Correlate with Prognosis , 2015, Cell.

[30]  Hans Clevers,et al.  Single-cell messenger RNA sequencing reveals rare intestinal cell types , 2015, Nature.

[31]  Fabian J. Theis,et al.  Diffusion maps for high-dimensional single-cell analysis of differentiation data , 2015, Bioinform..

[32]  T. Egawa,et al.  Regulation of CD4 and CD8 coreceptor expression and CD4 versus CD8 lineage decisions. , 2015, Advances in immunology.

[33]  D. Pe’er,et al.  Trajectories of cell-cycle progression from fixed cell populations , 2015, Nature Methods.

[34]  Fabian J Theis,et al.  Decoding the Regulatory Network for Blood Development from Single-Cell Gene Expression Measurements , 2015, Nature Biotechnology.

[35]  Allon M. Klein,et al.  Droplet Barcoding for Single-Cell Transcriptomics Applied to Embryonic Stem Cells , 2015, Cell.

[36]  Evan Z. Macosko,et al.  Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets , 2015, Cell.

[37]  S. Teichmann,et al.  Computational and analytical challenges in single-cell transcriptomics , 2015, Nature Reviews Genetics.

[38]  David W. Nauen,et al.  Single-Cell RNA-Seq with Waterfall Reveals Molecular Cascades underlying Adult Neurogenesis. , 2015, Cell stem cell.

[39]  Fabian J Theis,et al.  Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells , 2015, Nature Biotechnology.

[40]  I. Amit,et al.  Transcriptional Heterogeneity and Lineage Commitment in Myeloid Progenitors , 2015, Cell.