Dpath software reveals hierarchical haemato-endothelial lineages of Etv2 progenitors based on single-cell transcriptome analysis

Developmental, stem cell and cancer biologists are interested in the molecular definition of cellular differentiation. Although single-cell RNA sequencing represents a transformational advance for global gene analyses, novel obstacles have emerged, including the computational management of dropout events, the reconstruction of biological pathways and the isolation of target cell populations. We develop an algorithm named dpath that applies the concept of metagene entropy and allows the ranking of cells based on their differentiation potential. We also develop self-organizing map (SOM) and random walk with restart (RWR) algorithms to separate the progenitors from the differentiated cells and reconstruct the lineage hierarchies in an unbiased manner. We test these algorithms using single cells from Etv2-EYFP transgenic mouse embryos and reveal specific molecular pathways that direct differentiation programmes involving the haemato-endothelial lineages. This software program quantitatively assesses the progenitor and committed states in single-cell RNA-seq data sets in a non-biased manner.

[1]  Bin Zhou,et al.  Transcriptomic Profiling Maps Anatomically Patterned Subpopulations among Single Embryonic Cardiac Cells. , 2016, Developmental cell.

[2]  Bhairab N. Singh,et al.  Hedgehog Signaling during Appendage Development and Regeneration , 2015, Genes.

[3]  A. Oudenaarden,et al.  Validation of noise models for single-cell transcriptomics , 2014, Nature Methods.

[4]  Christos Faloutsos,et al.  Random walk with restart: fast solutions and applications , 2008, Knowledge and Information Systems.

[5]  M. Gertsenstein,et al.  Dominant-negative and targeted null mutations in the endothelial receptor tyrosine kinase, tek, reveal a critical role in vasculogenesis of the embryo. , 1994, Genes & development.

[6]  Sean C. Bendall,et al.  Wishbone identifies bifurcating developmental trajectories from single-cell data , 2016, Nature Biotechnology.

[7]  M. Kyba,et al.  Etv2 Is Expressed in the Yolk Sac Hematopoietic and Endothelial Progenitors and Regulates Lmo2 Gene Expression , 2012, Stem cells.

[8]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[9]  E. Pierson,et al.  ZIFA: Dimensionality reduction for zero-inflated single-cell gene expression analysis , 2015, Genome Biology.

[10]  Janet Rossant,et al.  Failure of blood-island formation and vasculogenesis in Flk-1-deficient mice , 1995, Nature.

[11]  Zakary S. Singer,et al.  Single-cell transcriptome analysis reveals dynamic changes in lncRNA expression during reprogramming. , 2015, Cell stem cell.

[12]  P. Kharchenko,et al.  Bayesian approach to single-cell differential expression analysis , 2014, Nature Methods.

[13]  Aleksandra A. Kolodziejczyk,et al.  Accounting for technical noise in single-cell RNA-seq experiments , 2013, Nature Methods.

[14]  Daofeng Li,et al.  Induction of hematopoietic and endothelial cell program orchestrated by ETS transcription factor ER71/ETV2 , 2015, EMBO reports.

[15]  M. Kyba,et al.  ER71 directs mesodermal fate decisions during embryogenesis , 2011, Development.

[16]  Pablo Tamayo,et al.  Metagenes and molecular pattern discovery using matrix factorization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[17]  G. Yancopoulos,et al.  Critical role of the TIE2 endothelial cell receptor in the development of definitive hematopoiesis. , 1998, Immunity.

[18]  E Leone,et al.  The strategy. , 2001, Hawaii dental journal.

[19]  B. Göttgens,et al.  The SCL 3' enhancer responds to Hedgehog signaling during hemangioblast specification. , 2006, Experimental hematology.

[20]  M. Yoder,et al.  VEGF and IHH rescue definitive hematopoiesis in Gata-4 and Gata-6-deficient murine embryoid bodies. , 2009, Experimental hematology.

[21]  B. Black,et al.  Vascular endothelial and endocardial progenitors differentiate as cardiomyocytes in the absence of Etsrp/Etv2 function , 2011, Development.

[22]  T. Davies,et al.  Staging of gastrulating mouse embryos by morphological landmarks in the dissecting microscope. , 1993, Development.

[23]  S. Orkin,et al.  Unsuspected role for the T-cell leukemia protein SCL/tal-1 in vascular development. , 1998, Genes & development.

[24]  T. Babak,et al.  Oct4 Is Required ∼E7.5 for Proliferation in the Primitive Streak , 2013, PLoS genetics.

[25]  M. Dyer,et al.  Indian hedgehog activates hematopoiesis and vasculogenesis and can respecify prospective neurectodermal cell fate in the mouse embryo. , 2001, Development.

[26]  S. Linnarsson,et al.  Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq , 2015, Science.

[27]  Joshua W. Vincentz,et al.  Analysis of the Hand1 cell lineage reveals novel contributions to cardiovascular, neural crest, extra‐embryonic, and lateral mesoderm derivatives , 2010, Developmental dynamics : an official publication of the American Association of Anatomists.

[28]  P. Labosky,et al.  Endocardial cells are a distinct endothelial lineage derived from Flk1+ multipotent cardiovascular progenitors. , 2009, Developmental biology.

[29]  S. Severini,et al.  Cellular network entropy as the energy potential in Waddington's differentiation landscape , 2013, Scientific Reports.

[30]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Christos Boutsidis,et al.  SVD based initialization: A head start for nonnegative matrix factorization , 2008, Pattern Recognit..

[32]  S. Kageyama,et al.  The role of ETS transcription factors in transcription and development of mouse preimplantation embryos. , 2006, Biochemical and biophysical research communications.

[33]  Guoli Wang,et al.  LS-NMF: A modified non-negative matrix factorization algorithm utilizing uncertainty estimates , 2006, BMC Bioinformatics.

[34]  A. Hart,et al.  Identification, cloning and expression analysis of the pluripotency promoting Nanog genes in mouse and human , 2004, Developmental dynamics : an official publication of the American Association of Anatomists.

[35]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[36]  Jagdish Chandra Patra,et al.  Genome-wide inferring gene-phenotype relationship by walking on the heterogeneous network , 2010, Bioinform..

[37]  L. Zon,et al.  Signaling axis involving Hedgehog, Notch, and Scl promotes the embryonic endothelial-to-hematopoietic transition , 2012, Proceedings of the National Academy of Sciences.

[38]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[39]  M. V. Velzen,et al.  Self-organizing maps , 2007 .

[40]  C. Begley,et al.  Absence of yolk sac hematopoiesis from mice with a targeted disruption of the scl gene. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[41]  R. Hammer,et al.  Nkx2–5 transactivates the Ets-related protein 71 gene and specifies an endothelial/endocardial fate in the developing embryo , 2009, Proceedings of the National Academy of Sciences.

[42]  Fabian J. Theis,et al.  Diffusion maps for high-dimensional single-cell analysis of differentiation data , 2015, Bioinform..

[43]  Nicola K. Wilson,et al.  Resolving Early Mesoderm Diversification through Single Cell Expression Profiling , 2016, Nature.

[44]  Wolfgang Huber,et al.  Cell-to-cell expression variability followed by signal reinforcement progressively segregates early mouse lineages , 2013, Nature Cell Biology.

[45]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[46]  Lutgarde M. C. Buydens,et al.  Self- and Super-organizing Maps in R: The kohonen Package , 2007 .

[47]  J. Crump,et al.  Smarcd3b and Gata5 promote a cardiac progenitor fate in the zebrafish embryo , 2011 .

[48]  W. Pu,et al.  Endocardial and Epicardial Epithelial to Mesenchymal Transitions in Heart Development and Disease , 2012, Circulation research.

[49]  Philipp S. Hoppe,et al.  Circulation-independent differentiation pathway from extraembryonic mesoderm toward hematopoietic stem cells via hemogenic angioblasts. , 2014, Cell reports.

[50]  S. Nishikawa,et al.  Expressions of PDGF receptor alpha, c‐Kit and Flk1 genes clustering in mouse chromosome 5 define distinct subsets of nascent mesodermal cells , 1997, Development, growth & differentiation.

[51]  S. Orkin,et al.  Absence of blood formation in mice lacking the T-cell leukaemia oncoprotein tal-1/SCL , 1995, Nature.

[52]  A. Visel,et al.  Combinatorial Regulation of Endothelial Gene Expression by Ets and Forkhead Transcription Factors , 2008, Cell.

[53]  I. Ial,et al.  Nature Communications , 2010, Nature Cell Biology.

[54]  N. Neff,et al.  Reconstructing lineage hierarchies of the distal lung epithelium using single cell RNA-seq , 2014, Nature.

[55]  Ben D. MacArthur,et al.  Statistical Mechanics of Pluripotency , 2013, Cell.

[56]  Thomas N. Sato,et al.  Distinct roles of the receptor tyrosine kinases Tie-1 and Tie-2 in blood vessel formation , 1995, Nature.

[57]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[58]  F. Ginhoux,et al.  Mpath maps multi-branching single-cell trajectories revealing progenitor cell progression during development , 2016, Nature Communications.

[59]  D. Wilson,et al.  Localization of transcription factor GATA-4 to regions of the mouse embryo involved in cardiac development. , 1994, Developmental biology.

[60]  Catalin C. Barbacioru,et al.  Tracing the Derivation of Embryonic Stem Cells from the Inner Cell Mass by Single-Cell RNA-Seq Analysis , 2010, Cell stem cell.

[61]  R. Strasser,et al.  Phenotypic overlap between hematopoietic cells with suggested angioblastic potential and vascular endothelial cells. , 2002, Journal of hematotherapy & stem cell research.

[62]  H. Aburatani,et al.  Endocardiogenesis in embryoid bodies: novel markers identified by gene expression profiling. , 2007, Biochemical and biophysical research communications.

[63]  L. Zon,et al.  Cloche, an early acting zebrafish gene, is required by both the endothelial and hematopoietic lineages. , 1995, Development.

[64]  Mauro J. Muraro,et al.  De Novo Prediction of Stem Cell Identity using Single-Cell Transcriptome Data , 2016, Cell stem cell.

[65]  Janet Rossant,et al.  A Requirement for Flk1 in Primitive and Definitive Hematopoiesis and Vasculogenesis , 1997, Cell.

[66]  Cole Trapnell,et al.  The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells , 2014, Nature Biotechnology.

[67]  S. Linnarsson,et al.  Unbiased classification of sensory neuron types by large-scale single-cell RNA sequencing , 2014, Nature Neuroscience.

[68]  M. Ramialison,et al.  Defining the earliest step of cardiovascular progenitor specification during embryonic stem cell differentiation , 2011, The Journal of cell biology.

[69]  Ilya Shmulevich,et al.  Gene pair signatures in cell type transcriptomes reveal lineage control , 2013, Nature Methods.

[70]  M. Kyba,et al.  ER71 acts downstream of BMP, Notch, and Wnt signaling in blood and vessel progenitor specification. , 2008, Cell stem cell.