Single-cell RNA-seq of human induced pluripotent stem cells reveals cellular heterogeneity and cell state transitions between subpopulations

Heterogeneity of cell states represented in pluripotent cultures has not been described at the transcriptional level. Since gene expression is highly heterogeneous between cells, single-cell RNA sequencing can be used to identify how individual pluripotent cells function. Here, we present results from the analysis of single-cell RNA sequencing data from 18,787 individual WTC-CRISPRi human induced pluripotent stem cells. We developed an unsupervised clustering method and, through this, identified four subpopulations distinguishable on the basis of their pluripotent state, including a core pluripotent population (48.3%), proliferative (47.8%), early primed for differentiation (2.8%), and late primed for differentiation (1.1%). For each subpopulation, we were able to identify the genes and pathways that define differences in pluripotent cell states. Our method identified four transcriptionally distinct predictor gene sets composed of 165 unique genes that denote the specific pluripotency states; using these sets, we developed a multigenic machine learning prediction method to accurately classify single cells into each of the subpopulations. Compared against a set of established pluripotency markers, our method increases prediction accuracy by 10%, specificity by 20%, and explains a substantially larger proportion of deviance (up to threefold) from the prediction model. Finally, we developed an innovative method to predict cells transitioning between subpopulations and support our conclusions with results from two orthogonal pseudotime trajectory methods.

[1]  F. Tang,et al.  Single-cell sequencing in stem cell biology , 2016, Genome Biology.

[2]  Davis J. McCarthy,et al.  Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation , 2012, Nucleic acids research.

[3]  Thomas A. Geddes,et al.  scReClassify: post hoc cell type classification of single-cell rNA-seq data , 2019, BMC Genomics.

[4]  Fabian J Theis,et al.  Diffusion pseudotime robustly reconstructs lineage branching , 2016, Nature Methods.

[5]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[6]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[7]  Megan F. Cole,et al.  Core Transcriptional Regulatory Circuitry in Human Embryonic Stem Cells , 2005, Cell.

[8]  B. Hadland,et al.  Generating high-purity cardiac and endothelial derivatives from patterned mesoderm using human pluripotent stem cells , 2016, Nature Protocols.

[9]  E. Ortlund,et al.  A Structural Investigation into Oct4 Regulation by Orphan Nuclear Receptors, Germ Cell Nuclear Factor (GCNF), and Liver Receptor Homolog-1 (LRH-1). , 2016, Journal of molecular biology.

[10]  Hannah A. Pliner,et al.  Reversed graph embedding resolves complex single-cell trajectories , 2017, Nature Methods.

[11]  Ruiqiang Li,et al.  Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells , 2013, Nature Structural &Molecular Biology.

[12]  Richard O C Oreffo,et al.  Hypoxia inducible factors regulate pluripotency and proliferation in human embryonic stem cells cultured at reduced oxygen tensions , 2010, Reproduction.

[13]  Nevan J Krogan,et al.  CRISPR Interference Efficiently Induces Specific and Reversible Gene Silencing in Human iPSCs. , 2016, Cell stem cell.

[14]  Rona S. Gertner,et al.  Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells , 2013, Nature.

[15]  James J. Cai,et al.  Single-Cell Expression Variability Implies Cell Function , 2019, Cells.

[16]  H. Schöler,et al.  Mouse germline restriction of Oct4 expression by germ cell nuclear factor. , 2001, Developmental cell.

[17]  Fabian J. Theis,et al.  Combined Single-Cell Functional and Gene Expression Analysis Resolves Heterogeneity within Stem Cell Populations , 2015, Cell stem cell.

[18]  Gary D. Bader,et al.  The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function , 2010, Nucleic Acids Res..

[19]  Grace X. Y. Zheng,et al.  Massively parallel digital transcriptional profiling of single cells , 2016, Nature Communications.

[20]  Grace X. Y. Zheng,et al.  Massively parallel digital transcriptional profiling of single cells , 2016, bioRxiv.

[21]  Li Chai,et al.  A Novel SALL4/OCT4 Transcriptional Feedback Network for Pluripotency of Embryonic Stem Cells , 2010, PloS one.

[22]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[23]  Sarah A Teichmann,et al.  Computational assignment of cell-cycle stage from single-cell transcriptome data. , 2015, Methods.

[24]  K. Mikoshiba,et al.  Zic1 and Zic3 Regulate Medial Forebrain Development through Expansion of Neuronal Progenitors , 2007, The Journal of Neuroscience.

[25]  Paul Bertone,et al.  Sall4 controls differentiation of pluripotent cells independently of the Nucleosome Remodelling and Deacetylation (NuRD) complex , 2016, Development.

[26]  R. Tibshirani,et al.  Strong rules for discarding predictors in lasso‐type problems , 2010, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[27]  Thorsten Wohland,et al.  DNA-dependent Oct4-Sox2 interaction and diffusion properties characteristic of the pluripotent cell state revealed by fluorescence spectroscopy. , 2012, The Biochemical journal.

[28]  Aaron T. L. Lun,et al.  Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R , 2017, Bioinform..

[29]  Christopher S. McGinnis,et al.  Cell population structure prior to bifurcation predicts efficiency of directed differentiation in human induced pluripotent cells , 2017, Proceedings of the National Academy of Sciences.

[30]  D. Norris,et al.  Cell fate decisions within the mouse organizer are governed by graded Nodal signals. , 2003, Genes & development.

[31]  S. Artavanis-Tsakonas,et al.  Notch Signaling : Cell Fate Control and Signal Integration in Development , 1999 .

[32]  A. Meissner,et al.  A qPCR ScoreCard quantifies the differentiation potential of human pluripotent stem cells , 2015 .

[33]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[34]  Martin Kuiper,et al.  BiNGO: a Cytoscape plugin to assess overrepresentation of Gene Ontology categories in Biological Networks , 2005, Bioinform..

[35]  S. Yamanaka,et al.  Induction of Pluripotent Stem Cells from Mouse Embryonic and Adult Fibroblast Cultures by Defined Factors , 2006, Cell.

[36]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[37]  Yupo Ma,et al.  A Novel SALL 4 / OCT 4 Transcriptional Feedback Network for Pluripotency of Embryonic Stem Cells , 2010 .

[38]  Bin Zhang,et al.  Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R , 2008, Bioinform..

[39]  M. Guenther,et al.  Transcriptional control of embryonic and induced pluripotent stem cells. , 2011, Epigenomics.

[40]  Fabian J. Theis,et al.  destiny: diffusion maps for large-scale single-cell data in R , 2015, Bioinform..

[41]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[42]  Janet Rossant,et al.  The International Stem Cell Initiative: toward benchmarks for human embryonic stem cell research , 2005, Nature Biotechnology.

[43]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[44]  Yang I Li,et al.  An Expanded View of Complex Traits: From Polygenic to Omnigenic , 2017, Cell.

[45]  Helen E. Parkinson,et al.  The human-induced pluripotent stem cell initiative—data resources for cellular genetics , 2016, Nucleic Acids Res..

[46]  Jeroen S. van Zon,et al.  Direct cell reprogramming is a stochastic process amenable to acceleration , 2009, Nature.

[47]  M. Kaufman,et al.  Establishment in culture of pluripotential cells from mouse embryos , 1981, Nature.

[48]  Juan Carlos Fernández,et al.  Multiobjective evolutionary algorithms to identify highly autocorrelated areas: the case of spatial distribution in financially compromised farms , 2014, Ann. Oper. Res..

[49]  Gabriel Kolle,et al.  A Continuum of Cell States Spans Pluripotency and Lineage Commitment in Human Embryonic Stem Cells , 2009, PloS one.

[50]  Ning Leng,et al.  Oscope identifies oscillatory genes in unsynchronized single cell RNA-seq experiments , 2015, Nature Methods.

[51]  J. Marioni,et al.  Pooling across cells to normalize single-cell RNA sequencing data with many zero counts , 2016, Genome Biology.

[52]  L. Stein,et al.  A human functional protein interaction network and its application to cancer data analysis , 2010, Genome Biology.

[53]  J. Thomson,et al.  Embryonic stem cell lines derived from human blastocysts. , 1998, Science.

[54]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[55]  Li Wang,et al.  Reversed graph embedding resolves complex single-cell developmental trajectories , 2017, bioRxiv.

[56]  T. Mikkelsen,et al.  Dynamics of lineage commitment revealed by single-cell transcriptomics of differentiating embryonic stem cells , 2016, Nature Communications.

[57]  Tracking the embryonic stem cell transition from ground state pluripotency , 2016 .