Stem cell transcriptome profiling via massive-scale mRNA sequencing

We developed a massive-scale RNA sequencing protocol, short quantitative random RNA libraries or SQRL, to survey the complexity, dynamics and sequence content of transcriptomes in a near-complete fashion. This method generates directional, random-primed, linear cDNA libraries that are optimized for next-generation short-tag sequencing. We surveyed the poly(A)+ transcriptomes of undifferentiated mouse embryonic stem cells (ESCs) and embryoid bodies (EBs) at an unprecedented depth (10 Gb), using the Applied Biosystems SOLiD technology. These libraries capture the genomic landscape of expression, state-specific expression, single-nucleotide polymorphisms (SNPs), the transcriptional activity of repeat elements, and both known and new alternative splicing events. We investigated the impact of transcriptional complexity on current models of key signaling pathways controlling ESC pluripotency and differentiation, highlighting how SQRL can be used to characterize transcriptome content and dynamics in a quantitative and reproducible manner, and suggesting that our understanding of transcriptional complexity is far from complete.

[1]  Gregory D. Schuler,et al.  ESTablishing a human transcript map , 1995, Nature Genetics.

[2]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[3]  W. Schmidt,et al.  CapSelect: a highly sensitive method for 5' CAP-dependent enrichment of full-length cDNA in PCR-mediated analysis of mRNAs. , 1999, Nucleic Acids Research.

[4]  O. Britanova,et al.  Amplification of cDNA ends based on template-switching effect and step-out PCR. , 1999, Nucleic acids research.

[5]  C. Bult,et al.  Functional annotation of a full-length mouse cDNA collection , 2001, Nature.

[6]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[7]  Ji Huang,et al.  [Serial analysis of gene expression]. , 2002, Yi chuan = Hereditas.

[8]  Tyson A. Clark,et al.  Genomewide Analysis of mRNA Processing in Yeast Using Splicing-Specific Microarrays , 2002, Science.

[9]  Sydney Brenner,et al.  Massively parallel signature sequencing (MPSS) as a tool for in-depth quantitative gene expression profiling in all organisms. , 2002, Briefings in functional genomics & proteomics.

[10]  E. Birney,et al.  Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs , 2002, Nature.

[11]  J. Kawai,et al.  Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[13]  S. Cawley,et al.  Unbiased Mapping of Transcription Factor Binding Sites along Human Chromosomes 21 and 22 Points to Widespread Regulation of Noncoding RNAs , 2004, Cell.

[14]  Y. Ishimaru,et al.  Determination of the capped site sequence of mRNA based on the detection of cap-dependent nucleotide addition using an anchor ligation method. , 2004, DNA research : an international journal for rapid publication of reports on genes and genomes.

[15]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[16]  S. Salzberg,et al.  The Transcriptional Landscape of the Mammalian Genome , 2005, Science.

[17]  J. Shendure,et al.  Materials and Methods Som Text Figs. S1 and S2 Tables S1 to S4 References Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome , 2022 .

[18]  John Quackenbush,et al.  The TIGR Gene Indices: clustering and assembling EST and known genes and integration with eukaryotic genomes , 2004, Nucleic Acids Res..

[19]  Yoshihide Hayashizaki,et al.  Disclosing hidden transcripts: mouse natural sense-antisense transcripts tend to be poly(A) negative and nuclear localized. , 2005, Genome research.

[20]  Tyson A. Clark,et al.  Alternative splicing and differential gene expression in colon cancer detected by a whole genome exon array , 2006, BMC Genomics.

[21]  J. Thierry-Mieg,et al.  AceView: a comprehensive cDNA-supported gene and transcripts annotation , 2006, Genome Biology.

[22]  Sin Lam Tan,et al.  Complex Loci in Human and Mouse Genomes , 2006, PLoS genetics.

[23]  E. Stanley,et al.  Transcriptional profiling of mouse and human ES cells identifies SLAIN1, a novel stem cell gene. , 2006, Developmental biology.

[24]  G. Church,et al.  Polony Multiplex Analysis of Gene Expression (PMAGE) in Mouse Hypertrophic Cardiomyopathy , 2007, Science.

[25]  A. Perkins,et al.  In vitro differentiation of murine embryonic stem cells toward a renal lineage. , 2007, Differentiation; research in biological diversity.

[26]  A. Perkins,et al.  Dynamic transcription programs during ES cell differentiation towards mesoderm in serum versus serum-freeBMP4 culture , 2007, BMC Genomics.

[27]  Vincent J. Magrini,et al.  Extending assembly of short DNA sequences to handle error , 2007, Bioinform..

[28]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[29]  P. Stadler,et al.  RNA Maps Reveal New RNA Classes and a Possible Function for Pervasive Transcription , 2007, Science.

[30]  W. Wong,et al.  A gene regulatory network in mouse embryonic stem cells , 2007, Proceedings of the National Academy of Sciences.

[31]  R. Lister,et al.  Highly Integrated Single-Base Resolution Maps of the Epigenome in Arabidopsis , 2008, Cell.

[32]  Geoffrey J Faulkner,et al.  A rescue strategy for multimapping short sequence tags refines surveys of transcriptional activity by CAGE. , 2008, Genomics.

[33]  M. Gerstein,et al.  The Transcriptional Landscape of the Yeast Genome Defined by RNA Sequencing , 2008, Science.

[34]  David Haussler,et al.  The UCSC Genome Browser Database: 2008 update , 2007, Nucleic Acids Res..