Landscape and variation of RNA secondary structure across the human transcriptome

In parallel to the genetic code for protein synthesis, a second layer of information is embedded in all RNA transcripts in the form of RNA structure. RNA structure influences practically every step in the gene expression program. However, the nature of most RNA structures or effects of sequence variation on structure are not known. Here we report the initial landscape and variation of RNA secondary structures (RSSs) in a human family trio (mother, father and their child). This provides a comprehensive RSS map of human coding and non-coding RNAs. We identify unique RSS signatures that demarcate open reading frames and splicing junctions, and define authentic microRNA-binding sites. Comparison of native deproteinized RNA isolated from cells versus refolded purified RNA suggests that the majority of the RSS information is encoded within RNA sequence. Over 1,900 transcribed single nucleotide variants (approximately 15% of all transcribed single nucleotide variants) alter local RNA structure. We discover simple sequence and spacing rules that determine the ability of point mutations to impact RSSs. Selective depletion of ‘riboSNitches’ versus structurally synonymous variants at precise locations suggests selection for specific RNA shapes at thousands of sites, including 3′ untranslated regions, binding sites of microRNAs and RNA-binding proteins genome-wide. These results highlight the potentially broad contribution of RNA structure and its variation to gene regulation.

[1]  Eduardo Eyras,et al.  DGCR8 HITS-CLIP reveals novel functions for the Microprocessor , 2012, Nature Structural &Molecular Biology.

[2]  Paul Ryvkin,et al.  Global analysis of RNA secondary structure in two metazoans. , 2012, Cell reports.

[3]  Michael D. Wilson,et al.  The Evolutionary Landscape of Alternative Splicing in Vertebrate Species , 2012, Science.

[4]  E. Wang,et al.  Analysis and design of RNA sequencing experiments for identifying isoform regulation , 2010, Nature Methods.

[5]  Aleksey Y. Ogurtsov,et al.  A periodic pattern of mRNA secondary structure created by the genetic code , 2006, Nucleic acids research.

[6]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[7]  Howard Y. Chang,et al.  RNA SHAPE analysis in living cells. , 2013, Nature chemical biology.

[8]  Howard Y. Chang,et al.  Genome-wide measurement of RNA secondary structure in yeast , 2010, Nature.

[9]  D. Bartel MicroRNAs: Target Recognition and Regulatory Functions , 2009, Cell.

[10]  Brendan J. Frey,et al.  Deciphering the splicing code , 2010, Nature.

[11]  L. Lim,et al.  MicroRNA targeting specificity in mammals: determinants beyond seed pairing. , 2007, Molecular cell.

[12]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[13]  J. Ule,et al.  iCLIP reveals the function of hnRNP particles in splicing at individual nucleotide resolution , 2010, Nature Structural &Molecular Biology.

[14]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[15]  Michael P Snyder,et al.  SeqFold: Genome-scale reconstruction of RNA secondary structure integrating high-throughput sequencing data , 2013, Genome research.

[16]  M. Gottesman,et al.  Sensitive measurement of single-nucleotide polymorphism-induced changes of RNA conformation: application to disease studies , 2012, Nucleic acids research.

[17]  Christoph Dieterich,et al.  doRiNA: a database of RNA interactions in post-transcriptional regulation , 2011, Nucleic Acids Res..

[18]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[19]  Ray M. Marín,et al.  Analysis of the accessibility of CLIP bound sites reveals that nucleation of the miRNA:mRNA pairing occurs preferentially at the 3'-end of the seed match. , 2012, RNA.

[20]  Alain Laederach,et al.  Disease-Associated Mutations That Alter the RNA Structural Ensemble , 2010, PLoS genetics.

[21]  Howard Y. Chang,et al.  Understanding the transcriptome through RNA structure , 2011, Nature Reviews Genetics.

[22]  Gene W. Yeo,et al.  LIN28 binds messenger RNAs at GGAGA motifs and regulates splicing factor abundance. , 2012, Molecular cell.

[23]  K. Weeks,et al.  Selective 2′-hydroxyl acylation analyzed by primer extension (SHAPE): quantitative RNA structure analysis at single nucleotide resolution , 2006, Nature Protocols.

[24]  Alain Laederach,et al.  Evaluating our ability to predict the structural disruption of RNA by SNPs , 2012, BMC Genomics.

[25]  M. Kiebler,et al.  Faculty Opinions recommendation of Argonaute HITS-CLIP decodes microRNA-mRNA interaction maps. , 2009 .

[26]  Bryan R. Cullen,et al.  The Viral and Cellular MicroRNA Targetome in Lymphoblastoid Cell Lines , 2012, PLoS pathogens.