Sequence, Structure and Context Preferences of Human RNA Binding Proteins

Production of functional cellular RNAs involves multiple processing and regulatory steps principally mediated by RNA binding proteins (RBPs). Here we present the affinity landscapes of 78 human RBPs using an unbiased assay that determines the sequence, structure, and context preferences of an RBP in vitro from deep sequencing of bound RNAs. Analyses of these data revealed several interesting patterns, including unexpectedly low diversity of RNA motifs, implying frequent convergent evolution of binding specificity toward a relatively small set of RNA motifs, many with low compositional complexity. Offsetting this trend, we observed extensive preferences for contextual features outside of core RNA motifs, including spaced “bipartite” motifs, biased flanking nucleotide context, and bias away from or towards RNA structure. These contextual features are likely to enable targeting of distinct subsets of transcripts by different RBPs that recognize the same core motif. Our results enable construction of “RNA maps” of RBP activity without requiring crosslinking-based assays, and provide unprecedented depth of information on the interaction of RBPs with RNA.

[1]  G. Dreyfuss,et al.  Heterogeneous nuclear ribonucleoproteins: role in RNA splicing. , 1986, Science.

[2]  J. Keene,et al.  A common RNA recognition motif identified within a defined U1 RNA binding domain of the 70K U1 snRNP protein , 1989, Cell.

[3]  G. Dreyfuss,et al.  The pre-mRNA binding K protein contains a novel evolutionarily conserved motif. , 1993, Nucleic acids research.

[4]  D. Rio,et al.  Interaction of the sex‐lethal RNA binding domains with RNA. , 1995, The EMBO journal.

[5]  J. Valcárcel,et al.  Distinct binding specificities and functions of higher eukaryotic polypyrimidine tract-binding proteins. , 1995, Science.

[6]  P. Sharp,et al.  PUF60: a novel U2AF65-related splicing activity. , 1999, RNA.

[7]  D. L. Lyakhov,et al.  Optimized RNA Targets of Two Closely Related Triple KH Domain Proteins, Heterogeneous Nuclear Ribonucleoprotein K and αCP-2KL, Suggest Distinct Modes of RNA Recognition* , 2001, The Journal of Biological Chemistry.

[8]  J. Guhaniyogi,et al.  Regulation of mRNA stability in mammalian cells. , 2001, Gene.

[9]  N. Grishin,et al.  KH domain: one motif, two folds. , 2001, Nucleic acids research.

[10]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[11]  Jernej Ule,et al.  CLIP Identifies Nova-Regulated RNA Networks in the Brain , 2003, Science.

[12]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[13]  K. Lindblad-Toh,et al.  Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammals , 2005, Nature.

[14]  L. Paillard,et al.  AU-rich elements and associated factors: are there unifying principles? , 2006, Nucleic acids research.

[15]  G. Ast,et al.  Comparative analysis identifies exonic splicing regulatory sequences--The complex definition of enhancers and silencers. , 2006, Molecular cell.

[16]  C. Gilbert,et al.  RNA Immunoprecipitation for Determining RNA‐Protein Associations In Vivo , 2006, Current protocols in molecular biology.

[17]  Frédéric H.-T. Allain,et al.  Sequence-specific binding of single-stranded RNA: is there a code for recognition? , 2006, Nucleic acids research.

[18]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[19]  Rolf Backofen,et al.  Pre-mRNA Secondary Structures Influence Exon Recognition , 2007, PLoS genetics.

[20]  J. Kitzman,et al.  Determinants of targeting by endogenous and exogenous microRNAs and siRNAs. , 2007, RNA.

[21]  Gabriele Varani,et al.  RNA is rarely at a loss for companions; as soon as RNA , 2008 .

[22]  L. Lim,et al.  MicroRNA targeting specificity in mammals: determinants beyond seed pairing. , 2007, Molecular cell.

[23]  J. Berglund,et al.  A comprehensive computational characterization of conserved mammalian intronic sequences reveals conserved motifs associated with constitutive and alternative splicing. , 2007, Genome research.

[24]  Novel Splicing Factor RBM25 Modulates Bcl-x Pre-mRNA 5′ Splice Site Selection , 2008, Molecular and Cellular Biology.

[25]  L. Regan,et al.  Structure and function of KH domains , 2008, The FEBS journal.

[26]  Eran Segal,et al.  Computational prediction of RNA structural motifs involved in posttranscriptional regulatory processes , 2008, Proceedings of the National Academy of Sciences.

[27]  Lourdes Peña Castillo,et al.  Rapid and systematic analysis of the RNA recognition specificities of RNA-binding proteins , 2009, Nature Biotechnology.

[28]  J Andrew Berglund,et al.  The protein factors MBNL1 and U2AF65 bind alternative RNA structures to regulate splicing , 2009, Proceedings of the National Academy of Sciences.

[29]  T. Madl,et al.  Structural Basis for Homodimerization of the Src-associated during Mitosis, 68-kDa Protein (Sam68) Qua1 Domain*♦ , 2010, The Journal of Biological Chemistry.

[30]  J Andrew Berglund,et al.  Role of RNA structure in regulating pre-mRNA splicing. , 2010, Trends in biochemical sciences.

[31]  Quaid Morris,et al.  Predicting in vivo binding sites of RNA-binding proteins using mRNA secondary structure. , 2010, RNA.

[32]  D. Patel,et al.  Structural insights into RNA recognition by the alternate-splicing regulator CUG-binding protein 1. , 2010, Structure.

[33]  Andrew R. Gehrke,et al.  Genome-wide analysis of ETS-family DNA-binding in vitro and in vivo , 2010, The EMBO journal.

[34]  Jingyue Ju,et al.  Quantitative evaluation of all hexamers as exonic splicing elements. , 2011, Genome research.

[35]  Jernej Ule,et al.  Understanding splicing regulation through RNA splicing maps , 2011, Trends in genetics : TIG.

[36]  R. Abagyan,et al.  Protein-RNA and protein-protein recognition by dual KH1/2 domains of the neuronal splicing factor Nova-1. , 2011, Structure.

[37]  An Xie,et al.  Role of RBM25/LUC7L3 in Abnormal Cardiac Sodium Channel Splicing Regulation in Human Heart Failure , 2011, Circulation.

[38]  Benjamin J. Raphael,et al.  Using positional distribution to identify splicing elements and predict pre-mRNA processing defects in human genes , 2011, Proceedings of the National Academy of Sciences.

[39]  Peter F. Stadler,et al.  ViennaRNA Package 2.0 , 2011, Algorithms for Molecular Biology.

[40]  M. Ares,et al.  Structural analysis of the quaking homodimerization interface. , 2012, Journal of molecular biology.

[41]  Zefeng Wang,et al.  Intronic Splicing Enhancers, Cognate Splicing Factors and Context Dependent Regulation Rules , 2012, Nature Structural &Molecular Biology.

[42]  Juw Won Park,et al.  MATS: a Bayesian framework for flexible detection of differential alternative splicing from RNA-Seq data , 2012, Nucleic acids research.

[43]  G. Brewer,et al.  The regulation of mRNA stability in mammalian cells: 2.0. , 2012, Gene.

[44]  R. Lührmann,et al.  Cwc2 and its human homologue RBM22 promote an active conformation of the spliceosome catalytic centre , 2012, The EMBO journal.

[45]  Gabriele Varani,et al.  Faculty Opinions recommendation of Systematic discovery of structural elements governing stability of mammalian messenger RNAs. , 2012 .

[46]  D. Rio Filter-binding assay for analysis of RNA-protein interactions. , 2012, Cold Spring Harbor protocols.

[47]  F. Allain,et al.  RRM-RNA recognition: NMR or crystallography…and new findings. , 2013, Current opinion in structural biology.

[48]  T. Cech,et al.  RNA seeds higher-order assembly of FUS protein. , 2013, Cell reports.

[49]  Sarah A. Smith,et al.  Paralogs hnRNP L and hnRNP LL Exhibit Overlapping but Distinct RNA Binding Constraints , 2013, PloS one.

[50]  F. Allain,et al.  FROM STRUCTURE TO FUNCTION OF RNA BINDING DOMAINS , 2013 .

[51]  Kiyoshi Asai,et al.  CapR: revealing structural specificities of RNA-binding protein target recognition using CLIP-seq data , 2014, Genome Biology.

[52]  Jernej Ule,et al.  RNAmotifs: prediction of multivalent RNA motifs that control alternative splicing , 2013, Genome Biology.

[53]  C. Burge,et al.  A Complex Network of Factors with Overlapping Affinities Repress Splicing through Intronic Elements , 2012, Nature Structural &Molecular Biology.

[54]  Brendan J. Frey,et al.  A compendium of RNA-binding motifs for decoding gene regulation , 2013, Nature.

[55]  J. Keene,et al.  Advancing the functional utility of PAR-CLIP by quantifying background binding to mRNAs and lncRNAs , 2014, Genome Biology.

[56]  Juan M. Vaquerizas,et al.  DNA-Binding Specificities of Human Transcription Factors , 2013, Cell.

[57]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[58]  S. Gerstberger,et al.  A census of human RNA-binding proteins , 2014, Nature Reviews Genetics.

[59]  Saeed Tavazoie,et al.  Systematic identification of regulatory elements in conserved 3' UTRs of human transcripts. , 2014, Cell Reports.

[60]  P. Sharp,et al.  RNA Bind-n-Seq: quantitative assessment of the sequence and structural binding specificity of RNA binding proteins. , 2014, Molecular cell.

[61]  Monika Heiner,et al.  The RNA-Binding Protein QKI Suppresses Cancer-Associated Aberrant Splicing , 2014, PLoS genetics.

[62]  Zefeng Wang,et al.  The splicing activator DAZAP1 integrates splicing control into MEK/Erk regulated cell proliferation and migration , 2014, Nature Communications.

[63]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[64]  M. Ares,et al.  Context-dependent control of alternative splicing by RNA-binding proteins , 2014, Nature Reviews Genetics.

[65]  Jonathan Hall,et al.  Structural and mechanistic insights into poly(uridine) tract recognition by the hnRNP C RNA recognition motif. , 2014, Journal of the American Chemical Society.

[66]  D. Bartel,et al.  Predicting effective microRNA target sites in mammalian mRNAs , 2015, eLife.

[67]  Georg Seelig,et al.  Learning the Sequence Determinants of Alternative Splicing from Millions of Random Sequences , 2015, Cell.

[68]  T. Cech,et al.  Biochemical Properties and Biological Functions of FET Proteins. , 2015, Annual review of biochemistry.

[69]  Timothy R. Hughes,et al.  High-throughput characterization of protein–RNA interactions , 2014, Briefings in functional genomics.

[70]  F. Allain,et al.  One, Two, Three, Four! How Multiple RRMs Read the Genome Sequence. , 2015, Methods in enzymology.

[71]  I. Hofacker,et al.  Predicting RNA 3D structure using a coarse-grain helix-centered model , 2015, RNA.

[72]  I. Taylor,et al.  KH-RNA interactions: back in the groove. , 2015, Current opinion in structural biology.

[73]  Gene W. Yeo,et al.  Distinct and shared functions of ALS-associated proteins TDP-43, FUS and TAF15 revealed by multisystem analyses , 2016, Nature Communications.

[74]  Tomaz Curk,et al.  SR proteins are NXF1 adaptors that link alternative RNA processing to mRNA export , 2016, Genes & development.

[75]  Gene W. Yeo,et al.  Enhanced CLIP Uncovers IMP Protein-RNA Targets in Human Pluripotent Stem Cells Important for Cell Adhesion and Survival. , 2016, Cell reports.

[76]  Xintao Wei,et al.  Resources for the comprehensive discovery of functional RNA elements , 2015, bioRxiv.

[77]  Fátima Sánchez-Cabo,et al.  ATtRACT—a database of RNA-binding proteins and associated motifs , 2016, Database J. Biol. Databases Curation.

[78]  Peter H. Sudmant,et al.  RNA Sequence Context Effects Measured In Vitro Predict In Vivo Protein Binding and Regulation. , 2016, Molecular cell.

[79]  Yi Xing,et al.  Rbfox Proteins Regulate Splicing as Part of a Large Multiprotein Complex LASR , 2016, Cell.

[80]  Lior Pachter,et al.  Near-optimal probabilistic RNA-seq quantification , 2016, Nature Biotechnology.

[81]  Henning Urlaub,et al.  A Compendium of RNA-Binding Proteins that Regulate MicroRNA Biogenesis. , 2017, Molecular cell.

[82]  Angela N. Brooks,et al.  RBM25 is a global splicing factor promoting inclusion of alternatively spliced exons and is itself regulated by lysine mono-methylation , 2017, The Journal of Biological Chemistry.

[83]  I. Poser,et al.  Cellular differentiation state modulates the mRNA export activity of SR proteins , 2017, The Journal of cell biology.

[84]  Jianlin Lei,et al.  An Atomic Structure of the Human Spliceosome , 2017, Cell.

[85]  Kaitlin U Laverty,et al.  RNAcompete-S: Combined RNA sequence/structure preferences for RNA binding proteins derived from a single-step in vitro selection. , 2017, Methods.

[86]  H. Okano,et al.  Structural Insight into the Recognition of r(UAG) by Musashi-1 RBD2, and Construction of a Model of Musashi-1 RBD1-2 Bound to the Minimum Target RNA , 2017, Molecules.

[87]  Gene W. Yeo,et al.  A Large-Scale Binding and Functional Map of Human RNA Binding Proteins , 2017, bioRxiv.

[88]  R. Amann,et al.  Predictive Identification of Exonic Splicing Enhancers in Human Genes , 2022 .