Deciphering the protein‐RNA recognition code: Combining large‐scale quantitative methods with structural biology

RNA binding proteins (RBPs) are key factors for the regulation of gene expression by binding to cis elements, i.e. short sequence motifs in RNAs. Recent studies demonstrate that cooperative binding of multiple RBPs is important for the sequence‐specific recognition of RNA and thereby enables the regulation of diverse biological activities by a limited set of RBPs. Cross‐linking immuno‐precipitation (CLIP) and other recently developed high‐throughput methods provide comprehensive, genome‐wide maps of protein‐RNA interactions in the cell. Structural biology gives detailed insights into molecular mechanisms and principles of RNA recognition by RBPs, but has so far focused on single RNA binding proteins and often on single RNA binding domains. The combination of high‐throughput methods and detailed structural biology studies is expected to greatly advance our understanding of the code for protein‐RNA recognition in gene regulation, as we review in this article.

[1]  Michael Q. Zhang,et al.  Design and bioinformatics analysis of genome-wide CLIP experiments , 2015, Nucleic acids research.

[2]  Marco Blanchette,et al.  SR proteins control a complex network of RNA-processing events , 2015, RNA.

[3]  F. Gebauer,et al.  Breaking the protein-RNA recognition code , 2014, Cell cycle.

[4]  N. Rajewsky,et al.  A Variety of Dicer Substrates in Human and C. elegans , 2014, Cell.

[5]  P. Stadler,et al.  The coilin interactome identifies hundreds of small noncoding RNAs that traffic through Cajal bodies. , 2014, Molecular cell.

[6]  S. Gerstberger,et al.  A census of human RNA-binding proteins , 2014, Nature Reviews Genetics.

[7]  F. Gebauer,et al.  Structural basis for the assembly of the Sxl–Unr translation regulatory complex , 2014, Nature.

[8]  F. Gebauer,et al.  UNR facilitates the interaction of MLE with the lncRNA roX2 during Drosophila dosage compensation , 2014, Nature Communications.

[9]  Jernej Ule,et al.  Rbfox2-coordinated alternative splicing of Mef2d and Rock2 controls myoblast fusion during myogenesis. , 2014, Molecular cell.

[10]  Peng R. Chen,et al.  Genetically encoded cleavable protein photo-cross-linker. , 2014, Journal of the American Chemical Society.

[11]  R. Janowski,et al.  Structural basis for RNA recognition in roquin-mediated post-transcriptional gene regulation , 2014, Nature Structural &Molecular Biology.

[12]  F. Allain,et al.  A fly trap mechanism provides sequence-specific RNA recognition by CPEB proteins , 2014, Genes & development.

[13]  Liang Tong,et al.  The ROQ domain of Roquin recognizes mRNA constitutive decay element and double-stranded RNA , 2014, Nature Structural &Molecular Biology.

[14]  M. Sattler,et al.  The dynamic duo: Combining NMR and small angle scattering in structural biology , 2014, Protein science : a publication of the Protein Society.

[15]  R. Sprangers,et al.  In Vitro Reconstitution of a Cellular Phase-Transition Process that Involves the mRNA Decapping Machinery , 2014, Angewandte Chemie.

[16]  C. Dieterich,et al.  MOV10 Is a 5' to 3' RNA helicase contributing to UPF1 mRNA target degradation by translocation along 3' UTRs. , 2014, Molecular cell.

[17]  Michael Sattler,et al.  Transient electrostatic interactions dominate the conformational equilibrium sampled by multidomain splicing factor U2AF65: a combined NMR and SAXS study. , 2014, Journal of the American Chemical Society.

[18]  Abdullah Ozer,et al.  Comprehensive Analysis of RNA-Protein Interactions by High Throughput Sequencing-RNA Affinity Profiling , 2014, Nature Methods.

[19]  P. Sharp,et al.  RNA Bind-n-Seq: quantitative assessment of the sequence and structural binding specificity of RNA binding proteins. , 2014, Molecular cell.

[20]  M. Gorospe,et al.  The binding of TIA-1 to RNA C-rich sequences is driven by its C-terminal RRM domain , 2014, RNA biology.

[21]  Howard Y. Chang,et al.  Quantitative analysis of RNA-protein interactions on a massively parallel array for mapping biophysical and evolutionary landscapes , 2014, Nature Biotechnology.

[22]  Ahmed S. Moursy,et al.  Characterization of the RNA recognition mode of hnRNP G extends its role in SMN2 splicing regulation , 2014, Nucleic acids research.

[23]  Rainer Merkl,et al.  The NHL domain of BRAT is an RNA-binding domain that directly contacts the hunchback mRNA for regulation , 2014, Genes & development.

[24]  Cameron D. Mackereth,et al.  Protein chemical shift assignments of the unbound and RNA-bound forms of the alternative splicing factor SUP-12 from C. elegans , 2014, Biomolecular NMR assignments.

[25]  J. Steitz,et al.  The Noncoding RNA Revolution—Trashing Old Rules to Forge New Ones , 2014, Cell.

[26]  J. Valcárcel,et al.  Structure, dynamics and RNA binding of the multi-domain splicing factor TIA-1 , 2014, Nucleic acids research.

[27]  J. Buchner,et al.  Modulation of the Hsp90 chaperone cycle by a stringent client protein. , 2014, Molecular cell.

[28]  Julian König,et al.  Crosslinking-immunoprecipitation (iCLIP) analysis reveals global regulatory roles of hnRNP L , 2014, RNA biology.

[29]  A. Mele,et al.  Mapping Argonaute and conventional RNA-binding protein interactions with RNA at single-nucleotide resolution using HITS-CLIP and CIMS analysis , 2014, Nature Protocols.

[30]  Yu Zhou,et al.  De Novo Prediction of PTBP1 Binding and Splicing Targets Reveals Unexpected Features of Its RNA Recognition and Function , 2014, PLoS Comput. Biol..

[31]  J. Ule,et al.  Molecular basis of UG-rich RNA recognition by the human splicing factor TDP-43 , 2013, Nature Structural &Molecular Biology.

[32]  E. Hurt,et al.  Eukaryotic ribosome biogenesis at a glance , 2013, Journal of Cell Science.

[33]  Audrone Lapinaite,et al.  The structure of the box C/D enzyme reveals regulation of RNA methylation , 2013, Nature.

[34]  Michael E. Harris,et al.  Hidden specificity in an apparently non-specific RNA-binding protein , 2013, Nature.

[35]  Jens Keilwagen,et al.  A general approach for discriminative de novo motif discovery from high-throughput data , 2013, GCB.

[36]  R. Aebersold,et al.  Structural features of Argonaute–GW182 protein interactions , 2013, Proceedings of the National Academy of Sciences.

[37]  A. Dziembowski,et al.  U6 RNA biogenesis and disease association , 2013, Wiley interdisciplinary reviews. RNA.

[38]  Jeroen Krijgsveld,et al.  The RNA-binding protein repertoire of embryonic stem cells , 2013, Nature Structural &Molecular Biology.

[39]  Brendan J. Frey,et al.  A compendium of RNA-binding motifs for decoding gene regulation , 2013, Nature.

[40]  A. Krainer,et al.  Isolated pseudo–RNA-recognition motifs of SR proteins can regulate splicing using a noncanonical mode of RNA recognition , 2013, Proceedings of the National Academy of Sciences.

[41]  Timothy L Bailey,et al.  Defining the RGG/RG motif. , 2013, Molecular cell.

[42]  M. Rudolph,et al.  Recognition of two distinct elements in the RNA substrate by the RNA-binding domain of the T. thermophilus DEAD box helicase Hera , 2013, Nucleic acids research.

[43]  M. Sattler,et al.  Combining NMR and small angle X-ray and neutron scattering in the structural analysis of a ternary protein-RNA complex , 2013, Journal of biomolecular NMR.

[44]  Michael R. Green,et al.  Structure of phosphorylated SF1 bound to U2AF⁶⁵ in an essential splicing factor complex. , 2013, Structure.

[45]  M. Ascano,et al.  Multi-disciplinary methods to define RNA-protein interactions and regulatory networks. , 2013, Current opinion in genetics & development.

[46]  F. Allain,et al.  RRM-RNA recognition: NMR or crystallography…and new findings. , 2013, Current opinion in structural biology.

[47]  Julian König,et al.  Direct Competition between hnRNP C and U2AF65 Protects the Transcriptome from the Exonization of Alu Elements , 2013, Cell.

[48]  S. Sieber,et al.  Structure, phosphorylation and U2AF65 binding of the N-terminal domain of splicing factor 1 during 3′-splice site recognition , 2012, Nucleic acids research.

[49]  Roy Parker,et al.  Global Analysis of Yeast mRNPs , 2012, Nature Structural &Molecular Biology.

[50]  David Klenerman,et al.  Ubiquitin chain conformation regulates recognition and activity of interacting proteins , 2012, Nature.

[51]  A. Mele,et al.  Neuronal Elav-like (Hu) Proteins Regulate RNA Splicing and Abundance to Control Glutamate Levels and Neuronal Excitability , 2012, Neuron.

[52]  David G. Knowles,et al.  The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression , 2012, Genome research.

[53]  Julian König,et al.  Analysis of CLIP and iCLIP methods for nucleotide-resolution studies of protein-RNA interactions , 2012, Genome Biology.

[54]  Philip Bradley,et al.  Atomistic modeling of protein-DNA interaction specificity: progress and applications. , 2012, Current opinion in structural biology.

[55]  Renato Paro,et al.  Mixture models and wavelet transforms reveal high confidence RNA-protein interaction sites in MOV10 PAR-CLIP data , 2012, Nucleic acids research.

[56]  A. Lambowitz,et al.  Structural basis for RNA duplex recognition and unwinding by the DEAD-box helicase Mss116p , 2012, Nature.

[57]  Richard Bonneau,et al.  The mRNA-bound proteome and its global occupancy profile on protein-coding transcripts. , 2012, Molecular cell.

[58]  Norman E. Davey,et al.  Insights into RNA Biology from an Atlas of Mammalian mRNA-Binding Proteins , 2012, Cell.

[59]  M. Sattler,et al.  Dynamics in multi-domain protein recognition of RNA. , 2012, Current opinion in structural biology.

[60]  Jimin Pei,et al.  Cell-free Formation of RNA Granules: Low Complexity Sequence Domains Form Dynamic Fibers within Hydrogels , 2012, Cell.

[61]  Jimin Pei,et al.  Cell-free Formation of RNA Granules: Bound RNAs Identify Features and Components of Cellular Assemblies , 2012, Cell.

[62]  F. van Roy,et al.  A flexible integrative approach based on random forest improves prediction of transcription factor binding sites , 2012, Nucleic acids research.

[63]  Jernej Ule,et al.  The RNA-binding landscapes of two SR proteins reveal unique functions and binding to diverse RNA classes , 2012, Genome Biology.

[64]  J. Valcárcel,et al.  hnRNP A1 proofreads 3' splice site recognition by U2AF. , 2012, Molecular cell.

[65]  J. Jenkins,et al.  Three RNA recognition motifs participate in RNA recognition and structural organization by the pro-apoptotic factor TIA-1. , 2012, Journal of molecular biology.

[66]  Thomas Conrad,et al.  Dosage compensation in Drosophila melanogaster: epigenetic fine-tuning of chromosome-wide transcription , 2012, Nature Reviews Genetics.

[67]  F. Allain,et al.  A syn–anti conformational difference allows SRSF2 to recognize guanines and cytosines equally well , 2012, The EMBO journal.

[68]  Zoltán Konthur,et al.  Probing the SELEX Process with Next-Generation Sequencing , 2011, PloS one.

[69]  H. Okano,et al.  Structure of Musashi1 in a complex with target RNA: the role of aromatic stacking interactions , 2011, Nucleic acids research.

[70]  R. Gregory,et al.  Molecular Basis for Interaction of let-7 MicroRNAs with Lin28 , 2011, Cell.

[71]  M. Gale,et al.  Structural basis of RNA recognition and activation by innate immune receptor RIG-I , 2011, Nature.

[72]  T. Edwards,et al.  Kinked β-strands mediate high-affinity recognition of mRNA targets by the germ-cell regulator DAZL , 2011, Proceedings of the National Academy of Sciences.

[73]  Patrick Linder,et al.  From unwinding to clamping — the DEAD box RNA helicase family , 2011, Nature Reviews Molecular Cell Biology.

[74]  D. Licatalosi,et al.  FMRP Stalls Ribosomal Translocation on mRNAs Linked to Synaptic Function and Autism , 2011, Cell.

[75]  Michael Sattler,et al.  Multi-domain conformational selection underlies pre-mRNA splicing regulation by U2AF , 2011, Nature.

[76]  R. Darnell,et al.  Mapping in vivo protein-RNA interactions at single-nucleotide resolution from HITS-CLIP data , 2011, Nature Biotechnology.

[77]  Markus Seiler,et al.  Translational Control via Protein-Regulated Upstream Open Reading Frames , 2011, Cell.

[78]  R. Maraia,et al.  3′ processing of eukaryotic precursor tRNAs , 2011, Wiley interdisciplinary reviews. RNA.

[79]  Stefan Stamm,et al.  Molecular basis of purine-rich RNA recognition by the human SR-like protein Tra2-β1 , 2011, Nature Structural &Molecular Biology.

[80]  J. Ule,et al.  Characterizing the RNA targets and position-dependent splicing regulation by TDP-43. , 2011, Nature neuroscience.

[81]  Michael Sattler,et al.  NMR and small-angle scattering-based structural analysis of protein complexes in solution. , 2011, Journal of structural biology.

[82]  J. Adams,et al.  Phosphorylation mechanism and structure of serine‐arginine protein kinases , 2011, The FEBS journal.

[83]  B. Kastner,et al.  Functional organization of the Sm core in the crystal structure of human U1 snRNP , 2010, The EMBO journal.

[84]  Grace X. Y. Zheng,et al.  Genome-wide identification of Ago2 binding sites from mouse embryonic stem cells with and without mature microRNAs , 2010, Nature Structural &Molecular Biology.

[85]  Hui Zhou,et al.  starBase: a database for exploring microRNA–mRNA interaction maps from Argonaute CLIP-Seq and Degradome-Seq data , 2010, Nucleic Acids Res..

[86]  D. Patel,et al.  Structural insights into RNA recognition by the alternate-splicing regulator CUG-binding protein 1. , 2010, Structure.

[87]  Oliver Bott,et al.  The Processing of Events , 2010 .

[88]  Robert B Darnell,et al.  HITS‐CLIP: panoramic views of protein–RNA regulation in living cells , 2010, Wiley interdisciplinary reviews. RNA.

[89]  Bradley M. Lunde,et al.  Structural insights into cis element recognition of non-polyadenylated RNAs by the Nab3-RRM , 2010, Nucleic Acids Res..

[90]  J. Ule,et al.  iCLIP reveals the function of hnRNP particles in splicing at individual nucleotide resolution , 2010, Nature Structural &Molecular Biology.

[91]  Scott B. Dewell,et al.  Transcriptome-wide Identification of RNA-Binding Protein and MicroRNA Target Sites by PAR-CLIP , 2010, Cell.

[92]  R. Sharan,et al.  I-TASSER: a unified platform for automated protein structure and function prediction , 2010, Nature Protocols.

[93]  Gene W. Yeo,et al.  Comprehensive discovery of endogenous Argonaute binding sites in Caenorhabditis elegans , 2010, Nature Structural &Molecular Biology.

[94]  I. Taylor,et al.  Structure of the Rna15 RRM–RNA complex reveals the molecular basis of GU specificity in transcriptional 3′-end processing factors , 2010, Nucleic acids research.

[95]  Gene W. Yeo,et al.  Genome-wide analysis of PTB-RNA interactions reveals a strategy used by the general splicing repressor to modulate exon inclusion or skipping. , 2009, Molecular cell.

[96]  M. Kiebler,et al.  Faculty Opinions recommendation of Argonaute HITS-CLIP decodes microRNA-mRNA interaction maps. , 2009 .

[97]  Lourdes Peña Castillo,et al.  Rapid and systematic analysis of the RNA recognition specificities of RNA-binding proteins , 2009, Nature Biotechnology.

[98]  C. Oubridge,et al.  Crystal structure of human spliceosomal U1 snRNP at 5.5 Å resolution , 2009, Nature.

[99]  Lili Wan,et al.  RNA and Disease , 2009, Cell.

[100]  V. Kim,et al.  Biogenesis of small RNAs in animals , 2009, Nature Reviews Molecular Cell Biology.

[101]  Tyson A. Clark,et al.  HITS-CLIP yields genome-wide insights into brain alternative RNA processing , 2008, Nature.

[102]  Markus Blatter,et al.  RNA recognition motifs: boring? Not quite. , 2008, Current opinion in structural biology.

[103]  Jun S. Liu,et al.  Extracting sequence features to predict protein–DNA interactions: a comparative study , 2008, Nucleic acids research.

[104]  Michael Sattler,et al.  U2AF-homology motif interactions are required for alternative splicing regulation by SPF45 , 2007, Nature Structural &Molecular Biology.

[105]  Gabriele Varani,et al.  RNA is rarely at a loss for companions; as soon as RNA , 2008 .

[106]  Florian C. Oberstrass,et al.  Solving the structure of PTB in complex with pyrimidine tracts: an NMR study of protein-RNA complexes of weak affinities. , 2007, Journal of molecular biology.

[107]  Y. Hargous,et al.  Molecular basis of RNA recognition and TAP binding by the SR proteins SRp20 and 9G8 , 2006, The EMBO journal.

[108]  Frédéric H.-T. Allain,et al.  Sequence-specific binding of single-stranded RNA: is there a code for recognition? , 2006, Nucleic acids research.

[109]  J. Ebert,et al.  The Crystal Structure of the Exon Junction Complex Reveals How It Maintains a Stable Grip on mRNA , 2006, Cell.

[110]  Michael R Green,et al.  Structural basis for polypyrimidine tract recognition by the essential pre-mRNA splicing factor U2AF65. , 2006, Molecular cell.

[111]  M. Hentze,et al.  Sex-lethal imparts a sex-specific function to UNR by recruiting it to the msl-2 mRNA 3' UTR: translational repression for dosage compensation. , 2006, Genes & development.

[112]  D. Black,et al.  Structure of PTB Bound to RNA: Specific Binding and Implications for Splicing Regulation , 2005, Science.

[113]  S. Cusack,et al.  X-ray crystallographic and NMR studies of the third KH domain of hnRNP K in complex with single-stranded nucleic acids. , 2005, Structure.

[114]  Armin Shmilovici,et al.  Identification of transcription factor binding sites with variable-order Bayesian networks , 2005, Bioinform..

[115]  T. Kiss Biogenesis of small nuclear RNPs , 2004, Journal of Cell Science.

[116]  S. Hart,et al.  Sortase-mediated protein ligation: a new method for protein engineering. , 2004, Journal of the American Chemical Society.

[117]  Jernej Ule,et al.  CLIP Identifies Nova-Regulated RNA Networks in the Brain , 2003, Science.

[118]  Michael Sattler,et al.  Structural basis for the molecular recognition between human splicing factors U2AF65 and SF1/mBBP. , 2003, Molecular cell.

[119]  J. Valcárcel,et al.  The splicing regulator TIA‐1 interacts with U1‐C to promote U1 snRNP recruitment to 5′ splice sites , 2002, The EMBO journal.

[120]  Ana C. Messias,et al.  Structural Basis for Recognition of the Intron Branch Site RNA by Splicing Factor 1 , 2001, Science.

[121]  Michael R. Green,et al.  A Novel Peptide Recognition Mode Revealed by the X-Ray Structure of a Core U2AF35/U2AF65 Heterodimer , 2001, Cell.

[122]  L. Gold,et al.  Interactions of Escherichia coli RNA with bacteriophage MS2 coat protein: genomic SELEX. , 2000, Nucleic acids research.

[123]  M. Hentze,et al.  Translational control of dosage compensation in Drosophila by Sex‐lethal: cooperative silencing via the 5′ and 3′ UTRs of msl‐2 mRNA is independent of the poly(A) tail , 1999, The EMBO journal.

[124]  Rahul C. Deo,et al.  Recognition of Polyadenylate RNA by the Poly(A)-Binding Protein , 1999, Cell.

[125]  D. Rio,et al.  Absence of interdomain contacts in the crystal structure of the RNA recognition motifs of Sex-lethal. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[126]  Kazuki Kurimoto,et al.  Structural basis for recognition of the tra mRNA precursor by the Sex-lethal protein , 1999, Nature.

[127]  R. Kelley,et al.  Sex lethal controls dosage compensation in Drosophila by a non-splicing mechanism , 1997, Nature.

[128]  T. Steitz,et al.  Crystal structure of the two RNA binding domains of human hnRNP A1 at 1.75 Å resolution , 1997, Nature Structural Biology.

[129]  G. Varani,et al.  Specificity of ribonucleoprotein interaction determined by RNA folding during complex formation , 1996, Nature.

[130]  Nobutoshi Ito,et al.  Crystal structure at 1.92 Å resolution of the RNA-binding domain of the U1A spliceosomal protein complexed with an RNA hairpin , 1994, Nature.

[131]  Hermann Schindelin,et al.  Universal nucleic acid-binding domain revealed by crystal structure of the B. subtilis major cold-shock protein , 1993, Nature.

[132]  Michael R. Green,et al.  The protein Sex-lethal antagonizes the splicing factor U2AF to regulate alternative splicing of transformer pre-mRNA , 1993, Nature.

[133]  J. Szostak,et al.  In vitro selection of RNA molecules that bind specific ligands , 1990, Nature.

[134]  L. Gold,et al.  Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. , 1990, Science.

[135]  V. Ramakrishnan,et al.  Distribution of protein and RNA in the 30S ribosomal subunit. , 1986, Science.

[136]  B. Jacrot,et al.  REVIEW ARTICLE: The study of biological structures by neutron scattering from solution , 1976 .

[137]  M. Zavolan,et al.  PAR-CLIP (Photoactivatable Ribonucleoside-Enhanced Crosslinking and Immunoprecipitation): a step-by-step protocol to the transcriptome-wide identification of binding sites of RNA-binding proteins. , 2014, Methods in enzymology.

[138]  Anke Busch,et al.  Evolution of SR protein and hnRNP splicing regulatory factors , 2012, Wiley interdisciplinary reviews. RNA.

[139]  Udo Heinemann,et al.  RNA single strands bind to a conserved surface of the major cold shock protein in crystals and solution. , 2012, RNA.

[140]  J. Cáceres,et al.  The SR protein family of splicing factors: master regulators of gene expression. , 2009, The Biochemical journal.