Comprehensive Identification of RNA-Binding Domains in Human Cells

Summary Mammalian cells harbor more than a thousand RNA-binding proteins (RBPs), with half of these employing unknown modes of RNA binding. We developed RBDmap to determine the RNA-binding sites of native RBPs on a proteome-wide scale. We identified 1,174 binding sites within 529 HeLa cell RBPs, discovering numerous RNA-binding domains (RBDs). Catalytic centers or protein-protein interaction domains are in close relationship with RNA-binding sites, invoking possible effector roles of RNA in the control of protein function. Nearly half of the RNA-binding sites map to intrinsically disordered regions, uncovering unstructured domains as prevalent partners in protein-RNA interactions. RNA-binding sites represent hot spots for defined posttranslational modifications such as lysine acetylation and tyrosine phosphorylation, suggesting metabolic and signal-dependent regulation of RBP function. RBDs display a high degree of evolutionary conservation and incidence of Mendelian mutations, suggestive of important functional roles. RBDmap thus yields profound insights into native protein-RNA interactions in living cells.

[1]  T. Glisovic,et al.  RNA‐binding proteins and post‐transcriptional gene regulation , 2008, FEBS letters.

[2]  H. Urlaub,et al.  Investigation of protein-RNA interactions by mass spectrometry--Techniques and applications. , 2012, Journal of proteomics.

[3]  T. Petersen,et al.  A generic method for assignment of reliability scores applied to solvent accessibility predictions , 2009, BMC Structural Biology.

[4]  Zhenhua Li,et al.  NDRG1 expression is related to the progression and prognosis of gastric cancer patients through modulating proliferation, invasion and cell cycle of gastric cancer cells , 2014, Molecular Biology Reports.

[5]  M. Mann,et al.  Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips , 2007, Nature Protocols.

[6]  Sarath Chandra Janga,et al.  A Screen for RNA-Binding Proteins in Yeast Indicates Dual Functions for Many Enzymes , 2010, PloS one.

[7]  Julia Salzman,et al.  Proteome-Wide Search Reveals Unexpected RNA-Binding Proteins in Saccharomyces cerevisiae , 2010, PloS one.

[8]  Silvia Domcke,et al.  Molecular mechanisms for the RNA-dependent ATPase activity of Upf1 and its regulation by Upf2. , 2011, Molecular cell.

[9]  J. Ebert,et al.  The Crystal Structure of the Exon Junction Complex Reveals How It Maintains a Stable Grip on mRNA , 2006, Cell.

[10]  Panagiotis K. Papasaikas,et al.  Functional splicing network reveals extensive regulatory potential of the core spliceosomal machinery. , 2014, Molecular cell.

[11]  Zsuzsanna Dosztányi,et al.  IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content , 2005, Bioinform..

[12]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[13]  S. Levine,et al.  Toll-like receptor, RIG-I-like receptors and the NLRP3 inflammasome: key modulators of innate immune responses to double-stranded RNA viruses. , 2011, Cytokine & growth factor reviews.

[14]  Jeroen Krijgsveld,et al.  The RNA-binding proteomes from yeast to man harbour conserved enigmRBPs , 2015, Nature Communications.

[15]  C. Thiele,et al.  Photo-leucine and photo-methionine allow identification of protein-protein interactions in living cells , 2005, Nature Methods.

[16]  C. Brangwynne,et al.  Getting RNA and Protein in Phase , 2012, Cell.

[17]  Kai-Wei Chang,et al.  RNA-binding proteins in human genetic disease. , 2008, Trends in genetics : TIG.

[18]  E. Conti,et al.  The conformational plasticity of eukaryotic RNA‐dependent ATPases , 2015, The FEBS journal.

[19]  John N. Hutchinson,et al.  An architectural role for a nuclear noncoding RNA: NEAT1 RNA is essential for the structure of paraspeckles. , 2009, Molecular cell.

[20]  J. Somarelli,et al.  Spliceosomal immunophilins , 2008, FEBS letters.

[21]  Robert D. Finn,et al.  InterPro in 2011: new developments in the family and domain prediction database , 2011, Nucleic acids research.

[22]  Ilan Davis,et al.  The new (dis)order in RNA regulation , 2016, Cell Communication and Signaling.

[23]  Y. Levy,et al.  Searching DNA via a "Monkey Bar" mechanism: the significance of disordered tails. , 2010, Journal of molecular biology.

[24]  J. Cieśla Metabolic enzymes that bind RNA: yet another level of cellular regulatory network? , 2006, Acta biochimica Polonica.

[25]  Erin M. Langdon,et al.  RNA Controls PolyQ Protein Phase Transitions. , 2015, Molecular cell.

[26]  Richard Bonneau,et al.  The mRNA-bound proteome and its global occupancy profile on protein-coding transcripts. , 2012, Molecular cell.

[27]  M. Hentze,et al.  Systemic iron homeostasis and the iron-responsive element/iron-regulatory protein (IRE/IRP) regulatory network. , 2008, Annual review of nutrition.

[28]  Michael Kinter,et al.  Two-site phosphorylation of EPRS coordinates multimodal regulation of noncanonical translational control activity. , 2009, Molecular cell.

[29]  D. Angelov,et al.  Crosslinking proteins to nucleic acids by ultraviolet laser irradiation. , 1991, Trends in biochemical sciences.

[30]  K A Brandt The GDB Human Genome Data Base: a source of integrated genetic mapping and disease data. , 1993, Bulletin of the Medical Library Association.

[31]  E. Huang,et al.  Mechanisms of FUS mutations in familial amyotrophic lateral sclerosis , 2016, Brain Research.

[32]  Tsutomu Suzuki,et al.  Hsc70/Hsp90 chaperone machinery mediates ATP-dependent RISC loading of small RNA duplexes. , 2010, Molecular cell.

[33]  C. Eyers Universal sample preparation method for proteome analysis , 2009 .

[34]  Gabriele Varani,et al.  RNA is rarely at a loss for companions; as soon as RNA , 2008 .

[35]  Timothy L Bailey,et al.  Defining the RGG/RG motif. , 2013, Molecular cell.

[36]  J. Mellor,et al.  Lysine Acetylation Controls Local Protein Conformation by Influencing Proline Isomerization , 2014, Molecular cell.

[37]  Reinout Raijmakers,et al.  Multiplex peptide stable isotope dimethyl labeling for quantitative proteomics , 2009, Nature Protocols.

[38]  Sarah E. Ewald,et al.  Nucleic acid recognition by the innate immune system. , 2011, Annual review of immunology.

[39]  Ulrich Rothbauer,et al.  A versatile assay for RNA-binding proteins in living cells , 2014, RNA.

[40]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[41]  A. Cook,et al.  NF45 dimerizes with NF90, Zfr and SPNR via a conserved domain that has a nucleotidyltransferase fold , 2012, Nucleic acids research.

[42]  R. Cameron,et al.  Biomaterials Synthesis and Characterization Optimisation of Uv Irradiation as a Binding Site Conserving Method for Crosslinking Collagen-based Scaffolds , 2022 .

[43]  Jeroen Krijgsveld,et al.  System-wide identification of RNA-binding proteins by interactome capture , 2013, Nature Protocols.

[44]  G. Kozlov,et al.  Interdomain allostery promotes assembly of the poly(A) mRNA complex with PABP and eIF4G. , 2012, Molecular cell.

[45]  Jeroen Krijgsveld,et al.  The RNA-binding protein repertoire of embryonic stem cells , 2013, Nature Structural &Molecular Biology.

[46]  Norman E. Davey,et al.  Insights into RNA Biology from an Atlas of Mammalian mRNA-Binding Proteins , 2012, Cell.

[47]  Bernd Fischer,et al.  RNA-binding proteins in Mendelian disease. , 2013, Trends in genetics : TIG.

[48]  D. Patel,et al.  Structural insights into RNA recognition by the alternate-splicing regulator CUG-binding protein 1. , 2010, Structure.

[49]  S. Cusack,et al.  Large‐scale induced fit recognition of an m7GpppG cap analogue by the human nuclear cap‐binding complex , 2002, The EMBO journal.

[50]  Matthias Mann,et al.  Unbiased RNA–protein interaction screen by quantitative proteomics , 2009, Proceedings of the National Academy of Sciences.

[51]  Jimin Pei,et al.  Cell-free Formation of RNA Granules: Bound RNAs Identify Features and Components of Cellular Assemblies , 2012, Cell.

[52]  Y. Yuan,et al.  Structural insights into RISC assembly facilitated by dsRNA-binding domains of human RNA helicase A (DHX9) , 2013, Nucleic acids research.

[53]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[54]  Jennifer McDowall,et al.  InterPro protein classification. , 2011, Methods in molecular biology.

[55]  M. Sternberg,et al.  Protein structure prediction on the Web: a case study using the Phyre server , 2009, Nature Protocols.

[56]  S. Gygi,et al.  FUS-SMN protein interactions link the motor neuron diseases ALS and SMA. , 2012, Cell reports.

[57]  Eszter Nagy,et al.  Glyceraldehyde-3-phosphate Dehydrogenase Selectively Binds AU-rich RNA in the NAD+-binding Region (Rossmann Fold) (*) , 1995, The Journal of Biological Chemistry.

[58]  M. Kinter,et al.  DAPK-ZIPK-L13a axis constitutes a negative-feedback module regulating inflammatory gene expression. , 2008, Molecular cell.

[59]  E. Laing,et al.  Conserved mRNA-binding proteomes in eukaryotic organisms , 2015, Nature Structural &Molecular Biology.

[60]  Leszek Rychlewski,et al.  Comprehensive classification of nucleotidyltransferase fold proteins: identification of novel families and their representatives in human , 2009, Nucleic acids research.

[61]  Panagiotis K. Papasaikas,et al.  Genome-wide identification of Fas/CD95 alternative splicing regulators reveals links with iron homeostasis. , 2015, Molecular cell.

[62]  Jimin Pei,et al.  Cell-free Formation of RNA Granules: Low Complexity Sequence Domains Form Dynamic Fibers within Hydrogels , 2012, Cell.

[63]  Albert J R Heck,et al.  Triplex protein quantification based on stable isotope labeling by peptide dimethylation applied to cell and tissue lysates , 2008, Proteomics.

[64]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[65]  Johannes Griss,et al.  The Proteomics Identifications (PRIDE) database and associated tools: status in 2013 , 2012, Nucleic Acids Res..

[66]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[67]  F. Netter,et al.  Supplemental References , 2002, We Came Naked and Barefoot.

[68]  J. Rappsilber,et al.  Trim25 Is an RNA-Specific Activator of Lin28a/TuT4-Mediated Uridylation , 2014, Cell reports.

[69]  Shu-Hui Chen,et al.  Stable-isotope dimethyl labeling for quantitative proteomics. , 2003, Analytical chemistry.

[70]  Anne-Marie Alleaume,et al.  FASTKD2 is an RNA-binding protein required for mitochondrial RNA processing and translation , 2015, RNA.

[71]  C. Dieterich,et al.  MOV10 Is a 5' to 3' RNA helicase contributing to UPF1 mRNA target degradation by translocation along 3' UTRs. , 2014, Molecular cell.

[72]  P. S. Ray,et al.  WHEP domains direct noncanonical function of glutamyl-Prolyl tRNA synthetase in translational control of gene expression. , 2008, Molecular cell.

[73]  Gabriele Varani,et al.  RNA recognition by a Staufen double‐stranded RNA‐binding domain , 2000, The EMBO journal.

[74]  J. Darnell,et al.  Structure-function studies of FMRP RGG peptide recognition of an RNA duplex-quadruplex junction , 2011, Nature Structural &Molecular Biology.

[75]  Matthias W. Hentze,et al.  Metabolic Enzymes Enjoying New Partnerships as RNA-Binding Proteins , 2015, Trends in Endocrinology & Metabolism.

[76]  M. Walkinshaw,et al.  Crystallographic and mass spectrometric characterisation of eIF4E with N7-alkylated cap derivatives. , 2007, Journal of molecular biology.

[77]  Roy Parker,et al.  Global Analysis of Yeast mRNPs , 2012, Nature Structural &Molecular Biology.

[78]  Oliver Kohlbacher,et al.  Photo-cross-linking and high-resolution mass spectrometry for assignment of RNA-binding sites in RNA-binding proteins , 2014, Nature Methods.

[79]  J. Frydman,et al.  The Cotranslational Function of Ribosome-Associated Hsp70 in Eukaryotic Protein Homeostasis , 2013, Cell.