Photo-cross-linking and high-resolution mass spectrometry for assignment of RNA-binding sites in RNA-binding proteins

RNA-protein complexes play pivotal roles in many central biological processes. Although methods based on high-throughput sequencing have advanced our ability to identify the specific RNAs bound by a particular protein, there is a need for precise and systematic ways to identify RNA interaction sites on proteins. We have developed an experimental and computational workflow combining photo-induced cross-linking, high-resolution mass spectrometry and automated analysis of the resulting mass spectra for the identification of cross-linked peptides, cross-linking sites and the cross-linked RNA oligonucleotide moieties of such RNA-binding proteins. The workflow can be applied to any RNA-protein complex of interest or to whole proteomes. We applied the approach to human and yeast mRNA-protein complexes in vitro and in vivo, demonstrating its powerful utility by identifying 257 cross-linking sites on 124 distinct RNA-binding proteins. The open-source software pipeline developed for this purpose, RNPxl, is available as part of the OpenMS project.

[1]  S. Kostka,et al.  Sm protein–Sm site RNA interactions within the inner ring of the spliceosomal snRNP core structure , 2001, The EMBO journal.

[2]  H. Urlaub,et al.  Mass-spectrometric analysis of proteins cross-linked to 4-thio-uracil- and 5-bromo-uracil-substituted RNA. , 2011 .

[3]  Knut Reinert,et al.  OpenMS – An open-source software framework for mass spectrometry , 2008, BMC Bioinformatics.

[4]  Henning Urlaub,et al.  Protein Composition and Electron Microscopy Structure of Affinity-Purified Human Spliceosomal B Complexes Isolated under Physiological Conditions , 2006, Molecular and Cellular Biology.

[5]  H. Urlaub,et al.  The Prp8 RNase H-like domain inhibits Brr2-mediated U4/U6 snRNA unwinding by blocking Brr2 loading onto the U4 snRNA. , 2012, Genes & development.

[6]  H. Urlaub,et al.  Crystal structure of Cwc2 reveals a novel architecture of a multipartite RNA‐binding protein , 2012, The EMBO journal.

[7]  Sarah J. Wheelan,et al.  Transcriptome-Wide Binding Sites for Components of the Saccharomyces cerevisiae Non-Poly(A) Termination Pathway: Nrd1, Nab3, and Sen1 , 2011, PLoS genetics.

[8]  R. Roeder,et al.  Accurate transcription initiation by RNA polymerase II in a soluble extract from isolated mammalian nuclei. , 1983, Nucleic acids research.

[9]  Scott B. Dewell,et al.  Transcriptome-wide Identification of RNA-Binding Protein and MicroRNA Target Sites by PAR-CLIP , 2010, Cell.

[10]  S. Bryant,et al.  Open mass spectrometry search algorithm. , 2004, Journal of proteome research.

[11]  Stephan Wickles,et al.  Structural characterization of a eukaryotic chaperone—the ribosome-associated complex , 2012, Nature Structural &Molecular Biology.

[12]  P. Limbach,et al.  Application of fractional mass for the identification of peptide-oligonucleotide cross-links by mass spectrometry. , 2008, Journal of mass spectrometry : JMS.

[13]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[14]  P. Cramer,et al.  A Cytoplasmic Complex Mediates Specific mRNA Recognition and Localization in Yeast , 2011, PLoS biology.

[15]  Sergey V. Melnikov,et al.  The structure of the eukaryotic ribosome at 3.0 angstrom resolution. , 2011 .

[16]  Richard Bonneau,et al.  The mRNA-bound proteome and its global occupancy profile on protein-coding transcripts. , 2012, Molecular cell.

[17]  J. Silberg,et al.  A transposase strategy for creating libraries of circularly permuted proteins , 2012, Nucleic acids research.

[18]  Sergey Melnikov,et al.  The Structure of the Eukaryotic Ribosome at 3.0 Å Resolution , 2011, Science.

[19]  M. Hentze,et al.  Enzymes as RNA-binding proteins: a role for (di)nucleotide-binding domains? , 1994, Trends in biochemical sciences.

[20]  Michael Sattler,et al.  Multi-domain conformational selection underlies pre-mRNA splicing regulation by U2AF , 2011, Nature.

[21]  R. Brimacombe,et al.  Identification and Sequence Analysis of Contact Sites between Ribosomal Proteins and rRNA in Escherichia coli 30 S Subunits by a New Approach Using Matrix-assisted Laser Desorption/Ionization-Mass Spectrometry Combined with N-terminal Microsequencing* , 1997, The Journal of Biological Chemistry.

[22]  Matthias Mann,et al.  Quantitative proteomic analysis reveals concurrent RNA–protein interactions and identifies new RNA-binding proteins in Saccharomyces cerevisiae , 2013, Genome research.

[23]  Knut Reinert,et al.  TOPP - the OpenMS proteomics pipeline , 2007, Bioinform..

[24]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[25]  Jürgen Cox,et al.  A systematic investigation into the nature of tryptic HCD spectra. , 2012, Journal of proteome research.

[26]  G. Hannon,et al.  Dogma derailed: the many influences of RNA on the genome. , 2013, Molecular cell.

[27]  M. Wickens,et al.  A 5′ cytosine binding pocket in Puf3p specifies regulation of mitochondrial mRNAs , 2009, Proceedings of the National Academy of Sciences.

[28]  Natalie I. Tasman,et al.  A Cross-platform Toolkit for Mass Spectrometry and Proteomics , 2012, Nature Biotechnology.

[29]  Johannes Griss,et al.  The Proteomics Identifications (PRIDE) database and associated tools: status in 2013 , 2012, Nucleic Acids Res..

[30]  U. Heinemann,et al.  High resolution crystal structure of domain I of the Saccharomyces cerevisiae homing endonuclease PI-SceI. , 2002, Nucleic acids research.

[31]  R. Terns,et al.  Non-coding RNAs: lessons from the small nuclear and small nucleolar RNAs , 2007, Nature Reviews Molecular Cell Biology.

[32]  H. Urlaub,et al.  Juzen-taiho-to (Shi-Quan-Da-Bu-Tang): Scientific Evaluation and Clinical Application , 2006, Evidence-based Complementary and Alternative Medicine.

[33]  T. Glisovic,et al.  RNA‐binding proteins and post‐transcriptional gene regulation , 2008, FEBS letters.

[34]  H. Urlaub,et al.  Investigation of protein-RNA interactions by mass spectrometry--Techniques and applications. , 2012, Journal of proteomics.

[35]  H. Urlaub,et al.  A novel Nop5-sRNA interaction that is required for efficient archaeal box C/D sRNP formation. , 2010, RNA.

[36]  M. Mann,et al.  Parts per Million Mass Accuracy on an Orbitrap Mass Spectrometer via Lock Mass Injection into a C-trap*S , 2005, Molecular & Cellular Proteomics.

[37]  C. Norbury,et al.  The Long and Short of MicroRNA , 2013, Cell.

[38]  Henning Urlaub,et al.  Structural and functional analysis of the E. coli NusB-S10 transcription antitermination complex. , 2009, Molecular cell.

[39]  B. Séraphin,et al.  A generic protein purification method for protein complex characterization and proteome exploration , 1999, Nature Biotechnology.

[40]  F. Allain,et al.  Solution structure of the HMG protein NHP6A and its interaction with DNA reveals the structural determinants for non‐sequence‐specific binding , 1999, The EMBO journal.

[41]  Jeroen Krijgsveld,et al.  System-wide identification of RNA-binding proteins by interactome capture , 2013, Nature Protocols.

[42]  Norman E. Davey,et al.  Insights into RNA Biology from an Atlas of Mammalian mRNA-Binding Proteins , 2012, Cell.

[43]  Lennart Martens,et al.  mzML—a Community Standard for Mass Spectrometry Data* , 2010, Molecular & Cellular Proteomics.

[44]  C. Lenz,et al.  Mapping the binding site of snurportin 1 on native U1 snRNP by cross-linking and mass spectrometry , 2010, Nucleic acids research.

[45]  D. P. Pomeranz Krummel,et al.  Architecture of the spliceosome. , 2012, Biochemistry.

[46]  Roy Parker,et al.  Global Analysis of Yeast mRNPs , 2012, Nature Structural &Molecular Biology.

[47]  J. Mattick,et al.  Structure and function of long noncoding RNAs in epigenetic regulation , 2013, Nature Structural &Molecular Biology.

[48]  Oliver Kohlbacher,et al.  TOPPView: an open-source viewer for mass spectrometry data. , 2009, Journal of proteome research.

[49]  H. Urlaub,et al.  Isolation of an active step I spliceosome and composition of its RNP core , 2008, Nature.

[50]  The UniProt Consortium,et al.  Reorganizing the protein space at the Universal Protein Resource (UniProt) , 2011, Nucleic Acids Res..

[51]  Knut Reinert,et al.  A geometric approach for the alignment of liquid chromatography - mass spectrometry data , 2007, ISMB/ECCB.