Bioinformatics tools for the structural elucidation of multi-subunit protein complexes by mass spectrometric analysis of protein-protein cross-links

Multi-subunit protein complexes are involved in many essential biochemical processes including signal transduction, protein synthesis, RNA synthesis, DNA replication and protein degradation. An accurate description of the relative structural arrangement of the constituent subunits in such complexes is crucial for an understanding of the molecular mechanism of the complex as a whole. Many complexes, however, lie in the mega-Dalton range, and are not amenable to X-ray crystallographic or nuclear magnetic resonance analysis. Techniques that are suited to structural studies of such large complexes, such as cryo-electron microscopy, do not provide the resolution required for a mechanistic insight. Mass spectrometry (MS) has increasingly been applied to identify the residues that are involved in chemical cross-links in compound protein assemblies, and have provided valuable insight into the molecular arrangement, orientation and contact surfaces of subunits within such large complexes. This approach is known as MS3D, and involves the MS analysis of cross-linked di-peptides following the enzymatic cleavage of a chemically cross-linked complex. A major challenge of this approach is the identification of the cross-linked di-peptides in a composite mixture of peptides, as well as the identification of the residues involved in the cross-link. These analyses require bioinformatics tools with capabilities beyond that of general, MS-based proteomic analysis software. Many MS3D software tools have appeared, often designed for very specific experimental methods. Here, we provide a review of all major MS3D bioinformatics programmes, reviewing their applicability to different workflows, specific experimental requirements and the computational approach taken by each.

[1]  Slavica Jonic,et al.  Protein structure determination by electron cryo-microscopy. , 2009, Current opinion in pharmacology.

[2]  Ruedi Aebersold,et al.  Identification of cross-linked peptides from large sequence databases , 2008, Nature Methods.

[3]  Leo C. McHugh,et al.  Harvest: an open-source tool for the validation and improvement of peptide identification metrics and fragmentation exploration , 2010, BMC Bioinformatics.

[4]  William Stafford Noble,et al.  Detecting cross-linked peptides by searching against a database of cross-linked peptide pairs. , 2010, Journal of proteome research.

[5]  M. Brinkley A brief survey of methods for preparing protein conjugates with dyes, haptens, and cross-linking reagents. , 1992, Bioconjugate chemistry.

[6]  D. Suckau,et al.  Screening for disulfide bonds in proteins by MALDI in-source decay and LIFT-TOF/TOF-MS. , 2002, Analytical chemistry.

[7]  R. Ambler,et al.  The use of thermolysin in amino acid sequence determination. , 1968, The Biochemical journal.

[8]  Zhongqi Zhang Prediction of low-energy collision-induced dissociation spectra of peptides. , 2004, Analytical chemistry.

[9]  Sándor Suhai,et al.  Fragmentation Pathways of Protonated Peptides , 2006 .

[10]  G. Anderson,et al.  Identification of Protein-Protein Interactions and Topologies in Living Cells with Chemical Cross-linking and Mass Spectrometry*S , 2009, Molecular & Cellular Proteomics.

[11]  J. Yates,et al.  Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. , 1995, Analytical chemistry.

[12]  W. Lehmann,et al.  Protease‐catalyzed incorporation of 18O into peptide fragments and its application for protein sequencing by electrospray and matrix‐assisted laser desorption/ionization mass spectrometry , 1996, Electrophoresis.

[13]  Albert Sickmann,et al.  Command line tool for calculating theoretical MS spectra for given sequences , 2004, Bioinform..

[14]  I. Mavelli,et al.  Stoichiometry and topology of the complex of the endogenous ATP synthase inhibitor protein IF(1) with calmodulin. , 2010, Biochemistry.

[15]  Juri Rappsilber,et al.  Structural Analysis of Multiprotein Complexes by Cross-linking, Mass Spectrometry, and Database Searching*S , 2007, Molecular & Cellular Proteomics.

[16]  Christophe Geourjon,et al.  MSX-3D: a tool to validate 3D protein models using mass spectrometry , 2008, Bioinform..

[17]  James E Bruce,et al.  Mass spectrometry identifiable cross-linking strategy for studying protein-protein interactions. , 2005, Analytical chemistry.

[18]  A. Petrescu,et al.  Interface Analysis of the Complex between ERK2 and PTP-SL , 2009, PloS one.

[19]  I. Papayannopoulos,et al.  The interpretation of collision‐induced dissociation tandem mass spectra of peptides , 1996 .

[20]  M. Mann,et al.  Stable Isotope Labeling by Amino Acids in Cell Culture, SILAC, as a Simple and Accurate Approach to Expression Proteomics* , 2002, Molecular & Cellular Proteomics.

[21]  Gordon A Anderson,et al.  Informatics strategies for large-scale novel cross-linking analysis. , 2007, Journal of proteome research.

[22]  Jianwen Fang,et al.  A Three-dimensional Homology Model of Lipid-free Apolipoprotein A-IV Using Cross-linking and Mass Spectrometry* , 2008, Journal of Biological Chemistry.

[23]  Peter R Baker,et al.  Finding Chimeras: a Bioinformatics Strategy for Identification of Cross-linked Peptides* , 2009, Molecular & Cellular Proteomics.

[24]  George C Tseng,et al.  Statistical characterization of the charge state and residue dependence of low-energy CID peptide dissociation patterns. , 2005, Analytical chemistry.

[25]  Alexey I Nesvizhskii,et al.  Investigation of neutral loss during collision-induced dissociation of peptide ions. , 2005, Analytical chemistry.

[26]  Yong Tang,et al.  CLPM: A Cross-Linked Peptide Mapping Algorithm for Mass Spectrometric Analysis , 2005, BMC Bioinformatics.

[27]  C D Schwieters,et al.  The VMD-XPLOR visualization package for NMR structure refinement. , 2001, Journal of magnetic resonance.

[28]  T. Takao,et al.  Facile assignment of sequence ions of a peptide labelled with 18O at the carboxyl terminus. , 1991, Rapid communications in mass spectrometry : RCM.

[29]  D Fenyö A software tool for the analysis of mass spectrometric disulfide mapping experiments , 1997, Comput. Appl. Biosci..

[30]  Eugene A. Kapp,et al.  Mining a tandem mass spectrometry database to determine the trends and global factors influencing peptide fragmentation. , 2003, Analytical chemistry.

[31]  P. Roepstorff,et al.  Proposal for a common nomenclature for sequence ions in mass spectra of peptides. , 1984, Biomedical mass spectrometry.

[32]  Malin M. Young,et al.  A top down approach to protein structural studies using chemical cross-linking and Fourier transform mass spectrometry. , 2003, Rapid communications in mass spectrometry : RCM.

[33]  Jason Lanman,et al.  Synthesis of biotin-tagged chemical cross-linkers and their applications for mass spectrometry. , 2009, Rapid communications in mass spectrometry : RCM.

[34]  David R Goodlett,et al.  Pro-CrossLink. Software tool for protein cross-linking and mass spectrometry. , 2006, Analytical chemistry.

[35]  Jean-Loup Faulon,et al.  Exploring the conformational space of membrane protein folds matching distance constraints , 2003, Protein science : a publication of the Protein Society.

[36]  P. Højrup,et al.  Determination of protein conformation by isotopically labelled cross-linking and dedicated software: , 2007 .

[37]  Brett S Phinney,et al.  Shotgun cross-linking analysis for studying quaternary and tertiary protein structures. , 2007, Journal of proteome research.

[38]  M. Mann,et al.  Precision proteomics: The case for high resolution and high mass accuracy , 2008, Proceedings of the National Academy of Sciences.

[39]  Robertson Craig,et al.  TANDEM: matching proteins with tandem mass spectra. , 2004, Bioinformatics.

[40]  Andrea Sinz,et al.  Chemical cross-linking and mass spectrometry to map three-dimensional protein structures and protein-protein interactions. , 2006, Mass spectrometry reviews.

[41]  B. Roschitzki,et al.  Epitope mapping on bovine prion protein using chemical cross-linking and mass spectrometry. , 2008, Journal of mass spectrometry : JMS.

[42]  Frank Alber,et al.  Unraveling the interface of signal recognition particle and its receptor by using chemical cross-linking and tandem mass spectrometry. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[43]  Ning Zhang,et al.  Protein cross-linking analysis using mass spectrometry, isotope-coded cross-linkers, and integrated computational data processing. , 2006, Journal of proteome research.

[44]  J. Yates,et al.  Statistical characterization of ion trap tandem mass spectra from doubly charged tryptic peptides. , 2003, Analytical chemistry.

[45]  P. Cramer,et al.  Architecture of the RNA polymerase II–TFIIF complex revealed by cross-linking and mass spectrometry , 2010, EMBO Journal.

[46]  Birgit Schilling,et al.  MS2Assign, automated assignment and nomenclature of tandem mass spectra of chemically crosslinked peptides , 2003, Journal of the American Society for Mass Spectrometry.

[47]  Nathan E Hall,et al.  Characterization of an Antagonist Interleukin-6 Dimer by Stable Isotope Labeling, Cross-linking, and Mass Spectrometry* , 2002, The Journal of Biological Chemistry.

[48]  Peter Roepstorff,et al.  Mass spectrometric analysis of protein interactions. , 2005, Current opinion in chemical biology.

[49]  Sylvia Richardson,et al.  Using statistical models to identify factors that have a role in defining the abundance of ions produced by tandem MS. , 2007, Analytical chemistry.

[50]  Malin M. Young,et al.  Top-down characterization of nucleic acids modified by structural probes using high-resolution tandem mass spectrometry and automated data interpretation. , 2004, Analytical chemistry.

[51]  R. Aebersold,et al.  ProbID: A probabilistic algorithm to identify peptides through sequence database searching using tandem mass spectral data , 2002, Proteomics.

[52]  H. Steen,et al.  GPMAW--a software tool for analyzing proteins and peptides. , 2001, Trends in biochemical sciences.

[53]  Juergen Kast,et al.  Identification of protein‐protein interactions using in vivo cross‐linking and mass spectrometry , 2004, Proteomics.

[54]  B. Luke,et al.  Isotope Labeling in MS-based Proteomics , 2009 .

[55]  John R Yates,et al.  Cleavage N-terminal to proline: analysis of a database of peptide tandem mass spectra. , 2003, Analytical chemistry.

[56]  A. Sinz Investigation of protein–protein interactions in living cells by chemical crosslinking and mass spectrometry , 2010, Analytical and bioanalytical chemistry.

[57]  V. Schnaible,et al.  SearchXLinks. A program for the identification of disulfide bonds in proteins from mass spectra. , 2006, Analytical chemistry.

[58]  Peter R. Baker,et al.  Role of accurate mass measurement (+/- 10 ppm) in protein identification strategies employing MS or MS/MS and database searching. , 1999, Analytical chemistry.

[59]  Malin M. Young,et al.  High throughput protein fold identification by using experimental constraints derived from intramolecular cross-links and mass spectrometry , 2000, Proc. Natl. Acad. Sci. USA.

[60]  T. Sixma,et al.  Identification of cross-linked peptides for protein interaction studies using mass spectrometry and 18O labeling. , 2002, Analytical chemistry.

[61]  M. Meldal,et al.  Substrate preferences of glutamic-acid-specific endopeptidases assessed by synthetic peptide substrates based on intramolecular fluorescence quenching. , 1992, European journal of biochemistry.

[62]  Navdeep Jaitly,et al.  Decon2LS: An open-source software package for automated processing and visualization of high resolution mass spectrometry data , 2009, BMC Bioinformatics.

[63]  Morgan C. Giddings,et al.  Fragmentation characteristics of collision-induced dissociation in MALDI TOF/TOF mass spectrometry. , 2007, Analytical chemistry.

[64]  B. Luke,et al.  18O stable isotope labeling in MS-based proteomics. , 2009, Briefings in functional genomics & proteomics.

[65]  Zhongqi Zhang,et al.  Prediction of low-energy collision-induced dissociation spectra of peptides with three or more charges. , 2005, Analytical chemistry.

[66]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[67]  Frank Kjeldsen,et al.  Relative specificities of water and ammonia losses from backbone fragments in collision-activated dissociation. , 2007, Journal of proteome research.

[68]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[69]  Frank Kjeldsen,et al.  Backbone carbonyl group basicities are related to gas-phase fragmentation of peptides and protein folding. , 2007, Angewandte Chemie.

[70]  Michael A. Freitas,et al.  Identification and characterization of disulfide bonds in proteins and peptides from tandem MS data by use of the MassMatrix MS/MS search engine. , 2008, Journal of proteome research.

[71]  Alexander Scherl,et al.  Characterization of protein cross-links via mass spectrometry and an open-modification search strategy. , 2008, Analytical chemistry.

[72]  Shan S. Wong,et al.  Chemistry of Protein Conjugation and Cross Linking , 1991 .

[73]  Gerald J Wyckoff,et al.  CrossSearch, a User-friendly Search Engine for Detecting Chemically Cross-linked Peptides in Conjugated Proteins*S , 2008, Molecular & Cellular Proteomics.

[74]  Jacek Otlewski,et al.  Trypsin specificity as elucidated by LIE calculations, X‐ray structures, and association constant measurements , 2004, Protein science : a publication of the Protein Society.

[75]  Hua Xu,et al.  A mass accuracy sensitive probability based scoring algorithm for database searching of tandem mass spectrometry data , 2007, BMC Bioinformatics.

[76]  C. G. Koster,et al.  Computer‐assisted mass spectrometric analysis of naturally occurring and artificially introduced cross‐links in proteins and protein complexes , 2006, The FEBS journal.

[77]  D. Hume,et al.  Protein structure determination using a combination of cross-linking, mass spectrometry, and molecular modeling. , 2008, Methods in molecular biology.

[78]  W. Herr,et al.  Structural flexibility in transcription complex formation revealed by protein-DNA photocrosslinking. , 1997, Proceedings of the National Academy of Sciences of the United States of America.