Computational Methods in Mass Spectrometry-Based Protein 3D Studies

Mass Spectrometry (MS)-based strategies featuring chemical or biochemical probing represent powerful and versatile tools for studying structural and dynamic features of proteins and their complexes. In fact, they can be used both as an alternative for systems intractable by other established high-resolution techniques, and as a complementary approach to these latter, providing different information on poorly characterized or very critical regions of the systems under investigation (Russell et al., 2004). The versatility of these MS-based methods depends on the wide range of usable probing techniques and reagents, which makes them suitable for virtually any class of biomolecules and complexes (Aebersold et al., 2003). Furthermore, versatility is still increased by the possibility of operating at very different levels of accuracy, ranging from qualitative high-throughput fold recognition or complex identification (Young et al., 2000), to the fine detail of structural rearrangements in biomolecules after environmental changes, point mutations or complex formations (Nikolova et al.,1998; Millevoi et al., 2001; Zheng et al., 2007). However, these techniques heavily rely upon the availability of powerful computational approaches to achieve a full exploitation of the information content associated with the experimental data. The determination of three-dimensional (3D) structures or models by MS-based techniques (MS3D) involves four main activity areas: 1) preparation of the sample and its derivatives labelled with chemical probes; 2) generation of derivatives/fragments of these molecules for further MS analysis; 3) interpretation of MS data to identify those residues that have reacted with probes; 4) derivation of 3D structures consistent with information from previous steps. Ideally, this procedure should be considered the core of an iterative process, where the final model possibly prompts for new validating experiments or helps the assignment of ambiguous information from the mass spectra interpretation step. Both the overall MS3D procedure and its different steps have been the subject of several accurate review and perspective articles (Sinz, 2006; Back et al., 2003; Young et al., 2000; Friedhoff, 2005, Renzone, et al., 2007a). However, with the partial exception of a few recent papers (Van Dijk et al., 2005; Fabris et al., 2010; Leitner et al., 2010), the full computational detail behind 3D model building (step 4) has generally received less attention than the former three steps. Structural derivation in MS3D, in fact, is considered a special case of structural determination from sparse/indirect constraints (SD-SIC). Nevertheless, information for modelling derivable from MS-based experiments exhibits some peculiar

[1]  Jianwen Fang,et al.  A Three-dimensional Homology Model of Lipid-free Apolipoprotein A-IV Using Cross-linking and Mass Spectrometry* , 2008, Journal of Biological Chemistry.

[2]  M. Jaramillo,et al.  The crystal structure and dimerization interface of GADD45γ , 2008, Proceedings of the National Academy of Sciences.

[3]  W. Taylor,et al.  Global fold determination from a small number of distance restraints. , 1995, Journal of molecular biology.

[4]  A. Petrescu,et al.  Interface Analysis of the Complex between ERK2 and PTP-SL , 2009, PloS one.

[5]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[6]  O. Schueler‐Furman,et al.  Progress in Modeling of Protein Structures and Interactions , 2005, Science.

[7]  K N Houk,et al.  Quantitative evaluation of the lengths of homobifunctional protein cross‐linking reagents used as molecular rulers , 2001, Protein science : a publication of the Protein Society.

[8]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[9]  Friedrich Förster,et al.  Integration of small-angle X-ray scattering data into structural modeling of proteins and their assemblies. , 2008, Journal of molecular biology.

[10]  Chris G de Koster,et al.  Chemical cross-linking and mass spectrometry for protein structural modeling. , 2003, Journal of molecular biology.

[11]  Miriam Eisenstein,et al.  Weighted geometric docking: Incorporating external information in the rotation‐translation scan , 2003, Proteins.

[12]  Mapping protein–protein interactions by bioinformatics and cross-linking , 2005, Analytical and bioanalytical chemistry.

[13]  D. Sept,et al.  Mapping the cofilin binding site on yeast G-actin by chemical cross-linking. , 2008, Journal of molecular biology.

[14]  Frank Alber,et al.  A structural perspective on protein-protein interactions. , 2004, Current opinion in structural biology.

[15]  Leszek Rychlewski,et al.  FFAS03: a server for profile–profile sequence alignments , 2005, Nucleic Acids Res..

[16]  A. Scaloni,et al.  Structural characterization of the functional regions in the archaeal protein Sso7d , 2007, Proteins.

[17]  Michelle D. Brazas,et al.  Providing web servers and training in Bioinformatics: 2010 update on the Bioinformatics Links Directory , 2010, Nucleic Acids Res..

[18]  A. Gronenborn,et al.  Determination of three‐dimensional structures of proteins from interproton distance data by hybrid distance geometry‐dynamical simulated annealing calculations , 1988, FEBS letters.

[19]  A. Brünger,et al.  Torsion angle dynamics: Reduced variable conformational sampling enhances crystallographic structure refinement , 1994, Proteins.

[20]  D. Kirsch,et al.  Mapping Protein-Protein Interactions between MutL and MutH by Cross-linking* , 2004, Journal of Biological Chemistry.

[21]  A. Sali,et al.  Modeller: generation and refinement of homology-based protein structure models. , 2003, Methods in enzymology.

[22]  Irwin D Kuntz,et al.  The collaboratory for MS3D: a new cyberinfrastructure for the structural elucidation of biological macromolecules and their assemblies using mass spectrometry-based approaches. , 2008, Journal of proteome research.

[23]  M. Emmett,et al.  Chemical cross-linking of the urease complex from Helicobacter pylori and analysis by Fourier transform ion cyclotron resonance mass spectrometry and molecular modeling , 2004 .

[24]  J. Skolnick,et al.  MONSSTER: a method for folding globular proteins with a small number of distance restraints. , 1997, Journal of molecular biology.

[25]  SödingJohannes Protein homology detection by HMM--HMM comparison , 2005 .

[26]  R. Portmann,et al.  Structural model of the CopA copper ATPase of Enterococcus hirae based on chemical cross-linking , 2009, BioMetals.

[27]  W. Taylor,et al.  Protein modeling by multiple sequence threading and distance geometry , 1997, Proteins.

[28]  Andrej Sali,et al.  Fold assessment for comparative protein structure modeling , 2007, Protein science : a publication of the Protein Society.

[29]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[30]  L. Grivell,et al.  A structure for the yeast prohibitin complex: Structure prediction and evidence from chemical crosslinking and mass spectrometry , 2002, Protein science : a publication of the Protein Society.

[31]  M. Chance,et al.  Complementary structural mass spectrometry techniques reveal local dynamics in functionally important regions of a metastable serpin. , 2008, Structure.

[32]  Andrzej Kolinski,et al.  Protein structure prediction: Combining de novo modeling with sparse experimental data , 2007, J. Comput. Chem..

[33]  R. Aebersold,et al.  Probing Native Protein Structures by Chemical Cross-linking, Mass Spectrometry, and Bioinformatics* , 2010, Molecular & Cellular Proteomics.

[34]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[35]  Andrea Sinz,et al.  Chemical cross-linking and mass spectrometry to map three-dimensional protein structures and protein-protein interactions. , 2006, Mass spectrometry reviews.

[36]  C D Schwieters,et al.  Internal coordinates for molecular dynamics and minimization in structure determination and refinement. , 2001, Journal of magnetic resonance.

[37]  S. Colowick,et al.  Methods in Enzymology , Vol , 1966 .

[38]  C. Griesinger,et al.  Structural insights into the calmodulin-Munc13 interaction obtained by cross-linking and mass spectrometry. , 2009, Biochemistry.

[39]  Frank Alber,et al.  Unraveling the interface of signal recognition particle and its receptor by using chemical cross-linking and tandem mass spectrometry. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Jennifer L. Martin,et al.  Modelling the structure of latexin-carboxypeptidase A complex based on chemical cross-linking and molecular docking. , 2006, Protein engineering, design & selection : PEDS.

[41]  Richard Hughey,et al.  Hidden Markov models for detecting remote protein homologies , 1998, Bioinform..

[42]  D. Baker,et al.  De novo protein structure determination using sparse NMR data , 2000, Journal of biomolecular NMR.

[43]  M. Kvaratskhelia,et al.  An Allosteric Mechanism for Inhibiting HIV-1 Integrase with a Small Molecule , 2009, Molecular Pharmacology.

[44]  M. Zacharias,et al.  Accounting for loop flexibility during protein–protein docking , 2005, Proteins.

[45]  A. Scaloni,et al.  Probing the dimeric structure of porcine aminoacylase 1 by mass spectrometric and modeling procedures. , 2003, Biochemistry.

[46]  Abhinandan Jain,et al.  Constant temperature constrained molecular dynamics: The Newton-Euler inverse mass operator method , 1996 .

[47]  A. Scaloni,et al.  Topology of the calmodulin-melittin complex. , 1998, Journal of molecular biology.

[48]  A T Brünger,et al.  Torsion-angle molecular dynamics as a new efficient tool for NMR structure calculation. , 1997, Journal of magnetic resonance.

[49]  Hugh Nymeyer,et al.  Atomic Simulations of Protein Folding, Using the Replica Exchange Algorithm , 2004, Numerical Computer Methods, Part D.

[50]  Pedro Alexandrino Fernandes,et al.  Protein–protein docking dealing with the unknown , 2009, J. Comput. Chem..

[51]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[52]  Alexandre M J J Bonvin,et al.  Data‐driven docking for the study of biomolecular complexes , 2005, The FEBS journal.

[53]  Nicholas Furnham,et al.  Conformer generation under restraints. , 2006, Current opinion in structural biology.

[54]  S. Arena,et al.  Mass Spectrometry-Based Approaches for Structural Studies on Protein Complexes at Low-Resolution , 2007 .

[55]  Karl Mechtler,et al.  Annexin A2/P11 interaction: New insights into annexin A2 tetramer structure by chemical crosslinking, high‐resolution mass spectrometry, and computational modeling , 2007, Proteins.

[56]  A. Sali,et al.  Alignment of protein sequences by their profiles , 2004, Protein science : a publication of the Protein Society.

[57]  Jeffrey J. Gray,et al.  Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. , 2003, Journal of molecular biology.

[58]  Salvatore Papa,et al.  Insights into the Structural Basis of the GADD45β-mediated Inactivation of the JNK Kinase, MKK7/JNKK2* , 2007, Journal of Biological Chemistry.

[59]  S. Millevoi,et al.  Atypical binding of the neuronal POU protein N-Oct3 to noncanonical DNA targets. Implications for heterodimerization with HNF-3 beta. , 2001, European journal of biochemistry.

[60]  A. Gronenborn,et al.  Determination of three-dimensional structures of proteins by simulated annealing with interproton distance restraints. Application to crambin, potato carboxypeptidase inhibitor and barley serine proteinase inhibitor 2. , 1988, Protein engineering.

[61]  Jianwen Fang,et al.  A three-dimensional molecular model of lipid-free apolipoprotein A-I determined by cross-linking/mass spectrometry and sequence threading. , 2005, Biochemistry.

[62]  K. Wüthrich,et al.  Torsion angle dynamics for NMR structure calculation with the new program DYANA. , 1997, Journal of molecular biology.

[63]  Chris Bailey-Kellogg,et al.  Geometric Analysis of Cross-Linkability for Protein Fold Discrimination , 2003, Pacific Symposium on Biocomputing.

[64]  K. Soman,et al.  Conformationally variable Rab protein surface regions mapped by limited proteolysis and homology modelling. , 1998, Biochemical Journal.

[65]  Abhinandan Jain,et al.  Protein simulations using techniques suitable for very large systems: The cell multipole method for nonbond interactions and the Newton‐Euler inverse mass operator method for internal coordinate dynamics , 1994, Proteins.

[66]  Rong Chen,et al.  Generating properly weighted ensemble of conformations of proteins from sparse or indirect distance constraints. , 2008, The Journal of chemical physics.

[67]  Timothy F. Havel,et al.  The combinatorial distance geometry method for the calculation of molecular conformation. I. A new approach to an old problem. , 1983, Journal of theoretical biology.

[68]  David R Goodlett,et al.  Chemical cross-linking and mass spectrometry as a low-resolution protein structure determination technique. , 2010, Analytical chemistry.

[69]  S. Vajda,et al.  Protein docking along smooth association pathways , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[70]  Malin M. Young,et al.  High throughput protein fold identification by using experimental constraints derived from intramolecular cross-links and mass spectrometry , 2000, Proc. Natl. Acad. Sci. USA.

[71]  F E Cohen,et al.  Pairwise sequence alignment below the twilight zone. , 2001, Journal of molecular biology.

[72]  J. Augsburger,et al.  A new approach to an old problem. , 1999, Survey of ophthalmology.