Computational methods in mass spectrometry-based structural proteomics for studying protein structure, dynamics, and interactions

Mass spectrometry (MS) has made enormous contributions to comprehensive protein identification and quantification in proteomics. MS is also gaining momentum for structural biology in a variety of ways, complementing conventional structural biology techniques. Here, we will review how MS-based techniques, such as hydrogen/deuterium exchange, covalent labeling, and chemical cross-linking, enable the characterization of protein structure, dynamics, and interactions, especially from a perspective of their data analyses. Structural information encoded by chemical probes in intact proteins is decoded by interpreting MS data at a peptide level, i.e., revealing conformational and dynamic changes in local regions of proteins. The structural MS data are not amenable to data analyses in traditional proteomics workflow, requiring dedicated software for each type of data. We first provide basic principles of data interpretation, including isotopic distribution and peptide sequencing. We then focus particularly on computational methods for structural MS data analyses and discuss outstanding challenges in a proteome-wide large scale analysis.

[1]  Eunok Paek,et al.  MODplus: robust and unrestrictive identification of post-translational modifications using mass spectrometry. , 2019, Analytical chemistry.

[2]  Sayan Gupta,et al.  Oxidative footprinting in the study of structure and function of membrane proteins: current state and perspectives. , 2015, Biochemical Society transactions.

[3]  R. Vachet,et al.  Covalent labeling and mass spectrometry reveal subtle higher order structural changes for antibody therapeutics , 2019, mAbs.

[4]  J. Eng,et al.  Comet: An open‐source MS/MS sequence database search tool , 2013, Proteomics.

[5]  Yingming Zhao,et al.  PTMap—A sequence alignment software for unrestricted, accurate, and full-spectrum identification of post-translational modification sites , 2009, Proceedings of the National Academy of Sciences.

[6]  Danny E Miller,et al.  HDXFinder: Automated Analysis and Data Reporting of Deuterium/Hydrogen Exchange Mass Spectrometry , 2012, Journal of The American Society for Mass Spectrometry.

[7]  Pavel A. Pevzner,et al.  Universal database search tool for proteomics , 2014, Nature Communications.

[8]  Heejin Park,et al.  Unrestrictive Identification of Multiple Post-translational Modifications from Tandem Mass Spectrometry Using an Error-tolerant Algorithm Based on an Extended Sequence Tag Approach*S , 2008, Molecular & Cellular Proteomics.

[9]  Stephen E. Stein,et al.  The Hybrid Search: A Mass Spectral Library Search Method for Discovery of Modifications in Proteomics , 2017 .

[10]  R. Appel,et al.  Popitam: Towards new heuristic strategies to improve protein identification from tandem mass spectrometry data , 2003, Proteomics.

[11]  Alexey I Nesvizhskii,et al.  Analysis and validation of proteomic data generated by tandem mass spectrometry , 2007, Nature Methods.

[12]  Peter R Baker,et al.  In-depth Analysis of Tandem Mass Spectrometry Data from Disparate Instrument Types*S , 2008, Molecular & Cellular Proteomics.

[13]  Lars Konermann,et al.  Mass spectrometry combined with oxidative labeling for exploring protein structure and folding. , 2010, Mass spectrometry reviews.

[14]  Lars Konermann,et al.  Hydrogen exchange mass spectrometry for studying protein structure and dynamics. , 2011, Chemical Society reviews.

[15]  Brett S Phinney,et al.  Shotgun cross-linking analysis for studying quaternary and tertiary protein structures. , 2007, Journal of proteome research.

[16]  Christodoulos A. Floudas,et al.  A Novel Approach for Untargeted Post-translational Modification Identification Using Integer Linear Optimization and Tandem Mass Spectrometry* , 2010, Molecular & Cellular Proteomics.

[17]  C. Sevier,et al.  Formation and transfer of disulphide bonds in living cells , 2002, Nature Reviews Molecular Cell Biology.

[18]  Peter R Baker,et al.  Finding Chimeras: a Bioinformatics Strategy for Identification of Cross-linked Peptides* , 2009, Molecular & Cellular Proteomics.

[19]  Michael R Shortreed,et al.  Enhanced Global Post-translational Modification Discovery with MetaMorpheus. , 2018, Journal of proteome research.

[20]  R. Vachet,et al.  Probing protein structure by amino acid-specific covalent labeling and mass spectrometry. , 2009, Mass spectrometry reviews.

[21]  Christoph H Borchers,et al.  Crosslinking combined with mass spectrometry for structural proteomics. , 2010, Mass spectrometry reviews.

[22]  Gordon A Anderson,et al.  Informatics strategies for large-scale novel cross-linking analysis. , 2007, Journal of proteome research.

[23]  Lloyd M. Smith,et al.  Identification of MS-Cleavable and Noncleavable Chemically Cross-Linked Peptides with MetaMorpheus. , 2018, Journal of proteome research.

[24]  D. Tabb,et al.  TagRecon: high-throughput mutation identification through sequence tagging. , 2010, Journal of proteome research.

[25]  Michael Götze,et al.  Automated Assignment of MS/MS Cleavable Cross-Links in Protein 3D-Structure Analysis , 2014, Journal of The American Society for Mass Spectrometry.

[26]  B. Bothner,et al.  The Role of Mass Spectrometry in Structural Studies of Flavin-Based Electron Bifurcating Enzymes , 2018, Front. Microbiol..

[27]  L. T. Ten Eyck,et al.  Automated extraction of backbone deuteration levels from amide H/2H mass spectrometry experiments , 2006, Protein science : a publication of the Protein Society.

[28]  Leland Mayne,et al.  ExMS: Data Analysis for HX-MS Experiments , 2011, Journal of the American Society for Mass Spectrometry.

[29]  S. Scheres,et al.  How cryo-EM is revolutionizing structural biology. , 2015, Trends in biochemical sciences.

[30]  Lutz Fischer,et al.  False discovery rate estimation and heterobifunctional cross-linkers , 2017, bioRxiv.

[31]  Friedrich Förster,et al.  False discovery rate estimation for cross-linked peptides identified by mass spectrometry , 2012, Nature Methods.

[32]  C. Borchers,et al.  Comparative higher-order structure analysis of antibody biosimilars using combined bottom-up and top-down hydrogen-deuterium exchange mass spectrometry. , 2016, Biochimica et biophysica acta.

[33]  Andrej Sali,et al.  Integrative Structural Biology , 2013, Science.

[34]  David Goldberg,et al.  Lookup peaks: a hybrid of de novo sequencing and database search for protein identification by tandem mass spectrometry. , 2007, Analytical chemistry.

[35]  Kasper D. Rand,et al.  Measuring the hydrogen/deuterium exchange of proteins at high spatial resolution by mass spectrometry: overcoming gas-phase hydrogen/deuterium scrambling. , 2014, Accounts of chemical research.

[36]  S. Lim,et al.  Elucidating in Vivo Structural Dynamics in Integral Membrane Protein by Hydroxyl Radical Footprinting* , 2009, Molecular & Cellular Proteomics.

[37]  R. Vachet,et al.  Covalent labeling-mass spectrometry with non-specific reagents for studying protein structure and interactions. , 2018, Methods.

[38]  Edward L. Huttlin,et al.  A mass-tolerant database search identifies a large proportion of unassigned spectra in shotgun proteomics as modified peptides , 2015, Nature Biotechnology.

[39]  Steven P Gygi,et al.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry , 2007, Nature Methods.

[40]  Devin K. Schweppe,et al.  Architecture of the human interactome defines protein communities and disease networks , 2017, Nature.

[41]  Magnus Palmblad,et al.  Automatic analysis of hydrogen/deuterium exchange mass spectra of peptides and proteins using calculations of isotopic distributions , 2001, Journal of the American Society for Mass Spectrometry.

[42]  S Walter Englander,et al.  Protein hydrogen exchange at residue resolution by proteolytic fragmentation mass spectrometry analysis , 2013, Proceedings of the National Academy of Sciences.

[43]  Jeddidiah Bellamy-Carter,et al.  PepFoot: a software package for semi-automated processing of protein footprinting data. , 2019, Journal of proteome research.

[44]  Guillaume Fertin,et al.  SpecOMS: A Full Open Modification Search Method Performing All-to-All Spectra Comparisons within Minutes. , 2017, Journal of proteome research.

[45]  Yan Fu,et al.  DeltAMT: A Statistical Algorithm for Fast Detection of Protein Modifications From LC-MS/MS Data* , 2011, Molecular & Cellular Proteomics.

[46]  R. Aebersold,et al.  Crosslinking and Mass Spectrometry: An Integrated Technology to Understand the Structure and Function of Molecular Machines. , 2016, Trends in biochemical sciences.

[47]  Ruixiang Sun,et al.  Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate , 2010, Bioinform..

[48]  M. J. Chalmers,et al.  HDX Workbench: Software for the Analysis of H/D Exchange MS Data , 2012, Journal of The American Society for Mass Spectrometry.

[49]  Eunok Paek,et al.  New algorithm for the identification of intact disulfide linkages based on fragmentation characteristics in tandem mass spectra. , 2010, Journal of proteome research.

[50]  Ruedi Aebersold,et al.  The complete structure of the large subunit of the mammalian mitochondrial ribosome , 2014, Nature.

[51]  Thomas E. Wales,et al.  Identification and characterization of EX1 kinetics in H/D exchange mass spectrometry by peak width analysis , 2006, Journal of the American Society for Mass Spectrometry.

[52]  John R Engen,et al.  Analysis of Overlapped and Noisy Hydrogen/Deuterium Exchange Mass Spectra , 2013, Journal of The American Society for Mass Spectrometry.

[53]  Jan Steyaert,et al.  Structure and flexibility of the endosomal Vps34 complex reveals the basis of its function on membranes , 2015, Science.

[54]  Andrea Ilari,et al.  Protein structure determination by x-ray crystallography. , 2008, Methods in molecular biology.

[55]  R. Kumar,et al.  H/D Exchange Centroid Monitoring is Insufficient to Show Differences in the Behavior of Protein States , 2013, Journal of The American Society for Mass Spectrometry.

[56]  M. Mann,et al.  The abc's (and xyz's) of peptide sequencing , 2004, Nature Reviews Molecular Cell Biology.

[57]  Iain D G Campuzano,et al.  An Integrated Native Mass Spectrometry and Top-Down Proteomics Method that Connects Sequence to Structure and Function of Macromolecular Complexes , 2017, Nature chemistry.

[58]  Diogo B Lima,et al.  SIM-XL: A powerful and user-friendly tool for peptide cross-linking analysis. , 2015, Journal of proteomics.

[59]  A. Nesvizhskii A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics. , 2010, Journal of proteomics.

[60]  Kun Zhang,et al.  pFind-Alioth: A novel unrestricted database search algorithm to improve the interpretation of high-resolution MS/MS data. , 2015, Journal of proteomics.

[61]  Scott A. Busby,et al.  HD desktop: An integrated platform for the analysis and visualization of H/D exchange data , 2009, Journal of the American Society for Mass Spectrometry.

[62]  R. Aebersold,et al.  Structural Probing of a Protein Phosphatase 2A Network by Chemical Cross-Linking and Mass Spectrometry , 2012, Science.

[63]  J. Gorman,et al.  Protein disulfide bond determination by mass spectrometry. , 2002, Mass spectrometry reviews.

[64]  Lennart Martens,et al.  Xilmass: A New Approach toward the Identification of Cross-Linked Peptides. , 2016, Analytical chemistry.

[65]  J. Lakbub,et al.  Recent mass spectrometry-based techniques and considerations for disulfide bond characterization in proteins , 2018, Analytical and Bioanalytical Chemistry.

[66]  N. Bulleid,et al.  Multiple ways to make disulfides. , 2011, Trends in biochemical sciences.

[67]  Martial Rey,et al.  Probing protein interactions with hydrogen/deuterium exchange and mass spectrometry-a review. , 2012, Analytica chimica acta.

[68]  Integrative methods in structural biology. , 2019, Journal of biomolecular NMR.

[69]  Pavel A. Pevzner,et al.  Protein identification by spectral networks analysis , 2007, Proceedings of the National Academy of Sciences.

[70]  Samuel H Payne,et al.  Multi-species Identification of Polymorphic Peptide Variants via Propagation in Spectral Networks , 2016, Molecular & Cellular Proteomics.

[71]  Kai Zhang,et al.  The structure of the dynactin complex and its interaction with dynein , 2015, Science.

[72]  Albert J R Heck,et al.  Proteome-wide profiling of protein assemblies by cross-linking mass spectrometry , 2015, Nature Methods.

[73]  M. Chance,et al.  Protein Footprinting Comes of Age: Mass Spectrometry for Biophysical Structure Assessment* , 2017, Molecular & Cellular Proteomics.

[74]  P. Radivojac,et al.  XLSearch: a Probabilistic Database Search Algorithm for Identifying Cross-Linked Peptides. , 2016, Journal of proteome research.

[75]  Eunok Paek,et al.  Software eyes for protein post-translational modifications. , 2015, Mass spectrometry reviews.

[76]  Dekel Tsur,et al.  Identification of post-translational modifications by blind search of mass spectra , 2005, Nature Biotechnology.

[77]  B. Ma,et al.  PeaksPTM: Mass spectrometry-based identification of peptides with unspecified modifications. , 2011, Journal of proteome research.

[78]  T. Burzykowski,et al.  Computational methods and challenges in hydrogen/deuterium exchange mass spectrometry. , 2017, Mass spectrometry reviews.

[79]  Michael J MacCoss,et al.  Kojak: efficient analysis of chemically cross-linked protein complexes. , 2015, Journal of proteome research.

[80]  M. Mann,et al.  Comparative Proteomic Analysis of Eleven Common Cell Lines Reveals Ubiquitous but Varying Expression of Most Proteins* , 2012, Molecular & Cellular Proteomics.

[81]  Xin Zhou,et al.  HDX-Analyzer: a novel package for statistical analysis of protein structure dynamics , 2011, BMC Bioinformatics.

[82]  Eunok Paek,et al.  Fast Multi-blind Modification Search through Tandem Mass Spectrometry* , 2011, Molecular & Cellular Proteomics.

[83]  P. Andrews,et al.  A spectral clustering approach to MS/MS identification of post-translational modifications. , 2008, Journal of proteome research.

[84]  Henry Lam,et al.  Hunting for unexpected post-translational modifications by spectral library searching with tier-wise scoring. , 2014, Journal of proteome research.

[85]  L. M. Jones,et al.  Fast photochemical oxidation of proteins (FPOP): A powerful mass spectrometry–based structural proteomics tool , 2019, The Journal of Biological Chemistry.

[86]  Guozhong Xu,et al.  Hydroxyl radical-mediated modification of proteins as probes for structural proteomics. , 2007, Chemical reviews.

[87]  Alexander Scherl,et al.  Characterization of protein cross-links via mass spectrometry and an open-modification search strategy. , 2008, Analytical chemistry.

[88]  M. Mann,et al.  Andromeda: a peptide search engine integrated into the MaxQuant environment. , 2011, Journal of proteome research.

[89]  M. Dong,et al.  Identification of cross-linked peptides from complex samples , 2012, Nature Methods.

[90]  Albert J R Heck,et al.  RNA targeting by the type III-A CRISPR-Cas Csm complex of Thermus thermophilus. , 2014, Molecular cell.

[91]  M. Mann,et al.  Proteomic analysis of post-translational modifications , 2003, Nature Biotechnology.

[92]  L. Mayne,et al.  ExMS2: An Integrated Solution for Hydrogen-Deuterium Exchange Mass Spectrometry Data Analysis. , 2019, Analytical chemistry.

[93]  A. Leitner Cross-linking and other structural proteomics techniques: how chemistry is enabling mass spectrometry applications in structural biology , 2016, Chemical science.

[94]  B. Searle,et al.  Identification of protein modifications using MS/MS de novo sequencing and the OpenSea alignment algorithm. , 2005, Journal of proteome research.

[95]  Philip Lössl,et al.  The diverse and expanding role of mass spectrometry in structural and molecular biology , 2016, The EMBO journal.

[96]  M. Mann,et al.  Mass spectrometry–based proteomics turns quantitative , 2005, Nature chemical biology.

[97]  Mikhail M Savitski,et al.  ModifiComb, a New Proteomic Tool for Mapping Substoichiometric Post-translational Modifications, Finding Novel Types of Modifications, and Fingerprinting Complex Protein Mixtures* , 2006, Molecular & Cellular Proteomics.

[98]  David C. Schriemer,et al.  Hydra: software for tailored processing of H/D exchange data from MS or tandem MS analyses , 2009, BMC Bioinformatics.

[99]  R. Aebersold,et al.  Probing Native Protein Structures by Chemical Cross-linking, Mass Spectrometry, and Bioinformatics , 2010, Molecular & Cellular Proteomics.

[100]  Derek J. Bailey,et al.  The One Hour Yeast Proteome* , 2013, Molecular & Cellular Proteomics.

[101]  Juri Rappsilber,et al.  Structural Analysis of Multiprotein Complexes by Cross-linking, Mass Spectrometry, and Database Searching*S , 2007, Molecular & Cellular Proteomics.

[102]  L. Kay,et al.  Nuclear magnetic resonance spectroscopy of high-molecular-weight proteins. , 2004, Annual review of biochemistry.

[103]  David Paramelle,et al.  Chemical cross‐linkers for protein structure studies by mass spectrometry , 2013, Proteomics.

[104]  J. Rappsilber,et al.  Quirks of Error Estimation in Cross-Linking/Mass Spectrometry , 2017, Analytical chemistry.

[105]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[106]  Guodong Chen,et al.  Mapping the Energetic Epitope of an Antibody/Interleukin-23 Interaction with Hydrogen/Deuterium Exchange, Fast Photochemical Oxidation of Proteins Mass Spectrometry, and Alanine Shave Mutagenesis. , 2017, Analytical chemistry.

[107]  Scott A. Busby,et al.  The Deuterator: software for the determination of backbone amide deuterium levels from H/D exchange MS data , 2007, BMC Bioinformatics.

[108]  K. Luger,et al.  Chaperone Nap1 shields histone surfaces used in a nucleosome and can put H2A-H2B in an unconventional tetrameric form. , 2013, Molecular cell.

[109]  Ruedi Aebersold,et al.  Identification of cross-linked peptides from large sequence databases , 2008, Nature Methods.

[110]  William Stafford Noble,et al.  Detecting cross-linked peptides by searching against a database of cross-linked peptide pairs. , 2010, Journal of proteome research.

[111]  Markus Müller,et al.  Unrestricted identification of modified proteins using MS/MS , 2010, Proteomics.

[112]  Andrew N. Holding,et al.  XL-MS: Protein cross-linking coupled with mass spectrometry. , 2015, Methods.

[113]  A. Heck,et al.  Facilitating Protein Disulfide Mapping by a Combination of Pepsin Digestion, Electron Transfer Higher Energy Dissociation (EThcD), and a Dedicated Search Algorithm SlinkS* , 2014, Molecular & Cellular Proteomics.

[114]  Eunok Paek,et al.  deMix: Decoding Deuterated Distributions from Heterogeneous Protein States via HDX-MS , 2019, Scientific Reports.

[115]  Derek J. Wilson,et al.  Bottom-up hydrogen deuterium exchange mass spectrometry: data analysis and interpretation. , 2017, The Analyst.

[116]  Quantitative Analysis of Protein Covalent Labeling Mass Spectrometry Data in the Mass Spec Studio. , 2019, Analytical chemistry.

[117]  Robert J. Chalkley,et al.  Matching Cross-linked Peptide Spectra: Only as Good as the Worse Identification* , 2013, Molecular & Cellular Proteomics.

[118]  Michal Sharon,et al.  Chemical cross‐linking and native mass spectrometry: A fruitful combination for structural biology , 2015, Protein science : a publication of the Protein Society.

[119]  S. Guan,et al.  Enhancement of the effective resolution of mass spectra of high-mass biomolecules by maximum entropy-based deconvolution to eliminate the isotopic natural abundance distribution , 1997 .

[120]  David R Goodlett,et al.  xComb: a cross-linked peptide database approach to protein-protein interaction analysis. , 2010, Journal of proteome research.

[121]  David C Schriemer,et al.  Quantitating the statistical distribution of deuterium incorporation to extend the utility of H/D exchange MS data. , 2006, Analytical chemistry.

[122]  Gordon A Anderson,et al.  Xlink-identifier: an automated data analysis platform for confident identifications of chemically cross-linked peptides using tandem mass spectrometry. , 2011, Journal of proteome research.

[123]  R. Aebersold,et al.  The Evolving Contribution of Mass Spectrometry to Integrative Structural Biology , 2016, Journal of The American Society for Mass Spectrometry.

[124]  P. Biggin,et al.  Mass spectrometry reveals synergistic effects of nucleotides, lipids, and drugs binding to a multidrug resistance efflux pump , 2013, Proceedings of the National Academy of Sciences.

[125]  Eunok Paek,et al.  Characterization of disulfide bonds by planned digestion and tandem mass spectrometry. , 2015, Molecular bioSystems.

[126]  N. Garcia,et al.  Current Trends in Biotherapeutic Higher Order Structure Characterization by Irreversible Covalent Footprinting Mass Spectrometry. , 2019, Protein and peptide letters.

[127]  F. Hamprecht,et al.  Hexicon 2: Automated Processing of Hydrogen-Deuterium Exchange Mass Spectrometry Data with Improved Deuteration Distribution Estimation , 2014, Journal of The American Society for Mass Spectrometry.

[128]  Lan Huang,et al.  Cross-Linking Mass Spectrometry: An Emerging Technology for Interactomics and Structural Biology. , 2018, Analytical chemistry.

[129]  Michael A. Freitas,et al.  Identification and characterization of disulfide bonds in proteins and peptides from tandem MS data by use of the MassMatrix MS/MS search engine. , 2008, Journal of proteome research.