In-Search Assignment of Monoisotopic Peaks Improves the Identification of Cross-Linked Peptides

Cross-linking/mass spectrometry has undergone a maturation process akin to standard proteomics by adapting key methods such as false discovery rate control and quantification. A poorly evaluated search setting in proteomics is the consideration of multiple (lighter) alternative values for the monoisotopic precursor mass to compensate for possible misassignments of the monoisotopic peak. Here, we show that monoisotopic peak assignment is a major weakness of current data handling approaches in cross-linking. Cross-linked peptides often have high precursor masses, which reduces the presence of the monoisotopic peak in the isotope envelope. Paired with generally low peak intensity, this generates a challenge that may not be completely solvable by precursor mass assignment routines. We therefore took an alternative route by ‘”in-search assignment of the monoisotopic peak” in the cross-link database search tool Xi (Xi-MPA), which considers multiple precursor masses during database search. We compare and evaluate the performance of established preprocessing workflows that partly correct the monoisotopic peak and Xi-MPA on three publicly available data sets. Xi-MPA always delivered the highest number of identifications with ∼2 to 4-fold increase of PSMs without compromising identification accuracy as determined by FDR estimation and comparison to crystallographic models.

[1]  J. Rappsilber,et al.  Optimizing the Parameters Governing the Fragmentation of Cross-Linked Peptides in a Tribrid Mass Spectrometer , 2017, Analytical chemistry.

[2]  Rosa Viner,et al.  Optimized fragmentation schemes and data analysis strategies for proteome-wide cross-link identification , 2017, Nature Communications.

[3]  P. Cramer,et al.  Architecture of the RNA polymerase II–TFIIF complex revealed by cross-linking and mass spectrometry , 2010, EMBO Journal.

[4]  Oliver Brock,et al.  Blind testing cross-linking/mass spectrometry under the auspices of the 11 th critical assessment of methods of protein structure prediction (CASP11) , 2016, Wellcome open research.

[5]  Luis Mendoza,et al.  Trans‐Proteomic Pipeline, a standardized data processing pipeline for large‐scale reproducible proteomics informatics , 2015, Proteomics. Clinical applications.

[6]  Juri Rappsilber,et al.  Protein Tertiary Structure by Crosslinking/Mass Spectrometry , 2018, Trends in biochemical sciences.

[7]  J. Eng,et al.  Comet: An open‐source MS/MS sequence database search tool , 2013, Proteomics.

[8]  Otto Hudecz,et al.  Structural prediction of protein models using distance restraints derived from cross-linking mass spectrometry data , 2018, Nature Protocols.

[9]  P. Bork,et al.  Capturing protein communities by structural proteomics in a thermophilic eukaryote , 2017, Molecular systems biology.

[10]  S. Bryant,et al.  Open mass spectrometry search algorithm. , 2004, Journal of proteome research.

[11]  William Stafford Noble,et al.  Semi-supervised learning for peptide identification from shotgun proteomics datasets , 2007, Nature Methods.

[12]  Albert J R Heck,et al.  Proteome-wide profiling of protein assemblies by cross-linking mass spectrometry , 2015, Nature Methods.

[13]  Michael J MacCoss,et al.  Kojak: efficient analysis of chemically cross-linked protein complexes. , 2015, Journal of proteome research.

[14]  M. Dong,et al.  Identification of cross-linked peptides from complex samples , 2012, Nature Methods.

[15]  R. Aebersold,et al.  Expanding the Chemical Cross-Linking Toolbox by the Use of Multiple Proteases and Enrichment by Size Exclusion Chromatography , 2012, Molecular & Cellular Proteomics.

[16]  Otto Hudecz,et al.  Comprehensive Cross-Linking Mass Spectrometry Reveals Parallel Orientation and Flexible Conformations of Plant HOP2-MND1. , 2015, Journal of proteome research.

[17]  Friedrich Förster,et al.  False discovery rate estimation for cross-linked peptides identified by mass spectrometry , 2012, Nature Methods.

[18]  Juri Rappsilber,et al.  Structural Analysis of Multiprotein Complexes by Cross-linking, Mass Spectrometry, and Database Searching*S , 2007, Molecular & Cellular Proteomics.

[19]  J. Rappsilber,et al.  Quirks of Error Estimation in Cross-Linking/Mass Spectrometry , 2017, Analytical chemistry.

[20]  Chao Liu,et al.  pParse: A method for accurate determination of monoisotopic peaks in high‐resolution mass spectra , 2012, Proteomics.

[21]  Knut Reinert,et al.  OpenMS – An open-source software framework for mass spectrometry , 2008, BMC Bioinformatics.

[22]  Jin Liang,et al.  MaXLinker: Proteome-wide Cross-link Identifications with High Specificity and Sensitivity* , 2019, Molecular & Cellular Proteomics.

[23]  G. Degliesposti,et al.  Structure of the Fanconi anemia monoubiquitin ligase complex , 2019, Nature.

[24]  Robert Burke,et al.  ProteoWizard: open source software for rapid proteomics tools development , 2008, Bioinform..

[25]  C. Iacobucci,et al.  To Be or Not to Be? Five Guidelines to Avoid Misassignments in Cross-Linking/Mass Spectrometry. , 2017, Analytical chemistry.

[26]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[27]  Ruedi Aebersold,et al.  Lysine-specific chemical cross-linking of protein complexes and identification of cross-linking sites using LC-MS/MS and the xQuest/xProphet software pipeline , 2013, Nature Protocols.

[28]  Juri Rappsilber,et al.  Optimized Fragmentation Regime for Diazirine Photo-Cross-Linked Peptides , 2016, Analytical chemistry.

[29]  Andreas Quandt,et al.  An automated pipeline for high-throughput label-free quantitative proteomics. , 2013, Journal of proteome research.

[30]  Jüergen Cox,et al.  The MaxQuant computational platform for mass spectrometry-based shotgun proteomics , 2016, Nature Protocols.

[31]  K. Reinert,et al.  OpenMS: a flexible open-source software platform for mass spectrometry data analysis , 2016, Nature Methods.

[32]  J. Rappsilber,et al.  In Situ Structural Restraints from Cross-Linking Mass Spectrometry in Human Mitochondria , 2019, Journal of proteome research.

[33]  Judith A J Steen,et al.  When less can yield more – Computational preprocessing of MS/MS spectra for peptide identification , 2009, Proteomics.

[34]  Alexey I Nesvizhskii,et al.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. , 2002, Analytical chemistry.

[35]  Robertson Craig,et al.  TANDEM: matching proteins with tandem mass spectra. , 2004, Bioinformatics.

[36]  Chao Liu,et al.  Trifunctional cross-linker for mapping protein-protein interaction networks and comparing protein conformational states , 2016, eLife.