Detecting cross-linked peptides by searching against a database of cross-linked peptide pairs.

Mass spectrometric identification of cross-linked peptides can provide valuable information about the structure of protein complexes. We describe a straightforward database search scheme that identifies and assigns statistical confidence estimates to spectra from cross-linked peptides. The method is well suited to targeted analysis of a single protein complex, without requiring an isotope labeling strategy. Our approach uses a SEQUEST-style search procedure in which the database is comprised of a mixture of single peptides with and without linkers attached and cross-linked products. In contrast to several previous approaches, we generate theoretical spectra that account for all of the expected peaks from a cross-linked product, and we employ an empirical curve-fitting procedure to estimate statistical confidence measures. We show that our fully automated procedure successfully reidentifies spectra from a previous study, and we provide evidence that our statistical confidence estimates are accurate.

[1]  M. MacCoss,et al.  A fast SEQUEST cross correlation algorithm. , 2008, Journal of proteome research.

[2]  John D. Storey A direct approach to false discovery rates , 2002 .

[3]  Ruedi Aebersold,et al.  Identification of cross-linked peptides from large sequence databases , 2008, Nature Methods.

[4]  Birgit Schilling,et al.  MS2Assign, automated assignment and nomenclature of tandem mass spectra of chemically crosslinked peptides , 2003, Journal of the American Society for Mass Spectrometry.

[5]  Juri Rappsilber,et al.  Structural Analysis of Multiprotein Complexes by Cross-linking, Mass Spectrometry, and Database Searching*S , 2007, Molecular & Cellular Proteomics.

[6]  Yong Tang,et al.  CLPM: A Cross-Linked Peptide Mapping Algorithm for Mass Spectrometric Analysis , 2005, BMC Bioinformatics.

[7]  Malin M. Young,et al.  High throughput protein fold identification by using experimental constraints derived from intramolecular cross-links and mass spectrometry , 2000, Proc. Natl. Acad. Sci. USA.

[8]  Ning Zhang,et al.  Protein cross-linking analysis using mass spectrometry, isotope-coded cross-linkers, and integrated computational data processing. , 2006, Journal of proteome research.

[9]  Alexander Scherl,et al.  Characterization of protein cross-links via mass spectrometry and an open-modification search strategy. , 2008, Analytical chemistry.

[10]  Brett S Phinney,et al.  Shotgun cross-linking analysis for studying quaternary and tertiary protein structures. , 2007, Journal of proteome research.

[11]  William Stafford Noble,et al.  Semi-supervised learning for peptide identification from shotgun proteomics datasets , 2007, Nature Methods.

[12]  David R Goodlett,et al.  Pro-CrossLink. Software tool for protein cross-linking and mass spectrometry. , 2006, Analytical chemistry.

[13]  Peter R Baker,et al.  Finding Chimeras: a Bioinformatics Strategy for Identification of Cross-linked Peptides* , 2009, Molecular & Cellular Proteomics.

[14]  William Stafford Noble,et al.  Statistical calibration of the SEQUEST XCorr function. , 2009, Journal of proteome research.

[15]  R. Appel,et al.  Popitam: Towards new heuristic strategies to improve protein identification from tandem mass spectrometry data , 2003, Proteomics.

[16]  William Stafford Noble,et al.  Improvements to the percolator algorithm for Peptide identification from shotgun proteomics data sets. , 2009, Journal of proteome research.

[17]  William Stafford Noble,et al.  Rapid and accurate peptide identification from tandem mass spectra. , 2008, Journal of proteome research.

[18]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[19]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.