A Mixed Integer Linear Optimization Framework for the Identification and Quantification of Targeted Post-translational Modifications of Highly Modified Proteins Using Multiplexed Electron Transfer Dissociation Tandem Mass Spectrometry*

Here we present a novel methodology for the identification of the targeted post-translational modifications present in highly modified proteins using mixed integer linear optimization and electron transfer dissociation (ETD) tandem mass spectrometry. For a given ETD tandem mass spectrum, the rigorous set of modified forms that satisfy the mass of the precursor ion, within some tolerance error, are enumerated by solving a feasibility problem via mixed integer linear optimization. The enumeration of the entire superset of modified forms enables the method to normalize the relative contributions of the individual modification sites. Given the entire set of modified forms, a superposition problem is then formulated using mixed integer linear optimization to determine the relative fractions of the modified forms that are present in the multiplexed ETD tandem mass spectrum. Chromatographic information in the mass and time dimension is utilized to assess the likelihood of the assigned modification states, to average several tandem mass spectra for confident identification of lower level forms, and to infer modification states of partially assigned spectra. The utility of the proposed computational framework is demonstrated on an entire LC-MS/MS ETD experiment corresponding to a mixture of highly modified histone peptides. This new computational method will facilitate the unprecedented LC-MS/MS ETD analysis of many hypermodified proteins and offer novel biological insight into these previously understudied systems.

[1]  Laurence A. Wolsey,et al.  Integer and Combinatorial Optimization , 1988, Wiley interscience series in discrete mathematics and optimization.

[2]  J. Yates,et al.  Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. , 1995, Analytical chemistry.

[3]  Christodoulos A. Floudas,et al.  Nonlinear and Mixed-Integer Optimization , 1995 .

[4]  F. McLafferty,et al.  Electron Capture Dissociation of Multiply Charged Protein Cations. A Nonergodic Process , 1998 .

[5]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[6]  Richard D. Smith,et al.  Accurate mass multiplexed tandem mass spectrometry for high-throughput polypeptide identification from mixtures. , 2000, Analytical chemistry.

[7]  D. Chelius,et al.  Quantitative profiling of proteins in complex mixtures using liquid chromatography and mass spectrometry. , 2002, Journal of proteome research.

[8]  D. Chelius,et al.  Identification and relative quantitation of protein mixtures by enzymatic digestion followed by capillary reversed-phase liquid chromatography-tandem mass spectrometry. , 2002, Analytical chemistry.

[9]  R. Beavis,et al.  A method for reducing the time required to match protein sequences with tandem mass spectra. , 2003, Rapid communications in mass spectrometry : RCM.

[10]  J. Shabanowitz,et al.  Peptide and protein sequence analysis by electron transfer dissociation mass spectrometry. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Chris F. Taylor,et al.  A common open representation of mass spectrometry data and its application to proteomics research , 2004, Nature Biotechnology.

[12]  S. Horvath,et al.  Global histone modification patterns predict risk of prostate cancer recurrence , 2005, Nature.

[13]  P. Højrup,et al.  VEMS 3.0: algorithms and computational tools for tandem mass spectrometry based identification of post-translational modifications in proteins. , 2005, Journal of proteome research.

[14]  P. Pevzner,et al.  InsPecT: identification of posttranslationally modified peptides from tandem mass spectra. , 2005, Analytical chemistry.

[15]  Neil L Kelleher,et al.  Mass spectrometric characterization of human histone H3: a bird's eye view. , 2006, Journal of proteome research.

[16]  Hokeun Kim,et al.  MODi : a powerful and convenient web server for identifying multiple post-translational peptide modifications from tandem mass spectra , 2006, Nucleic Acids Res..

[17]  N. Kelleher,et al.  Quantitative analysis of modified proteins and their positional isomers by tandem mass spectrometry: human histone H4. , 2006, Analytical chemistry.

[18]  Heejin Park,et al.  MOD i : a powerful and convenient web server for identifying multiple post-translational peptide modifications from tandem mass spectra , 2006 .

[19]  B. Ueberheide,et al.  The utility of ETD mass spectrometry in proteomic analysis. , 2006, Biochimica et biophysica acta.

[20]  Ray Bakhtiar,et al.  Electron Capture Dissociation Mass Spectrometry in Characterization of Peptides and Proteins , 2006, Biotechnology Letters.

[21]  Neil L Kelleher,et al.  Pervasive combinatorial modification of histone H3 in human cells , 2007, Nature Methods.

[22]  Christodoulos A Floudas,et al.  A Mixed-Integer Optimization Framework for De Novo Peptide Identification. , 2007, AIChE journal. American Institute of Chemical Engineers.

[23]  C. Allis,et al.  Extraction, purification and analysis of histones , 2007, Nature Protocols.

[24]  Suresh Mathivanan,et al.  Global proteomic profiling of phosphopeptides using electron transfer dissociation tandem mass spectrometry , 2007, Proceedings of the National Academy of Sciences.

[25]  Kristie L. Rose,et al.  Analysis of proteins and peptides on a chromatographic timescale by electron‐transfer dissociation MS , 2007, The FEBS journal.

[26]  Peter A. DiMaggio,et al.  De novo peptide identification via tandem mass spectrometry and integer linear optimization. , 2007, Analytical chemistry.

[27]  Christodoulos A. Floudas,et al.  A hybrid method for peptide identification using integer linear optimization, local database search, and quadrupole time-of-flight or OrbiTrap tandem mass spectrometry. , 2008, Journal of proteome research.

[28]  Christodoulos A Floudas,et al.  High Throughput Characterization of Combinatorial Histone Codes* , 2009, Molecular & Cellular Proteomics.