In silico proteome analysis to facilitate proteomics experiments using mass spectrometry

Proteomics experiments typically involve protein or peptide separation steps coupled to the identification of many hundreds to thousands of peptides by mass spectrometry. Development of methodology and instrumentation in this field is proceeding rapidly, and effective software is needed to link the different stages of proteomic analysis. We have developed an application, proteogest, written in Perl that generates descriptive and statistical analyses of the biophysical properties of multiple (e.g. thousands) protein sequences submitted by the user, for instance protein sequences inferred from the complete genome sequence of a model organism. The application also carries out in silico proteolytic digestion of the submitted proteomes, or subsets thereof, and the distribution of biophysical properties of the resulting peptides is presented. proteogest is customizable, the user being able to select many options, for instance the cleavage pattern of the digestion treatment or the presence of modifications to specific amino acid residues. We show how proteogest can be used to compare the proteomes and digested proteome products of model organisms, to examine the added complexity generated by modification of residues, and to facilitate the design of proteomics experiments for optimal representation of component proteins.

[1]  J. Yates,et al.  Direct analysis of protein complexes using mass spectrometry , 1999, Nature Biotechnology.

[2]  M. Mann,et al.  Analysis of proteins and proteomes by mass spectrometry. , 2001, Annual review of biochemistry.

[3]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[4]  N. Sherman,et al.  Protein Sequencing and Identification Using Tandem Mass Spectrometry: Kinter/Tandem Mass Spectrometry , 2000 .

[5]  Andrew Emili,et al.  De novo peptide sequencing and quantitative profiling of complex protein mixtures using mass-coded abundance tagging , 2002, Nature Biotechnology.

[6]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[7]  M. MacCoss,et al.  Shotgun proteomics: tools for the analysis of complex biological systems. , 2002, Current opinion in molecular therapeutics.

[8]  John I. Clark,et al.  Shotgun identification of protein modifications from protein complexes and lens tissue , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[9]  S. Gygi,et al.  Correlation between Protein and mRNA Abundance in Yeast , 1999, Molecular and Cellular Biology.

[10]  Ruedi Aebersold,et al.  Quantitative Proteome Analysis by Solid-phase Isotope Tagging and Mass Spectrometry Beads Photocleavable Linker Isotope Tag Reactive Group , 2022 .

[11]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[12]  K. H. Wolfe,et al.  Molecular evidence for an ancient duplication of the entire yeast genome , 1997, Nature.

[13]  M. Tyers,et al.  From genomics to proteomics , 2003, Nature.

[14]  Peter R. Baker,et al.  Role of accurate mass measurement (+/- 10 ppm) in protein identification strategies employing MS or MS/MS and database searching. , 1999, Analytical chemistry.

[15]  S. Fields,et al.  Protein analysis on a proteomic scale , 2003, Nature.

[16]  R. Aebersold,et al.  Mass spectrometry in proteomics. , 2001, Chemical reviews.

[17]  M. Mann,et al.  Proteomic analysis of post-translational modifications , 2003, Nature Biotechnology.

[18]  B. Futcher,et al.  A Sampling of the Yeast Proteome , 1999, Molecular and Cellular Biology.

[19]  J. Shabanowitz,et al.  Phosphoproteome analysis by mass spectrometry and its application to Saccharomyces cerevisiae , 2002, Nature Biotechnology.

[20]  Andrew Emili,et al.  PRISM, a Generic Large Scale Proteomic Investigation Strategy for Mammals*S , 2003, Molecular & Cellular Proteomics.

[21]  S. Gygi,et al.  Proteomics: the move to mixtures. , 2001, Journal of mass spectrometry : JMS.

[22]  J. Yates,et al.  Large-scale analysis of the yeast proteome by multidimensional protein identification technology , 2001, Nature Biotechnology.

[23]  P. Schultz,et al.  Profiling of tyrosine phosphorylation pathways in human cells using mass spectrometry , 2003, Proceedings of the National Academy of Sciences of the United States of America.