Multidimensional protein identification technology: current status and future prospects

Protein profiling using high-throughput tandem mass spectrometry has become a powerful method for analyzing changes in global protein expression patterns in cells and tissues as a function of developmental, physiologic and disease processes. This review summarizes the utility and practical application of multidimensional protein identification technology as a platform for comprehensive proteomic profiling of complex biologic samples. The strengths and potential problems and limitations associated with this powerful technology are discussed, with an emphasis placed on one of the biggest challenges currently facing large-scale expression profiling projects – namely, data analysis. Complementary bioinformatic computational data mining strategies, such as clustering, functional annotation and statistical inference, are also discussed as these are increasingly necessary for interpreting the results of global proteomic profiling studies.

[1]  John R Yates,et al.  Nuclear Membrane Proteins with Potential Disease Links Found by Subtractive Proteomics , 2003, Science.

[2]  J R Yates,et al.  Protein sequencing by tandem mass spectrometry. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Etienne Gagnon,et al.  Organelle proteomics: looking at less to see more. , 2004, Drug discovery today.

[4]  David L. Tabb,et al.  A proteomic view of the Plasmodium falciparum life cycle , 2002, Nature.

[5]  Bernhard Kuster,et al.  A Proteome-wide Approach Identifies Sumoylated Substrate Proteins in Yeast* , 2004, Journal of Biological Chemistry.

[6]  F. Regnier,et al.  Use of a lectin affinity selector in the search for unusual glycosylation in proteomics. , 2002, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[7]  J. Yates,et al.  A method for the comprehensive proteomic analysis of membrane proteins , 2003, Nature Biotechnology.

[8]  Andrew Emili,et al.  Going global: protein expression profiling using shotgun mass spectrometry. , 2003, Current opinion in molecular therapeutics.

[9]  John Quackenbush Microarray data normalization and transformation , 2002, Nature Genetics.

[10]  K. Standing Peptide and protein de novo sequencing by mass spectrometry. , 2003, Current opinion in structural biology.

[11]  E. Williams,et al.  Effects of solvent on the maximum charge state and charge state distribution of protein ions produced by electrospray ionization , 2000, Journal of the American Society for Mass Spectrometry.

[12]  Tony Pawson,et al.  Specificity in Signal Transduction From Phosphotyrosine-SH2 Domain Interactions to Complex Cellular Systems , 2004, Cell.

[13]  Marjan S. Bolouri,et al.  Integrated Analysis of Protein Composition, Tissue Diversity, and Gene Regulation in Mouse Mitochondria , 2003, Cell.

[14]  V. Wysocki,et al.  Mobile and localized protons: a framework for understanding peptide dissociation. , 2000, Journal of mass spectrometry : JMS.

[15]  E. O’Shea,et al.  Global analysis of protein expression in yeast , 2003, Nature.

[16]  J. Shabanowitz,et al.  Phosphoproteome analysis by mass spectrometry and its application to Saccharomyces cerevisiae , 2002, Nature Biotechnology.

[17]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[18]  Ron D. Appel,et al.  ExPASy: the proteomics server for in-depth protein knowledge and analysis , 2003, Nucleic Acids Res..

[19]  Andrew Emili,et al.  PRISM, a Generic Large Scale Proteomic Investigation Strategy for Mammals*S , 2003, Molecular & Cellular Proteomics.

[20]  M. Karas,et al.  Laser desorption ionization of proteins with molecular masses exceeding 10,000 daltons. , 1988, Analytical chemistry.

[21]  J. Yates,et al.  Direct analysis of protein complexes using mass spectrometry , 1999, Nature Biotechnology.

[22]  Steven P Gygi,et al.  A proteomics approach to understanding protein ubiquitination , 2003, Nature Biotechnology.

[23]  Alexey I Nesvizhskii,et al.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. , 2002, Analytical chemistry.

[24]  Andrew Emili,et al.  Integrating gene and protein expression data: pattern analysis and profile mining. , 2005, Methods.

[25]  Joshua E. Elias,et al.  Evaluation of multidimensional chromatography coupled with tandem mass spectrometry (LC/LC-MS/MS) for large-scale protein analysis: the yeast proteome. , 2003, Journal of proteome research.

[26]  N. Kelleher,et al.  Molecular-level description of proteins from saccharomyces cerevisiae using quadrupole FT hybrid mass spectrometry for top down proteomics. , 2004, Analytical chemistry.

[27]  R. Beavis,et al.  A method for assessing the statistical significance of mass spectrometry-based protein identifications using general scoring schemes. , 2003, Analytical chemistry.

[28]  Partha S. Vasisht Computational Analysis of Microarray Data , 2003 .

[29]  J. Yates,et al.  A model for random sampling and estimation of relative protein abundance in shotgun proteomics. , 2004, Analytical chemistry.

[30]  John R Yates,et al.  Multidimensional separations for protein/peptide analysis in the post-genomic era. , 2002, BioTechniques.

[31]  John R Yates,et al.  Applicability of Tandem Affinity Purification MudPIT to Pathway Proteomics in Yeast*S , 2004, Molecular & Cellular Proteomics.

[32]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[33]  John I. Clark,et al.  Shotgun identification of protein modifications from protein complexes and lens tissue , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Peter Walden,et al.  Sequit: software for de novo peptide sequencing by matrix-assisted laser desorption/ionization post-source decay mass spectrometry. , 2004, Rapid communications in mass spectrometry : RCM.

[35]  T. Veenstra Proteome analysis of posttranslational modifications. , 2003, Advances in protein chemistry.

[36]  M J MacCoss,et al.  Proteomics: analytical tools and techniques , 2001, Current opinion in clinical nutrition and metabolic care.

[37]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[38]  Oliver Fiehn,et al.  Linking protein fractionation with multidimensional monolithic reversed-phase peptide chromatography/mass spectrometry enhances protein identification from complex mixtures even in the presence of abundant proteins. , 2004, Rapid communications in mass spectrometry : RCM.

[39]  Mu Wang,et al.  Identification of Methylation and Acetylation Sites on Mouse Histone H3 Using Matrix-Assisted Laser Desorption/Ionization Time-of-Flight and Nanoelectrospray Ionization Tandem Mass Spectrometry , 2003, Journal of protein chemistry.

[40]  Steven P Gygi,et al.  Large-scale characterization of HeLa cell nuclear phosphoproteins. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Parvaneh Saeedi,et al.  A physical map of the mouse genome , 2002, Nature.

[42]  Ming Li,et al.  PEAKS: powerful software for peptide de novo sequencing by tandem mass spectrometry. , 2003, Rapid communications in mass spectrometry : RCM.

[43]  A. Shevchenko,et al.  “De novo” sequencing of peptides recovered from in-gel digested proteins by nanoelectrospray tandem mass spectrometry , 2002, Molecular biotechnology.

[44]  D. Figeys,et al.  18O labeling: a tool for proteomics. , 2001, Rapid communications in mass spectrometry : RCM.

[45]  M. Mann,et al.  Directed Proteomic Analysis of the Human Nucleolus , 2002, Current Biology.

[46]  Joaquín Dopazo,et al.  FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes , 2004, Bioinform..

[47]  Steven P Gygi,et al.  Phosphoproteomic Analysis of the Developing Mouse Brain*S , 2004, Molecular & Cellular Proteomics.

[48]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[49]  Bradford W. Gibson,et al.  Characterization of the human heart mitochondrial proteome , 2003, Nature Biotechnology.

[50]  T. Pawson,et al.  Assembly of Cell Regulatory Systems Through Protein Interaction Domains , 2003, Science.

[51]  Andrew Emili,et al.  De novo peptide sequencing and quantitative profiling of complex protein mixtures using mass-coded abundance tagging , 2002, Nature Biotechnology.

[52]  A. Shevchenko,et al.  Femtomole sequencing of proteins from polyacrylamide gels by nano-electrospray mass spectrometry , 1996, Nature.

[53]  Ruedi Aebersold,et al.  Proteome analysis of low-abundance proteins using multidimensional chromatography and isotope-coded affinity tags. , 2002, Journal of proteome research.

[54]  Terry D. Lee,et al.  Rapid protein identification using a microscale electrospray LC/MS system on an ion trap mass spectrometer , 1998, Journal of the American Society for Mass Spectrometry.

[55]  R. Aebersold,et al.  Advances in quantitative proteomics via stable isotope tagging and mass spectrometry. , 2003, Current opinion in biotechnology.

[56]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[57]  R. Aebersold,et al.  A statistical model for identifying proteins by tandem mass spectrometry. , 2003, Analytical chemistry.

[58]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[59]  John R Yates,et al.  Analysis of quantitative proteomic data generated via multidimensional protein identification technology. , 2002, Analytical chemistry.

[60]  T. Shaler,et al.  Quantification of proteins and metabolites by mass spectrometry without isotopic labeling or spiked standards. , 2003, Analytical chemistry.

[61]  Matthias Mann,et al.  A Proteomic Study of SUMO-2 Target Proteins* , 2004, Journal of Biological Chemistry.

[62]  Dirk Wolters,et al.  Proteomic survey of metabolic pathways in rice , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[63]  Steven C. Lawlor,et al.  GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways , 2002, Nature Genetics.

[64]  Eberhard Durr,et al.  Direct proteomic mapping of the lung microvascular endothelial cell surface in vivo and in cell culture , 2004, Nature Biotechnology.

[65]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[66]  M. Mann,et al.  The abc's (and xyz's) of peptide sequencing , 2004, Nature Reviews Molecular Cell Biology.

[67]  Lisa M. D'Souza,et al.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution , 2004, Nature.

[68]  John R Yates,et al.  Influence of basic residue content on fragment ion peak intensities in low-energy collision-induced dissociation spectra of peptides. , 2004, Analytical chemistry.

[69]  Daniel B. Martin,et al.  Advances in quantitative proteomics using stable isotope tags. , 2002, Trends in biotechnology.

[70]  M. Mann,et al.  Electrospray Ionization for Mass Spectrometry of Large Biomolecules , 1990 .

[71]  S. Gygi,et al.  Development of a multiplexed microcapillary liquid chromatography system for high-throughput proteome analysis. , 2002, Analytical chemistry.

[72]  May D. Wang,et al.  GoMiner: a resource for biological interpretation of genomic and proteomic data , 2003, Genome Biology.

[73]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[74]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[75]  M. Tyers,et al.  From genomics to proteomics , 2003, Nature.

[76]  M. Mann,et al.  Stable Isotope Labeling by Amino Acids in Cell Culture, SILAC, as a Simple and Accurate Approach to Expression Proteomics* , 2002, Molecular & Cellular Proteomics.

[77]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[78]  Neil L Kelleher,et al.  Detection and localization of protein modifications by high resolution tandem mass spectrometry. , 2005, Mass spectrometry reviews.

[79]  P. Roepstorff,et al.  Proposal for a common nomenclature for sequence ions in mass spectra of peptides. , 1984, Biomedical mass spectrometry.

[80]  Yongyi Mao,et al.  Informatics Platform for Global Proteomic Profiling and Biomarker Discovery Using Liquid Chromatography-Tandem Mass Spectrometry*S , 2004, Molecular & Cellular Proteomics.

[81]  Andrew Emili,et al.  Identification of biochemical adaptations in hyper- or hypocontractile hearts from phospholamban mutant mice by expression proteomics. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[82]  J. Yates,et al.  Large-scale analysis of the yeast proteome by multidimensional protein identification technology , 2001, Nature Biotechnology.

[83]  K. Biemann Sequencing of peptides by tandem mass spectrometry and high-energy collision-induced dissociation. , 1990, Methods in enzymology.

[84]  J. Yates,et al.  Probability-based validation of protein identifications using a modified SEQUEST algorithm. , 2002, Analytical chemistry.