Tandem Mass Spectral Libraries of Peptides in Digests of Individual Proteins: Human Serum Albumin (HSA) *

This work presents a method for creating a mass spectral library containing tandem spectra of identifiable peptide ions in the tryptic digestion of a single protein. Human serum albumin (HSA1) was selected for this purpose owing to its ubiquity, high level of characterization and availability of digest data. The underlying experimental data consisted of ∼3000 one-dimensional LC-ESI-MS/MS runs with ion-trap fragmentation. In order to generate a wide range of peptides, studies covered a broad set of instrument and digestion conditions using multiple sources of HSA and trypsin. Computer methods were developed to enable the reliable identification and reference spectrum extraction of all peptide ions identifiable by current sequence search methods. This process made use of both MS2 (tandem) spectra and MS1 (electrospray) data. Identified spectra were generated for 2918 different peptide ions, using a variety of manually-validated filters to ensure spectrum quality and identification reliability. The resulting library was composed of 10% conventional tryptic and 29% semitryptic peptide ions, along with 42% tryptic peptide ions with known or unknown modifications, which included both analytical artifacts and post-translational modifications (PTMs) present in the original HSA. The remaining 19% contained unexpected missed-cleavages or were under/over alkylated. The methods described can be extended to create equivalent spectral libraries for any target protein. Such libraries have a number of applications in addition to their known advantages of speed and sensitivity, including the ready re-identification of known PTMs, rejection of artifact spectra and a means of assessing sample and digestion quality.

[1]  Mark S Lowenthal,et al.  Quantitative bottom-up proteomics depends on digestion conditions. , 2014, Analytical chemistry.

[2]  Alexey I Nesvizhskii,et al.  Comprehensive analysis of protein digestion using six trypsins reveals the origin of trypsin as a significant source of variability in proteomics. , 2013, Journal of proteome research.

[3]  D. S. Hage,et al.  Review: Glycation of human serum albumin. , 2013, Clinica chimica acta; international journal of clinical chemistry.

[4]  J. Mira,et al.  Specific antioxidant properties of human serum albumin , 2013, Annals of Intensive Care.

[5]  S. Rappaport,et al.  Cys34 adducts of reactive oxygen species in human serum albumin. , 2012, Chemical research in toxicology.

[6]  Stephen Stein,et al.  Mass spectral reference libraries: an ever-expanding resource for chemical identification. , 2012, Analytical chemistry.

[7]  V. Trezza,et al.  Human serum albumin: from bench to bedside. , 2012, Molecular aspects of medicine.

[8]  Vassilios Ioannidis,et al.  ExPASy: SIB bioinformatics resource portal , 2012, Nucleic Acids Res..

[9]  J. Silberg,et al.  A transposase strategy for creating libraries of circularly permuted proteins , 2012, Nucleic acids research.

[10]  Bin Zhang,et al.  PhosphoSitePlus: a comprehensive resource for investigating the structure and function of experimentally determined post-translational modifications in man and mouse , 2011, Nucleic Acids Res..

[11]  The UniProt Consortium,et al.  Reorganizing the protein space at the Universal Protein Resource (UniProt) , 2011, Nucleic Acids Res..

[12]  E. Diamandis,et al.  Characterization of the seminal plasma proteome in men with prostatitis by mass spectrometry , 2012, Clinical Proteomics.

[13]  E. Go,et al.  A general protease digestion procedure for optimal protein sequence coverage and post-translational modifications analysis of recombinant glycoproteins: application to the characterization of human lysyl oxidase-like 2 glycosylation. , 2011, Analytical chemistry.

[14]  T. Köcher,et al.  Universal and confident phosphorylation site localization using phosphoRS. , 2011, Journal of proteome research.

[15]  Martin Kircher,et al.  Deep proteome and transcriptome mapping of a human cancer cell line , 2011, Molecular systems biology.

[16]  Ruedi Aebersold,et al.  Building and searching tandem mass (MS/MS) spectral libraries for peptide identification in proteomics. , 2011, Methods.

[17]  He Li,et al.  Profiling Cys34 Adducts of Human Serum Albumin by Fixed-Step Selected Reaction Monitoring* , 2010, Molecular & Cellular Proteomics.

[18]  M. Koga,et al.  Clinical impact of glycated albumin as another glycemic control marker. , 2010, Endocrine journal.

[19]  D. S. Hage,et al.  Quantitative analysis of glycation sites on human serum albumin using (16)O/(18)O-labeling and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. , 2010, Clinica chimica acta; international journal of clinical chemistry.

[20]  L. Deterding,et al.  Mass spectrometric identification of oxidative modifications of tryptophan residues in proteins: Chemical artifact or post-translational modification? , 2010, Journal of the American Society for Mass Spectrometry.

[21]  B. Kuster,et al.  Proteomics: a pragmatic perspective , 2010, Nature Biotechnology.

[22]  D. Tabb,et al.  TagRecon: high-throughput mutation identification through sequence tagging. , 2010, Journal of proteome research.

[23]  O. Lockridge,et al.  Reaction of human albumin with aspirin in vitro: mass spectrometric identification of acetylated lysines 199, 402, 519, and 545. , 2010, Biochemical pharmacology.

[24]  Birgit Schilling,et al.  Repeatability and reproducibility in proteomic identifications by liquid chromatography-tandem mass spectrometry. , 2010, Journal of proteome research.

[25]  Tao Jiang,et al.  MSOAR 2.0: Incorporating tandem duplications into ortholog assignment based on genome rearrangement , 2010, BMC Bioinformatics.

[26]  Birgit Schilling,et al.  Interlaboratory Study Characterizing a Yeast Performance Standard for Benchmarking LC-MS Platform Performance* , 2009, Molecular & Cellular Proteomics.

[27]  David L. Tabb,et al.  Performance Metrics for Liquid Chromatography-Tandem Mass Spectrometry Systems in Proteomics Analyses* , 2009, Molecular & Cellular Proteomics.

[28]  John R Yates,et al.  Identification of N-terminally arginylated proteins and peptides by mass spectrometry , 2009, Nature Protocols.

[29]  J. Mesirov,et al.  Prediction of high-responding peptides for targeted protein assays by mass spectrometry , 2009, Nature Biotechnology.

[30]  Yingming Zhao,et al.  PTMap—A sequence alignment software for unrestricted, accurate, and full-spectrum identification of post-translational modification sites , 2009, Proceedings of the National Academy of Sciences.

[31]  Stephen E. Stein,et al.  The Drosophila melanogaster PeptideAtlas facilitates the use of peptide data for improved fly proteomics and genome annotation , 2009, BMC Bioinformatics.

[32]  Felix Kratz,et al.  Albumin as a drug carrier: design of prodrugs, drug conjugates and nanoparticles. , 2008, Journal of controlled release : official journal of the Controlled Release Society.

[33]  Peter R Baker,et al.  In-depth Analysis of Tandem Mass Spectrometry Data from Disparate Instrument Types*S , 2008, Molecular & Cellular Proteomics.

[34]  H. V. Roohk,et al.  A Review of Glycated Albumin as an Intermediate Glycation Index for Controlling Diabetes , 2008, Journal of diabetes science and technology.

[35]  L. Hood,et al.  Proteomics cataloging analysis of human expressed prostatic secretions reveals rich source of biomarker candidates , 2008, Proteomics. Clinical applications.

[36]  Xinning Jiang,et al.  Large‐scale phosphoproteome analysis of human liver tissue by enrichment and fractionation of phosphopeptides with strong anion exchange chromatography , 2008, Proteomics.

[37]  Samuel H. Payne,et al.  Accurate annotation of peptide modifications through unrestrictive database search. , 2008, Journal of proteome research.

[38]  Ruedi Aebersold,et al.  The Implications of Proteolytic Background for Shotgun Proteomics*S , 2007, Molecular & Cellular Proteomics.

[39]  Nichole L. King,et al.  Development and validation of a spectral library searching method for peptide identification from MS/MS , 2007, Proteomics.

[40]  Steven P Gygi,et al.  Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry , 2007, Nature Methods.

[41]  Maolian Chen,et al.  Oxidation artifacts in the electrospray mass spectrometry of Abeta Peptide. , 2007, Analytical chemistry.

[42]  Rebekah L. Gundry,et al.  Investigation of an albumin‐enriched fraction of human serum and its albuminome , 2007, Proteomics. Clinical applications.

[43]  Jennifer A. Siepen,et al.  Prediction of missed cleavage sites in tryptic peptides aids protein identification in proteomics. , 2007, Journal of proteome research.

[44]  Zhongqi Zhang,et al.  Rearrangement of terminal amino acid residues in peptides by protease-catalyzed intramolecular transpeptidation. , 2006, Analytical biochemistry.

[45]  Steven P Gygi,et al.  A probability-based approach for high-throughput protein phosphorylation analysis and site localization , 2006, Nature Biotechnology.

[46]  Ping Wan,et al.  A Dataset of Human Fetal Liver Proteome Identified by Subcellular Fractionation and Multiple Protein Separation and Identification Technology*S , 2006, Molecular & Cellular Proteomics.

[47]  G. Aldini,et al.  Mass spectrometric characterization of covalent modification of human serum albumin by 4-hydroxy-trans-2-nonenal. , 2006, Journal of mass spectrometry : JMS.

[48]  William Stafford Noble,et al.  Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries. , 2006, Analytical chemistry.

[49]  R. Beavis,et al.  Using annotated peptide mass spectrum libraries for protein identification. , 2006, Journal of proteome research.

[50]  D. Fenyö,et al.  Detection of artifacts and peptide modifications in liquid chromatography/mass spectrometry data using two-dimensional signal intensity map data visualization. , 2006, Rapid communications in mass spectrometry : RCM.

[51]  S. Berezenko Heterogeneity and oxidation status of commercial human albumin preparations in clinical use. , 2006, Critical care medicine.

[52]  R. Aebersold,et al.  Dynamic Spectrum Quality Assessment and Iterative Computational Analysis of Shotgun Proteomic Data , 2006, Molecular & Cellular Proteomics.

[53]  D. Slone,et al.  The formation and rapid clearance of a truncated albumin species in a critically ill patient. , 2006, Clinica chimica acta; international journal of clinical chemistry.

[54]  Kaoru Kobayashi Summary of recombinant human serum albumin development. , 2006, Biologicals : journal of the International Association of Biological Standardization.

[55]  D. S. Hage,et al.  Obtaining high sequence coverage in matrix-assisted laser desorption time-of-flight mass spectrometry for studies of protein modification: analysis of human serum albumin as a model. , 2006, Analytical biochemistry.

[56]  G. Allmaier,et al.  Characterization of cysteinylation of pharmaceutical-grade human serum albumin by electrospray ionization mass spectrometry and low-energy collision-induced dissociation tandem mass spectrometry. , 2005, Rapid communications in mass spectrometry : RCM.

[57]  R. Aebersold,et al.  A uniform proteomics MS/MS analysis platform utilizing open XML file formats , 2005, Molecular systems biology.

[58]  P. Pevzner,et al.  InsPecT: identification of posttranslationally modified peptides from tandem mass spectra. , 2005, Analytical chemistry.

[59]  G. Martin,et al.  Albumin: Biochemical properties and therapeutic potential , 2005, Hepatology.

[60]  M. Mann,et al.  Phosphotyrosine interactome of the ErbB-receptor kinase family , 2005, Molecular systems biology.

[61]  Kai A Reidegeld,et al.  Tryptic transpeptidation products observed in proteome analysis by liquid chromatography‐tandem mass spectrometry , 2005, Proteomics.

[62]  Robertson Craig,et al.  TANDEM: matching proteins with tandem mass spectra. , 2004, Bioinformatics.

[63]  D. Creasy,et al.  Unimod: Protein modifications for mass spectrometry , 2004, Proteomics.

[64]  S. Bryant,et al.  Open mass spectrometry search algorithm. , 2004, Journal of proteome research.

[65]  S. Pongor,et al.  Vicinal disulfide bridge conformers by experimental methods and by ab initio and DFT molecular computations , 2004, Proteins.

[66]  John R Yates,et al.  Influence of basic residue content on fragment ion peak intensities in low-energy collision-induced dissociation spectra of peptides. , 2004, Analytical chemistry.

[67]  P. Weinstein,et al.  Human Serum Albumin and its N-Terminal Tetrapeptide (DAHK) Block Oxidant-Induced Neuronal Death , 2004, Stroke.

[68]  M. Mann,et al.  Proteomic analysis of post-translational modifications , 2003, Nature Biotechnology.

[69]  A. Marina,et al.  Peptide rearrangement during quadrupole ion trap fragmentation: added complexity to MS/MS spectra. , 2003, Analytical chemistry.

[70]  G. Pallante,et al.  Effects of peptide chain length on the gas-phase proton transfer properties of doubly-protonated ions from bradykinin and its N-terminal fragment peptides , 2002 .

[71]  E. Boja,et al.  Overalkylation of a protein digest with iodoacetamide. , 2001, Analytical chemistry.

[72]  J. Yates,et al.  Large-scale analysis of the yeast proteome by multidimensional protein identification technology , 2001, Nature Biotechnology.

[73]  S. Brennan,et al.  Three truncated forms of serum albumin associated with pancreatic pseudocyst. , 2000, Biochimica et biophysica acta.

[74]  Peter R. Baker,et al.  Role of accurate mass measurement (+/- 10 ppm) in protein identification strategies employing MS or MS/MS and database searching. , 1999, Analytical chemistry.

[75]  I. Apostol,et al.  Carbamylation of cysteine: a potential artifact in peptide mapping of hemoglobins in the presence of urea. , 1999, Analytical biochemistry.

[76]  J. Ross,et al.  Changes in nutritional, functional, and inflammatory markers in advanced pancreatic cancer. , 1999, Nutrition and cancer.

[77]  B. Chait,et al.  Modification of cysteine residues by alkylation. A tool in peptide mapping and protein identification. , 1998, Analytical chemistry.

[78]  J. Yates,et al.  Method to compare collision-induced dissociation spectra of peptides: potential for library searching and subtractive analysis. , 1998, Analytical chemistry.

[79]  T. Peters,et al.  All About Albumin: Biochemistry, Genetics, and Medical Applications , 1995 .

[80]  P. Schnier,et al.  On the maximum charge state and proton transfer reactivity of peptide and protein ions formed by electrospray ionization , 1995, Journal of the American Society for Mass Spectrometry.

[81]  N. Dodsworth,et al.  Site-specific N-terminal auto-degradation of human serum albumin. , 1995, European journal of biochemistry.

[82]  W. Kuang,et al.  Molecular structure of the human albumin gene is revealed by nucleotide sequence within q11-22 of chromosome 4. , 1986, The Journal of biological chemistry.