Proteomic Database Search and Analytical Quantification for Mass Spectrometry

Technological development and the ever-increasing complexity of proteomic profiling studies requires automated tools for tandem mass spectra interpretation. The analysis of various methods used for accurate and sensitive quantitation adds another layer of complexity. Therefore, investigators need a more in-depth understanding of database searches and the calculation of statistical significance in order to find a proper bioinformatics approach for their study. In this chapter, we review tools and procedures that have been developed in two decades of collective experience in the proteomics field. However, the field is still in dynamic expansion thus intention of this chapter is to provide readers with background for reading about the newest tools and approaches developed for database searches.

[1]  Peter B. McGarvey,et al.  UniRef: comprehensive and non-redundant UniProt reference clusters , 2007, Bioinform..

[2]  Brendan MacLean,et al.  Bioinformatics Applications Note Gene Expression Skyline: an Open Source Document Editor for Creating and Analyzing Targeted Proteomics Experiments , 2022 .

[3]  Peter R. Baker,et al.  Role of accurate mass measurement (+/- 10 ppm) in protein identification strategies employing MS or MS/MS and database searching. , 1999, Analytical chemistry.

[4]  Sean L Seymour,et al.  The Paragon Algorithm, a Next Generation Search Engine That Uses Sequence Temperature Values and Feature Probabilities to Identify Peptides from Tandem Mass Spectra*S , 2007, Molecular & Cellular Proteomics.

[5]  Yoshifumi Kawamura,et al.  HGPD: Human Gene and Protein Database, 2012 update , 2012, Nucleic Acids Res..

[6]  A. Makarov,et al.  Interfacing the orbitrap mass analyzer to an electrospray ion source. , 2003, Analytical chemistry.

[7]  A. Burlingame,et al.  Mass spectrometric analysis of histone posttranslational modifications. , 2005, Methods.

[8]  J. Yates,et al.  Method to correlate tandem mass spectra of modified peptides to amino acid sequences in the protein database. , 1995, Analytical chemistry.

[9]  A. Makarov,et al.  The Orbitrap: a new mass spectrometer. , 2005, Journal of mass spectrometry : JMS.

[10]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt): an expanding universe of protein information , 2005, Nucleic Acids Res..

[11]  M. Wilm,et al.  Error-tolerant identification of peptides in sequence databases by peptide sequence tags. , 1994, Analytical chemistry.

[12]  M. Mann,et al.  Andromeda: a peptide search engine integrated into the MaxQuant environment. , 2011, Journal of proteome research.

[13]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[14]  Robertson Craig,et al.  TANDEM: matching proteins with tandem mass spectra. , 2004, Bioinformatics.

[15]  M. Senko,et al.  A two-dimensional quadrupole ion trap mass spectrometer , 2002, Journal of the American Society for Mass Spectrometry.

[16]  M. Mann,et al.  MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification , 2008, Nature Biotechnology.

[17]  M. Westphall,et al.  Medicago PhosphoProtein Database: a repository for Medicago truncatula phosphoprotein data , 2012, Front. Plant Sci..

[18]  P. Højrup,et al.  Rapid identification of proteins by peptide-mass fingerprinting , 1993, Current Biology.