Data analysis of assorted serum peptidome profiles

Discovery of biomarker patterns using proteomic techniques requires examination of large numbers of patient and control samples, followed by data mining of the molecular read-outs (e.g., mass spectra). Adequate signal processing and statistical analysis are critical for successful extraction of markers from these data sets. The protocol, specifically designed for use in conjunction with MALDI-TOF-MS-based serum peptide profiling, is a data analysis pipeline, starting with transfer of raw spectra that are interpreted using signal processing algorithms to define suitable features (i.e., peptides). We describe an algorithm for minimal entropy-based peak alignment across samples. Peak lists obtained in this way, and containing all samples, all peptide features and their normalized MS-ion intensities, can be evaluated, and results validated, using common statistical methods. We recommend visual inspection of the spectra to confirm all results, and have written freely available software for viewing and color-coding of spectral overlays.

[1]  G. Opiteck,et al.  In Vitro Biomarker Discovery for Atherosclerosis by Proteomics* , 2004, Molecular & Cellular Proteomics.

[2]  Patrick G. A. Pedrioli,et al.  A tool to visualize and evaluate data obtained by liquid chromatography-electrospray ionization-mass spectrometry. , 2004, Analytical chemistry.

[3]  Tao Liu,et al.  Submitted to Molecular and Cellular Proteomics Advances and Challenges in Liquid Chromatography-Mass Spectrometry Based Proteomic Profiling for Clinical Applications , 2006 .

[4]  Hua Tang,et al.  A statistical method for chromatographic alignment of LC-MS data. , 2007, Biostatistics.

[5]  Steven A Carr,et al.  Place of pattern in proteomic biomarker discovery. , 2005, Journal of proteome research.

[6]  P. Tempst,et al.  Automated serum peptide profiling , 2006, Nature Protocols.

[7]  Richard D. Smith,et al.  Robust algorithm for alignment of liquid chromatography-mass spectrometry analyses in an accurate mass and time tag data analysis pipeline. , 2006, Analytical chemistry.

[8]  P. Tempst,et al.  Correcting common errors in identifying cancer-specific serum peptide signatures. , 2005, Journal of proteome research.

[9]  M. Schrader,et al.  Composition of the peptide fraction in human blood plasma: database of circulating human peptides. , 1999, Journal of chromatography. B, Biomedical sciences and applications.

[10]  K. Markides,et al.  Chromatographic alignment by warping and dynamic programming as a pre-processing tool for PARAFAC modelling of liquid chromatography-mass spectrometry data. , 2002, Journal of chromatography. A.

[11]  Jack G. Dodd,et al.  Smoothing and Derivatives in Spectroscopy , 2006 .

[12]  P. Schellhammer,et al.  Serum protein fingerprinting coupled with a pattern-matching algorithm distinguishes prostate cancer from benign prostate hyperplasia and healthy men. , 2002, Cancer research.

[13]  Robert Tibshirani,et al.  Sample classification from protein mass spectrometry, by 'peak probability contrasts' , 2004, Bioinform..

[14]  Ruedi Aebersold,et al.  Improving mass and liquid chromatography based identification of proteins using bayesian scoring. , 2005, Journal of proteome research.

[15]  E. Petricoin,et al.  SELDI-TOF-based serum proteomic pattern diagnostics for early detection of cancer. , 2004, Current opinion in biotechnology.

[16]  Thomas P Conrads,et al.  SELDI-TOF MS for diagnostic proteomics. , 2003, Analytical chemistry.

[17]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[18]  T. Shaler,et al.  Quantification of proteins and metabolites by mass spectrometry without isotopic labeling or spiked standards. , 2003, Analytical chemistry.

[19]  Yu Shyr,et al.  Proteomic patterns of tumour subsets in non-small-cell lung cancer , 2003, The Lancet.

[20]  P. Tempst,et al.  Serum Peptidome Patterns That Distinguish Metastatic Thyroid Carcinoma from Cancer-free Controls Are Unbiased by Gender and Age*S , 2006, Molecular & Cellular Proteomics.

[21]  E. Holland,et al.  Serum peptide profiling by magnetic particle-assisted, automated sample processing and MALDI-TOF mass spectrometry. , 2004, Analytical chemistry.

[22]  K. Coombes,et al.  Direct tandem mass spectrometry reveals limitations in protein profiling experiments for plasma biomarker discovery. , 2005, Journal of proteome research.

[23]  A. Olshen,et al.  Differential exoprotease activities confer tumor-specific serum peptidome patterns. , 2005, The Journal of clinical investigation.

[24]  M. Gorenstein,et al.  Quantitative proteomic analysis by accurate mass retention time pairs. , 2005, Analytical chemistry.

[25]  Mark S Friedrichs,et al.  Changes in the protein expression of yeast as a function of carbon source. , 2003, Journal of proteome research.