Differential expression profiling of serum proteins and metabolites for biomarker discovery

Abstract A liquid chromatography-mass spectrometry (LC-MS) proteomics and metabolomics platform is presented for quantitative differential expression analysis. Proteome profiles obtained from 1.5 μL of human serum show ∼5000 de-isotoped and quantifiable molecular ions. Approximately 1500 metabolites are observed from 100 μL of serum. Quantification is based on reproducible sample preparation and linear signal intensity as a function of concentration. The platform is validated using human serum, but is generally applicable to all biological fluids and tissues. The median coefficient of variation (CV) for ∼5000 proteomic and ∼1500 metabolomic molecular ions is approximately 25%. For the case of C-reactive protein, results agree with quantification by immunoassay. The independent contributions of two sources of variance, namely sample preparation and LC-MS analysis, are respectively quantified as 20.4 and 15.1% for the proteome, and 19.5 and 13.5% for the metabolome, for median CV values. Furthermore, biological diversity for ∼20 healthy individuals is estimated by measuring the variance of ∼6500 proteomic and metabolomic molecular ions in sera for each sample; the median CV is 22.3% for the proteome and 16.7% for the metabolome. Finally, quantitative differential expression profiling is applied to a clinical study comparing healthy individuals and rheumatoid arthritis (RA) patients.

[1]  Roberto Brunelli,et al.  MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES , 2001 .

[2]  E. Stein Laboratory surrogates for anti-atherosclerotic drug development. , 2001, The American journal of cardiology.

[3]  John B. Fenn,et al.  Electrospray ionization–principles and practice , 1990 .

[4]  Andrew Emili,et al.  De novo peptide sequencing and quantitative profiling of complex protein mixtures using mass-coded abundance tagging , 2002, Nature Biotechnology.

[5]  N. Anderson,et al.  Proteome and proteomics: New technologies, new concepts, and new words , 1998, Electrophoresis.

[6]  J. Yates,et al.  Direct analysis of protein complexes using mass spectrometry , 1999, Nature Biotechnology.

[7]  A. Karmen,et al.  Troponin‐T as a serum marker for myocardial infarction , 1997, Journal of clinical laboratory analysis (Print).

[8]  J. Yates,et al.  An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database , 1994, Journal of the American Society for Mass Spectrometry.

[9]  M. Lessl,et al.  Luteinizing hormone induces mouse vas deferens protein expression in the murine ovary. , 2000, Endocrinology.

[10]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[11]  C. A. Hastings,et al.  New algorithms for processing and peak detection in liquid chromatography/mass spectrometry data. , 2002, Rapid communications in mass spectrometry : RCM.

[12]  Thomas L. Isenhour,et al.  Time-warping algorithm applied to chromatographic peak matching gas chromatography/Fourier transform infrared/mass spectrometry , 1987 .

[13]  F. Regnier,et al.  Strategy for qualitative and quantitative analysis in proteomics based on signature peptides. , 2000, Journal of chromatography. B, Biomedical sciences and applications.

[14]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[15]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[16]  W. Weinmann,et al.  Ion suppression effects in liquid chromatography-electrospray-ionisation transport-region collision induced dissociation mass spectrometry with different serum extraction methods for systematic toxicological analysis with mass spectra libraries. , 2002, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.