Proteomic Biomarker Discovery in 1000 Human Plasma Samples with Mass Spectrometry.

The overall impact of proteomics on clinical research and its translation has lagged behind expectations. One recognized caveat is the limited size (subject numbers) of (pre)clinical studies performed at the discovery stage, the findings of which fail to be replicated in larger verification/validation trials. Compromised study designs and insufficient statistical power are consequences of the to-date still limited capacity of mass spectrometry (MS)-based workflows to handle large numbers of samples in a realistic time frame, while delivering comprehensive proteome coverages. We developed a highly automated proteomic biomarker discovery workflow. Herein, we have applied this approach to analyze 1000 plasma samples from the multicentered human dietary intervention study "DiOGenes". Study design, sample randomization, tracking, and logistics were the foundations of our large-scale study. We checked the quality of the MS data and provided descriptive statistics. The data set was interrogated for proteins with most stable expression levels in that set of plasma samples. We evaluated standard clinical variables that typically impact forthcoming results and assessed body mass index-associated and gender-specific proteins at two time points. We demonstrate that analyzing a large number of human plasma samples for biomarker discovery with MS using isobaric tagging is feasible, providing robust and consistent biological results.

[1]  Gil Alterovitz,et al.  Automation, parallelism, and robotics for proteomics , 2006, Proteomics.

[2]  Takuji Nishimura,et al.  Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator , 1998, TOMC.

[3]  M. Visser,et al.  Elevated C-reactive protein levels in overweight and obese adults. , 1999, JAMA.

[4]  L. Sottrup-jensen,et al.  Characterization of human pregnancy zone protein. Comparison with human alpha 2-macroglobulin. , 1985, The Journal of biological chemistry.

[5]  R. Cole,et al.  The Plasma Proteome Identifies Expected and Novel Proteins Correlated with Micronutrient Status in Undernourished Nepalese Children 1 – 3 , 2013 .

[6]  Alan A. Doucette,et al.  Proteomic Workflows for Biomarker Identification Using Mass Spectrometry — Technical and Statistical Considerations during Initial Discovery , 2013, Proteomes.

[7]  Ludovic C. Gillet,et al.  Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis* , 2012, Molecular & Cellular Proteomics.

[8]  T. Larsen,et al.  The Diet, Obesity and Genes (Diogenes) Dietary Study in eight European countries – a comprehensive design for long‐term intervention , 2010, Obesity reviews : an official journal of the International Association for the Study of Obesity.

[9]  A. Astrup,et al.  Diets with high or low protein content and glycemic index for weight-loss maintenance. , 2010, The New England journal of medicine.

[10]  M. Kussmann,et al.  Proteomics of human plasma: A critical comparison of analytical workflows in terms of effort, throughput and outcome , 2013 .

[11]  N. Anderson,et al.  The clinical plasma proteome: a survey of clinical assays for proteins in plasma and serum. , 2010, Clinical chemistry.

[12]  D. Figeys,et al.  Multiplexed proteomic reactor for the processing of proteomic samples. , 2007, Analytical chemistry.

[13]  Joshua LaBaer,et al.  Statistical design for biospecimen cohort size in proteomics-based biomarker discovery and verification studies. , 2013, Journal of proteome research.

[14]  Amanda G. Paulovich,et al.  An Automated and Multiplexed Method for High Throughput Peptide Immunoaffinity Enrichment and Multiple Reaction Monitoring Mass Spectrometry-based Quantification of Protein Biomarkers* , 2009, Molecular & Cellular Proteomics.

[15]  D. N. Perkins,et al.  Probability‐based protein identification by searching sequence databases using mass spectrometry data , 1999, Electrophoresis.

[16]  David M. Simcha,et al.  Tackling the widespread and critical impact of batch effects in high-throughput data , 2010, Nature Reviews Genetics.

[17]  R. Aebersold,et al.  Mass spectrometry-based proteomics , 2003, Nature.

[18]  Stefan Enroth,et al.  Identification of genetic variants influencing the human plasma proteome , 2013, Proceedings of the National Academy of Sciences.

[19]  F. Regnier,et al.  A large, consistent plasma proteomics data set from prospectively collected breast cancer patient and healthy volunteer samples , 2011, Journal of Translational Medicine.

[20]  Ruedi Aebersold,et al.  Targeted proteomic strategy for clinical biomarker discovery , 2009, Molecular oncology.

[21]  N. Anderson,et al.  The Human Plasma Proteome , 2002, Molecular & Cellular Proteomics.

[23]  Steven A Carr,et al.  Protein biomarker discovery and validation: the long and uncertain path to clinical utility , 2006, Nature Biotechnology.

[24]  C. Silliman,et al.  Proteomic analyses of human plasma: Venus versus Mars , 2012, Transfusion.

[25]  Ruedi Aebersold,et al.  Quantitative variability of 342 plasma proteins in a human twin population , 2015 .

[26]  A. Astrup,et al.  Mechanisms of weight maintenance under high- and low-protein, low-glycaemic index diets. , 2011, Molecular nutrition & food research.

[27]  M. Kussmann,et al.  Comprehensive and Scalable Highly Automated MS-Based Proteomic Workflow for Clinical Biomarker Discovery in Human Plasma. , 2014, Journal of proteome research.

[28]  C. Longcope,et al.  Sex hormone-binding globulin concentration: differences among commercially available methods. , 2000, Clinical chemistry.

[29]  Bernhard Kuster,et al.  Robust and Sensitive iTRAQ Quantification on an LTQ Orbitrap Mass Spectrometer*S , 2008, Molecular & Cellular Proteomics.

[30]  K. Nelson,et al.  Urine Sample Preparation in 96-Well Filter Plates for Quantitative Clinical Proteomics , 2014, Analytical chemistry.

[31]  J. Grudzinskas,et al.  Circulating levels of pregnancy zone protein: normal range and the influence of age and gender. , 1981, Clinica chimica acta; international journal of clinical chemistry.

[32]  A. Tchernof,et al.  Sex Steroid Hormones, Sex Hormone-Binding Globulin, and Obesity in Men and Women , 2000, Hormone and metabolic research = Hormon- und Stoffwechselforschung = Hormones et metabolisme.

[33]  S. Pennington,et al.  Why have so few proteomic biomarkers “survived” validation? (Sample size and independent validation considerations) , 2014, Proteomics.

[34]  Martijn Pinkse,et al.  A high‐throughput sample preparation method for cellular proteomics using 96‐well filter plates , 2013, Proteomics.

[35]  D. Hochstrasser,et al.  Relative quantification of proteins in human cerebrospinal fluids by MS/MS using 6-plex isobaric tags. , 2008, Analytical chemistry.

[36]  Larry Gold,et al.  Beyond antibodies: New affinity reagents to unlock the proteome , 2014, Proteomics.

[37]  H. Langen,et al.  Antibody‐based proteomics and biomarker research—Current status and limitations , 2014, Proteomics.

[38]  B. Pedersen,et al.  Calprotectin — A Novel Marker of Obesity , 2009, PloS one.

[39]  David J Hunter,et al.  Sex hormone-binding globulin and risk of type 2 diabetes in women and men. , 2009, The New England journal of medicine.

[40]  Hasmik Keshishian,et al.  Multiplexed, Quantitative Workflow for Sensitive Biomarker Discovery in Plasma Yields Novel Candidates for Early Myocardial Injury* , 2015, Molecular & Cellular Proteomics.