Integrating biomarkers across omic platforms: an approach to improve stratification of patients with indolent and aggressive prostate cancer

Classifying indolent prostate cancer represents a significant clinical challenge. We investigated whether integrating data from different omic platforms could identify a biomarker panel with improved performance compared to individual platforms alone. DNA methylation, transcripts, protein and glycosylation biomarkers were assessed in a single cohort of patients treated by radical prostatectomy. Novel multiblock statistical data integration approaches were used to deal with missing data and modelled via stepwise multinomial logistic regression, or LASSO. After applying leave‐one‐out cross‐validation to each model, the probabilistic predictions of disease type for each individual panel were aggregated to improve prediction accuracy using all available information for a given patient. Through assessment of three performance parameters of area under the curve (AUC) values, calibration and decision curve analysis, the study identified an integrated biomarker panel which predicts disease type with a high level of accuracy, with Multi AUC value of 0.91 (0.89, 0.94) and Ordinal C‐Index (ORC) value of 0.94 (0.91, 0.96), which was significantly improved compared to the values for the clinical panel alone of 0.67 (0.62, 0.72) Multi AUC and 0.72 (0.67, 0.78) ORC. Biomarker integration across different omic platforms significantly improves prediction accuracy. We provide a novel multiplatform approach for the analysis, determination and performance assessment of novel panels which can be applied to other diseases. With further refinement and validation, this panel could form a tool to help inform appropriate treatment strategies impacting on patient outcome in early stage prostate cancer.

[1]  P. Tan,et al.  Diagnostic and Prognostic Utility of a DNA Hypermethylated Gene Signature in Prostate Cancer , 2014, PloS one.

[2]  Jouhyun Jeon,et al.  Targeted proteomics identifies liquid-biopsy signatures for extracapsular prostate cancer , 2016, Nature Communications.

[3]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[4]  Benjamin R Saville,et al.  Decision curve analysis. , 2015, JAMA.

[5]  L. Holmberg,et al.  Low levels of apolipoprotein A-I and HDL are associated with risk of prostate cancer in the Swedish AMORIS study , 2011, Cancer Causes & Control.

[6]  D. Stein,et al.  Evaluation of matrix effect and chromatography efficiency: new parameters for validation of method development , 2005, Journal of the American Society for Mass Spectrometry.

[7]  Ruedi Aebersold,et al.  Cancer genetics-guided discovery of serum biomarker signatures for diagnosis and prognosis of prostate cancer , 2011, Proceedings of the National Academy of Sciences.

[8]  Thomas Brendan Murphy,et al.  Evaluation of prediction models for the staging of prostate cancer , 2013, BMC Medical Informatics and Decision Making.

[9]  Ludmila I. Kuncheva,et al.  Combining Label Outputs , 2014 .

[10]  Tong Li,et al.  miR‐20a Promotes Prostate Cancer Invasion and Migration Through Targeting ABL2 , 2014, Journal of cellular biochemistry.

[11]  W. Gallagher,et al.  Commercialized biomarkers: new horizons in prostate cancer diagnostics , 2015, Expert review of molecular diagnostics.

[12]  A. Albert,et al.  On the existence of maximum likelihood estimates in logistic regression models , 1984 .

[13]  Madhusudana Shashanka,et al.  Linear Methods for Regression , 2009 .

[14]  Igor Jurisica,et al.  Tumour genomic and microenvironmental heterogeneity for integrated prediction of 5-year biochemical recurrence of prostate cancer: a retrospective cohort study. , 2014, The Lancet. Oncology.

[15]  J. A. Ferreira,et al.  Approximate Sample Size Calculations with Microarray Data: An Illustration , 2006, Statistical applications in genetics and molecular biology.

[16]  Yuanyuan Liang,et al.  Updating risk prediction tools: A case study in prostate cancer , 2012, Biometrical journal. Biometrische Zeitschrift.

[17]  John Aitchison,et al.  The Statistical Analysis of Compositional Data , 1986 .

[18]  Yvonne Vergouwe,et al.  Discrimination ability of prediction models for ordinal outcomes: Relationships between existing measures and a new measure , 2012, Biometrical journal. Biometrische Zeitschrift.

[19]  S. Hewitt,et al.  Gene expression and epigenetic discovery screen reveal methylation of SFRP2 in prostate cancer , 2013, International journal of cancer.

[20]  Xavier Robin,et al.  pROC: an open-source package for R and S+ to analyze and compare ROC curves , 2011, BMC Bioinformatics.

[21]  E. Elkin,et al.  Decision Curve Analysis: A Novel Method for Evaluating Prediction Models , 2006, Medical decision making : an international journal of the Society for Medical Decision Making.

[22]  M. Cooperberg,et al.  Validation of a cell-cycle progression gene panel to improve risk stratification in a contemporary prostatectomy cohort. , 2012, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[23]  Anton K. Formann Mixture analysis of multivariate categorical data with covariates and missing entries , 2007, Comput. Stat. Data Anal..

[24]  Anthony D'Amico,et al.  Prostate cancer. Clinical practice guidelines in oncology. , 2007, Journal of the National Comprehensive Cancer Network : JNCCN.

[25]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[26]  David L Rimm,et al.  Development and Clinical Validation of an In Situ Biopsy-Based Multimarker Assay for Risk Stratification in Prostate Cancer , 2015, Clinical Cancer Research.

[27]  M. Mann,et al.  Universal sample preparation method for proteome analysis , 2009, Nature Methods.

[28]  R. William G. Watson,et al.  Biomarker research in prostate cancer—towards utility, not futility , 2011, Nature Reviews Urology.

[29]  Thomas Brendan Murphy,et al.  Applying random forests to identify biomarker panels in serum 2D-DIGE data for the detection and staging of prostate cancer. , 2011, Journal of proteome research.

[30]  M. Schemper,et al.  A solution to the problem of separation in logistic regression , 2002, Statistics in medicine.

[31]  C. Creighton,et al.  Dysregulation of miRNAs-COUP-TFII-FOXM1-CENPF axis contributes to the metastasis of prostate cancer , 2016, Nature Communications.

[32]  A. D'Amico,et al.  NCCN clinical practice guidelines in oncology: prostate cancer. , 2010, Journal of the National Comprehensive Cancer Network : JNCCN.

[33]  M. Babjuk,et al.  Importance of miR-20a expression in prostate cancer tissue. , 2010, Anticancer research.

[34]  M. Loda,et al.  Stress-Related Signaling Pathways in Lethal and Nonlethal Prostate Cancer , 2015, Clinical Cancer Research.

[35]  T. Golub,et al.  mRNA expression signature of Gleason grade predicts lethal prostate cancer. , 2011, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[36]  C. Eyers Universal sample preparation method for proteome analysis , 2009 .

[37]  Ludmila I. Kuncheva Fusion of Label Outputs , 2004 .

[38]  Pauline M Rudd,et al.  Detailed structural analysis of N-glycans released from glycoproteins in SDS-PAGE gel bands using HPLC combined with exoglycosidase array digestions. , 2006, Methods in molecular biology.

[39]  S. Serrano,et al.  A 12-gene expression signature is associated with aggressive histological in prostate cancer: SEC14L1 and TCEB1 genes are potential markers of progression. , 2012, The American journal of pathology.

[40]  David J. Harvey,et al.  HPLC-based analysis of serum N-glycans on a 96-well plate platform with dedicated database software. , 2008, Analytical biochemistry.

[41]  D. McMillan,et al.  Systemic Inflammatory Response and Survival in Patients with Localised Prostate Cancer: 10-Year Follow-Up , 2010, Urologia Internationalis.

[42]  S. Klauck,et al.  Novel RNA Markers in Prostate Cancer: Functional Considerations and Clinical Translation , 2014, BioMed research international.

[43]  Yue Fan,et al.  Core fucosylation and alpha2-3 sialylation in serum N-glycome is significantly increased in prostate cancer comparing to benign prostate hyperplasia. , 2011, Glycobiology.

[44]  P. Febbo,et al.  A 17-gene assay to predict prostate cancer aggressiveness in the context of Gleason grade heterogeneity, tumor multifocality, and biopsy undersampling. , 2014, European urology.

[45]  Colin Campbell,et al.  A pathway-based data integration framework for prediction of disease progression , 2013, Bioinform..